Hi,
Thank you for sharing the code and results for this project. It's very important work.
I have successfully located the trajectories, preds.json and eval_results.json. However, I noticed that the detailed execution logs(specifically stdout.log and stderr.log) for each instance seem to be missing.
Without these logs, it is difficult to determine exactly which specific test cases failed for the instances marked as False in eval_results.json.
Would it be possible to release these log files? Or is there a specific place I should look for them? Having access to the logs would be extremely helpful for performing a detailed error analysis.
Thanks in advance for your help!