Getting Intermittent failed Runs in Assistant API when generating images with code interpreter

I have deployed app using the beta assistant API to build graphs from an uploaded CSV to a thread. The assistant uses code interpreter to parse/graph the CSV. It uses matplotlib to generate these graphs and exports them as an image.

After deploying my changes, I noticed that runs have been failing somewhat frequently(~1/10 tries). The interesting thing is it appears code interpreter had run fine(it generated a valid image/graph) but the run step says it failed(with last_error set to {“code”: “server_error”,“message”: “Sorry, something went wrong.”}). This results in no message being generated(with the link to the image), leaving a bad user experience for my users.

I tried asking for the same graph in a separate thread to verify if it was an issue with the code generated and the run ran fine(the code it wrote to generate the graph was functionally identical).

I verified these errors didn’t occur during OpenAI incidents. My guess is this is a provisioning issue with the sandboxes code interpreter uses to run code.

Really appreciate all the hard work you guys have put into this new endpoint/code interpreter, the results we’ve been getting have been truly stunning! If you have any other questions for more context on the issue don’t hesitate to ask!

ThreadIds with failed runs:
thread_25clHfEhwfxl7g2TE0dUhGby
thread_p5dPvXC0LxjLti1o3ffnEIa3
thread_1Lq6guiecDJLgTMEVXwMlamB
thread_rpLVIOqiJRefsSP8AAsFbAQ9

I am seeing this issue occur more frequently now. I’ve had 6 runs fail in a row yesterday. Could anyone from OpenAI help me troubleshoot this?

Here are the latest threadIds:
thread_sTYCzC7nj3xGpksWXK0E65uM
thread_ONaJF21V5SyPoKObjcrGne4G
thread_hx039iEEdSYsE97oyOOge1rR
thread_RkbHyO7mKzKyB8jNgoU4bT1m
thread_fMtFeR4IE1Vk1yndKJ2nt4bh
thread_mr0vhYIB0wuRYuhFRDHsCub7
thread_7Y094HWHDpwMyqia7NLX10up
thread_OUtVPRrgx0CDaLa3vbjJGWPW