Assistant API - Run won't cancel, keeps billing usage

Same thing happened to me. Try to let it read a 100Mb pdf (Openai says the limit is @500Mb). It wait for 5 minute, it says “Run cancelled”.

However, an interesting thing is in the dashboard, I see a 500 token output for the API call (I only used it once, so, it must be the API call I just used.), but no output showing.

Another thing is I didn’t see the embedding models calls — based on my PDF size, I was expect a large usage for embedding models since:

performs a vector search for longer documents

And that request a embedding models call.

I have No idea what has happened.