I need to use API (whatever ChatCompletions or Assistants or some else) to extract structured data from Amazon product page. First I tried to extract just by link, but it returns some bullshit, for example https://www.amazon.com/-/de/dp/B0BZ8X9HGT/ :
So the first thing to note is that the model can’t actually browse the web. You need to implement some browsing functionality and include it as a tool for your assistant.
Extracting text from images isn’t super reliable, but it’s getting better. You’ll probably have more success with OCR tools if you need a high degree of reliability.
probably using a scraper? depends on what exactly you’re after.