The new GPT functionality of ChatGPT offers some kind of LLM agent capability, which is far simpler to setup than using platforms like Langchain. But I see several problems.

First, browsing is too slow to be really useful for those that just want swing by for a quick reply. Often one needs to browse several websites for one prompt (and each one needs first “google” and then the actual website retrieval).

Second, with APIs alone it is difficult to obtain more complex functions. Often the returned json is far too big. Then, APIs often use ids which require some kind of sql-join. This could in principle be done by chatGPT, but it requires two large jsons in the prompt. The join should better be done before the result is send to ChatGPT. If anyone has a solution to this, please let me know.