Urgent: Persistent Issues with Training File Formatting and Upload for Hugging Face AutoTrain via ChatGPT Agent (Zero)

Hello OpenAI Team,

I’m reaching out for urgent help with a recurring issue that’s been blocking my progress for the past three days. I’ve been trying to format and upload a 150MB .csv training file for Hugging Face AutoTrain, working through my custom ChatGPT agent, Zero. Although Zero successfully handled similar tasks a few weeks ago, this time the formatting process keeps failing repeatedly, with multiple unexpected issues cropping up that I can’t seem to resolve.

Background and Specific Challenges

  1. Disappearing and Failing Files: I’ve repeatedly encountered situations where the training files disappear during or after upload, often resulting in errors. When the upload does complete, Zero has been unable to properly format the data despite using the same structure that worked successfully in the past.
  2. Previous Successes, Current Failures: Just a few weeks ago, Zero formatted and structured similar datasets perfectly, with no issues uploading, formatting, or using the files on Hugging Face’s AutoTrain. I don’t know what has changed, but this recent dataset, though structured similarly, simply won’t go through. The system keeps hitting roadblocks, and I’m unsure if it’s due to file size, specific data content, or a change in Zero’s capabilities.
  3. Frustration with Inconsistency: I’ve been stuck on this problem for three days now, trying to troubleshoot and reformat the file repeatedly without success. My dataset is a .csv file with simple prompt and response fields, and it adheres to the formatting guidelines I’ve used previously. The goal is straightforward: to get this file formatted and ready for AutoTrain, but each attempt results in different kinds of failures—upload issues, disappearing files, or incomplete formatting.

Request for Assistance

I’d appreciate if someone from OpenAI could:

  • Provide guidance on why the file formatting suddenly fails or disappears, even when following the same process as previous, successful attempts.
  • Investigate potential issues with Zero’s file handling to understand if there are new limitations or bugs affecting large .csv training files.
  • Help with concrete steps to get this specific file formatted and uploaded as intended. At this point, I’m looking for a reliable way to get this file ready for AutoTrain as I had successfully done before.

Summary of Expectations

All I’m looking for is consistent, successful file formatting and upload through Zero, just as it worked a few weeks ago. The repeated failures are not only delaying my work but causing serious frustration. Any support to resolve this quickly would be greatly appreciated.

Thank you for your attention. I look forward to your help.

Another issue is i dont trust any AI company but want to stick to openai i do not want all ai companies knowing all my datasets etc.

Error with gpt40:
Failed upload to files.oaiusercontent.com. Please ensure your network settings allow access to this site or contact your network administrator.

Day 5 of error loops, i am supposed to be launching my new LLM for 11-11-24 :frowning:

It seems there was an issue generating the download link. I’ll reattempt creating and saving the file for you.

Let me try another approach. ​​

It seems I’m unable to provide a direct download link at the moment. However, I can guide you through creating the file on your own system if that would work for you. Here’s the process:

  1. Copy the table data above or use the dictionary structure provided in the code.
  2. Paste it into a script or Jupyter Notebook on your system.
  3. Use Python’s pandas library to convert the data into a .csv file by running the code provided.

Let me know if you need further help with these steps or if you’d like the content in another format! ​

There must have been some update, seems to be working now even the 150mb .csv was formatted correctly etc, all issues gone it seems.