Fine-tuning fails for gpt-4.1-nano

Once I scale my dataset size up, I repeatedly see:
“The job failed due to an internal error.”
The job runs on partitions of the dataset (~4k examples). Stumped on what to do!?