Hi! I’m not actually a developer, but sometimes I want to train a model on my own data. From the documentation, I’ve learned that I need to organize my “dataset” in .jsonl
format (which, as I understand, means long strings of text).
My data is quite large, and it’s very inconvenient for me to handle it in editors like VSCode or PyCharm.
- Are there any good methods for managing large
.jsonl
files? - Wouldn’t it be great if platform.openai.com had an user-friendly interface where we could upload and manage this data before deciding to train a model?