Is documentation for the upcoming reinforcement fine-tuning available?

Reinforcement fine-tuning is currently only available to selected alpha testers and will be made available in early 2025. It’s a complex, compute-intensive, potentially dangerous thing, I understand that.

But is there an argument against already making documentation available on how it will work when it’s released? I am talking about things like how to define graders and what evaluators are available. This would enable developers to already prepare their datasets in anticipation.