Reinforcement Fine-Tuning
This fine-tuning technique is designed to improve the performance of our reasoning models on verifiable domain-specific tasks. We find it's most useful on tasks that can be easily graded against the “correct” answer and where accuracy is important—in industries like math, science, legal, healthcare, and financial services.