Research
Hard Examples Are All You Need
Eternis Team·March 22, 2026
Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets
Focusing on the hardest examples yields up to 47% performance gains when post-training with GRPO under budget constraints.
This work was presented at the NeurIPS Efficient Reasoning Workshop.
