APRIL Strategy¶
**A**ctive **P**artial **R**ollout for **I**efficient generation with **L**ong-tail handling.
Strategy¶
- Oversample: Generate more rollouts than needed
- Abort: Cancel long-running generations
- Reuse: Save and reuse partial trajectories
Configuration¶
Benefits¶
- Reduces waiting for slow generations
- Improves GPU utilization
- Maintains training throughput