This is a really beautiful piece of work. WARP makes great use of properties of model merging I don't often see combined. Catastrophic forgetting mitigation, capability enhancement, KL/reward balancing, and low-bandwidth parallelization all at once? Hell yeah.