Post Content Post navigation Decoding Order Matters in Autoregressive Speech Synthesis PRPO: Aligning Process Reward with Outcome Reward in Policy Optimization