PRISM: Parallel Reward Integration with Symmetry for MORL

ByAdmin

Feb 23, 2026

THE AI TODAY

arXiv:2602.18277v1 Announce Type: cross
Abstract: This work studies heterogeneous Multi-Objective Reinforcement Learning (MORL), where objectives can differ sharply in temporal frequency. Such heterogeneity allows dense objectives to dominate learning, while sparse long-horizon rewards receive weak credit assignment, leading to poor sample efficiency. We propose a Parallel Reward Integration with Symmetry (PRISM) algorithm that enforces reflectional symmetry as an inductive bias in aligning reward channels. PRISM introduces ReSymNet, a theory-motivated model that reconciles temporal-frequency mismatches across objectives, using residual blocks to learn a scaled opportunity value that accelerates exploration while preserving the optimal policy. We also propose SymReg, a reflectional equivariance regulariser that enforces agent mirroring and constrains policy search to a reflection-equivariant subspace. This restriction provably reduces hypothesis complexity and improves generalisation. Across MuJoCo benchmarks, PRISM consistently outperforms both a sparse-reward baseline and an oracle trained with full dense rewards, improving Pareto coverage and distributional balance: it achieves hypervolume gains exceeding 100% over the baseline and up to 32% over the oracle. The code is at href{https://github.com/EVIEHub/PRISM}{https://github.com/EVIEHub/PRISM}.

By Admin

AI RESEARCH

Accurate prediction of ecDNA in interphase cancer cells using deep neural networks

Apr 11, 2026 Admin

AI RESEARCH

Using causal machine learning and real world data to improve dose response decision making for secukinumab in psoriatic arthritis

Apr 11, 2026 Admin

AI RESEARCH

LobePrior segments lung lobes on computed tomography images in the presence of severe abnormalities

Apr 10, 2026 Admin

PRISM: Parallel Reward Integration with Symmetry for MORL

ByAdmin

By Admin

Related Post

Accurate prediction of ecDNA in interphase cancer cells using deep neural networks

Using causal machine learning and real world data to improve dose response decision making for secukinumab in psoriatic arthritis

LobePrior segments lung lobes on computed tomography images in the presence of severe abnormalities

You missed

Using causal machine learning and real world data to improve dose response decision making for secukinumab in psoriatic arthritis

Accurate prediction of ecDNA in interphase cancer cells using deep neural networks

A lightweight machine learning approach for DDoS detection and classification

LobePrior segments lung lobes on computed tomography images in the presence of severe abnormalities