Who Owns the Text? Design Patterns for Preserving Authorship in AI-Assisted Writing
arXiv:2601.10236v1 Announce Type: cross Abstract: AI writing assistants can reduce effort and improve fluency, but they may also weaken writers’…
Introduction to optimization methods for training SciML models
arXiv:2601.10222v1 Announce Type: cross Abstract: Optimization is central to both modern machine learning (ML) and scientific machine learning (SciML), yet…
Reward Learning through Ranking Mean Squared Error
arXiv:2601.09236v2 Announce Type: replace-cross Abstract: Reward design remains a significant bottleneck in applying reinforcement learning (RL) to real-world problems. A…
Bridging the Trust Gap: Clinician-Validated Hybrid Explainable AI for Maternal Health Risk Assessment in Bangladesh
arXiv:2601.07866v1 Announce Type: new Abstract: While machine learning shows promise for maternal health risk prediction, clinical adoption in resource-constrained settings…
ES-Mem: Event Segmentation-Based Memory for Long-Term Dialogue Agents
arXiv:2601.07582v2 Announce Type: replace-cross Abstract: Memory is critical for dialogue agents to maintain coherence and enable continuous adaptation in long-term…
PRPO: Aligning Process Reward with Outcome Reward in Policy Optimization
arXiv:2601.07182v2 Announce Type: replace-cross Abstract: Policy optimization for large language models often suffers from sparse reward signals in multi-step reasoning…
