Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer
arXiv:2604.00785v1 Announce Type: cross Abstract: Pretraining Large Language Models (LLMs) from scratch requires massive amount of compute. Aurora super computer…
ChartDiff: A Large-Scale Benchmark for Comprehending Pairs of Charts
arXiv:2603.28902v1 Announce Type: new Abstract: Charts are central to analytical reasoning, yet existing benchmarks for chart understanding focus almost exclusively…
A Convex Route to Thermomechanics: Learning Internal Energy and Dissipation
arXiv:2603.28707v2 Announce Type: replace-cross Abstract: We present a physics-based neural network framework for the discovery of constitutive models in fully…
LLM-Meta-SR: In-Context Learning for Evolving Selection Operators in Symbolic Regression
arXiv:2505.18602v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have revolutionized algorithm development, yet their application in symbolic regression, where…
Evaluation of Generative Models for Emotional 3D Animation Generation in VR
arXiv:2512.16081v2 Announce Type: replace-cross Abstract: Social interactions incorporate nonverbal signals to convey emotions alongside speech, including facial expressions and body…
Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling
arXiv:2603.14841v2 Announce Type: replace-cross Abstract: Road crashes remain a leading cause of preventable fatalities. Existing prediction models predominantly produce binary…
ZeroFlood: Flood Hazard Mapping from Single-Modality SAR Using Geo-Foundation Models
arXiv:2510.23364v2 Announce Type: replace-cross Abstract: Flood hazard mapping is essential for disaster prevention but remains challenging in data-scarce regions, where…
Agenda-based Narrative Extraction: Steering Pathfinding Algorithms with Large Language Models
arXiv:2603.29661v1 Announce Type: cross Abstract: Existing narrative extraction methods face a trade-off between coherence, interactivity, and multi-storyline support. Narrative Maps…
Concept frustration: Aligning human concepts and machine representations
arXiv:2603.29654v1 Announce Type: cross Abstract: Aligning human-interpretable concepts with the internal representations learned by modern machine learning systems remains a…
6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management
arXiv:2603.29656v1 Announce Type: cross Abstract: Autonomous 6G network management requires agents that can execute tools, observe the resulting state changes,…
