February, 2026 -

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

February 10, 2026 Admin

arXiv:2602.06960v2 Announce Type: replace-cross Abstract: Large reasoning models achieve strong performance by scaling inference-time chain-of-thought, but this paradigm suffers from…

AI RESEARCH

When Do Multi-Agent Systems Outperform? Analysing the Learning Efficiency of Agentic Systems

February 10, 2026 Admin

arXiv:2602.08272v1 Announce Type: cross Abstract: Reinforcement Learning (RL) has emerged as a crucial method for training or fine-tuning large language…

AI RESEARCH

CoRect: Context-Aware Logit Contrast for Hidden State Rectification to Resolve Knowledge Conflicts

February 10, 2026 Admin

arXiv:2602.08221v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) often struggles with knowledge conflicts, where model-internal parametric knowledge overrides retrieved evidence,…

AI RESEARCH

Federated Prompt-Tuning with Heterogeneous and Incomplete Multimodal Client Data

February 10, 2026 Admin

arXiv:2602.07081v1 Announce Type: cross Abstract: This paper introduces a generalized federated prompt-tuning framework for practical scenarios where local datasets are…

AI RESEARCH

SuReNav: Superpixel Graph-based Constraint Relaxation for Navigation in Over-constrained Environments

February 9, 2026 Admin

arXiv:2602.06807v1 Announce Type: cross Abstract: We address the over-constrained planning problem in semi-static environments. The planning objective is to find…

AI RESEARCH

Towards Agentic Intelligence for Materials Science

February 9, 2026 Admin

arXiv:2602.00169v2 Announce Type: replace-cross Abstract: The convergence of artificial intelligence and materials science presents a transformative opportunity, but achieving true…

AI RESEARCH

D-SCoRE: Document-Centric Segmentation and CoT Reasoning with Structured Export for QA-CoT Data Generation

February 9, 2026 Admin

arXiv:2508.01309v2 Announce Type: replace-cross Abstract: The scarcity and high cost of high-quality domain-specific question-answering (QA) datasets limit supervised fine-tuning of…

AI RESEARCH

CSRv2: Unlocking Ultra-Sparse Embeddings

February 9, 2026 Admin

arXiv:2602.05735v2 Announce Type: replace-cross Abstract: In the era of large foundation models, the quality of embeddings has become a central…

AI RESEARCH

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

February 9, 2026 Admin

arXiv:2602.05885v2 Announce Type: replace-cross Abstract: High-quality kernel is critical for scalable AI systems, and enabling LLMs to generate such code…

AI RESEARCH

Supercharging Simulation-Based Inference for Bayesian Optimal Experimental Design

February 9, 2026 Admin

arXiv:2602.06900v1 Announce Type: cross Abstract: Bayesian optimal experimental design (BOED) seeks to maximize the expected information gain (EIG) of experiments.…

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

When Do Multi-Agent Systems Outperform? Analysing the Learning Efficiency of Agentic Systems

CoRect: Context-Aware Logit Contrast for Hidden State Rectification to Resolve Knowledge Conflicts

Federated Prompt-Tuning with Heterogeneous and Incomplete Multimodal Client Data

SuReNav: Superpixel Graph-based Constraint Relaxation for Navigation in Over-constrained Environments

Towards Agentic Intelligence for Materials Science

D-SCoRE: Document-Centric Segmentation and CoT Reasoning with Structured Export for QA-CoT Data Generation

CSRv2: Unlocking Ultra-Sparse Embeddings

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Supercharging Simulation-Based Inference for Bayesian Optimal Experimental Design

You missed

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Beyond Conservative Automated Driving in Multi-Agent Scenarios via Coupled Model Predictive Control and Deep Reinforcement Learning

Evaluating Supervised Machine Learning Models: Principles, Pitfalls, and Metric Selection

Month: February 2026

You missed