March, 2026 - - Page 10

Residual Stream Analysis of Overfitting And Structural Disruptions

March 17, 2026 Admin

arXiv:2603.13318v1 Announce Type: cross Abstract: Ensuring that large language models (LLMs) remain both helpful and harmless poses a significant challenge:…

AI RESEARCH

Probe-then-Plan: Environment-Aware Planning for Industrial E-commerce Search

March 17, 2026 Admin

arXiv:2603.15262v1 Announce Type: new Abstract: Modern e-commerce search is evolving to resolve complex user intents. While Large Language Models (LLMs)…

AI RESEARCH

Interference-Aware K-Step Reachable Communication in Multi-Agent Reinforcement Learning

March 17, 2026 Admin

arXiv:2603.15054v1 Announce Type: new Abstract: Effective communication is pivotal for addressing complex collaborative tasks in multi-agent reinforcement learning (MARL). Yet,…

AI RESEARCH

Why AI systems don’t learn and what to do about it: Lessons on autonomous learning from cognitive science

March 17, 2026 Admin

arXiv:2603.15381v1 Announce Type: new Abstract: We critically examine the limitations of current AI models in achieving autonomous learning and propose…

AI RESEARCH

s2n-bignum-bench: A practical benchmark for evaluating low-level code reasoning of LLMs

March 17, 2026 Admin

arXiv:2603.14628v1 Announce Type: cross Abstract: Neurosymbolic approaches leveraging Large Language Models (LLMs) with formal methods have recently achieved strong results…

AI RESEARCH

ResearchPilot: A Local-First Multi-Agent System for Literature Synthesis and Related Work Drafting

March 17, 2026 Admin

arXiv:2603.14629v1 Announce Type: cross Abstract: ResearchPilot is an open-source, self-hostable multi-agent system for literature-review assistance. Given a natural-language research question,…

AI RESEARCH

Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation

March 17, 2026 Admin

arXiv:2603.12983v2 Announce Type: replace-cross Abstract: Error Span Detection (ESD) is a crucial subtask in Machine Translation (MT) evaluation, aiming to…

AI RESEARCH

Failure Detection in Chemical Processes Using Symbolic Machine Learning: A Case Study on Ethylene Oxidation

March 17, 2026 Admin

arXiv:2603.06767v2 Announce Type: replace-cross Abstract: Over the past decade, Artificial Intelligence has significantly advanced, mostly driven by large-scale neural approaches.…

AI RESEARCH

Self Voice Conversion as an Attack against Neural Audio Watermarking

March 17, 2026 Admin

arXiv:2601.20432v2 Announce Type: replace-cross Abstract: Audio watermarking embeds auxiliary information into speech while maintaining speaker identity, linguistic content, and perceptual…

AI RESEARCH

Sample-efficient generative molecular design using memory manipulation

March 17, 2026 Admin

Nature Machine Intelligence, Published online: 17 March 2026; doi:10.1038/s42256-026-01200-4 Guo et al. train a Mamba-based language model for molecule generation…

Residual Stream Analysis of Overfitting And Structural Disruptions

Probe-then-Plan: Environment-Aware Planning for Industrial E-commerce Search

Interference-Aware K-Step Reachable Communication in Multi-Agent Reinforcement Learning

Why AI systems don’t learn and what to do about it: Lessons on autonomous learning from cognitive science

s2n-bignum-bench: A practical benchmark for evaluating low-level code reasoning of LLMs

ResearchPilot: A Local-First Multi-Agent System for Literature Synthesis and Related Work Drafting

Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation

Failure Detection in Chemical Processes Using Symbolic Machine Learning: A Case Study on Ethylene Oxidation

Self Voice Conversion as an Attack against Neural Audio Watermarking

Sample-efficient generative molecular design using memory manipulation

You missed

Dynamic feature pyramid network for real-time gesture recognition

Probing the Origins of Reasoning Performance: Representational Quality for Mathematical Problem-Solving in RL vs. SFT Fine-Tuned Models

Knowledge-Guided Multimodal Reasoning over Interacting Streams for Video-Level Ambivalence and Hesitancy Recognition

BayesAME: Bayesian Active Model Evaluation

Month: March 2026

You missed