- by ODEFTO Labs

Code Review Agent Benchmark

March 31, 2026 Admin

arXiv:2603.23448v2 Announce Type: replace-cross Abstract: Software engineering agents have shown significant promise in writing code. As AI agents permeate code…

AI RESEARCH

A Step Toward Federated Pretraining of Multimodal Large Language Models

March 31, 2026 Admin

arXiv:2603.26786v1 Announce Type: cross Abstract: The rapid evolution of Multimodal Large Language Models (MLLMs) is bottlenecked by the saturation of…

AI RESEARCH

Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers

March 31, 2026 Admin

arXiv:2603.28013v1 Announce Type: cross Abstract: We present a stage-decomposed analysis of prompt injection attacks against five frontier LLM agents. Prior…

AI RESEARCH

CARLA-Air: Fly Drones Inside a CARLA World — A Unified Infrastructure for Air-Ground Embodied Intelligence

March 31, 2026 Admin

arXiv:2603.28032v1 Announce Type: cross Abstract: The convergence of low-altitude economies, embodied intelligence, and air-ground cooperative systems creates growing demand for…

AI RESEARCH

CPUBone: Efficient Vision Backbone Design for Devices with Low Parallelization Capabilities

March 31, 2026 Admin

arXiv:2603.26425v2 Announce Type: replace-cross Abstract: Recent research on vision backbone architectures has predominantly focused on optimizing efficiency for hardware platforms…

AI RESEARCH

SmaAT-QMix-UNet: A Parameter-Efficient Vector-Quantized UNet for Precipitation Nowcasting

March 31, 2026 Admin

arXiv:2603.21879v2 Announce Type: replace-cross Abstract: Weather forecasting supports critical socioeconomic activities and complements environmental protection, yet operational Numerical Weather Prediction…

AI RESEARCH

FUSAR-GPT : A Spatiotemporal Feature-Embedded and Two-Stage Decoupled Visual Language Model for SAR Imagery

March 31, 2026 Admin

arXiv:2602.19190v3 Announce Type: replace-cross Abstract: Research on the intelligent interpretation of all-weather, all-time Synthetic Aperture Radar (SAR) is crucial for…

AI RESEARCH

MLLM-based Textual Explanations for Face Comparison

March 30, 2026 Admin

arXiv:2603.16629v3 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have recently been proposed as a means to generate natural-language…

AI RESEARCH

When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models

March 30, 2026 Admin

arXiv:2603.25960v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed in medical settings, yet their sensitivity to prompt…

AI RESEARCH

How Open Must Language Models be to Enable Reliable Scientific Inference?

March 30, 2026 Admin

arXiv:2603.26539v1 Announce Type: cross Abstract: How does the extent to which a model is open or closed impact the scientific…

Latest Post

FBS: Modeling Native Parallel Reading inside a Transformer

Can VLMs Unlock Semantic Anomaly Detection? A Framework for Structured Reasoning

Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

In-Context Decision Making for Optimizing Complex AutoML Pipelines

On the Robustness of Diffusion-Based Image Compression to Bit-Flip Errors

FBS: Modeling Native Parallel Reading inside a Transformer

Can VLMs Unlock Semantic Anomaly Detection? A Framework for Structured Reasoning

Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

In-Context Decision Making for Optimizing Complex AutoML Pipelines

AI Education: How to Get Ahead in the 21st Century Job Market!

The Power of NLP: How to Transform Text into Actionable Insights!

The AI Breakthrough That Could Change the World!

Mastering NLP: How to Use Language to Your Advantage in the Digital Age!

FBS: Modeling Native Parallel Reading inside a Transformer

Can VLMs Unlock Semantic Anomaly Detection? A Framework for Structured Reasoning

Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

In-Context Decision Making for Optimizing Complex AutoML Pipelines

Code Review Agent Benchmark

A Step Toward Federated Pretraining of Multimodal Large Language Models

Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers

CARLA-Air: Fly Drones Inside a CARLA World — A Unified Infrastructure for Air-Ground Embodied Intelligence

CPUBone: Efficient Vision Backbone Design for Devices with Low Parallelization Capabilities

SmaAT-QMix-UNet: A Parameter-Efficient Vector-Quantized UNet for Precipitation Nowcasting

FUSAR-GPT : A Spatiotemporal Feature-Embedded and Two-Stage Decoupled Visual Language Model for SAR Imagery

MLLM-based Textual Explanations for Face Comparison

When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models

How Open Must Language Models be to Enable Reliable Scientific Inference?

You missed

FBS: Modeling Native Parallel Reading inside a Transformer

Can VLMs Unlock Semantic Anomaly Detection? A Framework for Structured Reasoning

Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

In-Context Decision Making for Optimizing Complex AutoML Pipelines