Admin - - Page 16

HumanMCP: A Human-Like Query Dataset for Evaluating MCP Tool Retrieval Performance

March 2, 2026 Admin

arXiv:2602.23367v1 Announce Type: new Abstract: Model Context Protocol (MCP) servers contain a collection of thousands of open-source standardized tools, linking…

AI RESEARCH

MoDora: Tree-Based Semi-Structured Document Analysis System

March 2, 2026 Admin

arXiv:2602.23061v2 Announce Type: replace-cross Abstract: Semi-structured documents integrate diverse interleaved data elements (e.g., tables, charts, hierarchical paragraphs) arranged in various…

AI RESEARCH

Conformalized Neural Networks for Federated Uncertainty Quantification under Dual Heterogeneity

March 2, 2026 Admin

arXiv:2602.23296v2 Announce Type: replace-cross Abstract: Federated learning (FL) faces challenges in uncertainty quantification (UQ). Without reliable UQ, FL systems risk…

AI RESEARCH

Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping

February 28, 2026 Admin

Post Content

AI RESEARCH

Instruction-based Image Editing with Planning, Reasoning, and Generation

February 27, 2026 Admin

arXiv:2602.22624v1 Announce Type: cross Abstract: Editing images via instruction provides a natural way to generate interactive content, but it is…

AI RESEARCH

dLLM: Simple Diffusion Language Modeling

February 27, 2026 Admin

arXiv:2602.22661v1 Announce Type: cross Abstract: Although diffusion language models (DLMs) are evolving quickly, many recent models converge on a set…

AI RESEARCH

Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training

February 27, 2026 Admin

arXiv:2602.21189v2 Announce Type: replace-cross Abstract: Pass@k is a widely used performance metric for verifiable large language model tasks, including mathematical…

AI RESEARCH

Graph Your Way to Inspiration: Integrating Co-Author Graphs with Retrieval-Augmented Generation for Large Language Model Based Scientific Idea Generation

February 27, 2026 Admin

arXiv:2602.22215v1 Announce Type: new Abstract: Large Language Models (LLMs) demonstrate potential in the field of scientific idea generation. However, the…

AI RESEARCH

Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning

February 27, 2026 Admin

arXiv:2602.21670v2 Announce Type: replace-cross Abstract: Multi-robot task planning requires decomposing natural-language instructions into executable actions for heterogeneous robot teams. Conventional…

AI RESEARCH

A scalable framework for evaluating health language models

February 27, 2026 Admin

Post Content

HumanMCP: A Human-Like Query Dataset for Evaluating MCP Tool Retrieval Performance

MoDora: Tree-Based Semi-Structured Document Analysis System

Conformalized Neural Networks for Federated Uncertainty Quantification under Dual Heterogeneity

Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping

Instruction-based Image Editing with Planning, Reasoning, and Generation

dLLM: Simple Diffusion Language Modeling

Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training

Graph Your Way to Inspiration: Integrating Co-Author Graphs with Retrieval-Augmented Generation for Large Language Model Based Scientific Idea Generation

Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning

A scalable framework for evaluating health language models

You missed

Memory Bear AI Memory Science Engine for Multimodal Affective Intelligence: A Technical Report

Beyond Matching to Tiles: Bridging Unaligned Aerial and Satellite Views for Vision-Only UAV Navigation

Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees

The EU AI Act and the Rights-based Approach to Technological Governance

Author: Admin

You missed