AI RESEARCH - - Page 35

Sun. Aug 2nd, 2026

Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS

June 2, 2026 Admin

arXiv:2605.30748v2 Announce Type: replace-cross Abstract: We present Chatterbox-Flash, a zero-shot text-to-speech model obtained by fine-tuning a pretrained autoregressive TTS decoder…

AI RESEARCH

Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models

June 1, 2026 Admin

arXiv:2605.29299v2 Announce Type: replace-cross Abstract: Evaluations of dental vision-language models remain fragmented across datasets, task definitions and metrics, and often…

AI RESEARCH

DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval

June 1, 2026 Admin

arXiv:2605.31377v1 Announce Type: cross Abstract: Agentic Retrieval-Augmented Generation improves retrieval by integrating planning, tool use, and iterative reasoning, but existing…

AI RESEARCH

Target-Side Paraphrase Augmentation for Sign Language Translation with Large Language Models

June 1, 2026 Admin

arXiv:2605.31393v1 Announce Type: cross Abstract: Sign language translation (SLT) remains constrained by limited paired sign-video/text corpora and heavy-tailed target vocabularies.…

AI RESEARCH

Neural Network Verification using Partial Multi-Neuron Relaxation

June 1, 2026 Admin

arXiv:2605.30155v2 Announce Type: replace-cross Abstract: The increasing integration of deep neural networks in critical systems has spawned a theoretical and…

AI RESEARCH

PhyDrawGen: Physically Grounded Diagram Generation from Natural Language

June 1, 2026 Admin

arXiv:2605.30512v1 Announce Type: new Abstract: Generating physics diagrams from text requires strict adherence to physical laws. While current generative models…

AI RESEARCH

Mental Damage: Caption Poisoning Attacks on Retrieval-Augmented Text-to-Music Generation

June 1, 2026 Admin

arXiv:2605.30365v1 Announce Type: cross Abstract: Retrieval-augmented text-to-music (TTM) systems augment underspecified user prompts using captions retrieved from a music caption…

AI RESEARCH

PInVerify: An Offline Embodied Benchmark for Active Instance Verification

June 1, 2026 Admin

arXiv:2605.30639v1 Announce Type: cross Abstract: Embodied agents have made strong progress in navigating to target objects, but reaching the goal…

AI RESEARCH

Investigating Detection and Obfuscation of Prompt Injection Attacks Against Software Reverse Engineering AI Agents

June 1, 2026 Admin

arXiv:2605.30677v1 Announce Type: cross Abstract: Agentic software reverse engineering systems are vulnerable to prompt injection attacks placed into the source…

AI RESEARCH

Seeing Isn’t Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)?

June 1, 2026 Admin

arXiv:2605.30557v1 Announce Type: cross Abstract: Spatial reasoning is a fundamental capability for vision-language models (VLMs) deployed in real-world environments. However,…