DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
arXiv:2603.11076v1 Announce Type: new Abstract: Recent work synthesizes agentic tasks for post-training tool-using LLMs, yet robust generalization under shifts in…
arXiv:2603.11076v1 Announce Type: new Abstract: Recent work synthesizes agentic tasks for post-training tool-using LLMs, yet robust generalization under shifts in…
arXiv:2603.09689v2 Announce Type: replace-cross Abstract: Visual Question Answering (VQA) is a fundamental multimodal task that requires models to jointly understand…
arXiv:2603.10695v1 Announce Type: cross Abstract: Being trained on large and diverse datasets, visual foundation models (VFMs) can be fine-tuned to…
arXiv:2603.10697v1 Announce Type: cross Abstract: Neural text-to-SQL models, which translate natural language questions (NLQs) into SQL queries given a database…
arXiv:2603.09827v2 Announce Type: replace-cross Abstract: As embodied models become powerful, humans will collaborate with multiple embodied AI agents at their…
arXiv:2603.10133v1 Announce Type: new Abstract: Data products enable end users to gain greater insights about their data by providing supporting…