Evaluating Control Protocols for Untrusted AI Agents
arXiv:2511.02997v1 Announce Type: new Abstract: As AI systems become more capable and widely deployed as agents, ensuring their safe operation…
For An Exciting Tomorrow
arXiv:2511.02997v1 Announce Type: new Abstract: As AI systems become more capable and widely deployed as agents, ensuring their safe operation…
arXiv:2511.02802v2 Announce Type: replace-cross Abstract: Tabular foundation models represent a growing paradigm in structured data learning, extending the benefits of…
arXiv:2510.26899v2 Announce Type: replace-cross Abstract: The launch of Grokipedia, an AI-generated encyclopedia developed by Elon Musk’s xAI, was presented as…
arXiv:2510.27629v3 Announce Type: replace-cross Abstract: Open-weight bio-foundation models present a dual-use dilemma. While holding great promise for accelerating scientific research…
arXiv:2511.00020v1 Announce Type: new Abstract: In the current digital commerce landscape, user-generated reviews play a critical role in shaping consumer…
arXiv:2511.01354v1 Announce Type: cross Abstract: Recently, the demand for small and efficient reasoning models to support real-world applications has driven…
arXiv:2511.01357v1 Announce Type: cross Abstract: Medical visual question answering (Med-VQA) is a crucial multimodal task in clinical decision support and…
arXiv:2510.26852v1 Announce Type: new Abstract: Large Language Model (LLM) agents have evolved from basic text generation to autonomously completing complex…
arXiv:2510.26776v2 Announce Type: replace-cross Abstract: How can we explain the influence of training data on black-box models? Influence functions (IFs)…
arXiv:2510.27442v1 Announce Type: cross Abstract: Vision Transformers (ViTs) have demonstrated strong potential in medical imaging; however, their high computational demands…