AI RESEARCH - - Page 118

Wed. Jun 17th, 2026

Scheduling Your LLM Reinforcement Learning with Reasoning Trees

October 30, 2025 Admin

arXiv:2510.24832v1 Announce Type: new Abstract: Using Reinforcement Learning with Verifiable Rewards (RLVR) to optimize Large Language Models (LLMs) can be…

AI RESEARCH

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

October 29, 2025 Admin

arXiv:2510.23691v1 Announce Type: new Abstract: We present Game-TARS, a generalist game agent trained with a unified, scalable action space anchored…

AI RESEARCH

Eigen-Value: Efficient Domain-Robust Data Valuation via Eigenvalue-Based Approach

October 29, 2025 Admin

arXiv:2510.23409v2 Announce Type: replace-cross Abstract: Data valuation has become central in the era of data-centric AI. It drives efficient training…

AI RESEARCH

Closing Gaps: An Imputation Analysis of ICU Vital Signs

October 29, 2025 Admin

arXiv:2510.24217v1 Announce Type: cross Abstract: As more Intensive Care Unit (ICU) data becomes available, the interest in developing clinical prediction…

AI RESEARCH

PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling

October 29, 2025 Admin

arXiv:2510.24235v1 Announce Type: cross Abstract: Reward models (RMs) are central to reinforcement learning from human feedback (RLHF), providing the critical…

AI RESEARCH

BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents

October 29, 2025 Admin

arXiv:2510.23458v2 Announce Type: replace-cross Abstract: Confidence in LLMs is a useful indicator of model uncertainty and answer reliability. Existing work…

AI RESEARCH

Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge

October 28, 2025 Admin

arXiv:2510.20819v2 Announce Type: replace-cross Abstract: Recent advances in generative modeling have positioned diffusion models as state-of-the-art tools for sampling from…

AI RESEARCH

Nested AutoRegressive Models

October 28, 2025 Admin

arXiv:2510.23028v1 Announce Type: cross Abstract: AutoRegressive (AR) models have demonstrated competitive performance in image generation, achieving results comparable to those…

AI RESEARCH

Efficient and Encrypted Inference using Binarized Neural Networks within In-Memory Computing Architectures

October 28, 2025 Admin

arXiv:2510.23034v1 Announce Type: cross Abstract: Binarized Neural Networks (BNNs) are a class of deep neural networks designed to utilize minimal…

AI RESEARCH

WhaleVAD-BPN: Improving Baleen Whale Call Detection with Boundary Proposal Networks and Post-processing Optimisation

October 28, 2025 Admin

arXiv:2510.21280v2 Announce Type: replace-cross Abstract: While recent sound event detection (SED) systems can identify baleen whale calls in marine audio,…

You missed

AI RESEARCH

Scheduling Your LLM Reinforcement Learning with Reasoning Trees

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Eigen-Value: Efficient Domain-Robust Data Valuation via Eigenvalue-Based Approach

Closing Gaps: An Imputation Analysis of ICU Vital Signs

PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling

BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents

Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge

Nested AutoRegressive Models

Efficient and Encrypted Inference using Binarized Neural Networks within In-Memory Computing Architectures

WhaleVAD-BPN: Improving Baleen Whale Call Detection with Boundary Proposal Networks and Post-processing Optimisation

You missed

Data Augmentations for Data-Constrained Language Model Pretraining

Detect Before You Leap: Mirage Detection in Vision-Language Models

Upper Bounds on the Generalization Error of Deep Learning Models via Local Robustness and Stability

daVinci-kernel: Co-Evolving Skill Selection, Summarization, and Utilization via RL for GPU Kernel Optimization

Category: AI RESEARCH

You missed