Q-Flow: Stable and Expressive Reinforcement Learning with Flow-Based Policy
arXiv:2605.13435v1 Announce Type: cross Abstract: There is growing interest in utilizing flow-based models as decision-making policies in reinforcement learning due…
arXiv:2605.13435v1 Announce Type: cross Abstract: There is growing interest in utilizing flow-based models as decision-making policies in reinforcement learning due…
arXiv:2605.12350v2 Announce Type: replace-cross Abstract: Lack of transparency in AI systems poses challenges in critical real-life applications. It is important…
arXiv:2605.12620v1 Announce Type: new Abstract: Building generalist embodied agents capable of solving complex real-world tasks remains a fundamental challenge in…
arXiv:2605.07649v2 Announce Type: replace-cross Abstract: Over the last few years, research on autonomous systems has matured to such a degree…
arXiv:2603.01960v2 Announce Type: replace-cross Abstract: TiledAttention is a scaled dot-product attention (SDPA) forward operator for SDPA research on NVIDIA GPUs.…
arXiv:2412.18798v3 Announce Type: replace-cross Abstract: Transformer-based models have achieved remarkable success in multivariate time series forecasting (MTSF) by capturing long-range…
arXiv:2511.06894v3 Announce Type: replace-cross Abstract: Reconstruction-based methods are a dominant paradigm in time series anomaly detection (TSAD), however, their near-universal…
arXiv:2605.08681v1 Announce Type: cross Abstract: We study solving large-scale fixed-point equation (x^star=bar F(x^star)) with decomposition. Standard strict decomposition assigns each…
arXiv:2605.02948v3 Announce Type: replace-cross Abstract: Diffusion-based talking head generation has achieved remarkable visual quality, yet scaling it to long-term videos…
arXiv:2605.09534v1 Announce Type: cross Abstract: Engineering managers increasingly must decide how to introduce generative artificial intelligence (AI), retrieval-augmented generation, and…