Controllable Flow Matching for Online Reinforcement Learning
arXiv:2511.06816v2 Announce Type: replace-cross Abstract: Model-based reinforcement learning (MBRL) typically relies on modeling environment dynamics for data efficiency. However, due…
arXiv:2511.06816v2 Announce Type: replace-cross Abstract: Model-based reinforcement learning (MBRL) typically relies on modeling environment dynamics for data efficiency. However, due…
arXiv:2601.00993v1 Announce Type: cross Abstract: Wildlife monitoring is crucial for studying biodiversity loss and climate change. Camera trap images provide…
arXiv:2601.01839v1 Announce Type: cross Abstract: Despite the growing popularity of AI coding assistants, over 80% of machine learning (ML) projects…
arXiv:2510.25781v3 Announce Type: replace-cross Abstract: Kolmogorov-Arnold Networks (KANs), whose design is inspired-rather than dictated-by the Kolmogorov superposition theorem, have emerged…
arXiv:2601.01011v1 Announce Type: cross Abstract: Every act of language generation compresses a rich internal state into a single token sequence.…
arXiv:2601.01835v1 Announce Type: cross Abstract: In this paper, a deep learning approach for Mpox diagnosis named Customized Residual SwinTransformerV2 (RSwinV2)…
arXiv:2601.00012v1 Announce Type: cross Abstract: Electroencephalography (EEG) data present unique modeling challenges because recordings vary in length, exhibit very low…
arXiv:2511.20859v5 Announce Type: replace-cross Abstract: We present an algorithm for computing all evolutionarily stable strategies in nondegenerate normal-form games with…
arXiv:2511.11990v4 Announce Type: replace Abstract: The convergence of deep learning and formal mathematics has spurred research in formal verification. Statement…
arXiv:2507.22411v2 Announce Type: replace-cross Abstract: Recent reports suggest that LLMs can handle increasingly long contexts. However, many existing benchmarks for…