MOSS-TTSD: Text to Spoken Dialogue Generation
arXiv:2603.19739v1 Announce Type: cross Abstract: Spoken dialogue generation is crucial for applications like podcasts, dynamic commentary, and entertainment content, but…
arXiv:2603.19739v1 Announce Type: cross Abstract: Spoken dialogue generation is crucial for applications like podcasts, dynamic commentary, and entertainment content, but…
arXiv:2603.19757v1 Announce Type: cross Abstract: Few-shot 3D semantic segmentation aims to generate accurate semantic masks for query point clouds with…
arXiv:2603.19121v2 Announce Type: replace-cross Abstract: The creation of high-fidelity, customizable 3D indoor scene textures remains a significant challenge. While text-driven…
arXiv:2603.19429v1 Announce Type: new Abstract: Classical planning problems are typically defined using lifted first-order representations, which offer compactness and generality.…
Nature Machine Intelligence, Published online: 23 March 2026; doi:10.1038/s42256-026-01177-0 We introduce a framework to analyse interpretability in deep learning, by…
Nature Machine Intelligence, Published online: 23 March 2026; doi:10.1038/s42256-026-01194-z Madduri et al. introduce a computational framework grounded in control and…
arXiv:2603.17973v2 Announce Type: replace-cross Abstract: AI coding agents can resolve real-world software issues, yet they frequently introduce regressions — breaking…
arXiv:2603.18048v1 Announce Type: new Abstract: Recent Audio Multimodal Large Language Models (Audio MLLMs) demonstrate impressive performance on speech benchmarks, yet…
arXiv:2603.17380v2 Announce Type: replace-cross Abstract: Virtual cell models aim to enable in silico experimentation by predicting how cells respond to…