SmaAT-QMix-UNet: A Parameter-Efficient Vector-Quantized UNet for Precipitation Nowcasting
arXiv:2603.21879v2 Announce Type: replace-cross Abstract: Weather forecasting supports critical socioeconomic activities and complements environmental protection, yet operational Numerical Weather Prediction…
FUSAR-GPT : A Spatiotemporal Feature-Embedded and Two-Stage Decoupled Visual Language Model for SAR Imagery
arXiv:2602.19190v3 Announce Type: replace-cross Abstract: Research on the intelligent interpretation of all-weather, all-time Synthetic Aperture Radar (SAR) is crucial for…
MLLM-based Textual Explanations for Face Comparison
arXiv:2603.16629v3 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have recently been proposed as a means to generate natural-language…
When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models
arXiv:2603.25960v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed in medical settings, yet their sensitivity to prompt…
How Open Must Language Models be to Enable Reliable Scientific Inference?
arXiv:2603.26539v1 Announce Type: cross Abstract: How does the extent to which a model is open or closed impact the scientific…
The Multi-AMR Buffer Storage, Retrieval, and Reshuffling Problem: Exact and Heuristic Approaches
arXiv:2603.26542v1 Announce Type: cross Abstract: Buffer zones are essential in production systems to decouple sequential processes. In dense floor storage…
Efficient Detection of Bad Benchmark Items with Novel Scalability Coefficients
arXiv:2603.24999v2 Announce Type: replace-cross Abstract: The validity of assessments, from large-scale AI benchmarks to human classrooms, depends on the quality…
Neuro-Symbolic Process Anomaly Detection
arXiv:2603.26461v1 Announce Type: cross Abstract: Process anomaly detection is an important application of process mining for identifying deviations from the…
Gelina: Unified Speech and Gesture Synthesis via Interleaved Token Prediction
arXiv:2510.12834v3 Announce Type: replace-cross Abstract: Human communication is multimodal, with speech and gestures tightly coupled, yet most computational methods for…
