Exploiting LLM-as-a-Judge Disposition on Free Text Legal QA via Prompt Optimization
arXiv:2604.20726v2 Announce Type: replace-cross Abstract: This work explores the role of prompt design and judge selection in LLM-as-a-Judge evaluations of…
arXiv:2604.20726v2 Announce Type: replace-cross Abstract: This work explores the role of prompt design and judge selection in LLM-as-a-Judge evaluations of…
arXiv:2604.20860v1 Announce Type: cross Abstract: Despite the success of Retrieval-Augmented Generation (RAG) in grounding LLMs with external knowledge, its application…
arXiv:2509.24239v4 Announce Type: replace-cross Abstract: Recent large language models (LLMs) have shown strong reasoning capabilities. However, a critical question remains:…
arXiv:2604.21312v1 Announce Type: cross Abstract: This paper presents the NTIRE 2026 Remote Sensing Infrared Image Super-Resolution (x4) Challenge, one of…
arXiv:2604.20846v1 Announce Type: cross Abstract: Next point-of-interest (POI) recommendation requires modeling user mobility as a spatiotemporal sequence, where different behavioral…
arXiv:2604.20862v1 Announce Type: new Abstract: The automation system for Course of Action (CoA) planning is an essential element in future…
arXiv:2604.20789v2 Announce Type: replace-cross Abstract: We investigate the integration of human-like working memory constraints into the Transformer architecture and implement…
arXiv:2604.21473v1 Announce Type: cross Abstract: In the treatment of complex diseases, treatment regimens using a single drug often yield limited…
arXiv:2604.21464v1 Announce Type: cross Abstract: Standard reinforcement learning (RL) optimizes policies for reward but imposes few constraints on how decisions…
arXiv:2604.20688v2 Announce Type: replace-cross Abstract: Storm surge forecasting remains a critical challenge in mitigating the impacts of tropical cyclones on…