Pseudo Contrastive Learning for Diagram Comprehension in Multimodal Models
arXiv:2602.23589v2 Announce Type: replace-cross Abstract: Recent multimodal models such as Contrastive Language-Image Pre-training (CLIP) have shown remarkable ability to align…
