AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding
Yidan Wang, Chenyi Zhuang, Wutao Liu, Pan Gao, Nicu Sebe. (2025). "AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding." ACMMM 2025.
