AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding

Published in ACM International Conference on Multimedia (ACMMM) 2025, 2025

Recommended citation: Yidan Wang, Chenyi Zhuang, Wutao Liu, Pan Gao, Nicu Sebe. (2025). "AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding." ACMMM 2025.
Download Paper