Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Posts

Blog Test Ai Bullet Time

less than 1 minute read

Published: November 29, 2025

title: Bullet Time of AI Video Generation Tools date: 2025-11-29 permalink: /posts/2025-11-29-blog-test-ai-bullet-time/ tags:

Emotional companionship of AI

1 minute read

Published: November 18, 2025

测试的时间是2025年11月18日，因为AI日新月异，更新很快，所以，测试结果仅供参考

RAG-SEG For Medicine

less than 1 minute read

Published: September 06, 2025

Paper Link: https://arxiv.org/abs/2508.15313

RAG-SEG( First RAG, Second SEG: A Training-Free Paradigm for Camouflaged Object Detection )

1 minute read

Published: July 29, 2025

Paper Link: https://arxiv.org/abs/2508.15313

publications

Unified Unsupervised Salient Object Detection via Knowledge Transfer

Published in International Joint Conference on Artificial Intelligence (IJCAI) 2024, 2024

A unified framework for unsupervised salient object detection (USOD), featuring curriculum learning-based saliency distilling and knowledge transfer across tasks.

Recommended citation: Yao Yuan, Wutao Liu, Pan Gao, Qun Dai, Jie Qin. (2024). "Unified Unsupervised Salient Object Detection via Knowledge Transfer." IJCAI 2024.
Download Paper

ACGFormer: Attribute Classification Guided Transformer for Camouflaged Object Detection

Published in PRCV 2025, 2025

ACGFormer introduces an Attribute Classification Guided Transformer for camouflaged object detection, leveraging attribute-aware guidance and feature refinement to achieve state-of-the-art performance.

Recommended citation: Wutao Liu, Yao Yuan, Pan Gao, Zheng Lin, Jie Qin. (2025). "ACGFormer: Attribute Classification Guided Transformer for Camouflaged Object Detection." PRCV.

First RAG, Second SEG: A Training-Free Paradigm for Camouflaged Object Detection

Published in Conference Paper (Preprint on arXiv), 2025

RAG-SEG presents a training-free paradigm for camouflaged object detection (salient obect detection) by decoupling the task into retrieval-augmented generation of coarse masks and SAM-based refinement, eliminating conventional training while achieving competitive results on benchmark datasets with only a personal laptop.

Recommended citation: Wutao Liu, Yidan Wang, Pan Gao. (2025). "First RAG, Second SEG: A Training-Free Paradigm for Camouflaged Object Detection." Conference Paper (Preprint on arXiv).
Download Paper

AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding

Published in ACM International Conference on Multimedia (ACMMM) 2025, 2025

AlignCAT introduces a query-based semantic matching framework for weakly supervised visual grounding, employing coarse-grained category alignment and fine-grained attribute alignment to enhance visual-linguistic reasoning and achieve state-of-the-art performance on RefCOCO, RefCOCO+, and RefCOCOg.

Recommended citation: Yidan Wang, Chenyi Zhuang, Wutao Liu, Pan Gao, Nicu Sebe. (2025). "AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding." ACMMM 2025.
Download Paper

Wutao Liu (Sherry APTX489)

Sitemap

Pages

Page Not Found

Welcome to Wutao Liu's Homepage

Archive Layout with Content

Posts by Category

Posts by Collection

CV

CV

Markdown

Page not in menu

Page Archive

Portfolio

Publications

Sitemap

Posts by Tags

Talk map

Talks and presentations

Teaching

Terms and Privacy Policy

Blog posts

Jupyter notebook markdown generator

Posts

Blog Test Ai Bullet Time

Emotional companionship of AI

测试的时间是2025年11月18日，因为AI日新月异，更新很快，所以，测试结果仅供参考

RAG-SEG For Medicine

RAG-SEG( First RAG, Second SEG: A Training-Free Paradigm for Camouflaged Object Detection )

publications

Unified Unsupervised Salient Object Detection via Knowledge Transfer

ACGFormer: Attribute Classification Guided Transformer for Camouflaged Object Detection

First RAG, Second SEG: A Training-Free Paradigm for Camouflaged Object Detection

AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding