Yang, Guan ; Ji, Cheng ; Liu, Xiaoming ; Zhang, Ziming ; Wang, Chen
Korbicz, Józef (1951- ) - red. ; Uciński, Dariusz - red.
Visual question answering (VQA) is a pivotal topic at the intersection of computer vision and natural language processing. This paper addresses the challenges of linguistic bias and bias fusion within invalid regions encountered in existing VQA models due to insufficient representation of multi-modal features. To overcome those issues, we propose a multi-feature enhancement scheme. This scheme involves the fusion of one or more features with the original ones, incorporating discrete cosine transform (DCT) features into the counterfactual reasoning framework. ; This approach harnesses finegrained information and spatial relationships within images and questions, enabling a more refined understanding of the indirect relationship between images and questions. Consequently, it effectively mitigates linguistic bias and bias fusion within invalid regions in the model. Extensive experiments are conducted on multiple datasets, including VQA2 and VQACP2, employing various baseline models and fusion techniques, resulting in promising and robust performance.
Zielona Góra: Uniwersytet Zielonogórski
AMCS, volume 34, number 3 (2024) ; kliknij tutaj, żeby przejść
Biblioteka Uniwersytetu Zielonogórskiego
5 sie 2025
5 sie 2025
9
https://www.zbc.uz.zgora.pl/publication/101875
| Nazwa wydania | Data |
|---|---|
| DCF-VQA: Counterfactual structure based on multi-feature enhancement | 5 sie 2025 |
Stępka, Ignacy Lango, Mateusz Stefanowski, Jerzy Korbicz, Józef (1951- ) - red. Uciński, Dariusz - red.
Cheng, Chao Wang, Meng Wang, Jiuhe Shao, Junjie Chen, Hongtian Wiśniewski, Remigiusz - ed. Gomes, Luis - ed. Wan, Shaohua - ed.
Xiang, Zhengrong Wang, Ronghao Chen, Qingwei Korbicz, Józef (1951- ) - red. Uciński, Dariusz - red.
Chen, Jingying Liao, Mengyi Wang, Guangshuai Chen, Chang Kołodziej, Joanna - ed. Pllana, Sabri - ed. Vitabile , Salvatore - ed.
Chen, Lei Zielińska, Teresa Wang, Jikun Ge, Weimin Korbicz, Józef (1951- ) - red. Uciński, Dariusz - red.
Liu, Xinbo Wang, Wen Chen, Xin Sterna, Małgorzata Blazewicz, Jacek Kitowski, Zygmunt - ed. Piskur, Paweł - ed. Hożyń , Stanisław - ed.
Wang, Wen Chen, Xin Musial, Jedrzej Blazewicz, Jacek Kołodziej, Joanna - ed. Pllana, Sabri - ed. Vitabile , Salvatore - ed.