Debiasing Multimodal Models via Causal Information Minimization

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Most existing debiasing methods for multimodal models, including causal intervention and inference methods, utilize approximate heuristics to represent the biases, such as shallow features from early stages of training or unimodal features for multimodal tasks like VQA, etc., which may not be accurate. In this paper, we study bias arising from confounders in a causal graph for multimodal data and examine a novel approach that leverages causally-motivated information minimization to learn the confounder representations. Robust predictive features contain diverse information that helps a model generalize to out-of-distribution data. Hence, minimizing the information content of features obtained from a pretrained biased model helps learn the simplest predictive features that capture the underlying data distribution. We treat these features as confounder representations and use them via methods motivated by causal theory to remove bias from models. We find that the learned confounder representations indeed capture dataset biases, and the proposed debiasing methods improve out-of-distribution (OOD) performance on multiple multimodal datasets without sacrificing in-distribution performance. Additionally, we introduce a novel metric to quantify the sufficiency of spurious features in models' predictions that further demonstrates the effectiveness of our proposed methods. Our code is available at: https://github.com/Vaidehi99/CausalInfoMin

Related collections

Author and article information

Journal

Publication date Created: 28 November 2023

Article

ArXiV ID: 2311.16941

SO-VID: df2cb12c-9d2d-4d58-b756-c9e978685293

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments EMNLP 2023 Findings (16 pages)

Categories cs.LG cs.AI cs.CL cs.CV stat.ME

ScienceOpen disciplines: Computer vision & Pattern recognition,Theoretical computer science,Artificial intelligence,Methodology

Data availability:

ScienceOpen disciplines: Computer vision & Pattern recognition, Theoretical computer science, Artificial intelligence, Methodology

Debiasing Multimodal Models via Causal Information Minimization

Read this article at

Abstract

Related collections

Journal of Information and Communication Technology

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 209