Multimodal vision-language models are increasingly deployed in high-stakes settings, which makes understanding the biases they encode both a technical and ethical priority. This tutorial addresses that challenge directly, using counterfactual fairness as its core evaluative framework. The concept of counterfactual fairness is familiar in machine learning: it captures the intuitive idea that if decisions should not be made on the basis of protected characteristics (race, gender, disability, age, and others) then changing those attributes in the input to a model should not change the output. This tutorial focuses on the question of how we can develop and use counterfactual image datasets to assess bias in large vision-language models (LVLMs). Our discussion will begin with the process of image dataset creation and the trade-offs of collecting versus generating images. We will summarize which protected characteristics have been studied (and overlooked) in the literature, and provide examples of numerical, closed-class, and open-ended generation tasks and corresponding bias metrics. Furthermore, we will address some of the more contested questions around counterfactual fairness, such as whether mapping social constructs like race and gender onto discrete counterfactual classes risks reinforcing the very categories we seek to interrogate, and whether the use of synthetic images introduces new sources of bias to the analysis. The session will culminate in a hands-on session so that attendees can test and critique the methodology for themselves. This tutorial is aimed at researchers and practitioners in all disciplines — no prior background knowledge is required.
Register now →Kusner, Loftus, Russell, Silva. Introduces the counterfactual fairness framework using tools from causal inference.
→ Read paper Paper · NAACL 2025Howard, Fraser, Bhiwandiwalla, Kiritchenko. Large-scale counterfactual study of bias in LVLM-generated text.
→ Read paper Paper · EACL 2024Fraser, Kiritchenko. Introduces the PAIRS dataset of parallel AI-generated images to probe LVLM bias.
→ Read paper Paper · EMNLP 2025Huang, Qin, Zhang, Yuan, Wang, Zhao. Evaluates explicit and implicit biases in VLMs across direct questioning and indirect tasks.
→ Read paper Paper · arXivExtends counterfactual evaluation to cultural dimensions of bias in LVLMs.
→ Read paper Paper · ICCV 2021Zhao, Wang, Russakovsky. Analyzes racial bias in image captioning systems trained on COCO.
→ Read paper Paper · NAACL 2018Zhao, Wang, Yatskar, Ordonez, Chang. Introduces WinoBias and a data-augmentation debiasing approach.
→ Read paper Paper · *SEM 2018Kiritchenko, Mohammad. Introduces the Equity Evaluation Corpus (EEC) for auditing sentiment systems.
→ Read paper Paper · FAccT 2023Evaluates LLMs and language-vision models for bias against a broad set of stigmatized groups.
→ Read paper Paper · ACL 2023Cheng, Durmus, Jurafsky. Prompt-based method grounded in markedness to measure intersectional stereotypes in LLMs.
→ Read paper Paper · SurveyGan, Li, Li, Wang, Liu, Gao. Comprehensive survey of vision-language pre-training methods.
→ Read paper Paper · Law ReviewKohler-Hausmann. Critiques the counterfactual causal model of discrimination from a legal and constructivist perspective.
→ Read paper Paper · CSCW 2020Scheuerman, Wade, Lustig, Brubaker. Examines how race and gender are operationalized in facial-analysis training data.
→ Read paper Paper · FAccT 2023Bianchi, Kalluri, Durmus, Ladhak, et al. Documents how text-to-image models amplify stereotypes about gender, race, and nationality.
→ Read paper Paper · FAccT 2021Kasirzadeh, Smart. Argues for caution when applying counterfactuals to social categories like race or gender.
→ Read paper Paper · FAccT* 2020Hanna, Denton, Smart, Smith-Loud. Examines race as a social construct in algorithmic fairness, drawing on critical race theory.
→ Read paper
Kathleen is an associate professor of computer science at the University of Ottawa. Her research focuses on responsible use of NLP and AI technologies, including evaluation of bias and safety in large language models. Kathleen has numerous publications on bias in vision-language systems.

Phillip is an AI researcher who leads Thoughtworks AI Labs, an industry research team focused on interpretability, robustness, and evaluation of generative AI models. Phillip has published multiple papers on the generation of synthetic counterfactual examples for improving model generalization as well as evaluating and mitigating social biases

Jieyu is an assistant professor in the Department of Computer Science at the University of Southern California. Her research focuses on fairness in machine learning and NLP. Her work has garnered attention from major media outlets, including Wired and The Daily Mail. She has delivered talks to a range of international audiences at various venues, including AAAI,2 NAACL, and IJCAI.

Margaret is a Program Leader at the National Research Council of Canada. Her research focuses on AI safety, primarily from the perspectives of enabling governance, assessment and real-world accountability. Margaret has spoken at academic conferences, educational sessions for the public, and expert presentation at the OECD. She holds a Juris Doctor from the University of Toronto, and a Master of Laws in Cybersecurity & Privacy.

Morgan is a research scientist in AI Ethics at Sony AI, with a broad focus on responsible AI, data curation, identity theory, infrastructure studies, and digital identity. Morgan has published extensively on the real-world implications of machine learning for those with historically marginalized identities, and has won numerous best paper, honorable mention, and diversity & inclusion awards.
If you find this tutorial useful, please cite it as:
@inproceedings{fraser2026counterfactual,
author = {Fraser, Kathleen and Howard, Phillip and Zhao, Jieyu and McKay, Margaret and Scheuerman, Morgan},
title = {Translation/Dialogue Tutorial: Counterfactual Fairness Analysis in Language-Vision Models},
booktitle = {Presented at the 2026 ACM Conference on Fairness, Accountability, and Transparency (FAccT)},
year = {2026},
month = {June},
address = {Montreal, Canada},
publisher = {Association for Computing Machinery},
}