Home // Bridging Machine Learning Explanations and Causal Inference

Supervisor

Dr. Iryna Okhrin

Center for Interdisciplinary Digital Sciences (CIDS)

TUD Dresden University of Technology

iryna.okhrin@tu-dresden.de

Bridging Machine Learning Explanations and Causal Inference

Status: open / Type of Theses: Master theses / Location: Dresden

Motivation

Interpretability methods for machine learning (ML) models, such as Variable Importance (VI), Partial Dependence Plots (PDP), and SHAP values, play a crucial role in explaining model predictions. However, most of these methods are correlational in nature—they capture associations between input features and predictions but do not provide information about causal relationships. That is, they indicate how variables move together but not whether changing a variable would actually cause a change in the outcome.
A classic example illustrating this limitation is the positive correlation between the number of firefighters dispatched and the amount of fire damage. While more firefighters are often present at larger fires, they do not cause the increased damage; the true cause is the fire’s size. This highlights the importance of causal reasoning in interpreting model behavior.
Causal analysis is especially critical in high-stakes domains such as healthcare, economics, and public policy, where understanding the effect of interventions (e.g., changing treatment or policy) is essential for trustworthy and actionable AI systems. Incorporating causality into interpretability methods can bridge the gap between black-box ML models and decision-making based on real-world cause-and-effect relationships.

Thesis Objective

This thesis aims to integrate traditional ML interpretability techniques with causal inference frameworks, specifically Structural Causal Models (SCMs), Counterfactual analysis, and do-calculus.
The goal is to extend or adapt interpretability methods like VI, PDP, and SHAP to their causal counterparts and compare how well they reflect the true causal effects of features on predictions.

Thesis key components

Literature Review: Survey existing methods for interpretable ML, causal inference (SCMs, do-calculus), and counterfactual reasoning.
Method Development: Implement or adapt causal versions of VI, PDP, and SHAP using SCMs and intervention-based reasoning.
Simulation Study: Create controlled synthetic datasets with known causal structures to benchmark standard vs. causal interpretability methods.
Real Data Application: Apply both standard and causal interpretability methods to real-world datasets (e.g., from healthcare, economics, or education).

funded by:

Gefördert vom Bundesministerium für Bildung und Forschung.

ScaDS.AI Dresden/Leipzig (Center for Scalable Data Analytics and Artificial Intelligence) is a center for Data Science, Artificial Intelligence and Big Data with locations in Dresden and Leipzig.