Causal-Inference Emsley
Causal-Inference Emsley
Causal-Inference Emsley
Richard Emsley
Professor of Medical Statistics and Trials Methodology
Department of Biostatistics & Health Informatics
Institute of Psychiatry, Psychology & Neuroscience
Correlation and causation
3
If there is a significant correlation between two
variables R and Y, then either:
1. R causes Y R Y
2. Y causes R R Y
R Y
3. R and Y share a common cause X
R Y
4. R and Y are conditioned on a
common descendent Z
Z
4
The general principle of causal inference
• Statistical models can only tell us about association between
two variables (say R and Y)
5
A brief history of causal inference (1)
6
A brief history of causal inference (2)
7
A brief history of causal inference (3)
8
A brief history of causal inference (4)
9
A brief history of causal inference (5)
• There is a group who argue
against using the counterfactuals
or potential outcomes framework.
10
Is the terminology important?
11
Fundamental concept of causal inference
R Y
R Y
Types of variable in DAGs
• Confounding (X)
• Mediation (M)
• Partitions effects into direct and indirect effects
• Colliders (Z) – biases association between their parents
X
R M Y
Z
Overview
1. What is causal inference?
18
When can ATE be estimated?
• We wish to evaluate the effects of receiving a treatment compared
to a suitably defined control condition
• Thus we want to use a sample of subjects from a relevant target
population to compare outcomes between a treated group and a
control group
• When can we do this without running into problems?
– Randomised controlled trial (RCT): Participants are randomised to two
arms (experimental treatment and control)
• YES - provided participants adhere to their allocated treatment.
– Observational study: Compares subjects receiving the experimental
treatment with subjects under the control condition
• SOMETIMES - only if variables that drive treatment group selection
have been measured and accounted for appropriately
19
What are we estimating in trials?
• Interested in various measures of effect
– Effectiveness - the benefit of a treatment policy
– Efficacy - the benefit of actually receiving treatment
20
Target mechanisms
Mechanism
Intervention Symptoms
• An explanatory analysis of a trial would seek to establish that this is indeed the
case using mediation analysis; i.e. assess the mediated path.
21
Mediation analysis and causal inference…
22
The basic underlying problem:
estimating valid causal effects
U – the unmeasured confounders
error
U
Mediator
Random error
Outcomes
allocation
Covariates
23
Example: simple mediation analysis
e1
0.25 X
M 0.25 0.25
0.25
0.25
R Y e2
. regress y r
25
Example: regression approach stage 2
. regress m r
26
Example: regression approach stage 3
. regress y m r
27
Example: regression approach stage 3
. regress y m r x
28
Example: with measurement error
. regress y m_star r x
29
“Nuisance” mediators
• Variables measured post-randomisation that we may wish to
rule out having a mediated effect - essentially we want to
estimate the residual direct effects and find a small indirect
effect
Antidepressant
Use
Cognitive therapy +
antidepressant Depression
Vs. Score
Antidepressant only
30
Example: HIV and condom use
• Diaphragm and lubricant gel for prevention of HIV acquisition
in southern African women: a randomised controlled trial
Diaphragm + gel +
condoms
HIV infection
Vs
Condoms alone
31
Example: ITT effect on HIV incidence
32
Example: ITT effect on condom use
• Padian et al (2007)
33
Example: mediation question
• “Shelton and Stein also ask for a disentanglement of the
separate effects of condom and diaphragm use on incidence
of HIV infection in the trial, part of which necessarily involves
an estimation of the independent effect of condom use. We
agree that such analyses are important additions to basic
intention-to-treat results, and in fact, we prespecified
appropriate methods to address these issues in our analytical
plan and included results in the submitted paper. Regrettably,
we were directed not to include these findings by both a
referee and an editor of the original article.”
34
Example: mediation hypothesis
• A case for controlled direct effects…
36
Assumptions for identification
• Controlled direct effects require:
– A1: no unmeasured R-Y confounding (U1);
– A2: no unmeasured M-Y confounding (U2).
M U2
R Y
X
U1
37
M-bias graph
G B
X
D Y
38
M-bias graph
• What are all the paths from D to Y?
G B
X
• What is the consequence of adjusting
for X in estimating the effect of D on Y? D Y
39
A realistically complex DAG
• To estimate the effect of obesity on PE (Pearce and Lawlor, IJE, 2017)
• Unblocked paths:
• Obesity – Smoking – SEP - Age at pregnancy - PE
• Obesity – SEP – Age at pregnancy - PE
• Obesity – Smoking – Age at pregnancy - PE
• Obesity – Smoking – PE
B
X
R D 𝛽
Y
• R = randomisation
• D = treatment received
• Y = outcome Effect of R on Y
𝛽=
• X = confounder Effect of R on D
• B = prognostic variable
42
Mendelian randomisation
B
X
G D 𝛽
Y
• G = gene
• D = treatment received
• Y = outcome Effect of G on Y
𝛽=
• X = confounder Effect of G on D
• B = prognostic variable
43
Overview
1. What is causal inference?
44
Should we be careful with the C-Word?
• Miguel Hernán (2018), American Journal of Public Health
45
Observational studies and causality
• In observational studies, we want to know about variables
that can be modified or manipulated
46
When can we use the words “causal effect”?
• Title
• Introduction
• Methods
• Results
• Discussion 47
Hayes and Rockwood (2017)
“There are some hardliners who say that to claim the existence of
cause-effect relationships (and mediation is by definition a cause-
effect process), one must engage in experimental manipulation with
random assignment, collect data over time or, ideally, both.
“(see e.g., Emsley, Dunn, & White, 2010; Preacher, 2015, for a
discussion of many of these assumptions).” “
Hayes and Rockwood (2017)
“We feel that if these are taken as literal requirements rather than as
just ideals or recommendations, most research would not be done
because most researchers cannot meet these requirements (due to
resource constraints, ethics, and a myriad list of other reasons).”
“We would rather see more imperfect work conducted and published
than see research slow to a trickle because investigators don’t feel that
their work will satisfy all critics and pass every test for valid causal
inference.”
“You can do most anything you want with your data. Most any
statistical tool can provide some insight into the story you ultimately
end up telling with your data.”
51
Research Programme:
Efficacy and Mechanisms Evaluation
Joint work and slides prepared with Graham Dunn, Ian White, Andrew Pickles and Sabine Landau.
• Design and methods of explanatory (causal) analysis for randomised trials of complex interventions in
mental health (2006-2009)
– Graham Dunn (PI), Richard Emsley, et al
• Estimation of causal effects of complex interventions in longitudinal studies with intermediate variables
(2009-2012)
– Richard Emsley (PI), Graham Dunn.
• Designs and analysis for the evaluation and validation of social and psychological markers in randomised
trials of complex interventions in mental health (2010-12)
– Graham Dunn (PI), Richard Emsley, et al.
53
Thank you for your attention
Email: [email protected]
54