Skip to content

Commit 10a3df3

Browse files
committed
Several small copyedits and comma fixes
1 parent 812c23f commit 10a3df3

File tree

2 files changed

+22
-20
lines changed

2 files changed

+22
-20
lines changed

content/blog/2021-01-15-causal-effect-mobility.Rmd

Lines changed: 11 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -67,20 +67,20 @@ exposition of some key ideas here.
6767
To learn more,
6868
we recommend [the book
6969
by Hernan and Robins](https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/) and
70-
the paper:
70+
the paper
7171
"[Statistics and Causal Inference](https://www.jstor.org/stable/2289064)"
7272
by Paul Holland
73-
(Journal of the American Statistical Association 1986, p 945-960).
73+
(*Journal of the American Statistical Association* 1986, pp. 945-960).
7474
And one can find many tutorials
7575
on the web.
7676

7777
Suppose we have
7878
three variables
79-
$X$, $A$ and $Y$ where
79+
$X$, $A$ and $Y$, where
8080
$Y$ is the outcome of interest
81-
(such as deaths from COVID)
81+
(such as deaths from COVID),
8282
$A$ is a treatment or exposure
83-
(such as mobility level)
83+
(such as mobility level),
8484
and $X$ are confounding variables,
8585
which are variables that affect both
8686
$A$ and $Y$ (such as age).
@@ -115,7 +115,7 @@ where
115115
$\mu(x,a) = \mathbb{E}[Y|X=x,A=a]$
116116
is the mean of $Y$ given $X$ and $A$.
117117
This is not equal to
118-
the mean of $Y$ given $A=a$ which is
118+
the mean of $Y$ given $A=a$, which is
119119
$\mathbb{E}[Y|A=a] = \int \mu(x,a) p(x|a) dx$.
120120
This is the difference between causation
121121
and prediction (correlation).
@@ -154,7 +154,8 @@ The first plot below shows an example where we would predict
154154
higher values of $Y$ when $A$ is large. For pure prediction, this is
155155
the correct conclusion. The second plot shows that once we
156156
account for $X = \text{age}$ (corresponding to different colors) there is
157-
a negative relationship between $Y$ and $A$. In this case, age is a
157+
a negative relationship between $Y$ and $A$: for a given person with a fixed
158+
age, increasing $A$ would *decrease* $Y$. In this case, age is a
158159
confounder and the $g$-formula would correctly recover the negative
159160
relationship. For causal inference, this is the correct conclusion.
160161

@@ -239,12 +240,12 @@ They mean the same thing.
239240

240241
Having a formula for
241242
$\psi(\overline{a}_t)$
242-
is great but how do we estimate it?
243+
is great, but how do we estimate it?
243244
Again we could plug-in estimates
244245
of all the quantities in
245246
the $g$-formula.
246247
But there are
247-
better things we can do
248+
better things we can do,
248249
as we now explain.
249250

250251
## A Marginal Structural Model
@@ -495,7 +496,7 @@ $Y_t^\gamma$
495496
is the (counterfactual) number of deaths
496497
we would have observed of mobility had
497498
been increases by $\gamma$ at all times,
498-
were we take $\gamma = 10$
499+
where we take $\gamma = 10$
499500
(a ten percent increase in stay-at-home).
500501
The fact that the values
501502
are negative

content/blog/2021-01-15-causal-effect-mobility.html

Lines changed: 11 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -70,19 +70,19 @@ <h2>Causal Inference</h2>
7070
To learn more,
7171
we recommend <a href="https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/">the book
7272
by Hernan and Robins</a> and
73-
the paper:
73+
the paper
7474
<a href="https://www.jstor.org/stable/2289064">Statistics and Causal Inference</a>
7575
by Paul Holland
76-
(Journal of the American Statistical Association 1986, p 945-960).
76+
(<em>Journal of the American Statistical Association</em> 1986, pp. 945-960).
7777
And one can find many tutorials
7878
on the web.</p>
7979
<p>Suppose we have
8080
three variables
81-
<span class="math inline">\(X\)</span>, <span class="math inline">\(A\)</span> and <span class="math inline">\(Y\)</span> where
81+
<span class="math inline">\(X\)</span>, <span class="math inline">\(A\)</span> and <span class="math inline">\(Y\)</span>, where
8282
<span class="math inline">\(Y\)</span> is the outcome of interest
83-
(such as deaths from COVID)
83+
(such as deaths from COVID),
8484
<span class="math inline">\(A\)</span> is a treatment or exposure
85-
(such as mobility level)
85+
(such as mobility level),
8686
and <span class="math inline">\(X\)</span> are confounding variables,
8787
which are variables that affect both
8888
<span class="math inline">\(A\)</span> and <span class="math inline">\(Y\)</span> (such as age).
@@ -112,7 +112,7 @@ <h2>Causal Inference</h2>
112112
<span class="math inline">\(\mu(x,a) = \mathbb{E}[Y|X=x,A=a]\)</span>
113113
is the mean of <span class="math inline">\(Y\)</span> given <span class="math inline">\(X\)</span> and <span class="math inline">\(A\)</span>.
114114
This is not equal to
115-
the mean of <span class="math inline">\(Y\)</span> given <span class="math inline">\(A=a\)</span> which is
115+
the mean of <span class="math inline">\(Y\)</span> given <span class="math inline">\(A=a\)</span>, which is
116116
<span class="math inline">\(\mathbb{E}[Y|A=a] = \int \mu(x,a) p(x|a) dx\)</span>.
117117
This is the difference between causation
118118
and prediction (correlation).
@@ -147,7 +147,8 @@ <h2>Causal Inference</h2>
147147
higher values of <span class="math inline">\(Y\)</span> when <span class="math inline">\(A\)</span> is large. For pure prediction, this is
148148
the correct conclusion. The second plot shows that once we
149149
account for <span class="math inline">\(X = \text{age}\)</span> (corresponding to different colors) there is
150-
a negative relationship between <span class="math inline">\(Y\)</span> and <span class="math inline">\(A\)</span>. In this case, age is a
150+
a negative relationship between <span class="math inline">\(Y\)</span> and <span class="math inline">\(A\)</span>: for a given person with a fixed
151+
age, increasing <span class="math inline">\(A\)</span> would <em>decrease</em> <span class="math inline">\(Y\)</span>. In this case, age is a
151152
confounder and the <span class="math inline">\(g\)</span>-formula would correctly recover the negative
152153
relationship. For causal inference, this is the correct conclusion.</p>
153154
<p><img src="/blog/images/causal-simple-confounder.svg" /></p>
@@ -221,12 +222,12 @@ <h2>Causal Inference</h2>
221222
They mean the same thing.</p>
222223
<p>Having a formula for
223224
<span class="math inline">\(\psi(\overline{a}_t)\)</span>
224-
is great but how do we estimate it?
225+
is great, but how do we estimate it?
225226
Again we could plug-in estimates
226227
of all the quantities in
227228
the <span class="math inline">\(g\)</span>-formula.
228229
But there are
229-
better things we can do
230+
better things we can do,
230231
as we now explain.</p>
231232
</div>
232233
<div id="a-marginal-structural-model" class="section level2">
@@ -456,7 +457,7 @@ <h2>The Data and the Results</h2>
456457
is the (counterfactual) number of deaths
457458
we would have observed of mobility had
458459
been increases by <span class="math inline">\(\gamma\)</span> at all times,
459-
were we take <span class="math inline">\(\gamma = 10\)</span>
460+
where we take <span class="math inline">\(\gamma = 10\)</span>
460461
(a ten percent increase in stay-at-home).
461462
The fact that the values
462463
are negative

0 commit comments

Comments
 (0)