Skip to content

Commit 44ebcdb

Browse files
20260117 - DAGs
1 parent e212fc6 commit 44ebcdb

File tree

1 file changed

+12
-6
lines changed

1 file changed

+12
-6
lines changed

dags.qmd

Lines changed: 12 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,12 +8,18 @@ Causal diagrams depict the hypoothesized causal processes that link two or more
88
Path diagrams are typically used after analysis to describe and report the findings in analysis (when using path analysis, [factor analysis](factorAnalysis.qmd), or [structural equation modeling](sem.qmd)).
99
By contrast, DAGs are particularly useful when designing a study or before analysis, because they can help specify which variables it is important to control for and—just as importantly—which variables it is important not to control for.
1010

11-
When drawing a DAG for your study, draw all the variables that link the hypothesized cause to the hypothesized effect, including confounders, mediators, and colliders.
12-
In your study, it is important to control for confounders.
13-
Moreover, it is important not to control to control for mediators when you are interested in the total effect of the predictor on the outcome.
14-
In addition, it is important not to control for descendants of the outcome variable.
15-
When there is a collision, it is important not to control for the collider when examining the association between the two causes of the collider.
16-
The only time when one should control for a collider is when the collider is also a cause (i.e., confound) of both the predictor and outcome variable rather than a common effect of both.
11+
When drawing a DAG for your study, draw all the variables that link the hypothesized cause to the hypothesized effect, including confounds, mediators, and colliders.
12+
13+
In your study, it is important to control for all confounds.
14+
In addition, it is okay to control for ancestors of the outcome variable that are not confounds or mediators.
15+
That is, it is okay to control for variables that influence `Y` that do not influence `X` and that are not influenced by `X`.
16+
When including these variable as control variables in a model, they are called precision variables.
17+
You do not need to include precision variables in the model because the estimate of the association is already unbiased if you have controlled for all confounds.
18+
However, including precision variables in the model reduces residual variance in the outcome variable and can yield more precise estimates (i.e., smaller standard errors) of the association between the predictor variable and outcome variable.
19+
20+
In addition, there are some variables that are important not to control for.
21+
It is important not to control for mediators of two variables for which you want to determine the estimate of the causal effect—unless you are interested in the direct causal effect of the predictor variable on the outcome variable above and beyond the mediator.
22+
In addition, it is is important not to control for a) ancestors of the predictor variable that are not confounds, b) descendants of the outcome variable, and c) colliders (unless the collider is also a confound).
1723

1824
For more information on DAGs, including ancestors, descendants, confounders, and colliders, see here: <https://isaactpetersen.github.io/Fantasy-Football-Analytics-Textbook/causal-inference.html#sec-causalDiagrams>.
1925

0 commit comments

Comments
 (0)