| Literature DB >> 32154892 |
Kellyn F Arnold1,2, Laurie Berrie1,2, Peter W G Tennant1,2,3, Mark S Gilthorpe1,2,3.
Abstract
BACKGROUND: Compositional data comprise the parts of some whole, for which all parts sum to that whole. They are prevalent in many epidemiological contexts. Although many of the challenges associated with analysing compositional data have been discussed previously, we do so within a formal causal framework by utilizing directed acyclic graphs (DAGs).Entities:
Keywords: Compositional data; causal inference; collider bias; directed acyclic graphs; joint effects; relative effects
Mesh:
Year: 2020 PMID: 32154892 PMCID: PMC7660155 DOI: 10.1093/ije/dyaa021
Source DB: PubMed Journal: Int J Epidemiol ISSN: 0300-5771 Impact factor: 7.196
Figure 1Directed acyclic graphs (DAGs) depicting two random variables A and B. (A) A causes B probabilistically; this is indicated by a single-lined arrow. (B) A causes B deterministically; this is indicated by a double-lined arrow and double-outlined rectangle.
Figure 2Directed acyclic graphs (DAGs) depicting three random variables X, Y and Z, for which X + Y = Z. Deterministic relationships are indicated by double-lined arrows, and fully determined nodes are indicated by double-outlined rectangles. A dashed box around variables indicates that those variables occur at an instantaneous point in time. (A) X and Y are unconditionally independent. (B) X and Y are unconditionally independent, and may affect a subsequent outcome O via their influence on Z. We note that, due to the deterministic nature of X, Y and Z, it is not possible to parameterize all arrows simultaneously.
Figure 3Directed acyclic graph (DAG) depicting total population in relation to gross domestic product (GDP), in which total population is subdivided into economic activity and inactivity (i.e. total population = economically active population + economically inactive population). Deterministic relationships are indicated by double-lined arrows, and fully determined nodes are indicated by double-outlined rectangles. A dashed box around variables indicates that those variables occur at an instantaneous point in time.
Figure 4Directed acyclic graph (DAG) depicting total energy intake in relation to body weight, in which total energy intake is subdivided into macronutrient consumption (i.e. total energy intake = fat consumption + protein consumption + carbohydrate consumption). Deterministic relationships are indicated by double-lined arrows, and fully determined nodes are indicated by double-outlined rectangles. A dashed box around variables indicates that those variables occur at an instantaneous point in time.
Figure 5Directed acyclic graph (DAG) depicting total hours in relation to body weight, in which total hours is subdivided into activity category (i.e. total hours = time spent sedentary + time spent physically active). Deterministic relationships are indicated by double-lined arrows, and fully determined nodes are indicated by double-outlined rectangles. A dashed box around variables indicates that those variables occur at an instantaneous point in time. Total hours is inherently constrained (i.e. total hours = 24) and thus has no identifiable causal effect on body weight.