| Literature DB >> 35873671 |
Samira Asgari1,2,3,4,5,6, Yang Luo1,2,3,4,5, Chuan-Chin Huang7,8, Zibiao Zhang7,9,8, Roger Calderon10, Judith Jimenez10, Rosa Yataco10, Carmen Contreras10, Jerome T Galea7,11, Leonid Lecca7,10, David Jones7,12, D Branch Moody2, Megan B Murray7,9,8, Soumya Raychaudhuri1,2,3,4,5,13,14.
Abstract
We investigated whether ancestry-specific genetic factors affect tuberculosis (TB) progression risk in a cohort of admixed Peruvians. We genotyped 2,105 patients with TB and 1,320 household contacts (HHCs) who were infected with Mycobacterium tuberculosis (M. tb) but did not develop TB and inferred each individual's proportion of native Peruvian genetic ancestry. Our HHC study design and our data on potential confounders allowed us to demonstrate increased risk independent of socioeconomic factors. A 10% increase in individual-level native Peruvian genetic ancestry proportion corresponded to a 25% increased TB progression risk. This corresponds to a 3-fold increased risk for individuals in the highest decile of native Peruvian genetic ancestry versus the lowest decile, making native Peruvian genetic ancestry comparable in effect to clinical factors such as diabetes. Our results suggest that genetic ancestry is a major contributor to TB progression risk and highlight the value of including diverse populations in host genetic studies.Entities:
Year: 2022 PMID: 35873671 PMCID: PMC9306274 DOI: 10.1016/j.xgen.2022.100151
Source DB: PubMed Journal: Cell Genom ISSN: 2666-979X
Figure 1.Our household-contact study design
We recruited patients in a large catchment area that included 20 urban districts and ~3.3 million residents. Within 14 days of enrollment of index cases, we contacted their household contacts (HHCs). HHCs with pulmonary TB were recruited as cases (baseline cases). HHCs that were TST positive but did not have active TB were recruited as controls. All individuals were followed up with for 1 year, and all HHCs were evaluated for signs and symptoms of pulmonary and extra-pulmonary TB disease at 2, 6, and 12 months after enrollment and were recruited as cases if they developed active TB during follow up (secondary cases). HHCs that remained or became TST positive but did not develop active TB were recruited as controls. The final cohort included 2,105 TB cases and 1,320 TST-positive HHCs.
Cohort’s demographic information
| Mean (SD) | p value | ||||
|---|---|---|---|---|---|
| Native Peruvian genetic ancestry | |||||
| NAT tertile 1 | 0.64 (0.13) | <2.2 × 10−308 | |||
| NAT tertile 2 | 0.84 (0.03) | ||||
| NAT tertile 3 | 0.94 (0.03) | ||||
| European genetic ancestry | |||||
| NAT tertile 1 | 0.26 (0.10) | <2.2 × 10−308 | |||
| NAT tertile 2 | 0.14 (0.03) | ||||
| NAT tertile 3 | 0.06 (0.03) | ||||
| West African genetic ancestry | |||||
| NAT tertile 1 | 0.08 (0.10) | 3.6 × 10−185 | |||
| NAT tertile 2 | 0.015 (0.02) | ||||
| NAT tertile 3 | 0.005 (0.006) | ||||
| East Asian genetic ancestry | |||||
| NAT tertile 1 | 0.02 (0.05) | 1.4 × 10−59 | |||
| NAT tertile 2 | 0.007 (0.01) | ||||
| NAT tertile 3 | 0.003 (0.006) | ||||
| Age | |||||
| NAT tertile 1 | 33.26 (15.59) | 2.0 × 10−8 | |||
| NAT tertile 2 | 29.02 (13.12) | ||||
| NAT tertile 3 | 34.90 (17.05) | ||||
|
|
|
| |||
| NAT tertile 1 | 0.16 | 0.17 | 2.3 × 10−16 | ||
| NAT tertile 2 | 0.12 | 0.22 | |||
| NAT tertile 3 | 0.11 | 0.23 | |||
|
|
|
| |||
| NAT tertile 1 | 0.13 | 0.20 | 4.7 × 10−6 | ||
| NAT tertile 2 | 0.14 | 0.19 | |||
| NAT tertile 3 | 0.16 | 0.17 | |||
|
|
|
|
|
| |
| NAT tertile 1 | 0.02 | 0.02 | 0.29 | 0.008 | 2.2 × 10−26 |
| NAT tertile 2 | 0.004 | 0.01 | 0.31 | 0.007 | |
| NAT tertile 3 | 0.002 | 0.004 | 0.32 | 0.005 | |
|
|
|
|
|
| |
| NAT tertile 1 | 0.04 | 0.11 | 0.17 | 0.02 | 1.5 × 10−12 |
| NAT tertile 2 | 0.03 | 0.09 | 0.2 | 0.01 | |
| NAT tertile 3 | 0.02 | 0.09 | 0.22 | 0.01 | |
|
|
|
|
|
| |
| NAT tertile 1 | 0.007 | 0.04 | 0.28 | 0.004 | 0.12 |
| NAT tertile 2 | 0.01 | 0.04 | 0.28 | 0.004 | |
| NAT tertile 3 | 0.007 | 0.03 | 0.29 | 0.004 | |
|
|
|
| |||
| NAT tertile 1 | 0.31 | 0.03 | 0.09 | ||
| NAT tertile 2 | 0.30 | 0.04 | |||
| NAT tertile 3 | 0.30 | 0.04 | |||
|
|
|
| |||
| NAT tertile 1 | 0.1 | 0.23 | 4.7 × 10−7 | ||
| NAT tertile 2 | 0.09 | 0.24 | |||
| NAT tertile 3 | 0.07 | 0.26 | |||
|
|
|
| |||
| NAT tertile 1 | 0.03 | 0.3 | 5.6 × 10−04 | ||
| NAT tertile 2 | 0.04 | 0.29 | |||
| NAT tertile 3 | 0.05 | 0.28 | |||
|
|
|
|
|
| |
| NAT tertile 1 | 0.10 | 0.13 | 0.08 | 0.01 | 0.79 |
| NAT tertile 2 | 0.11 | 0.13 | 0.08 | 0.01 | |
| NAT tertile 3 | 0.11 | 0.14 | 0.08 | 0.01 | |
The cohort includes 2,105 individuals with active TB (cases) and 1,320 infected HHCs (controls). We divided the cohort into tertiles based on native Peruvian genetic ancestry and tested the association of these tertiles with each covariate individually. Two-sided p values are calculated using the ANOVA for quantitative variables and using the chi-square test for categorical variables. For quantitative variables, mean (SD), and for categorical variables, percentages are shown. NA, not available. The socioeconomic group is a household-level variable; all other variables are individual-level variables. Numbers are rounded to two or three decimal places.
Figure 2.Native Peruvian genetic ancestry is associated with TB progression risk
(A) ADMIXTURE analysis results using K = 4 clusters. The average proportions of native Peruvian (red), European (yellow), West African (green), and East Asian (pink) genetic ancestry were 0.8 (standard deviation [SD] = 0.15), 0.16 (0.11), 0.03 (0.07), and 0.01 (0.03), respectively. X axis: individuals (axis ticks not shown). Y axis: genomic proportion. Displayed populations from left to right: Utah residents with Northern and Western European ancestry (CEU from the 1000 Genomes Project), Yoruba (YRI from the 1000 Genomes Project), Southern Han Chinese (CHS from the 1000 Genomes Project), Andean (from Reich et al.’s study), Northern American-Indians (North Amerind, from Reich et al.’s study), Central American-Indians (Central Amerind, from Reich et al.’s study), Altaic (Siberians speaking Altaic languages, from Reich et al.’s study), Puerto Ricans from Puerto Rico (PUR from the 1000 Genomes Project), Colombians from Medellin, Colombia (CLM from the 1000 Genomes Project), Mexican Ancestry from Los Angeles USA (MXL from the 1000 Genomes Project), Peruvians from Lima, Peru (PEL from the 1000 Genomes Project), TB cases form this study (n = 2,106), and TB controls from this study (n = 1,320).
(B) Probability density distribution of native Peruvian genetic ancestry proportion in TB cases and controls. TB cases have a higher proportion of native Peruvian genetic ancestry than infected HHCs (two-sided t test p = 8.8 × 10−19). Y axis: density. X axis: native Peruvian genetic ancestry (NAT) proportion.
(C) Individuals in the higher native Peruvian genetic ancestry decile have a higher TB risk. Individuals with the highest level of native Peruvian genetic ancestry (top decile, average native Peruvian genetic ancestry proportion = 0.97 [0.01], n = 232 cases, 110 controls) are three times more likely to progress to active TB (OR = 2.90, 95% CI = 1.99–4.26, p = 2.8 × 10−8) compared with the individuals in the bottom decile (average native Peruvian genetic ancestry proportion = 0.48 [0.13], n = 149 cases, 194 controls). X axis: average native Peruvian genetic ancestry proportion for each decile; error bars show standard error. Y axis: TB odds ratio (OR) after correction for age, sex, socioeconomic status, household, and genetic relatedness. ORs are shown relative to the first decile. Error bars show 95% CI. Each decile includes 398 individuals.
(D) Native Peruvian genetic ancestry remained significantly associated with TB progression risk after controlling for eight additional individual covariates including West African and East Asian genetic ancestry proportion, smoking, drinking, previous TB status, education level, BMI, and BCG vaccination. Similarly, native Peruvian genetic ancestry remained significantly associated with TB progression risk when we restricted our cohort to secondary or secondary clustered cases and their HHCs. Circles show OR for 0.1 increase in native Peruvian genetic ancestry; error bars show 95% CI. NAT, native Peruvian genetic ancestry.
Self-reported race and ethnicity in our cohort (n = 3,425)
| Self-reported race | Count |
|---|---|
| American Indian + White | 2,538 |
| American Indian | 515 |
| White | 289 |
| Black | 42 |
| American Indian + Black | 15 |
| Asian | 6 |
| Black + White | 6 |
| American Indian + White + Asian | 1 |
| NA | 13 |
| Self-reported ethnicity | Count |
| Latino | 3,410 |
| Not Latino | 9 |
| NA | 6 |
Figure 3.Genetic ancestry proportions among different self-reported race categories
Native Peruvian (NAT) is the dominant genetic ancestry in all categories of self-reported race. The proportion of NAT, European (EUR), West African (AFR), and East Asian (ASI) genetic ancestries was significantly different between self-reported race categories (ANOVA, degrees of freedom = 7, two-sided p = 1.7 × 10−48, 1.6 × 10−32, 1.2 × 10−118, and 8.2 × 10−32, respectively). Box plots show median and interquartile range (IQR), and whiskers show 1.5× IQR.
Association of genetic ancestry and TB progression risk
| Model | Genetic ancestry | ORNAT0.1 (CI) | p value |
|---|---|---|---|
| NAT + age + sex + SES + (1|HH) + GRM | NAT | 1.25 (1.18–1.33) | 1.1 × 10−13 |
| EUR + age + sex + SES + (1|HH) + GRM | EUR | 0.76 (0.70–0.83) | 1.3 × 10−10 |
| AFR + age + sex + SES + (1|HH) + GRM | AFR | 0.70 (0.60–0.80) | 8.5 × 10−7 |
| ASI + age + sex + SES + (1|HH) + GRM | ASI | 0.77 (0.59–1.00) | 0.05 |
| NAT + EUR + age + sex + SES + (1|HH) + GRM | NAT | 1.24 (1.11–1.39) | 2.5 × 10−4 |
| EUR | 0.99 (0.84–1.16) | 0.89 | |
| NAT + AFR + age + sex + SES + (1|HH) + GRM | NAT | 1.24 (1.15–1.34) | 8.5 × 10−8 |
| AFR | 0.97 (0.81–1.16) | 0.73 | |
| NAT + ASI + age + sex + SES + (1|HH) + GRM | NAT | 1.26 (1.18–1.34) | 6.5 × 10−13 |
| ASI | 1.11 (0.85–1.45) | 0.44 |
Native Peruvian genetic ancestry (NAT) is associated with TB progression risk while European (EUR), West African (AFR), and East Asian (ASI) genetic ancestries were associated with reduced TB progression risk. NAT remained significantly associated with increased TB progression risk conditioned on non-NAT ancestries, but none of the non-NAT ancestries showed association with TB progression risk after conditioning on NAT. Odds ratios and 95% confidence interval (CI) correspond to 10% increase in genetic ancestry (ORNAT0.1). p, two-sided Wald test p value; SES, socioeconomic status; HH, household.
KEY RESOURCES TABLE
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Deposited data | ||
| Individual-level TB status and genotyping data | Luo et al., 2019 | phs002025.v1.p1 |
| Software and algorithms | ||
| Plink | Chang et al., 2015 |
|
| PCAdmix | Brisbin et al., 2012 |
|
| ADMIXTURE | Alexander et al., 2009 |
|
| GENESIS R package | Conomos et al., 2016 |
|
| lme4qtl | Ziyatdinov et al., 2018 |
|
| Other | ||
| The 1000 Genomes Project | Consortium, 2015 |
|
| Genotyping data from Native Siberian and Native Americans | Reich et al., 2012 | Direct communication with authors |