| Literature DB >> 34302177 |
Desiré Casares-Marfil1, Beatriz Guillen-Guio2, Jose M Lorenzo-Salazar3, Héctor Rodríguez-Pérez2, Martin Kerick1, Mayra A Jaimes-Campos4, Martha L Díaz4, Elkyn Estupiñán1,4, Luis E Echeverría5, Clara I González4, Javier Martín1, Carlos Flores2,3,6, Marialbert Acosta-Herrera1.
Abstract
Chagas disease is an infection caused by the parasite Trypanosoma cruzi, endemic in Latino America. Leveraging the three-way admixture between Native American (AMR), European (EUR) and African (AFR) populations in Latin Americans, we aimed to better understand the genetic basis of Chagas disease by performing an admixture mapping study in a Colombian population. A two-stage study was conducted, and subjects were classified as seropositive and seronegative for T. cruzi. In stage 1, global and local ancestries were estimated using reference data from the 1000 Genomes Project (1KGP), and local ancestry associations were performed by logistic regression models. The AMR ancestry showed a protective association with Chagas disease within the major histocompatibility complex region [Odds ratio (OR) = 0.74, 95% confidence interval (CI) = 0.66-0.83, lowest P-value = 4.53 × 10-8]. The fine mapping assessment on imputed genotypes combining data from stage 1 and 2 from an independent Colombian cohort, revealed nominally associated variants in high linkage disequilibrium with the top signal (rs2032134, OR = 0.93, 95% CI = 0.90-0.97, P-value = 3.54 × 10-4) in the previously associated locus. To assess ancestry-specific adaptive signals, a selective sweep scan in an AMR reference population from 1KGP together with an in silico functional analysis highlighted the Tripartite Motif family and the human leukocyte antigen genes, with crucial role in the immune response against pathogens. Furthermore, these analyses emphasized the macrophages, neutrophils and eosinophils, as key players in the defense against T. cruzi. This first admixture mapping study in Chagas disease provided novel insights underlying the host immune response in the pathogenesis of this neglected disease.Entities:
Mesh:
Year: 2021 PMID: 34302177 PMCID: PMC8643504 DOI: 10.1093/hmg/ddab213
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Demographical characteristics and sample size of the Colombian collections
| Stage 1 | Stage 2 | |||
|---|---|---|---|---|
| Seropositive | Seronegative | Seropositive | Seronegative | |
| Pre QC sample size | 998 | 659 | 122 | 532 |
| Post QC sample size | 913 | 592 | 122 | 512 |
| Sex, females (%) | 503 (55) | 357 (60) | 65 (53) | 315 (62) |
| Age (mean ± SD) | 62.7 ± 16.2 | 49.0 ± 17.6 | 64.7 ± 11.2 | 50.0 ± 15.6 |
QC, quality controls.
Figure 3Selective sweep scan results for Native Americans (AMR) from the 1000 Genomes Project. Y and X axes represent the iSAFE scores and hg19 genomic positions, respectively. (A) iSAFE scores within the MHC region (chr6:28477895-33 389 603). Top ranking variants included in the significant admixture mapping region are highlighted in red (chr6:30079993-30 332 160). (B) Zoom in of the admixture mapping significant region, where the top 25 variants according to their iSAFE score are represented. Genetic variants are depicted in colored dots to reflect its LD with the variant with the highest iSAFE score based on pairwise r values in AMR population. Their dot size correlates to their composite variant to genes score (V2G) from Open Targets Genetics (https://genetics.opentargets.org/).
Joint single nucleotide polymorphism-ancestry analysis in the discovery stage
| Factor | OR (95% CI) |
|
|---|---|---|
| AMR ancestry | 0.74 (0.66–0.83) | 4.53 × 10−08 |
| Allele dosage rs115833233 | 1.00 (0.89–1.13) | 0.99 |
| AMR ancestry (conditioned on rs115833233) | 0.73 (0.65–0.82) | 2.15 × 10−08 |
AMR, Native American ancestry; CI, confidence interval; OR, odds ratio.
Association testing results and allele frequencies on imputed data for the admixture mapping associated region
| Stage 1 | Stage 2 | Meta-analysis | |||||||
|---|---|---|---|---|---|---|---|---|---|
| SNP_ID(*EA) | CHR:BP | EA frequency (cases/controls) | OR (95% CI) |
| EA frequency (cases/controls) | OR (95% CI) |
| OR (95% CI) |
|
| rs2032134*C | 6:30360509 | 0.81/0.86 | 0.95 (0.90–0.99) | 2.57 × 10−2 | 0.75/0.84 | 0.92 (0.87–0.97) | 3.67 × 10−3 | 0.93 (0.90–0.97) | 3.54 × 10−4 |
EA, effect allele; BP, base pair; CHR, chromosome; CI, confidence interval; OR, odds ratio; SNP, single nucleotide polymorphism.
Figure 2Regional plots of the results from admixture (top) and fine mapping (bottom) analyses, showing the −log10 transformed P-value in the y-axis and the hg19 genomic position in the x-axis. Top: Region centered on the significant variants for the AMR ancestry association. The horizontal line indicates the significance threshold (P-value = 3.07 × 10−6). Estimated recombination rates (light blue line) are plotted on the right y-axis. Bottom: Meta-analysis results from stages 1 and 2 within the admixture mapping significant region (chr6:30079993-30 332 160) and its proxies (r > 0.8) indicating the associated variant with the lowest significance in the region (rs2032134). The results for the remaining single nucleotide polymorphisms (SNPs) are color coded to reflect their degree of LD with the indicated SNP based on pairwise r values in AMR. The horizontal lines indicate the significant (solid line, P-value = 3.23 × 10−4) and suggestive thresholds (broken line, P-value = 6.46 × 10−3).
In silico functional assessment of the fine mapping and selective sweep analyses top variants
| Chr | SNP | Function | Nearest gene | eQTLs | C-HiC genes | PheWAS |
|---|---|---|---|---|---|---|
| 6 | rs2032134 | Intergenic |
|
|
| - |
| 6 | rs9261440 | Intergenic |
|
|
| Hematological measurement, eosinphil count |
eGENES in whole blood.
Lymphoblastoid cell line from Mifsud et al.
Macrophages and neutrophil from Javierre et al.
Queried databases were Open Targets Genetics, eQTLGen, HaploReg and Capture HiC Plotter.
Chr, chromosome; C-HiC, capture Hi-C; eQTL, expression quantitative trait loci; PheWAS, phenome-wide association study.