| Literature DB >> 29755620 |
Gerald Mboowa1,2, Ivan Sserwadda1, Marion Amujal2, Norah Namatovu2.
Abstract
HIV/AIDS, tuberculosis (TB), and malaria are 3 major global public health threats that undermine development in many resource-poor settings. Recently, the notion that positive selection during epidemics or longer periods of exposure to common infectious diseases may have had a major effect in modifying the constitution of the human genome is being interrogated at a large scale in many populations around the world. This positive selection from infectious diseases increases power to detect associations in genome-wide association studies (GWASs). High-throughput sequencing (HTS) has transformed both the management of infectious diseases and continues to enable large-scale functional characterization of host resistance/susceptibility alleles and loci; a paradigm shift from single candidate gene studies. Application of genome sequencing technologies and genomics has enabled us to interrogate the host-pathogen interface for improving human health. Human populations are constantly locked in evolutionary arms races with pathogens; therefore, identification of common infectious disease-associated genomic variants/markers is important in therapeutic, vaccine development, and screening susceptible individuals in a population. This review describes a range of host-pathogen genomic loci that have been associated with disease susceptibility and resistant patterns in the era of HTS. We further highlight potential opportunities for these genetic markers.Entities:
Year: 2018 PMID: 29755620 PMCID: PMC5884297 DOI: 10.1155/2018/1875217
Source DB: PubMed Journal: Can J Infect Dis Med Microbiol ISSN: 1712-9532 Impact factor: 2.471
Host loci strongly associated with the three common infectious diseases.
| Disease | Population | Variant | Annotation | Genome-wide significance ( | Association | References |
|---|---|---|---|---|---|---|
| HIV-1/AIDS | African Americans | rs2523608 |
| 5.6 × 10−10 | Viral load set point | [ |
| European ancestry | rs2395029 |
| 9.7 × 10−26 | Long-term nonprogression and viral load set point | [ | |
| European ancestry | rs9264942 |
| 2.8 × 10−35 | Long-term nonprogression and viral load set point | [ | |
| European ancestry | rs4418214 |
| 1.4 × 10−34 | Long-term nonprogression | [ | |
| European ancestry | rs3131018 |
| 4.2 × 10−16 | Long-term nonprogression | ||
| African American | rs2255221 |
| 3.5 × 10−14 | Long-term nonprogression | ||
| rs2523590 |
| 1.7 × 10−13 | Long-term nonprogression | |||
| rs9262632 |
| 1.0 × 10−8 | Long-term nonprogression | |||
| European ancestry | rs9261174 |
| 1.8 × 10–8 | Disease progression | [ | |
| rs11884476 |
| 3.4 × 10–9 | Disease progression | [ | ||
| rs2234358 |
| 9.7 × 10–10 | Long-term nonprogression | [ | ||
| Malaria | Ghanaian | rs10900585 |
| 6.1 × 10−9 | Protective | [ |
| Ghanaian | rs2334880 |
| 3.9 × 10−8 | Protective | [ | |
| Ghanaian | rs8176719 |
| 2.9 × 10−13 | Protective | [ | |
| Gambian | rs11036238 |
| 3.7 × 10–11 | Susceptibility | [ | |
| Gambian | rs334 |
| 4 × 10−14 | Protective | [ | |
| Tuberculosis | Moroccan | rs17590261 |
| 2 × 10−6 | Age-at-onset of TB | [ |
| Ghanaian and Gambian | rs4331426 | 18q11.2 | 6.8 × 10−9 | Susceptibility | [ |
Figure 1(a) Pipeline for interrogation of pathogen genomes using high-throughput sequencing and computational approaches. DNA extraction for HTS can be done from either direct clinical specimen of individuals who are suspected to be infected with the disease or from enriched/isolated cultures. Quality control and read preprocessing are critical steps in the analysis of datasets generated from high-throughput sequencing technologies. FASTQC is an example of a tool for general quality assessment of HTS data from all technologies. Genomes can be recreated with no prior knowledge using de novo sequence assembly as well as recreating the genome using prior knowledge based on a reference genome—alignment/mapping. The former is necessary for novel genomes and where the sequenced genome differs from reference. Sequence data analysis is important in infectious disease outbreak investigations, molecular typing, antimicrobial drug resistance, transmission, surveillance, and microbial evolution. (b) Pipeline for interrogation of host genomes using high-throughput sequencing and computational approaches. For a given infectious disease in a population, an appropriate study design is determined and host DNA is collected from cases (exposed to pathogen and infected) and controls (exposed to pathogen and uninfected). HTS of DNA from both cases and control is performed. Quality control (QC) procedures vary in different pipelines. These include QC on individuals for missingness, gender checks, duplicates and cryptic relatedness, population outliers, heterozygosity and inbreeding, QC on SNPs for missingness, minor allele frequency, and Hardy–Weinberg equilibrium. Many of these are computationally intensive, operationally challenging, and constantly evolving. Genome-wide association studies (GWASs) involving case-control studies compare the frequencies of common genetic variants, assume an appropriate statistical model, and account for multiple testing correction threshold to identify susceptibility and protective polymorphisms in the population.
Figure 2Comparison of selected communicable and noncommunicable disease GWASs since 2005.