Literature DB >> 28951201

Genetic susceptibility to infectious diseases: Current status and future perspectives from genome-wide approaches.

Alessandra Mozzi1, Chiara Pontremoli1, Manuela Sironi2.   

Abstract

Genome-wide association studies (GWASs) have been widely applied to identify genetic factors that affect complex diseases or traits. Presently, the GWAS Catalog includes >2800 human studies. Of these, only a minority have investigated the susceptibility to infectious diseases or the response to therapies for the treatment or prevention of infections. Despite their limited application in the field, GWASs have provided valuable insights by pinpointing associations to both innate and adaptive immune response loci, as well as novel unexpected risk factors for infection susceptibility. Herein, we discuss some issues and caveats of GWASs for infectious diseases, we review the most recent findings ensuing from these studies, and we provide a brief summary of selected GWASs for infections in non-human mammals. We conclude that, although the general trend in the field of complex traits is to shift from GWAS to next-generation sequencing, important knowledge on infectious disease-related traits can be still gained by GWASs, especially for those conditions that have never been investigated using this approach. We suggest that future studies will benefit from the leveraging of information from the host's and pathogen's genomes, as well as from the exploration of models that incorporate heterogeneity across populations and phenotypes. Interactions within HLA genes or among HLA variants and polymorphisms located outside the major histocompatibility complex may also play an important role in shaping the susceptibility and response to invading pathogens.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  GWAS; Infectious disease; Response to treatment or vaccine

Mesh:

Year:  2017        PMID: 28951201      PMCID: PMC7106304          DOI: 10.1016/j.meegid.2017.09.028

Source DB:  PubMed          Journal:  Infect Genet Evol        ISSN: 1567-1348            Impact factor:   3.342


General overview

Genome-wide association studies (GWASs) are based on the screening of many genomes to find genetic variants associated with a trait or disease. Both dichotomous and quantitative traits can be analyzed in GWASs, and the genomes can be those of unrelated subjects or derive from samples with familiar structure (e.g., parent-affected child trios). Because GWASs analyze variants distributed throughout the genome, they are unbiased with respect to prior biological knowledge, offering the potential to identify novel variants and genes associated to the trait of interest. The first small-scale GWAS results were performed in 2005–2006 and investigated age-related macular degeneration, a common disease in elderly populations worldwide (Dewan et al., 2006; Klein et al., 2005). After those early efforts, the Wellcome Trust Case Control Consortium large-scale study in 2007 (Wellcome Trust Case Control Consortium, 2007) gave way to the GWAS era. In recent years, hundreds of GWASs have been published. Their results are stored in a dedicated database, the GWAS Catalog, presently hosted by the EMBL-EBI (https://www.ebi.ac.uk/gwas/) (MacArthur et al., 2017). As of April 24th 2017, the catalog comprised 2880 studies and > 9700 unique variant-trait associations. This figure refers to associations with a p value lower than or equal to 5 × 10− 8, which is usually considered as the threshold for genome-wide significance. We mined the catalog to obtain a global overview of the application of GWAS in the field of infectious diseases. We classified studies and associations in broad categories depending on the disease or trait they analyzed. The category “infectious disease” was intended in a very broad sense: for instance, it included virus-associated cancers, specific phenotypes associated with infections (e.g., mood disorders in prion disease), and hematological measurements related to infectious agents (e.g., antibody levels for common infections, cytokine production during septic shock). We also defined categories for response to therapy, by classifying GWAS that analyzed the response to treatments for infectious diseases, non-communicable conditions, and vaccines. Overall, 4%, 0.66%, and 0.31% of all studies in the catalog were classified in the categories “infectious disease”, “response to treatment for infectious disease”, and “response to vaccine”, respectively (Fig. 1 ). In terms of unique variant-trait associations, percentages were as follows: 2.5% for “infectious disease”, 0.18% for “response to treatment for infectious disease”, and 0.66% for “response to vaccine” (Fig. 1). Based on the relative number of studies and associations, it is clear that some traits/diseases have attracted huge efforts that provided consequent reward (e.g., inflammatory/autoimmune disorders, metabolic traits and phenotypes), whereas others have been deeply investigated with comparatively modest success (e.g., neurologic/neuropsychiatric/cognitive diseases and traits, cancers). The reasons for these differences are manifold and outside the scope of this Review, which focuses on infections and related traits. Based on the percentages given above and on the comparison with other traits, we conclude that, despite the huge burden that infections pose for human health, relatively few GWASs for infectious diseases have been performed and these have attained a moderate success. Apparently, the best performance was obtained by GWASs for response to vaccination, with 9 studies that reported 64 associations. However, most of these associations will require validation, as they derive from a relatively small-scale study with no replication (Kennedy et al., 2012).
Fig. 1

Bar-plot representation of studies and associations recorded in the GWAS Catalog (as of April 24th, 2017). Bars represent the number of studies (red) or the number of unique SNP-trait associations (gray). Studies were manually categorized based on the diseases or trait they investigated. Only a subset of traits/diseases are shown. Associations were included only if they displayed an association p value ≤ 5 × 10− 8.

Bar-plot representation of studies and associations recorded in the GWAS Catalog (as of April 24th, 2017). Bars represent the number of studies (red) or the number of unique SNP-trait associations (gray). Studies were manually categorized based on the diseases or trait they investigated. Only a subset of traits/diseases are shown. Associations were included only if they displayed an association p value ≤ 5 × 10− 8. Significant associations with infectious diseases (or related traits) in genome-wide association studies. Articles not in the GWAS catalog (as of April 24th, 2017). Sheer numbers, though, are not the only indicator of success. Important results were obtained by GWASs, both by identifying or refining associations in genes that were a priori expected to modulate infection susceptibility and by pointing out unexpected genes and pathways. Herein we discuss some issues associated with GWASs for infectious diseases, we review the most recent findings ensuing from these studies, and we provide a brief summary of selected GWASs in non-human mammals. Table 1 summarizes the results of human studies that detected at least one associated variant at genome-wide significance, whereas a list of all studies is available as Supplementary Table S1.
Table 1

Significant associations with infectious diseases (or related traits) in genome-wide association studies.

Pathogen/diseaseYearPopulationAssociated lociTrait descriptionReferences
GWAS for infectious diseases
 Viruses
 Dengue virus2011Asian ancestryVietnamMICB, PLCE1Dengue shock syndrome (DSS) susceptibility(Khor et al., 2011)
 Diarrhoeal disease2016European ancestryDiscovery: Netherlands, UK, Spain, GermanyReplication: Norway, UK, U.S., Germany, Netherland, SpainNTN5, SEC1P, FUT2Susceptibility to diarrhoeal disease in young children (1 and 2 years)(Bustamante et al., 2016)
 Epstein-Barr virus (EBV)2013Hispanic ancestryDiscovery: MexicoReplication: MexicoHLA-DRB1, HLA-DQB1EBV seroreactivity measured by Epstein-Barr virus nuclear antigen 1 (anti-EBNA-1) titer(Rubicz et al., 2013)
 Hepatitis B virus (HBV)2009Asian ancestryDiscovery: JapanReplication: Japan and ThailandHLA-DPB1Chronic hepatitis B (CHB) susceptibility(Kamatani et al., 2009)
2010Asian ancestryDiscovery: ChinaReplication: ChinaUBE4B, KIF1B, PGDHBV-related hepatocellular carcinoma (HCC) susceptibility(Zhang et al., 2010)
2011Asian ancestryDiscovery: JapanReplication: JapanHLA-DPA1, HLA-DPB1, HLA-DQB1, HLA-DQB2CHB susceptibility(Mbarek et al., 2011)
2011Asian ancestryChinaGRIN2AHBV progression(Liu et al., 2011)
2012Asian ancestryDiscovery: JapanReplication: Japan and KoreaHLA-DPA1, HLA-DPB1HBV clearance(Nishida et al., 2012)
2012Asian ancestryDiscovery: ChinaReplication: ChinaGRIK1, HLA-DRB1, HLA-DQA1HBV-related HCC(Li et al., 2012)
2013Asian ancestryDiscovery: ChinaReplication: ChinaSTAT4, HLA-DQHBV-related HCC(Jiang et al., 2013)
2013Asian ancestryDiscovery: KoreaReplication: KoreaHLA-DP, HLA-DQ, EHMT2, TCF19CHB susceptibility(Kim et al., 2013)
2013Asian ancestryDiscovery: ChinaReplication: ChinaHLA-DQB2, HLA-C, UBE2L3CHB susceptibility(Hu et al., 2013)
2014Asian ancestryDiscovery: ChinaReplication: ChinaHLA-DQA2, HLA-DQB2, HLA-DPB1, HLA-DPA3CHB in males(Chang et al., 2014)
2015Asian ancestryDiscovery: ChinaReplication: ChinaHLA-C, CFB, NOTCH4, HLA-DOA, CD40, HLA-DQB1, HLA-DQB2, HLA-DPA1, HLA-DPB1CHB susceptibility(Jiang et al., 2015)
2016Asian ancestryDiscovery: ChinaReplication: Chinars7000921 (locus 8p21.3)CHB susceptibility(Li et al., 2016)
2017aAsian ancestryDiscovery: ChinaReplication: ChinaHLA-DRHCV-related acute-on-chronic liver failure (ACLF) susceptibility(Tan et al., 2017)
 Hepatitis C virus (HCV)2010European ancestryGermany, SwitzerlandIFNL2 (previously known as IL28A), IFNL3/IFNL4, IFNL1 (previously known as IL29)Chronic hepatitis C (CHC) susceptibility(Rauch et al., 2010)
2011Asian ancestryDiscovery: JapanReplication: JapanHLA-DQ, HLA-DR, MICAHCV-related HCC(Kumar et al., 2011)
2011Asian ancestryDiscovery: JapanReplication: JapanDEPDC5HCV-related HCC(Miki et al., 2011)
2012European ancestryDiscovery: France, SwitzerlandReplication: France, Germany, Italy, UK, U.S., AustraliaRNF7, MERTKLiver fibrosis progression related to HCV infection(Patin et al., 2012)
2013Asian ancestryDiscovery: JapanReplication: JapanC6orf10, BTNL2, BTNL2-HLA-DRACHC-induced liver cirrhosis (LC)(Urabe et al., 2013)
2013European, African and Afro-Caribbean ancestryDiscovery: Austria, France, Germany, Greece, UK, U.S.Replication: U.S., EgyptIFNL4, HLA-DQSpontaneous resolution of HCV infection(Duggal et al., 2013)
2013Asian ancestryDiscovery: JapanReplication: JapanHLA-DQB1, HLA-DQA1CHC susceptibility(Miki et al., 2013)
2014European ancestryDiscovery: U.S., France, UK, ItalyReplication: U.S., France, ItalyHLA-DQA1, HLA-DRB1, NOTCH4Susceptibility to mixed cryoglobulinemia related to CHC(Zignego et al., 2014)
2017Asian ancestryDiscovery: JapanReplication: JapanTLL1HCC development after HCV eradication(Matsuura et al., 2017)
 HCV and HIV co-infection2016European ancestryFranceOXTR, MAP1LC3BP1Liver fibrosis progression(Ulveling et al., 2016)
 Varicella Zooster virus (VZV)2015European, African, Hispanic ancestryU.S.HCP5, HLA-B, DHFRHerpes zoster susceptibility(Crosslin et al., 2015)
 Human Immunodeficiency Virus (HIV)2007European ancestryDiscovery: Denmark, Italy, Spain, Switzerland, AustraliaReplication: Denmark, Italy, Spain, Switzerland, AustraliaHCP5, HLA-CHIV-1 viral load at set pointHIV-1 viral load at set point(Fellay et al., 2007)
2009European ancestryDiscovery: FranceReplication: Denmark, Italy, Spain, Switzerland, AustraliaHCP5Progression to AIDS(Limou et al., 2009)
2009European ancestryU.S.HCP5, HLA-B, HLA-CHIV-1 viral load at set point(Fellay et al., 2009)
2010European ancestryU.S.RYR3Susceptibility to atherosclerosis(Shrestha et al., 2010)
2010European, African (African American or Afro-Caribbean), and Hispanic ancestryCanada, U.S., AustraliaHLA-C, MICA, HLA-B, HCP5, PSORS1C3HIV-1 viral load control(International HIV Controllers Study et al., 2010)
2011European ancestryU.S.PARD3BProgression to AIDS(Troyer et al., 2011)
2015European, African, Hispanic ancestry (NR)ACKR1 for Neutrophil count in HIV-infection; UGT1A, UGT1A1, UGT1A3, UGT1A4, UGT1A5, UGT1A6, UGT1A7, UGT1A8, UGT1A9, UGT1A10, MROH2A for Total bilirubin levels in HIV-infectionMultiple phenotypes measured by standard clinical laboratory practice(Moore et al., 2015)
2015European ancestryU.S., Australia, France, NetherlandsMICA, CCRL2, HLA-B, HLA-CHIV-1 viral load at set point(McLaren et al., 2015)
 Human Papillomavirus (HPV)2011European and Hispanic ancestryDiscovery: Czech Republic, Hungary, Poland, Romania,Russian Federation, SlovakiaReplication: Argentina, Brazil, CubaHLA-DQB1HPV seroconversion(Chen et al., 2011)
2013Asian ancestryDiscovery: ChinaReplication: ChinaEXOC1, HLA-DPB2, GSDMB, rs9277952 (locus:6p21.32)HPV-related cervical carcer(Shi et al., 2013)
2013European ancestryDiscovery: SwedenReplication: SwedenMICA, HLA-DRB1, HLA-DQA1, HLASusceptibility to cervical cancer.(Chen et al., 2013)
2016European ancestryDiscovery: SwedenReplication: Swedenrs73730372 (locus:6p21.32)Susceptibility to cervical cancer.(Chen et al., 2016)
 Influenza A virus (H1N1)2015European ancestrySpainSee Supplementary Table S1Susceptibility to influenza A (H1N1) infection and disease severity(Garcia-Etxebarria et al., 2015)
 Bacteria
 Helicobacter pylori2013European ancestryGermany and NetherlandsTLR-1, HSPA6/FCGR2A/BH. pylori seroprevalence defined using anti–H. pylori serum IgG antibody titers(Mayerle et al., 2013)
 Mycobacterium tuberculosis2010African ancestryDiscovery: Ghana, The GambiaReplication: Ghana and Malawirs4331426 (locus 18q11.2)Tuberculosis susceptibility(Thye et al., 2010)
2012African, Asian and European ancestryDiscovery: GhanaReplication: The Gambia, Indonesia and RussiaWT1Tuberculosis susceptibility(Thye et al., 2012)
2014African ancestry and otherThe Gambia, Ghana, Malawi, South AfricaSMOC1, WT1Tuberculosis susceptibility(Chimusa et al., 2014)
2015European and African ancestryDiscovery: RussiaReplication: Russia, The Gambia and GhanaASAP1Tuberculosis susceptibility(Curtis et al., 2015)
2016aAfrican ancestryUganda and TanzaniaIL-12Tuberculosis resistance in HIV-positive individuals(Sobota et al., 2016)
 Subgengival bacterial biofilm (e.g.: Porphyromonas gingivalis)2010European ancestryDiscovery: GermanyReplication: NetherlandsGLT6D1Periodontitis susceptibility(Schaefer et al., 2010)
2016European ancestryDiscovery: U.S.Replication: U.S. and GermanyCLEC19A, TRA, GGTA2P, TM9SF2, IFI16, RBMS3, C1QTNF7, TSNARE, HPVC1, SLC15A4, PKP2, SNRPNPeriodontal disease-related phenotypes(Offenbacher et al., 2016)
2017Hispanic ancestry, European and African (African-American and Afro-Caribbean) ancestryDiscovery: Hispanics/Latinos AmericaReplication: U.Srs149133391 (locus 1q42.2)Periodontitis susceptibility(Sanders et al., 2017)
 Mouth carbohydrate-fermenting bacteria (e.g.: Streptococcus mutans)2013European ancestry (NR)AJAP1, LYZL2Dental caries-related phenotypes(Shaffer et al., 2013)
2014European ancestryU.S.KPNA4 for Pit-and-Fissure cariesDental caries susceptibility(Zeng et al., 2014)
2016Hispanic/Latino ancestryU.S.NAMPT, BMP7Dental caries susceptibility(Morrison et al., 2016)
 Mycobacterium leprae2009Asian ancestryDiscovery: ChinaReplication: ChinaRIPK2, TNFSF15, LACC1, NOD2, HLA-DR, CCDC122Leprosy susceptiility and progression(Zhang et al., 2009)
2011Asian ancestryDiscovery: ChinaReplication: ChinaIL23R, RAB32, CYLDLeprosy susceptiility(Zhang et al., 2011)
2015Asian ancestryDiscovery: ChinaReplication: ChinaIL23R, IL1RL1, IL12B, HLA-DRB1, RAB32, TNFSF15, NOD2, BATF3, CDH18, DEC1, EGR2, CCDC88B, CIITA, SIGLEC5, RIPK2, LACC1, rs16948876 (locus 16q12.1)Leprosy susceptiility(H. Liu et al., 2015)
2016Asian ancestryDiscovery: ChinaReplication: ChinaSYN2, BBS9, CTSB, MED30Leprosy susceptiility(Wang et al., 2016)
 Neisseria meningitidis2010European ancestryDiscovery: UKReplication: Austria, Netherlands, SpainCFH, CFHR3Susceptibility to meningococcal disease(Davila et al., 2010)
2016European ancestryDiscovery: SpainReplication: Spain, UKCFHSusceptibility to meningococcal disease(Martinon-Torres et al., 2016)
 Propionibacterium acnes2014Asian ancestryDiscovery: ChinaReplication: ChinaDDB2, gene cluster including SELL, SELP and SELESevere acne susceptibility(He et al., 2014)
2014European ancestryDiscovery: UKReplication: UKFST, TGFB2, OVOL1Severe acne susceptibility(Navarini et al., 2014)
 Salmonella typhi2014Asian ancestryDiscovery: VietnamReplication: Vietnam and NepaleseHLA-DRB1Enteric fever susceptibility(Dunstan et al., 2014)
 Staphylococcus aureus2015Hispanic ancestryMexicoKAT2B for intermittent carriageS. aureus nasal carriage(Brown et al., 2015)
2016European ancestryU.S.HLA-DRB1 for all S. aureus infectionsSusceptibility to S. aureus and to skin and soft tissue S. aureus infections (SSTIs)(DeLorenze et al., 2016)
 Streptococcus pneumoniae2016African ancestryDiscovery: KeniaReplication: KeniaAC00600.5 and AC011288.2 (two overlapping intergenic non-coding RNA genes)Pneumococcal bacteremia(Kenyan Bacteraemia Study Group et al., 2016)
 Parasites
Leishmania donovani/Leishmania infantum chagasi2013Asian and Hispanic ancestryDiscovery: India, BrazilReplication: IndiaHLA-DRB1-HLA-DQA1Visceral leishmaniasis susceptibility(LeishGEN Consortium et al., 2013)
 Plasmodium falciparum2009African ancestryDiscovery: The GambiaReplication: The GambiaHBBSevere malaria susceptibility(Jallow et al., 2009)
2012African ancestryDiscovery: The GambiaReplication: The Gambia and GhanaHBB, ABO, ATP2B4, MARVELD3Severe malaria susceptibility(Timmann et al., 2012)
2013African ancestryThe Gambia, Kenya, MalawiHBB, ABOSevere malaria susceptibility(Band et al., 2013)
2015African ancestryDiscovery: Kenya, Malawi, The GambiaReplication: Malawi, United Republic of Tanzania, Cameroon, Burkina Faso, The Gambia, Ghana, MaliHBB, ABO, ATP2B4, FREM3/GYPE regionSevere malaria susceptibility(Malaria Genomic Epidemiology Network et al., 2015)
 Prion
 Prion (Creutzfeldt-Jakob disease)2009European and Oceanian ancestryDiscovery: UKReplication: UK and Papua New GuineaPRNPSusceptibility to variant CJD(Mead et al., 2009)
2011European ancestryDiscovery: UKReplication: France, UKPRNP, MTMR7, NPAS2Susceptibility to variant CJD(Sanchez-Juan et al., 2012)
2015European ancestryDiscovery: Germany, Netherlands, UKReplication: Australia, Austria, France, Germany, Netherlands, Italy, SpainPRNP, GRM8Susceptibility to sporadic CJD(Sanchez-Juan et al., 2015)



GWAS for response to treatment for infectious diseases
 Viruses
 Hepatitis C virus (HCV)2009European, African (African American or Afro-Caribbean) and Hispanic ancestry (NR)IFNL3/IFNL4 (European Ancestry)Response to antiviral treatment with pegylated interferon-alpha combined with ribavirin (pegIFNα/ribavirin)(Ge et al., 2009)
2009Asian ancestryDiscovery: JapanReplication: JapanIFNL3/IFNL4Response to antiviral treatment with (pegIFNα/ribavirin)(Tanaka et al., 2009)
2009European ancestryDiscovery: AustraliaReplication: Germany, Italy, UK, AustraliaIFNL3, IFNL3/IFNL4Response to antiviral treatment with (pegIFNα/ribavirin)(Suppiah et al., 2009)
2010European ancestryGermany, SwitzerlandIFNL2, IFNL3/IFNL4, IFNL1Response to antiviral treatment with (pegIFNα/ribavirin)(Rauch et al., 2010)
2010European, African American and Hispanic ancestryU.S.ITPASusceptibility to treatment-related anaemia(Fellay et al., 2010)
2010Asian ancestryDiscovery: JapanReplication: JapanITPASusceptibility to treatment-related anaemia(Ochi et al., 2010)
2011Asian ancestryTaiwan, JapanIFNL3/IFNL4HCV treatment response(Ochi et al., 2011)
2011Asian ancestryDiscovery: JapanReplication: NRDDRGK1, ITPA for Hb level and platelets countSusceptibility to pegIFNα-induced thrombocytopenia and to RBV-induced anaemia(Tanaka et al., 2011)
2012European and African (African-American and Afro-Caribbean) and Hispanic ancestryU.S.ITPASusceptibility to thrombocytopenia, neutropenia, and leukopenia related to pegIFNα treatment(Thompson et al., 2012)
2012Hispanic, European, and African (African American or Afro-Caribbean) (NR)IFNL3/IFNL4Lipid levels during and after HCV infection treatment and the interactions with sustained viral response (SVR)(Clark et al., 2012)
 Human Immunodeficiency Virus (HIV)2011Asian ancestryDiscovery: ThailandReplication: ThailandCCHCR1Susceptibility to Nevirapine-induced rash(Chantarangsu et al., 2011)
2014European, Hispanic, African (African American or Afro-Caribbean) ancestryU.S.RSPO4 (hispanic - Grade ≥ 2); IL2RA, CXCL12, rs9501753 (locus:6p25.3), PTPRB (European ancestry – Grade ≥ 3); TANK, HDAC9, rs801350 (locus:2q32.3), rs801378 (locus:2q32.3) (Hispanic - Grade ≥ 3)Susceptibility to peripheral neuropathy related to d4T/ddI-containing treatment(Leger et al., 2014)
2015aEuropean, African, and Hispanic ancestry (NR)SLC17A1Plasma Tenofovir and creatinine clearance after TDF/emtricitabine containing regimens(Wanga et al., 2015)



GWAS for response to vaccine
 Viruses
 Measles, Mumps and Rubella vaccine2014European ancestryU.S.ACO1, PTPRDImmune responses to rubella vaccination(Kennedy et al., 2014)
2014European ancestryDiscovery: DenmarkReplication: DenmarkIFI44L, rs1318653 (locus:1q32.2), CD46, ANO3, SCN1A, rs11105468 (locus:12q21.33), SCN2A for febrile seizures; IFI44L, SCN1A, ANO3, CD46, CD34 for febrile seizures MMR vaccine-related; ANO3, SCN1A for febrile seizures MMR vaccine-unrelatedSusceptibility to general and measles, mumps and rubella (MMR) vaccine-related febrile seizures(Feenstra et al., 2014)
 Hepatitis B vaccine2011Asian ancestryDiscovery: IndonesiaReplication: IndonesiaHLA-DR, HLA, HLA-DPB1Immune response to HBV vaccine(Png et al., 2011)
2014Asian ancestryDiscovery: ChinaReplication: ChinaHLA-DRA, BTNL2, HLA-DRB1, C6orf10Immune response to HBV vaccine(Pan et al., 2014)
 Smallpox vaccine2012Hispanic, European and African (African-American and Afro-Caribbean) ancestryU.S.MKX, rs10503951 (locus: 8p12), GPR158, ZHX2, SPIRE1 (African ancestry); PCDH15, PRKCQ (Hispanic ancestry)Immune response to smallpox vaccine(Ovsyannikova et al., 2012)
2012European and African (African-American and Afro-Caribbean) ancestryU.S.See Supplementary Table S1Cytokine responses to smallpox vaccine(Kennedy et al., 2012)

Articles not in the GWAS catalog (as of April 24th, 2017).

GWAS evolution

GWASs rely on linkage disequilibrium between genotyped SNPs and causal variants (often ungenotyped). The number of variants on the SNP chips that are used for GWASs has increased sensibly since the development of this technology (from ~ 500,000 in the initial studies to > 4,000,000 in the latest platforms). Moreover, the availability of an increasing number of fully sequenced genomes, together with the development of efficient imputation methods, has definitely expanded the power of GWASs to detect association with variants in the low range of the frequency spectrum (reviewed in (Visscher et al., 2017)). However, the overall power of a GWAS is also critically dependent on the sample size (reviewed in (Visscher et al., 2017) and see below) and in most cases power remains low for variants with a frequency below 1% (reviewed in (Visscher et al., 2017)). Notably, though, important developments have been achieved in terms of imputation of variants within the Major Histocompatibility Complex (MHC), with clear relevance for the investigation of infectious (and immuno-mediated) diseases. As recently reviewed elsewhere (Matzaraki et al., 2017), the development of large reference panels (at least for Europeans and Asian populations) and the use of advanced imputation methods allowed the fine mapping of human leukocyte antigen (HLA) variants as susceptibility factors for some infectious diseases (e.g., HIV-1, HCV, and HBV infection) and many more autoimmune conditions (Matzaraki et al., 2017). Recently, an imputation algorithm for typing KIR (Killer-cell immunoglobulin-like receptor) gene copy number has also been developed and applied to data from European populations (Vukcevic et al., 2015). However, KIR variability extends beyond copy number variation and next-generation sequencing technologies were proposed to allow high-resolution genotyping of KIR loci (Maniangou et al., 2017; Norman et al., 2016). Finally, we mention that in 2009 a SNP chip (Immunochip) providing high coverage of immune-related genes, including HLA and KIR loci, came into use to fine-map and validate association signals for immune-mediated diseases (Cortes and Brown, 2011). The application of the Immunochip in the field of infectious diseases has been very limited, with the notable exception of a study that investigated the genetic susceptibility to candidaemia and identified three risk variants (Kumar et al., 2014).

Issues and caveats of GWASs for infectious diseases and related traits

Some considerations apply to GWASs, whether they investigate infectious diseases or other conditions/traits. Although the sample size that is necessary to achieve an adequate statistical power depends on the genetic architecture of the analyzed trait, GWASs typically require large samples of individuals. This is because thousands (millions) of variants are simultaneously analyzed, making it difficult to disentangle true associations from false positives. Ideally, GWASs should enroll subject of shared and homogeneous ethnical ancestry, to avoid issues of population stratification, which can in turn produce spurious associations if not properly corrected. Moreover, phenotype heterogeneity (Bennett et al., 2011), environmental confounders (Pearson and Manolio, 2008), as well as misclassification of cases and controls (Pearson and Manolio, 2008) have been shown to affect GWAS reliability and power. Some of these issues may be particularly relevant in the field of infectious diseases.

Sample size, population structure, and environmental confounders

Large populations samples are often difficult to recruit in developing countries, due to limited resources, inadequate health facilities, and inefficient screening or diagnostic procedures. In these countries, however, the burden imposed by infectious diseases is highest (Jones et al., 2008; Mabey et al., 2004). Also, some populations in developing areas have complex demographic histories. This is the case of African populations, that are genetically highly diverse and display a limited extension of linkage disequilibrium (LD) compared to non-Africans (International HapMap Consortium et al., 2007). For instance, one of the first GWASs for infectious diseases genotyped individuals in West Africa to identify susceptibility variants for severe malaria (Jallow et al., 2009). The authors found considerable population stratification, which was corrected for using principal components analysis (PCA); signals of association at known malaria resistance loci (e.g. HBB) were difficult to retrieve due to weak LD between causal variants (e.g., HbS locus) and tag SNPs (Jallow et al., 2009). Populations from South and Central America also display relevant population structure due to recent admixture (Bryc et al., 2010). In a GWAS for visceral Leishmaniasis that included families from Brazil, ancestry differences and close relationships were efficiently corrected for using a linear mixed model (LeishGEN Consortium et al., 2013). These models are an alternative to those based on PCA and rely on the incorporation of genetic relatedness between individuals directly in the statistical model (see (Eu-Ahsunthornwattana et al., 2014; Hayes, 2013; Yang et al., 2014) for reviews and comparison among different linear mixed model methods). Another issue often related to population structure is that of environmental confounders (Vilhjalmsson and Nordborg, 2013). A well-known example in the field of non-communicable diseases relates to type 2 diabetes (T2D) susceptibility. Several studies for T2D indicated that the proportion of Native American genetic ancestry is associated with lower socioeconomic status in admixed Latino populations from North and South America (Chakraborty et al., 1986; Florez et al., 2009; Martinez-Marignac et al., 2007; Parra et al., 2004). This effect is partially responsible for the generally higher incidence of T2D in Latinos. This exemplifies how the combined effect of genetic admixture and environmental factors has the potential to affect genetic associations. Indeed, poor socioeconomic conditions represent a risk factor for several infectious diseases (Baker et al., 2012; Braveman, 2011; Franco-Paredes et al., 2007; May, 2007; Semenza, 2010). Thus, possible environmental confounders must be accounted for to avoid spurious associations or loss of statistical power. However, even in the presence of admixture and environmental confounders, associations can be retrieved with confidence. In a GWAS for tuberculosis (TB), Chimusa and coworkers analyzed admixed South African Coloured case-control cohorts (Chimusa et al., 2014). After accounting for population stratification and hidden relatedness, they replicated a previously reported association at the WT1 gene (Thye et al., 2012) and performed trans-ethnic fine mapping of the association signals (Table 1). The authors also found a positive correlation between San (an African ethnic group) ancestry proportion and TB status. Although complete information was available for a minority of individuals, no correlation was detected between socioeconomic status and ancestry components, leading the authors to suggest that the association between San ancestry and TB is not merely explained by differences in socioeconomic conditions (Chimusa et al., 2014).

Phenotype heterogeneity and case/control misclassification

Perhaps, the two most important points that characterize GWAS for infectious diseases compared to studies of non-communicable conditions are phenotypic heterogeneity and case-control misclassification. An important source of heterogeneity in GWASs for infectious diseases derives from genetic variation in the pathogen. One of the best known examples of how the genotype of the infectious agent can interact with the host genome refers to hepatitis C virus. Among the most significant findings of GWASs for infectious diseases was the identification in 2009–2010 of variants at the IFNL3/IFNL4 loci (previously known as IL28B) that associate with spontaneous clearance of HCV infection and with response to interferon-α/ribavirin therapy (Ge et al., 2009; Rauch et al., 2010; Suppiah et al., 2009; Tanaka et al., 2009) (Table 1). This finding set the basis for a personalized treatment of HCV infection (Matsuura et al., 2014). However, the effect of the IFNL3/IFNL4 variants on response to therapy is much stronger for patients infected with HCV genotypes 1 and 4 than for those infected with genotypes 2, 3, and 6 (Rauch et al., 2010; Akkarathamrongsin et al., 2014). Interestingly, interferon-α/ribavirin therapy is less effective for patients infected with genotypes 1 and 4 compared to the other genotypes (European Association for the Study of the Liver, 2011). Thus, the prognostic value of the IFNL3/IFNL4 variants is particularly relevant for patients infected with HCV genotypes that are poorly responsive to this treatment (Rauch et al., 2010). These observations well exemplify how important the interaction of the host and pathogen genomes can be. Very recently, a genome-to-genome GWAS for HCV infection was performed, shedding more light into the interaction between human IFNL4/IFNL3 variants and HCV genetic diversity (Ansari et al., 2017) (see Conclusions and perspectives section). Another notable example of how pathogen heterogeneity can interact with the host genotype refers to TB susceptibility. The human Mycobacterium tuberculosis complex (MTBC) consists of several major phylogenetic lineages whose names reflect their association with geographic areas (Brites and Gagneux, 2015). Candidate gene approaches have shown that host genetic variants can modulate susceptibility to TB caused by specific MTBC strains (Herb et al., 2008; Caws et al., 2008; Intemann et al., 2009; van Crevel et al., 2009; Salie et al., 2014). For instance, a variant in TLR2 was specifically associated with an increased risk of TB caused by the Beijing strain in a Vietnamese population, whereas two SLC11A1 polymorphisms were found to be significantly more common in patients having tuberculosis caused by M. tuberculosis Beijing genotype strains than in patients carrying other genotypes (Caws et al., 2008; van Crevel et al., 2009). In populations from Ghana, an IRGM polymorphism was found to protect against disease caused by the Euro-American lineage (Intemann et al., 2009), and polymorphisms in ALOX5 were more strongly associated with TB caused by M. africanum West African 2 strain (Herb et al., 2008). Furthermore, associations of specific HLA class I alleles and disease caused by the Beijing, LAM, LCC and Quebec strains, as well as by the Euro-American or East Asian lineages, were found in a South African population (Salie et al., 2014). Therefore, the genetic heterogeneity of MTBC may limit the reproducibility of GWASs, as different cohorts may indeed include patients with distinct TB epidemiologies and with TB caused by different M. tuberculosis strains. Finally, misclassification of cases or controls is an issue for several association studies. In the case of infections, a major problem is the difficulty to assess exposure in the control population. For instance, it is sometimes impossible to determine whether individuals recruited as uninfected controls (e.g., based on serology) are seronegative because they were exposed but did not acquire the infection or because they were never or rarely exposed to the pathogen. Cases are also sometimes problematic to define. For instance, seropositivity can result from different routes of exposure, in turn associated with distinct probabilities of infection, and possibly modulated by diverse genetic factors. Moreover, the probability of infection may depend on several factors, including the genetic characteristics of the pathogen, the infectiousness of the transmitting individual, the general health status at the moment of exposure (this is clearly important for nosocomial infections, for instance). Some of these issues are evident in a GWAS for susceptibility to HIV-1 infection. McLaren and coworkers combined information from multiple studies to obtain case-control cohorts of > 6300 HIV-1 positive cases and 7200 general population HIV-1 negative controls (McLaren et al., 2013). Cases derived from different studies and were infected via diverse routes (i.e., parenteral and sexual). No information was available for controls on HIV-1 exposure (e.g., at risk behavior or possible parenteral exposure). The study failed to detect association with HIV infection susceptibility other than the known CCR5Δ32 variant. An association signal at the HLA-B/HLA-C genes was not confirmed after correcting for frailty (survival) bias (McLaren et al., 2013). These results led to the conclusion that genetic variants that modulate HIV-1 acquisition are either rare or have small effects. However, as the authors note, the study design has a high potential for misclassification of both cases and controls (McLaren et al., 2013). Indeed, cases may include subjects with low HIV-1 susceptibility who were infected due to very high exposure (e.g. via multiple blood transfusions), whereas controls may comprise subjects who never exposed themselves to the virus. Because studies on high-risk populations indicated that the proportion of subjects who are naturally resistant to HIV-1 is around 20% (Plummer et al., 1999; Fowke et al., 1996), it can be hypothesized that as many as 80% of controls would seroconvert if exposed. McLaren and coworkers estimated that the sample size of their GWAS provided 80% power for variants with 0.1 allele frequency and genotype relative risk (GRR) of 1.3. Their calculations also show that if ~ 30%, ~ 60%, or ~ 80% of cases were misclassified as controls, GRRs of 2, 3, or 4 would be respectively necessary to achieve the same power (McLaren et al., 2013). In fact, the authors did detect association for the CCR5Δ32 variant, indicating that polymorphisms with strong effect could be identified. Another situation that may potentially lead to classification biases is reinfection. For instance, HCV reinfection is relatively common in high-risk groups (e.g., drug users) (Grebely et al., 2009; Grebely et al., 2012; Midgard et al., 2016). If individuals are classified as persistently infected (chronic hepatitis C, CHC) based on viral loads measurements with relatively long intervals between tests, cases of clearance and reinfection can potentially be classified as persistent infections (Grebely et al., 2009; Grebely et al., 2012).

GWAS for viral infections

HIV-1

In 2007, Fellay and coworkers performed the first GWAS for an infectious disease by investigating viral set-point in HIV-1 infection (Fellay et al., 2007). This pilot study paved the way for several other GWASs. Indeed, the two signals detected within the HLA class I gene region were since confirmed by several studies (Fellay et al., 2009; Limou et al., 2009; International HIV Controllers Study et al., 2010) (Table 1, Fig. 2 ). Additional GWASs focused on HIV-1 viral load, progression to AIDS, and susceptibility to infection (Table 1). Their results have been extensively reviewed elsewhere (van Manen et al., 2012).
Fig. 2

A. Schematic representation of adaptive and innate immunity pathways derived by the Reactome database (http://www.reactome.org/), version (v61) (Fabregat et al., 2016; Milacic et al., 2012) (StableIDs: R-HSA-1280218 and R-HSA-168249).

Black boxes show sub-pathway names and contributing genes (as derived from Reactome): only genes that carry genetic variants associated in GWASs are reported (see Table 1). Color codes are shown in the legend. B. Schematic view of HLA class I and II loci. SNPs reported are associated to different traits related to pathogen (see Table 1). Asterisk denote association to multiple pathogens. Color codes and genomic coordinates (hg19) are reported.

A. Schematic representation of adaptive and innate immunity pathways derived by the Reactome database (http://www.reactome.org/), version (v61) (Fabregat et al., 2016; Milacic et al., 2012) (StableIDs: R-HSA-1280218 and R-HSA-168249). Black boxes show sub-pathway names and contributing genes (as derived from Reactome): only genes that carry genetic variants associated in GWASs are reported (see Table 1). Color codes are shown in the legend. B. Schematic view of HLA class I and II loci. SNPs reported are associated to different traits related to pathogen (see Table 1). Asterisk denote association to multiple pathogens. Color codes and genomic coordinates (hg19) are reported. In recent years, GWASs for HIV-1 infection have also focused on understanding the host determinants of response to standard and experimental therapies, as well as on detecting variants responsible for susceptibility to treatment-related adverse effects (Table 1 and Supplementary Table S1) (Chantarangsu et al., 2011; Leger et al., 2014; Wanga et al., 2015) (Table 1 and Supplementary Table S1).

HBV and HCV

Susceptibility to Hepatitis B and Hepatitits C viruses (HBV and HCV) was also extensively investigated in GWASs. Worldwide, chronic HBV or HCV infections represent the major causes of progressive liver disease, including liver cirrhosis (LC) and hepatocellular carcinoma (HCC) (Shirvani-Dastgerdi et al., 2016). HBV predominantly affects Southeast Asia and the East Pacific, where it reaches the highest prevalence in the world (Custer et al., 2004). It is thus not surprising that most GWASs performed to date genotyped individuals of Asian ancestry (Table 1). Genetic variations in HLA class II loci (HLA-DP and HLA-DQ) were significantly associated with susceptibility to chronic HBV infection and with HBV-related HCC (Mbarek et al., 2011; Kamatani et al., 2009; Nishida et al., 2012; Jiang et al., 2013; Jiang et al., 2015; Li et al., 2012; Kim et al., 2013; Hu et al., 2013; Chang et al., 2014; Tan et al., 2017) (Table 1, Fig. 2). Interestingly, host genetic variants in HLA class II genes were also found in association with hepatitis B vaccine response (Png et al., 2011; Pan et al., 2014) (Table 1). In 2015, Jiang et al. (2015) performed the largest GWAS on a Chinese population including about 18,400 among cases (chronically-infected) and controls (seronegative). The study confirmed most of the previously reported associations in HLA class II genes and detected a new variant, rs1883832 in CD40, strongly associated with HBV chronic infection (Table 1, Fig. 2). rs1883832 influences CD40 expression by affecting its translational efficiency (Jacobson et al., 2005). This association was confirmed, although not at genome-wide significant level, by a recent GWAS that analyzed chronically HBV-infected and spontaneously recovered subjects (Li et al., 2016). In this study, the authors also identified an intergenic variant at 8p21.3 which represents an expression quantitative trait locus (eQTL) for the INTS10 (integrator complex subunit 10) gene (Table 1). Interestingly, the authors demonstrated that INTS10 suppresses HBV replication in liver cells via IRF3, and confirmed decreased INTS10 protein levels in plasma samples of chronically HBV-infected patients compared to subjects who spontaneously cleared infection (Li et al., 2016). Although these studies have opened up new horizons of research, analyses in populations of different ancestries are strictly required. As mentioned above, a landmark discovery of GWASs for HCV infection was the identification of the IFNL3/IFNL4 association. In addition to this locus, variants in HLA class II loci were also found to affect viral clearance and disease progression in chronic HCV infection (Table 1, Fig. 2). In addition, polymorphisms in apoptosis-related genes such as RNF7, TULP1, and MERTK were associated with liver fibrosis, whereas variants in DEPDC5 and MICA were described as modulators of HCV-related hepatocellular carcinoma risk (Table 1, Fig. 2).

Other widespread viral infections

The GWA strategy was also exploited to investigate other infections caused by viral pathogens with worldwide diffusion. These include Human Papillomavirus (HPV), Varicella Zooster virus (VZV), and Influenza A virus (IAV). Overall, these works unveiled a limited number of associations, sometimes as a consequence of small sample sizes and, consequently, modest statistical power. In the case of HPV, most GWASs for susceptibility to infection and progression to cervical carcinoma confirmed associations to HLA class II genes (Table 1, Fig. 2). It should be noted, however, that some of these studies did not directly test HPV infection status, nor HPV type, making it difficult to determine which trait these associations refer to. An interesting observation is that polymorphisms in the EXCO1 and GSDMB genes were associated with progression to cervical carcinoma (Shi et al., 2013) (Table 1). EXOC1 encodes a subunit of the exocyst complex involved in host innate immune response against DNA antigens (Ishikawa et al., 2009), whereas GSDMB codes for the cancer-associated gasdermin-like protein, which is highly expressed in cervical cancer cells (Sun et al., 2008). These data suggest that both immunological and carcinogenic factors contribute to the risk of cervical cancer development. With respect to VZV, a GWAS was performed to investigate risk factors for herpes zoster (Crosslin et al., 2015). Although the study had limited power and potential for control misclassification (e.g., individuals who never suffered from primary infection with VZV are not at risk of virus reactivation), a significant association was detected in the MHC region where the non-coding HCP5 (HLA Complex P5) gene maps (Table 1). Intriguingly, variants in this region were also associated with delayed progression to AIDS (Table 1, Fig. 2). HCP5 derives from an endogenous retroviral element with sequence homology to the HIV-1 pol gene (Kulski and Dawkins, 1999) and is primarily expressed in immune system cells (Liu et al., 2008). The function of this noncoding gene and its role (if any) in restricting viral infection remain to be determined. In fact, a functional study showed that HCP5 does not restrict HIV-1 infection in vitro (Yoon et al., 2010), and refinement of the association signals within the HLA region indicated that the HCP5 variant associated with HIV-1 control is indeed a marker for the HLA-B*5701 allele (Fellay et al., 2009; Trachtenberg et al., 2009). Likewise, HCP5 variants that reached genome-wide significance in the VZV GWAS are in strong linkage disequilibrium with polymorphisms in the HLA-B gene region (Crosslin et al., 2015). It is thus possible that HCP5 has no effect on viral infection per se, but its variants tag specific HLA class I alleles/haplotypes. As for IAV, a single study analyzed susceptibility to H1N1 virus infection (Garcia-Etxebarria et al., 2015). The authors found no genome-wide significant associations when severe influenza cases were compared to mild cases. Conversely, several SNPs reached the significance threshold when severe and mild cases were compared to the general population (Table 1 and Supplementary Table S1). However, these associations were obtained on very small samples of patients and controls, and no replication cohort was analyzed. Finally, we draw attention to a recent GWAS that searched for genetic variants associated with diarrhoeal episodes in young children (1–2 years old) (Bustamante et al., 2016). All subjects were recruited in developed countries, where the majority of diarrhoeal episodes are caused by viral infections (Wiegering et al., 2011). Despite the potential issues of phenotype heterogeneity, a significant association was found for variants in FUT2, a gene encoding an alpha (1,2)-fucosyltransferase which participates in the production of histo-blood group antigens (Table 1). In particular, the activity of FUT2 determines the expression of the ABO histo-blood group antigens on the gastrointestinal mucosa and in bodily secretions. Genetic diversity at the FUT2 gene is maintained in human populations by balancing selection (Koda et al., 2001; Fumagalli et al., 2009) and common FUT2 null alleles are present in many populations (Kelly et al., 1995; Koda et al., 1996; Liu et al., 1998). In homozygotes, these alleles determine the “non-secretor” phenotype. In the GWAS for diarrhoeal episodes, non-secretors were found to be at lower risk compared to secretors, a result in line with candidate-gene studies that associated the non-secretor status with protection from Rotavirus and Norovirus infections (which represent the leading causes of diarrhoeal disease in developed regions) (Wiegering et al., 2011; Lindesmith et al., 2003; Carlsson et al., 2009; Imbert-Marcille et al., 2014; Thorven et al., 2005). In line with the balancing selection scenario, however, candidate-gene studies associated the non-secretor status with higher risk for infection by different pathogens, including Neisseria meningitidis, Streptococcus pneumoniae, and Candida spp. (Blackwell et al., 1986a; Blackwell et al., 1986b; Chaim et al., 1997). More recently, non-secretors were also found to be at higher risk to develop Crohn's disease (Franke et al., 2010; McGovern et al., 2010). The molecular mechanism underlying the different predisposition to infection (and to Crohn's disease) of secretors and non-secretors most likely stems from the fact that both pathogenic and commensal microorganisms exploit oligosaccharides on the gastrointestinal mucosa for cytoadherence.

GWASs for bacterial infections

Studies on the genetic susceptibility to bacterial infections have investigated a wide variety of conditions, either resulting from specific infections (e.g., tuberculosis, pneumococcal bacteremia) or deriving from more heterogenous situations (e.g., periodontitis).

Specific infections

GWASs for infections caused by Neisseria meningitidis (Davila et al., 2010; Martinon-Torres et al., 2016), Helicobacter pylori (Mayerle et al., 2013), and Mycobacterium leprae (Zhang et al., 2009), strongly pointed to a role for risk variants in genes of the innate immune system. For H. pylori, variants in TLR1, known to be essential for innate immunity against bacterial infection, were identified as strongly associated with seroprevalence (Table 1, Fig. 2). For N. meningitidis, two GWASs for meningitis and septicaemia identified variants in the CFH gene region (Davila et al., 2010; Martinon-Torres et al., 2016) (Table 1, Fig. 2). CFH encodes complement factor H, a regulator of the complement system, which is bound by N. meningitidis to escape complement mediated killing (Schneider et al., 2009). In the case of leprosy, a single GWAS identified candidate susceptibility genes encoding proteins involved in innate immune response (NOD2, TNFSF15, and RIPK2) (Table 1, Fig. 2). Furthermore, in a GWAS for leprosy in the Chinese population, H. Liu et al. (2015) found an overlap between their hits and those for autoimmunity/inflammatory diseases. For example, association with CCDC88B, identified in the leprosy GWAS (Table 1), was also reported for primary biliary cirrhosis, sarcoidosis, and inflammatory bowel disease. Likewise, SNPs in TNFSF15, LRRK2, IL18RAP/IL1RL1, and LACC1/CCDC122 represent susceptibility loci for both leprosy and Crohn's disease (Jostins et al., 2012; J.Z. Liu et al., 2015). This observation points to a shared immunological basis for infectious and inflammatory conditions (H. Liu et al., 2015). However, the study also showed that, whereas some shared loci (RIPK2 and LACC1/CCDC122) had concordant effects on the risk for leprosy and for inflammatory/autoimmune diseases, some others (e.g., IL18RAP/IL1RL1) showed opposite or discordant risk effects. These findings suggest a complex scenario and a delicate balance between response to infection and autoimmunity/chronic inflammation (Sironi and Clerici, 2010). As expected, genes of the adaptive immune system also turned out to harbor polymorphisms associated with susceptibility to infections. Variants in the MHC region were identified as risk factors for leprosy (Zhang et al., 2009; H. Liu et al., 2015), Staphylococcus aureus infection (DeLorenze et al., 2016), enteric fever caused by Salmonella typhi (Dunstan et al., 2014), and TB (Sveinbjornsson et al., 2016) (Table 1, Fig. 2). For this latter infection, several GWASs have been performed (Table 1 and Supplementary Table 1). The WT1 association mentioned above was detected in two independent studies (Chimusa et al., 2014; Thye et al., 2012) and recently confirmed, although with a p value that did not reach genome-wide significance, in a family-based Moroccan discovery sample (Grant et al., 2016). The functional role played by WT1, a zinc-finger transcription factor, in TB susceptibility remains to be elucidated. An interesting possibility is that WT1 regulates cytokine expression or activation of the vitamin D receptor (Maurer et al., 2001). Instead, the role played by ASAP1, a gene associated to TB in a Russian discovery cohort and replicated in African populations (Table 1), is somehow clearer, as its protein product regulates dendritic cell migration (Curtis et al., 2015). However, the signals in these genes showed very small effects (WT1) or failed to replicate (ASAP1) in a recent large TB study in the Icelandic population (Sveinbjornsson et al., 2016). This GWAS is a nice example of the power of imputation to enormously extend the number of analyzed variants. In fact, the authors used whole-genome sequencing of 2636 Icelanders to impute 28.3 million SNPs and insertions/deletions for > 100,000 chip-typed individuals. These included subjects with pulmonary TB or infected with M. tuberculosis. This large-scale study detected association signals at MHC class II loci, which were validated in Russian and Croatian samples (Sveinbjornsson et al., 2016). As discussed above, the low reproducibility of associations across studies and populations may in part reflect local differences in circulating M. tuberculosis strains (Sveinbjornsson et al., 2016). Finally, we mention that variants nearby the IL12B gene, which encodes a subunit of interleukin 12, were recently associated to TB in African populations (from Tanzania and Uganda) (Sobota et al., 2016). In this GWAS, the authors used an interesting approach by including in the study only HIV-1 positive individuals. The underlying rationale was that HIV-positive individuals who do not develop TB despite living in endemic areas should be genetically resistant and ideally suited to detect variants with strong effect. Indeed, the identified variant has a strong protective effect and may modulate IL12B expression (Sobota et al., 2016). Although this signal has not been replicated yet, it is worth mentioning that loss-of-function mutations in IL12B cause mendelian susceptibility to mycobacterial disease, a rare condition characterized by the development of clinical symptoms following infection with weakly virulent mycobacteria (Picard et al., 2002). A nice example of the ability of GWASs to identify unexpected associations was provided by a recent study on bacteremia caused by Streptococcus pneumoniae (Kenyan Bacteraemia Study Group et al., 2016). By investigating a population of Kenyan children, Rautanen and coworkers identified an association peak located in the introns of two separate intergenic long non-coding RNAs (lncRNAs) annotated as AC011288.2 and AC006000.5 (Table 1). Interestingly, the risk allele is relatively rare and restricted to populations of African ancestry, underscoring the relevance of population-specific variants in disease susceptibility. Assessing AC011288.2 RNA expression in leukocyte cell subsets, Rautanen and coworkers observed expression in neutrophils only. These cells are known to play a major role in pneumococcal clearance (Gingles et al., 2001; Brinkmann et al., 2004) and their count is an independent predictor of pneumococcal bacteremia in febrile children (Kuppermann et al., 1998). Thousands of lncRNA genes have been identified in the human genome (Atianand and Fitzgerald, 2014) and growing evidence suggests that these molecules are involved in gene regulation in different cell-types and tissues, including the immune system (Atianand and Fitzgerald, 2014). In 2013, using inter-crosses of mouse strains, Gomez and coworkers provided the first direct demonstration that a lncRNA (termed NeST) can modulate the outcome of infection (Gomez et al., 2013). In particular, NeST was shown to be responsible for the persistence of Theiler's virus in the central nervous system, as well as for the clearance of Salmonella infection (Gomez et al., 2013). These data suggest that additional lncRNA await identification as important players in immune response and infectious disease susceptibility.

Caries, periodontitis, and acne

GWASs have also been applied to identify genetic determinants for very common human condition characterized by a strong interaction between environmental factors and the infecting/commensal bacteria. These include caries, which is caused by a wide array of carbohydrate-fermenting bacteria, (Larsen and Fiehn, 2017), periodontitis, where an anaerobic bacterial biofilm participates in disease onset and progression (Larsen and Fiehn, 2017), as well as acne, which is characterized by overgrowth of Propionibacterium acnes. For dental caries a variety of loci were identified via GWASs. Studies in adults have yielded significant and “suggestive” associations within or near genes with roles in tooth development and host defense. Among these we mention LYZL2, encoding a bacteriolytic agent (Shaffer et al., 2013), NAMPT, which is involved in periodontal healing (Morrison et al., 2016), and BMP7, a tooth development gene (Morrison et al., 2016) (Table 1). To date, none of the dental caries loci identified in GWASs in adults have been followed up in fine-mapping or replication studies. Importantly, given the complexity of caries etiology and the spectrum of contributing environmental factors, some authors speculated that the effects of some genetic variants may vary across different populations (Morrison et al., 2016). Although several GWASs were conducted for periodontitis, only few of them successfully identified implicated loci (Table 1 and Supplementary Table S1). Among them, the first GWAS for periodontitis (Schaefer et al., 2010) studied the aggressive type of disease and identified an association with a marker in the GLT6D1 gene (encoding a glycosyltransferase) (Table 1). Functional experiments suggested that reduced GATA3 binding affinity to the GLT6D1 locus could be a component of the pathophysiology of periodontitis. Subsequently, Offenbacher et al. (2016) used a promising approach that combined clinical phenotypes, biological intermediates of microbial burden, and measures of local inflammation to derive periodontal complex traits (PCTs). PCTs were carried forward to GWAS to identify PCT-associated loci among European American adult participants (Table 1). The authors found several significant signals in loci that included genes involved in immune response and epithelial barrier function. However, candidate loci did not associate with current clinically determined periodontal disease categories upon replication. Finally, GWASs were also used to find associations between genetic variants and severe acne susceptibility. The top signals identified in a Chinese population included a SNP within the promoter region of DDB2 (a novel androgen receptor-interacting protein), as well as variants in SELL/SELP/SELE gene cluster (Table 1). These selectins have important roles in regulating cutaneous inflammation (He et al., 2014) (Fig. 2). Instead, a GWAS conducted in a UK population identified genes linked to the TGFβ cell signaling pathway and to skin homeostasis, namely OVOL1, FST and TGFB2 (Navarini et al., 2014) (Table 1).

GWASs for parasitic and prion diseases

Parasitic infections, either caused by protozoa or by helminths, affect millions of people worldwide. For many parasitic diseases a heritable component has been demonstrated (Verra et al., 2009; Williams-Blangero et al., 2011; Dold and Holland, 2011; Choi et al., 2003). However very few GWAS were conducted; their results are summarized in Table 1 and were reviewed elsewhere (Mangano and Modiano, 2014). We add herein that a recent GWAS explored the genetic susceptibility to malaria (Malaria Genomic Epidemiology Network et al., 2015). Beyond confirming loci previously associated with severe malaria (HBB, ABO, ATP2B4) (Jallow et al., 2009; Timmann et al., 2012; Band et al., 2013), this study identified additional variants on chromosome 4, between FREM3 and a cluster of three glycophorin genes (GYPE, GYPB, and GYPA) (Table 1). Glycophorins are sialoglycoproteins abundantly expressed on the erythrocyte membrane, where they play a functional role in invasion by Plasmodium falciparum. In a recent follow-up study, the authors identified an array of large copy number variants (CNVs) affecting the GYPA and GYPB genes (Leffler et al., 2017). One of the identified CNVs (DUP4) is associated with resistance to severe malaria and explains the GWAS association signal (Malaria Genomic Epidemiology Network et al., 2015). The most common form of human prion disease in humans is Creutzfeldt-Jacob disease (CJD), which is classified as sporadic, acquired (iatrogenic or variant) or familial (Iwasaki, 2017). Variants in PRPN, encoding the prion protein (PrP), are the best known genetic factor for susceptibility to CJD (Owen et al., 1990; Mead et al., 2009) (Table 1). Nevertheless, associations to other genomic loci (MTMR7 and NPAS2 for variant CJD (Sanchez-Juan et al., 2012), GRM8 for sporadic CJD (Sanchez-Juan et al., 2015)) were recently reported (Table 1). MTMR7 is specifically expressed in the central nervous system (CNS) and is involved in the phosphatidylinositol pathway, whereas NPAS2 encodes a transcription factor implicated in neuronal function. Interestingly, the CJD susceptibility variant in NPAS2 is in strong linkage disequilibrium with an SNP that regulates PLCD3 in trans (Sanchez-Juan et al., 2012). Because PLCD3 encodes a catabolic enzyme of the phosphatidylinositol pathway, these data implicate this pathway in the susceptibility to variant CJD. As for GRM8, it encodes a member of the metabotropic glutamate receptor family. Other members of this family are involved in cellular signal transduction triggered by PrP (Um et al., 2013). Thus, these data highlight the ability of GWASs to pinpoint molecular disease pathways.

GWAS in non-human mammals

In comparison to human GWASs, association studies in domestic and laboratory animals offer advantages and disadvantages. Model animals, for instance, can be infected with a known dose of a genetically homogeneous pathogen, removing part of the variability associated to exposure routes and load in humans, as well as to the genetic heterogeneity of the infecting pathogen. Due to their peculiar demographic histories, both model and domestic animals often display extensive linkage disequilibrium and genetic homogeneity, at least within breeds or strains (Alhaddad et al., 2013; Badke et al., 2012; Boyko et al., 2010; Flint and Eskin, 2012; Lindblad-Toh et al., 2005). This implies that GWASs can be carried out using much fewer markers and samples than required in human studies. The other side of the coin is that the often complex breeding strategies pose challenges related to population structure, cryptic relatedness, and extensive selective sweeps resulting from artificial selection. These factors need to be accurately corrected for, to avoid spurious associations and loss of statistical power. Popular methods to account for these effects include the use of PCA to explicitly model ancestry contributions (Price et al., 2006) and the application of mixed model association methods (see (Hayes, 2013; Yang et al., 2014; Flint and Eskin, 2012) for review).

Mouse studies

In the field of mouse genetics, an important contribution to the mapping of complex traits came from the Collaborative Cross (CC), a collaborative effort aimed at providing the scientific community with a large panel of recombinant inbred mouse strains derived from genetically diverse founders (Churchill et al., 2004; Maurizio and Ferris, 2017). As founders are both classic inbred strains and wild-derived strains, the resulting mouse panel has higher genetic diversity and less population structure than other mouse-based resources. Within the CC framework, an ancestry-based approach was shown to be superior to marker-based methods for mapping QTLs (Aylor et al., 2011). The utility of this approach in the field of infectious diseases was initially demonstrated by a genome-wide scan of incipient CC lines for susceptibility to Aspergillus fumigatus, a pathogen that recapitulates in mice the signs of human aspergillosis (Durrant et al., 2011). The authors found that CC lines were heterogenous in terms of susceptibility to A. fumigatus infection, measured as survival days after infection, and the broad-sense heritability of this phenotype was 0.78. The genome-wide scan identified several QTLs, including one on chromosome 8 that contained the Irf2 gene and another one on chromosome 10 covering cytokine (Il20ra, Il22ra) and interferon (Ifngr1) receptors (Durrant et al., 2011). Incipient CC lines were then used to investigate the genetic susceptibility to IAV infection by recording several IAV-induced phenotypes, including virus replication, airway inflammation, weight loss, and pulmonary edema. Interestingly, a highly significant QTL on chromosome 16 (HrI1, Host response to Influenza), was identified (Ferris et al., 2013). HrI1 explained a considerable proportion of variation in several phenotypes and encompassed a genomic region where the Mx1 gene maps. Because Mx1 is a well-known IAV resistance gene (Staeheli et al., 1988), these findings provide a nice validation of the CC approach. Additional QTLs were mapped in the study and suggested that variants in Il16 and/or Nox4 (HrI2) modulate IAV-induced weight loss (Ferris et al., 2013). More recently, incipient CC lines were used to identify variants in Trim55 associated with vascular cuffing after infection with a mouse adapted SARS-CoV strain (Gralinski et al., 2015) and QTLs for survival to Klebsiella pneumoniae infection (Vered et al., 2014). Importantly, CC mice have been used to create improved models of EBOV (Ebola virus) and WNV (West Nile virus) infection (Graham et al., 2016; Graham et al., 2015; Rasmussen et al., 2014) and mouse populations showing diverse susceptibility to M. tuberculosis and Pseudomonas aeruginosa (Lore et al., 2015; Smith et al., 2016). These models await genome-wide scans to unveil the genetic determinants of infection susceptibility and severity.

Associations in dogs

Dogs have also proven useful models for the mapping of variants associated with infectious disease phenotypes. Dog breeds represent genetic isolates deriving from a few founder individuals and subject to strong artificial selection. Thus, breeds differ in several phenotypes. One of these is granulomatous colitis, which is caused by mucosally invasive Escherichia coli (Simpson et al., 2006) and is described in boxers and bulldogs only (Craven et al., 2011; Manchester et al., 2013). Within a larger project that assessed several traits and diseases, Hayward and coworkers performed a within-breed GWAS for granulomatous colitis and identified a strong association signal within a genomic region where several members of the SLAM (signaling lymphocyte activation molecule) gene family map (Hayward et al., 2016). Interestingly, variants in the corresponding human genomic region have been associated with the susceptibility to Crohn's disease and ulcerative colitis (Franke et al., 2010; J.Z. Liu et al., 2015; Barrett et al., 2008). These data indicate that, both in dogs and in humans, genes within this region contribute to the maintenance of intestinal immune homeostasis in the presence of commensal or pathogenic bacteria.

GWASs in cattle and swine

In cows, GWASs have been widely applied to the field of infectious disease susceptibility. This is clearly motivated by the fact that infections cause a major economic burden in the cattle industry worldwide. In this context, GWASs and other genomic approaches are regarded as powerful strategies to develop breeding programs aimed at the generation of less susceptible livestock (Raszek et al., 2016). One of the most investigated traits was Johne's disease, a chronic gastrointestinal tract disease caused by Mycobacterium avium subspecies paratuberculosis. Several GWASs have been performed, mostly on naturally infected Holsteins and Jerseys (common breeds of dairy cattle): associated variants with small effect were identified in several analyses, but reproducibility among studies was extremely low, resulting in no confidently associated marker (Alpay et al., 2014; Kirkpatrick et al., 2011; Minozzi et al., 2010; Neibergs et al., 2010; Pant et al., 2010; Settles et al., 2009; van Hulzen et al., 2012; Zare et al., 2014). Although the reasons for low across-study consistency are likely manifold and include different criteria to classify phenotypes and different statistical procedures, these results suggest that loci with a major effect on Johne's disease do not exist, at least in these breeds. Likewise, GWASs for susceptibility to bovine tuberculosis (caused by Mycobacterium bovis) yielded inconsistent results (Bermingham et al., 2014; Finlay et al., 2012; Kassahun et al., 2015; Richardson et al., 2016). Efforts to identify susceptibility alleles for other infectious diseases detected small-effect loci with poor replicability (reviewed in (Raszek et al., 2016)). To date, the greatest success was obtained for mastitis, which is commonly caused by bacterial infections; regions spanning the DCK, SLC4A4, and EDN3 genes were detected in at least two studies (Kanazawa et al., 1989; Sahana et al., 2013; Sodeland et al., 2011; Wu et al., 2015). The functional role of these genes in mastitis and response to invading bacteria remains to be evaluated. As is the case for cattle, infections represent major economic problems for the swine industry, with one viral pathogen, porcine reproductive and respiratory syndrome virus (PRRSV) accounting for substantial burden (Dekkers et al., 2017). Several GWASs for PRRSV susceptibility converged to identify a major locus on chromosome 4 (reviewed in (Dekkers et al., 2017)). Fine mapping of the QTL region and a further functional genomics work identified a variant in the GBP5 gene as likely causal (Dekkers et al., 2017). GPB5 encodes an interferon-inducible guanosine triphosphatase which plays central roles in cell-intrinsic immunity. In humans, GBP5 acts as a restriction factor for HIV (Krapp et al., 2016). For the purpose of swine breeding, this result is relevant as a variation at GBP5 can be used for marker-assisted selection (Dekkers et al., 2017).

Conclusions and perspectives

As detailed above, GWASs for infectious diseases have provided important biological insight and, in some cases, the results have opened the way to personalized therapy (Booth et al., 2012). Huge gaps however remain, with a number of infections still not addressed by GWAS or similar approaches. For instance, hundreds of millions of people are infected by helminths (Hotez et al., 2008), but no GWAS has investigated the genetic susceptibility to these parasites even though linkage studies provided evidence for several QTLs (reviewed in (Mangano and Modiano, 2014)). Diarrhoea and meningitis also account for a heavy health burden, especially among children in developing countries (World Health Organization, http://apps.who.int/gho/data/node.home). Nonetheless, only two studies provided genome-wide scans for genetic associations to these conditions (Bustamante et al., 2016; Davila et al., 2010). The general trend in the field of complex traits is to shift from GWAS to next-generation sequencing approaches (exome or genome sequencing). These latter provide several advantages over GWASs, which mainly derive from the possibility to identify rare or private risk variants. However, next-generation approaches require sample sizes even larger than GWASs (Auer et al., 2016; Zuk et al., 2014) and the amount of phenotypic variance that is explained by rare penetrant variants may largely differ among traits. We consider that important knowledge on infectious disease susceptibility (and related traits) can be still gained by GWASs, especially for those conditions that have never been investigated using this approach. A promising strategy for future GWASs will be to leverage information from the host's and pathogen's genomes. An interesting step forward in this direction came from a study that generated human genome-wide genotyping data of antiretroviral naive patients and almost complete HIV-1 genome sequences to systematically search for associations between host and virus variants (genome-to-genome scan) (Bartha et al., 2013). The authors found strong associations between SNPs that tag HLA class I alleles and viral mutations in CTL (cytotoxic T lymphocyte) epitopes. These results clearly highlight the selective pressure imposed by the host immune system on the viral genome. No signals were detected outside the HLA class I loci and, on the viral genome, most selected sites were located in Gag and Nef. However, the majority of host-associated HIV-1 mutations were found to have no or little effect on viral load, suggesting that the virus can compensate for selective pressure with little fitness cost (Bartha et al., 2013). More recently, Ansari et al. (2017) performed a genome-to-genome scans in patients chronically infected with HCV. As in the HIV-1 study, the authors found that the adaptive immune system exerts a selective pressure on the viral genome and drives the evolution of several positions across the HCV genome. Importantly, Ansari and coworkers also showed that the host genotype for a functional IFNL4 variant modulates viral load only when the individuals are infected by a virus that carries a specific amino acid residue (serine) at position 2414 in the NS5A protein. This result clearly indicates that the host and viral genomes interact to determine the control of infection (Ansari et al., 2017). The majority of patients recruited in the study were infected with genotype 3 and HCV is genetically heterogeneous, suggesting that other and possibly different interactions exist in patients infected by distinct HCV genotypes. The functional polymorphism in IFNL4 is a dinucleotide variant (ss469415590, TT/ΔG) and the effect of the “favorable” TT allele is recessive (Prokunina-Olsson et al., 2013). This finding highlights the possible relevance of non-additive models in genetic associations. In particular, overodominant models may be particularly worth exploring in the field of infectious diseases. Remarkable examples of heterozygote advantage include those described for HBB and G6PD in relation to malaria. Heterozygosity at HLA genes has also been shown to protect against different infections (reviewed in (Quintana-Murci, 2016)), and the same is true for a common variant in TIRAP, with heterozygotes protected from malaria, invasive pneumococcal disease, bacteremia, and TB (Khor et al., 2007). Heterozygosity for a common polymorphism in PRNP also confers relative resistance to prion diseases (Mead et al., 2003). Overdominance causes multiple alleles to be maintained at a locus via balancing selection (reviewed in (Quintana-Murci, 2016)). A well-known example of this phenomenon is described above for the FUT2 gene. The maintenance of multiple ABO histo-blood groups, which most likely resulted from the selective pressure exerted by different pathogens, is also due to balancing selection (reviewed in (Quintana-Murci, 2016)). Several works have indicated that targets of balancing selection in primate genomes have often evolved in response to pathogen-driven selective pressures (Leffler et al., 2013; Ferrer-Admetlla et al., 2008; Azevedo et al., 2015; Fumagalli and Sironi, 2014), making overdominance an appealing model to test for resistance against infection. It should however be noted that the action of balancing selection does not necessarily imply that heterozygotes are protected against a specific diseases. Epitomal in this respect is the case of FUT2 described above, as well as that of ABO: individuals with O histo-blood group are protected against severe malaria and cerebral malaria, but at higher risk of developing highly symptomatic cholera infection (Malaria Genomic Epidemiology Network and Malaria Genomic Epidemiology Network, 2014; Cooling, 2015). Interestingly, large-scale studies that focused on specific variants involved in resistance to malaria (Malaria Genomic Epidemiology Network and Malaria Genomic Epidemiology Network, 2014; Clarke et al., 2017) indicated that G6PD deficiency also results in different susceptibility phenotypes. Thus, the level of G6PD activity is associated with decreased risk of cerebral malaria, but with increased risk of severe malarial anaemia (Malaria Genomic Epidemiology Network and Malaria Genomic Epidemiology Network, 2014; Clarke et al., 2017). Moreover, by using a Bayesian statistical framework that allows for heterogeneity of effects across populations and phenotypes, the authors showed that homozygotes for the derived allele of a variant in the 5′ upstream region of CD40LG have significantly reduced risk of severe malaria in The Gambia, but significantly increased risk in Kenya (Malaria Genomic Epidemiology Network and Malaria Genomic Epidemiology Network, 2014). Finally, a significant epistatic interaction was noted between the HbC variant and an SNP in ATP2B4, this latter encoding a major erythrocyte calcium channel (Malaria Genomic Epidemiology Network and Malaria Genomic Epidemiology Network, 2014). Overall, these results indicate that models of genetic resistance/susceptibility are often complex and that statistical methods that incorporate heterogeneity across populations and phenotypes may improve the power and reliability of GWASs. For infectious diseases, a particular role may be played by interactions within HLA genes or among HLA variants and polymorphisms located outside the major histocompatibility complex. One example of this latter scenario relates to polymorphic variants in HLA-C and in the MIR148A gene (for microRNA-148a, mir-148a). Kulkarni et al. (2011) identified a polymorphism in the 3'UTR of HLA-C which affects a miR-148a binding site and associates with HLA-C expression levels, as well as with HIV-1 control. More recently, the same authors showed that an insertion/deletion polymorphism flanking MIR148A modulates the expression of this microRNA and the level of HIV-1 control only in individuals carrying HLA-C alleles with an intact miR-148a binding site. The MIR148A variant has no effect on HIV-1 control among subjects who carry HLA-C alleles that do not bind miR-148A (Kulkarni et al., 2013). Situations similar to the one described by Kulkarni and coworkers may be common and may be particularly important for HLA class I and KIR loci, as they biologically interact as binding partners. For instance, epistatic interactions among specific KIR and HLA loci have been shown to modulate the progression to AIDS (Hancock et al., 2008; Gaudieri et al., 2005; Martin et al., 2007), as well as the spontaneous (Thons et al., 2017; Khakoo et al., 2004) and therapy-induced clearance of HCV infection (Ahlenstiel et al., 2008). Likewise, interactions within the MHC may be common and relevant in modulating infection-related phenotypes. Indeed, it was recently shown that non-additive and interaction effects within HLA loci are widespread and modulate the risk of different autoimmune diseases (Lenz et al., 2015). Taking these possibilities into account would certainly benefit the discovery of novel genetic effects modulating infection susceptibility or progression. Finally, we note that, as recently highlighted, association studies have been dramatically skewed in terms of population inclusion: a 2016 survey of GWASs in the Catalog indicated that 81% of analyzed samples were of European ancestry and 14% of Asian ancestry (Popejoy and Fullerton, 2016). All other populations remained severely underrepresented. This trend is somehow attenuated in the studies we reviewed herein, as populations of Asian origin were included in > 33% of studies (most of them on HBV, HCV, and mycobacterial infections), whereas 14% and 17% of GWASs analyzed at least one cohort of African or Hispanic/Latino ancestry, respectively. However, only two studies, both of them on prion diseases, included Pacific Islanders (from Papua New Guinea) and no GWAS recruited Australian aborigines or other native peoples. These percentages clearly highlight the need to extend GWASs analysis for infectious diseases to under-represented populations. This is required to ensure that the benefits of research are equally distributed, especially in light of the recent evidence of limited portability of GWAS results across populations (Martin et al., 2017). The following is the supplementary data related to this article.

Supplementary Table S1

GWAS in infectious diseases.
  256 in total

1.  Two new susceptibility loci 1q24.2 and 11p11.2 confer risk to severe acne.

Authors:  Li He; Wen-Juan Wu; Jian-Kang Yang; Hui Cheng; Xian-Bo Zuo; Wei Lai; Tian-Wen Gao; Cui-Lin Ma; Na Luo; Jian-Qing Huang; Feng-Yan Lu; Ye-Qiang Liu; Yi-Jin Huang; Qian-Jin Lu; Huai-Liang Zhang; Lin Wang; Wei-Zhen Wang; Mei-Mei Wang; Sheng-Xiang Xiao; Qing Sun; Chun-Yang Li; Yan-Ping Bai; Hui Li; Zhan-Chao Zhou; Fu-Sheng Zhou; Gang Chen; Bo Liang; Jue Qi; Xiao-Yan Yang; Ting Yang; Xing Zheng; Liang-Dan Sun; Xue-Jun Zhang; Ya-Ping Zhang
Journal:  Nat Commun       Date:  2014       Impact factor: 14.919

2.  HCV reinfection studies and the door to vaccine development.

Authors:  Jason Grebely; David L Thomas; Gregory J Dore
Journal:  J Hepatol       Date:  2009-07-17       Impact factor: 25.083

3.  Non-secretion of ABO antigens predisposing to infection by Neisseria meningitidis and Streptococcus pneumoniae.

Authors:  C C Blackwell; K Jónsdóttir; M Hanson; W T Todd; A K Chaudhuri; B Mathew; R P Brettle; D M Weir
Journal:  Lancet       Date:  1986-08-02       Impact factor: 79.321

4.  GWAS identifies novel susceptibility loci on 6p21.32 and 21q21.3 for hepatocellular carcinoma in chronic hepatitis B virus carriers.

Authors:  Shengping Li; Ji Qian; Yuan Yang; Wanting Zhao; Juncheng Dai; Jin-Xin Bei; Jia Nee Foo; Paul J McLaren; Zhiqiang Li; Jingmin Yang; Feng Shen; Li Liu; Jiamei Yang; Shuhong Li; Shandong Pan; Yi Wang; Wenjin Li; Xiangjun Zhai; Boping Zhou; Lehua Shi; Xinchun Chen; Minjie Chu; Yiqun Yan; Jun Wang; Shuqun Cheng; Jiawei Shen; Weihua Jia; Jibin Liu; Jiahe Yang; Zujia Wen; Aijun Li; Ying Zhang; Guoliang Zhang; Xianrong Luo; Hongbo Qin; Minshan Chen; Hua Wang; Li Jin; Dongxin Lin; Hongbing Shen; Lin He; Paul I W de Bakker; Hongyang Wang; Yi-Xin Zeng; Mengchao Wu; Zhibin Hu; Yongyong Shi; Jianjun Liu; Weiping Zhou
Journal:  PLoS Genet       Date:  2012-07-12       Impact factor: 5.917

5.  Killer Immunoglobulin-Like Receptor Allele Determination Using Next-Generation Sequencing Technology.

Authors:  Bercelin Maniangou; Nolwenn Legrand; Mehdi Alizadeh; Ulysse Guyet; Catherine Willem; Gaëlle David; Eric Charpentier; Alexandre Walencik; Christelle Retière; Katia Gagne
Journal:  Front Immunol       Date:  2017-05-19       Impact factor: 7.561

6.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease.

Authors:  Luke Jostins; Stephan Ripke; Rinse K Weersma; Richard H Duerr; Dermot P McGovern; Ken Y Hui; James C Lee; L Philip Schumm; Yashoda Sharma; Carl A Anderson; Jonah Essers; Mitja Mitrovic; Kaida Ning; Isabelle Cleynen; Emilie Theatre; Sarah L Spain; Soumya Raychaudhuri; Philippe Goyette; Zhi Wei; Clara Abraham; Jean-Paul Achkar; Tariq Ahmad; Leila Amininejad; Ashwin N Ananthakrishnan; Vibeke Andersen; Jane M Andrews; Leonard Baidoo; Tobias Balschun; Peter A Bampton; Alain Bitton; Gabrielle Boucher; Stephan Brand; Carsten Büning; Ariella Cohain; Sven Cichon; Mauro D'Amato; Dirk De Jong; Kathy L Devaney; Marla Dubinsky; Cathryn Edwards; David Ellinghaus; Lynnette R Ferguson; Denis Franchimont; Karin Fransen; Richard Gearry; Michel Georges; Christian Gieger; Jürgen Glas; Talin Haritunians; Ailsa Hart; Chris Hawkey; Matija Hedl; Xinli Hu; Tom H Karlsen; Limas Kupcinskas; Subra Kugathasan; Anna Latiano; Debby Laukens; Ian C Lawrance; Charlie W Lees; Edouard Louis; Gillian Mahy; John Mansfield; Angharad R Morgan; Craig Mowat; William Newman; Orazio Palmieri; Cyriel Y Ponsioen; Uros Potocnik; Natalie J Prescott; Miguel Regueiro; Jerome I Rotter; Richard K Russell; Jeremy D Sanderson; Miquel Sans; Jack Satsangi; Stefan Schreiber; Lisa A Simms; Jurgita Sventoraityte; Stephan R Targan; Kent D Taylor; Mark Tremelling; Hein W Verspaget; Martine De Vos; Cisca Wijmenga; David C Wilson; Juliane Winkelmann; Ramnik J Xavier; Sebastian Zeissig; Bin Zhang; Clarence K Zhang; Hongyu Zhao; Mark S Silverberg; Vito Annese; Hakon Hakonarson; Steven R Brant; Graham Radford-Smith; Christopher G Mathew; John D Rioux; Eric E Schadt; Mark J Daly; Andre Franke; Miles Parkes; Severine Vermeire; Jeffrey C Barrett; Judy H Cho
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

7.  Imputation-based meta-analysis of severe malaria in three African populations.

Authors:  Gavin Band; Quang Si Le; Luke Jostins; Matti Pirinen; Katja Kivinen; Muminatou Jallow; Fatoumatta Sisay-Joof; Kalifa Bojang; Margaret Pinder; Giorgio Sirugo; David J Conway; Vysaul Nyirongo; David Kachala; Malcolm Molyneux; Terrie Taylor; Carolyne Ndila; Norbert Peshu; Kevin Marsh; Thomas N Williams; Daniel Alcock; Robert Andrews; Sarah Edkins; Emma Gray; Christina Hubbart; Anna Jeffreys; Kate Rowlands; Kathrin Schuldt; Taane G Clark; Kerrin S Small; Yik Ying Teo; Dominic P Kwiatkowski; Kirk A Rockett; Jeffrey C Barrett; Chris C A Spencer
Journal:  PLoS Genet       Date:  2013-05-23       Impact factor: 5.917

8.  Adaptations to climate in candidate genes for common metabolic disorders.

Authors:  Angela M Hancock; David B Witonsky; Adam S Gordon; Gidon Eshel; Jonathan K Pritchard; Graham Coop; Anna Di Rienzo
Journal:  PLoS Genet       Date:  2008-02       Impact factor: 5.917

9.  Widespread non-additive and interaction effects within HLA loci modulate the risk of autoimmune diseases.

Authors:  Tobias L Lenz; Aaron J Deutsch; Buhm Han; Xinli Hu; Yukinori Okada; Stephen Eyre; Michael Knapp; Alexandra Zhernakova; Tom W J Huizinga; Gonçalo Abecasis; Jessica Becker; Guy E Boeckxstaens; Wei-Min Chen; Andre Franke; Dafna D Gladman; Ines Gockel; Javier Gutierrez-Achury; Javier Martin; Rajan P Nair; Markus M Nöthen; Suna Onengut-Gumuscu; Proton Rahman; Solbritt Rantapää-Dahlqvist; Philip E Stuart; Lam C Tsoi; David A van Heel; Jane Worthington; Mira M Wouters; Lars Klareskog; James T Elder; Peter K Gregersen; Johannes Schumacher; Stephen S Rich; Cisca Wijmenga; Shamil R Sunyaev; Paul I W de Bakker; Soumya Raychaudhuri
Journal:  Nat Genet       Date:  2015-08-10       Impact factor: 38.330

10.  Genome-Wide Association Study of Staphylococcus aureus Carriage in a Community-Based Sample of Mexican-Americans in Starr County, Texas.

Authors:  Eric L Brown; Jennifer E Below; Rebecca S B Fischer; Heather T Essigmann; Hao Hu; Chad Huff; D Ashley Robinson; Lauren E Petty; David Aguilar; Graeme I Bell; Craig L Hanis
Journal:  PLoS One       Date:  2015-11-16       Impact factor: 3.240

View more
  15 in total

1.  Genetic variability in COVID-19-related genes in the Brazilian population.

Authors:  Rodrigo Secolin; Tânia K de Araujo; Marina C Gonsales; Cristiane S Rocha; Michel Naslavsky; Luiz De Marco; Maria A C Bicalho; Vinicius L Vazquez; Mayana Zatz; Wilson A Silva; Iscia Lopes-Cendes
Journal:  Hum Genome Var       Date:  2021-04-02

Review 2.  These Are the Genes You're Looking For: Finding Host Resistance Genes.

Authors:  Jeffrey S Bourgeois; Clare M Smith; Dennis C Ko
Journal:  Trends Microbiol       Date:  2020-09-29       Impact factor: 17.079

3.  Dynamic linkage of COVID-19 test results between Public Health England's Second Generation Surveillance System and UK Biobank.

Authors:  Jacob Armstrong; Justine K Rudkin; Naomi Allen; Derrick W Crook; Daniel J Wilson; David H Wyllie; Anne Marie O'Connell
Journal:  Microb Genom       Date:  2020-07

4.  Control and prevention of infectious diseases from a One Health perspective.

Authors:  Joel Henrique Ellwanger; Ana Beatriz Gorini da Veiga; Valéria de Lima Kaminski; Jacqueline María Valverde-Villegas; Abner Willian Quintino de Freitas; José Artur Bogo Chies
Journal:  Genet Mol Biol       Date:  2021-01-29       Impact factor: 1.771

5.  Statistical power in COVID-19 case-control host genomic study design.

Authors:  Yu-Chung Lin; Jennifer D Brooks; Shelley B Bull; France Gagnon; Celia M T Greenwood; Rayjean J Hung; Jerald Lawless; Andrew D Paterson; Lei Sun; Lisa J Strug
Journal:  Genome Med       Date:  2020-12-28       Impact factor: 11.117

6.  In-Depth Analysis of Genetic Variation Associated with Severe West Nile Viral Disease.

Authors:  Megan E Cahill; Mark Loeb; Andrew T Dewan; Ruth R Montgomery
Journal:  Vaccines (Basel)       Date:  2020-12-08

7.  Genetic variability in COVID-19-related genes in the Brazilian population.

Authors:  Rodrigo Secolin; Tânia K de Araujo; Marina C Gonsales; Cristiane S Rocha; Michel Naslavsky; Luiz De Marco; Maria A C Bicalho; Vinicius L Vazquez; Mayana Zatz; Wilson A Silva; Iscia Lopes-Cendes
Journal:  Hum Genome Var       Date:  2021-04-02

Review 8.  Current Challenges in Vaccinology.

Authors:  Richard B Kennedy; Inna G Ovsyannikova; Peter Palese; Gregory A Poland
Journal:  Front Immunol       Date:  2020-06-25       Impact factor: 7.561

Review 9.  Extracellular Vesicles Could Carry an Evolutionary Footprint in Interkingdom Communication.

Authors:  Ricardo Correa; Zuleima Caballero; Luis F De León; Carmenza Spadafora
Journal:  Front Cell Infect Microbiol       Date:  2020-03-03       Impact factor: 5.293

Review 10.  An Overview on ERAP Roles in Infectious Diseases.

Authors:  Irma Saulle; Chiara Vicentini; Mario Clerici; Mara Biasin
Journal:  Cells       Date:  2020-03-14       Impact factor: 6.600

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.