Literature DB >> 32558485

Genomewide Association Study of Severe Covid-19 with Respiratory Failure.

David Ellinghaus1, Frauke Degenhardt1, Luis Bujanda1, Maria Buti1, Agustín Albillos1, Pietro Invernizzi1, Javier Fernández1, Daniele Prati1, Guido Baselli1, Rosanna Asselta1, Marit M Grimsrud1, Chiara Milani1, Fátima Aziz1, Jan Kässens1, Sandra May1, Mareike Wendorff1, Lars Wienbrandt1, Florian Uellendahl-Werth1, Tenghao Zheng1, Xiaoli Yi1, Raúl de Pablo1, Adolfo G Chercoles1, Adriana Palom1, Alba-Estela Garcia-Fernandez1, Francisco Rodriguez-Frias1, Alberto Zanella1, Alessandra Bandera1, Alessandro Protti1, Alessio Aghemo1, Ana Lleo1, Andrea Biondi1, Andrea Caballero-Garralda1, Andrea Gori1, Anja Tanck1, Anna Carreras Nolla1, Anna Latiano1, Anna Ludovica Fracanzani1, Anna Peschuck1, Antonio Julià1, Antonio Pesenti1, Antonio Voza1, David Jiménez1, Beatriz Mateos1, Beatriz Nafria Jimenez1, Carmen Quereda1, Cinzia Paccapelo1, Christoph Gassner1, Claudio Angelini1, Cristina Cea1, Aurora Solier1, David Pestaña1, Eduardo Muñiz-Diaz1, Elena Sandoval1, Elvezia M Paraboschi1, Enrique Navas1, Félix García Sánchez1, Ferruccio Ceriotti1, Filippo Martinelli-Boneschi1, Flora Peyvandi1, Francesco Blasi1, Luis Téllez1, Albert Blanco-Grau1, Georg Hemmrich-Stanisak1, Giacomo Grasselli1, Giorgio Costantino1, Giulia Cardamone1, Giuseppe Foti1, Serena Aneli1, Hayato Kurihara1, Hesham ElAbd1, Ilaria My1, Iván Galván-Femenia1, Javier Martín1, Jeanette Erdmann1, Jose Ferrusquía-Acosta1, Koldo Garcia-Etxebarria1, Laura Izquierdo-Sanchez1, Laura R Bettini1, Lauro Sumoy1, Leonardo Terranova1, Leticia Moreira1, Luigi Santoro1, Luigia Scudeller1, Francisco Mesonero1, Luisa Roade1, Malte C Rühlemann1, Marco Schaefer1, Maria Carrabba1, Mar Riveiro-Barciela1, Maria E Figuera Basso1, Maria G Valsecchi1, María Hernandez-Tejero1, Marialbert Acosta-Herrera1, Mariella D'Angiò1, Marina Baldini1, Marina Cazzaniga1, Martin Schulzky1, Maurizio Cecconi1, Michael Wittig1, Michele Ciccarelli1, Miguel Rodríguez-Gandía1, Monica Bocciolone1, Monica Miozzo1, Nicola Montano1, Nicole Braun1, Nicoletta Sacchi1, Nilda Martínez1, Onur Özer1, Orazio Palmieri1, Paola Faverio1, Paoletta Preatoni1, Paolo Bonfanti1, Paolo Omodei1, Paolo Tentorio1, Pedro Castro1, Pedro M Rodrigues1, Aaron Blandino Ortiz1, Rafael de Cid1, Ricard Ferrer1, Roberta Gualtierotti1, Rosa Nieto1, Siegfried Goerg1, Salvatore Badalamenti1, Sara Marsal1, Giuseppe Matullo1, Serena Pelusi1, Simonas Juzenas1, Stefano Aliberti1, Valter Monzani1, Victor Moreno1, Tanja Wesse1, Tobias L Lenz1, Tomas Pumarola1, Valeria Rimoldi1, Silvano Bosari1, Wolfgang Albrecht1, Wolfgang Peter1, Manuel Romero-Gómez1, Mauro D'Amato1, Stefano Duga1, Jesus M Banales1, Johannes R Hov1, Trine Folseraas1, Luca Valenti1, Andre Franke1, Tom H Karlsen1.   

Abstract

BACKGROUND: There is considerable variation in disease behavior among patients infected with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes coronavirus disease 2019 (Covid-19). Genomewide association analysis may allow for the identification of potential genetic factors involved in the development of Covid-19.
METHODS: We conducted a genomewide association study involving 1980 patients with Covid-19 and severe disease (defined as respiratory failure) at seven hospitals in the Italian and Spanish epicenters of the SARS-CoV-2 pandemic in Europe. After quality control and the exclusion of population outliers, 835 patients and 1255 control participants from Italy and 775 patients and 950 control participants from Spain were included in the final analysis. In total, we analyzed 8,582,968 single-nucleotide polymorphisms and conducted a meta-analysis of the two case-control panels.
RESULTS: We detected cross-replicating associations with rs11385942 at locus 3p21.31 and with rs657152 at locus 9q34.2, which were significant at the genomewide level (P<5×10-8) in the meta-analysis of the two case-control panels (odds ratio, 1.77; 95% confidence interval [CI], 1.48 to 2.11; P = 1.15×10-10; and odds ratio, 1.32; 95% CI, 1.20 to 1.47; P = 4.95×10-8, respectively). At locus 3p21.31, the association signal spanned the genes SLC6A20, LZTFL1, CCR9, FYCO1, CXCR6 and XCR1. The association signal at locus 9q34.2 coincided with the ABO blood group locus; in this cohort, a blood-group-specific analysis showed a higher risk in blood group A than in other blood groups (odds ratio, 1.45; 95% CI, 1.20 to 1.75; P = 1.48×10-4) and a protective effect in blood group O as compared with other blood groups (odds ratio, 0.65; 95% CI, 0.53 to 0.79; P = 1.06×10-5).
CONCLUSIONS: We identified a 3p21.31 gene cluster as a genetic susceptibility locus in patients with Covid-19 with respiratory failure and confirmed a potential involvement of the ABO blood-group system. (Funded by Stein Erik Hagen and others.).
Copyright © 2020 Massachusetts Medical Society.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32558485      PMCID: PMC7315890          DOI: 10.1056/NEJMoa2020283

Source DB:  PubMed          Journal:  N Engl J Med        ISSN: 0028-4793            Impact factor:   91.245


Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was discovered in Wuhan, China, in late 2019, and coronavirus disease 2019 (Covid-19), the disease caused by SARS-CoV-2, rapidly evolved into a global pandemic.[1] As of June 15, 2020, there were more than 8.03 million confirmed cases worldwide, with total deaths exceeding 436,900.[2] In Europe, Italy and Spain were severely affected early on, with epidemic peaks starting in the second half of February 2020 (Figure 1) and 61,507 deaths reported by June 15, 2020. Covid-19 has varied manifestations,[3] with the large majority of infected persons having only mild symptoms or even no symptoms.[4] Mortality rates are driven predominantly by the subgroup of patients who have severe respiratory failure related to interstitial pneumonia in both lungs and acute respiratory distress syndrome.[5] Severe Covid-19 with respiratory failure requires early and prolonged support by mechanical ventilation.[6]
Figure 1

Timeline of Rapid Covid-19 Genomewide Association Study (GWAS).

The main events and milestones of the study are summarized in the plot. Samples from patients in three Italian hospitals (hospital A: Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Milan; hospital B: Humanitas Clinical and Research Center, IRCCS, Milan; and hospital C: UNIMIB School of Medicine, San Gerardo Hospital, Monza) and four Spanish hospitals (hospital A: Hospital Clínic and IDIBAPS, Barcelona; hospital B: Hospital Universitario Vall d’Hebron, Barcelona; hospital C: Hospital Universitario Ramón y Cajal, Madrid; and hospital D: Donostia University Hospital, San Sebastian) were obtained around the peak of the local epidemics, and ethics applications were quickly obtained by means of fast-track procedures (i.e., every local ethics review board supported studies of coronavirus disease 2019 [Covid-19] studies by providing rapid turn-around times, thus facilitating this fast de novo data generation). All the obtained blood samples were centrally isolated, genotyped, and analyzed within 8 weeks. Control data were obtained from control participants and from historical control data in Italy and Spain. The rapid workflow from patients to target identification shows the usefulness of GWAS, a standardized research tool that often relies on international and interdisciplinary cooperation. One center alone could not have completed this study, not to mention the increase in statistical power that was available because of the contribution of patients from multiple centers. The speed of data production depended heavily on laboratory automation, and the speed of analyses reflects existing analytic pipelines and the support of public so-called imputation servers (here, the Michigan imputation server of the G. Abecasis group). QC denotes quality control.

The pathogenesis of severe Covid-19 and the associated respiratory failure is poorly understood, but higher mortality is consistently associated with older age and male sex.[7,8] Clinical associations have also been reported for hypertension, diabetes, and other obesity-related and cardiovascular disease traits, but the relative role of clinical risk factors in determining the severity of Covid-19 has not yet been clarified.[7-11] Observational data on lymphocytic endotheliitis and diffuse microvascular and macrovascular thromboembolic complications suggest that Covid-19 is a systemic disease that involves injury to the vascular endothelium but provide little insight into the underlying pathogenesis.[12-14] At the peak of the epidemic in Italy and Spain in early 2020, we performed a genomewide association study (GWAS) in an attempt to delineate host genetic factors contributing to severe Covid-19 with respiratory failure. The relatively low disease burden of Covid-19 in Norway and Germany allowed for a complementary team to be set up, whereby genotyping and analysis could occur in parallel with the rapid recruitment of patients in the heavily affected Italian and Spanish epicenters.

Methods

Study Participants and Recruitment

We recruited 1980 patients with severe Covid-19, which was defined as hospitalization with respiratory failure and a confirmed SARS-CoV-2 viral RNA polymerase-chain-reaction (PCR) test from nasopharyngeal swabs or other relevant biologic fluids, cross sectionally, from intensive care units and general wards at seven hospitals in four cities in the pandemic epicenters in Italy and Spain (Table S1A in Supplementary Appendix 1, available with the full text of this article at NEJM.org). The hospitals in Italy were the following: Fondazione IRCCS Cá Granda Ospedale Maggiore Policlinico in Milan (597 patients); Humanitas Clinical and Research Center, IRCCS, in Milan (154 patients); and UNIMIB (Università degli Studi di Milano–Bicocca) School of Medicine, San Gerardo Hospital, in Monza (a suburb of Milan) (200 patients). The hospitals in Spain were the following: Hospital Clínic and IDIBAPS (Instituto de Investigaciones Biomédicas August Pi i Sunyer) in Barcelona (56 patients), Hospital Universitario Vall d’Hebron in Barcelona (337 patients), Hospital Universitario Ramón y Cajal in Madrid (298 patients), and Donostia University Hospital in San Sebastian (338 patients). Respiratory failure was defined in the simplest possible manner in order to ensure feasibility: the use of oxygen supplementation or mechanical ventilation, with severity graded according to the maximum respiratory support received at any point during hospitalization (supplemental oxygen therapy only, noninvasive ventilatory support, invasive ventilatory support, or extracorporeal membrane oxygenation). For severity assessments, severity was also dichotomized as no mechanical ventilation or mechanical ventilation. Whole-blood samples or buffy coats from diagnostic venipuncture were obtained for DNA extraction. For comparison, we included 2381 control participants from Italy and Spain (Table S1B in Supplementary Appendix 1). We recruited 998 randomly selected blood donors at Fondazione IRCCS Cá Granda Ospedale Maggiore Policlinico, Milan, who underwent genotyping for the purpose of the present study. A total of 40 of these participants had evidence of the development of anti–SARS-CoV-2 antibodies, all of whom had mild or no Covid-19 symptoms. We also included two control panels with genotype data derived from previous studies and from persons with unknown SARS-CoV-2 infection status using the same genotyping array. The panels included 396 healthy volunteers, blood donors, and outpatients of gastroenterology departments in Italy[15] and 987 healthy blood donors in San Sebastian, Spain.

Ethics Committee Approval

The project protocol involved the rapid recruitment of patient-participants and no additional project-related procedures (we primarily used material from clinically indicated venipunctures) and afforded anonymity, owing to the minimal data set collected. Differences in recruitment and consent procedures among the centers arose because some centers integrated the project into larger Covid-19 biobanking efforts, whereas other centers did not, and because there were differences in how local ethics committees provided guidance on the handling of anonymization or deidentification of data as well as consent procedures. Written informed consent was obtained, sometimes in a delayed fashion, from the study patients at each center when possible. In some instances, informed consent was provided verbally or by the next of kin, depending on local ethics committee regulations and special policies issued for Covid-19 research. For some severely ill patients, an exemption from informed consent was obtained from a local ethics committee or according to local regulations in order to allow the use of completely anonymized surplus material from diagnostic venipuncture. The following approvals of the project were obtained from the relevant ethics committees: Germany: Kiel (reference number, D464/20); Italy: Fondazione IRCCS Cá Granda Ospedale Maggiore Policlinico (reference numbers, 342_2020 for patients and 334-2020 for control participants), Humanitas Clinical and Research Center, IRCCS (reference number, 316/20), the University of Milano–Bicocca School of Medicine, San Gerardo Hospital, Monza (the ethics committee of the National Institute of Infectious Diseases Lazzarro Spallanzani reference number, 84/2020); Norway: Regional Committee for Medical and Health Research Ethics in South-Eastern Norway (reference number, 132550); Spain: Hospital Clínic, Barcelona (reference number, HCB/2020/0405), Hospital Universitario Vall d’Hebron, Barcelona (reference number, PR[AG]244/2020), Hospital Universitario Ramón y Cajal, Madrid (reference number, 093/20) and Donostia University Hospital, San Sebastian (reference number, PI2020064).

Sample Processing, Genotyping, and Imputation

We performed DNA extraction using a Chemagic 360 (PerkinElmer) with the use of the low-volume kit CMG-1491 and the buffy-coat kit CMG-714 (Chemagen), respectively. For genotyping, we used the Global Screening Array (GSA), version 2.0 (Illumina), which contains 712,189 variants before quality control. Details on genotyping and quality-control procedures are provided in the Supplementary Methods section in Supplementary Appendix 1. To maximize genetic coverage, we performed single-nucleotide polymorphism (SNP) imputation on genome build GRCh38 using the Michigan Imputation Server and 194,512 haplotypes generated by the Trans-Omics for Precision Medicine (TOPMed) program (freeze 5).[16] After the exclusion of samples during quality control (the majority of which were due to population outliers; see the Supplementary Methods section and Table S1B and S1C), the final case–control data sets comprised 835 patients and 1255 control participants from Italy and 775 patients and 950 control participants from Spain. A total of 8,965,091 SNPs were included in the Italian cohort and 9,140,716 SNPs in the Spanish cohort.

Statistical Analysis

To take imputation uncertainty into account, we tested for phenotypic associations with allele dosage data separately for both the Italian and Spanish case–control panels with the use of the PLINK logistic-regression framework for dosage data (PLINK, version 1.9).[17] We carried out two genomewide tests of association that included covariates from principal-component analyses, with adjustments to control for potential population stratification (main analysis) and potential population stratification and age and sex bias (analysis corrected for age and sex). A fixed-effects meta-analysis was conducted with the use of the meta-analysis tool METAL[18] on 8,582,968 variants that were common to both the Italian and Spanish data sets with the use of effect-size estimates and their standard errors from the study-specific association analyses. For the genomewide meta-analysis, we used the commonly accepted threshold of 5×10−8 for joint P values to determine statistical significance. Bayesian fine-mapping analysis was performed for loci reaching genomewide significance (see the Supplementary Methods section). Genomewide summary statistics of our analyses are publicly available through our web browser (www.c19-genetics.eu) and have been submitted to the European Bioinformatics Institute (www.ebi.ac.uk/gwas; accession numbers, GCST90000255 and GCST90000256). On the basis of the results from the TOPMed genotype imputation, we selected three ABO SNPs (rs8176747, rs41302905, and rs8176719)[19,20] to infer the ABO blood type and calculated odds ratios according to blood type (A vs. B, AB, or O; B vs. A, AB, or O; AB vs. A, B, or O; and O vs. A, AB, or B) (see the Supplementary Methods). To assess in detail the HLA complex at locus 6p21, we performed sequencing-based HLA typing of seven classical HLA loci (HLA-A, -C, -B, -DRB1, -DQA1, -DQB1, and -DPB1) in a subgroup of 835 patients and 891 control participants from Italy and 773 patients from Spain (see the Supplementary Methods). We also assessed allelic distribution according to no mechanical ventilation (supplemental oxygen only) as compared with mechanical ventilation of any type. A similar assessment was made for lead SNPs rs11385942 and rs657152 at loci 3p21.31 and 9q34.2, respectively.

Results

Patients, Genotyping, and Quality Control

The milestones of the study in the context of the peak outbreaks in Italy and Spain are shown in Figure 1. Data on the age, sex, maximum respiratory support at any point during hospitalization, and relevant coexisting conditions (type 2 diabetes, hypertension, and coronary heart disease) in the patients who were included in the final analysis are shown in Table 1 and in Table S2 in Supplementary Appendix 1. Because we used the same genotyping platform (GSA) to obtain both data sets, we were able to perform a uniform quality control of the merged Italian and Spanish SNP data sets, thus reducing technical confounders to a minimum. A quantile–quantile (Q-Q) plot of the two meta-analyses (the main analysis and the analysis corrected for age and sex) showed significant associations in the tail of the distribution with minimal genomic inflation (λGC=1.015 for main analysis and λGC=1.006 for analysis corrected for age and sex) (Fig. S2 in Supplementary Appendix 1). We also carried out separate association analyses for the Italian and Spanish data sets (see the Supplementary Methods section and Fig. S3).
Table 1

Overview of Patients Included in the Final Analysis.*

CharacteristicItalian HospitalsSpanish Hospitals
A(N=503)B(N=140)C(N=192)A(N=45)B(N=228)C(N=201)D(N=301)
Median age (IQR) — yr64 (54–76)67 (57–75)66 (56–74)69 (59–75)65 (56–72)69 (60–79)67 (57–75)
Female sex — no. (%)159 (32)39 (28)51 (27)13 (29)78 (34)50 (25)124 (41)
Respiratory support — no. (%)
Supplemental oxygen only070 (50)67 (35)7 (16)105 (46)106 (53)255 (85)
Noninvasive ventilation399 (79)25 (18)89 (46)6 (13)7 (3)16 (8)0
Ventilator104 (21)45 (32)33 (17)31 (69)116 (51)77 (38)46 (15)
ECMO003 (2)1 (2)02 (1)0
Hypertension — no./total no. (%)166/503 (33)71/140 (51)109/192 (57)26/45 (58)113/228 (50)112/199 (56)114/301 (38)
Coronary artery disease — no./total no. (%)21/503 (4)25/140 (18)25/192 (13)4/45 (9)14/228 (6)35/199 (18)15/301 (5)
Diabetes — no./total no. (%)63/503 (13)18/140 (13)34/192 (18)10/45 (22)50/228 (22)57/199 (29)65/301 (22)

In Italy, hospital A was Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Milan; hospital B Humanitas Clinical and Research Center, IRCCS, Milan; and hospital C UNIMIB School of Medicine, San Gerardo Hospital, Monza. In Spain, hospital A was Hospital Clínic and IDIBAPS, Barcelona; hospital B Hospital Universitario Vall d’Hebron, Barcelona; hospital C Hospital Universitario Ramón y Cajal, Madrid; and hospital D Donostia University Hospital, San Sebastian. The predominance of men among the patients and the advanced age (median, >63 years) were consistent across all the centers. The sample numbers provided are after quality control was conducted for the genomewide association study (Table S1C in Supplementary Appendix 1). Allele distributions for detected risk variants at loci 3p21.31 and 9q34.2 in clinical subsets are shown in Table S2. ECMO denotes extracorporeal membrane oxygenation, and IQR interquartile range.

Genomewide Association Analysis

We found two loci to be associated with Covid-19–induced respiratory failure with genomewide significance (P<5×10−8) in the main meta-analysis: the rs11385942 insertion–deletion GA or G variant at locus 3p21.31 (odds ratio for the GA allele, 1.77; 95% confidence interval [CI], 1.48 to 2.11; P=1.15×10−10) and the rs657152 A or C SNP at locus 9q34.2 (odds ratio for the A allele, 1.32; 95% CI, 1.20 to 1.47; P=4.95×10−8) (Figure 2 and Table 2 and Supplementary Appendix 2, available at NEJM.org). Both loci showed nominally significant association in both the Spanish and Italian subanalyses (Table 2). The meta-analysis association results for recessive and heterozygous genetic models for the two meta-analyses (main analysis and the analysis corrected for age and sex) are provided in Supplementary Appendix 3, available at NEJM.org. The imputation quality of the associated markers was good (Table 2 and Supplementary Appendix 2), and manual inspection of genotype cluster plots of genotyped SNPs in these regions showed distinct genotype clouds for homozygous and heterozygous calls (Fig. S4 in Supplementary Appendix 1). Furthermore, the analyses that were corrected for age and sex corroborated the observations at both rs11385942 (meta-analysis odds ratio, 2.11; 95% CI, 1.70 to 2.61; P=9.46×10−12) and rs657152 (meta-analysis odds ratio, 1.39; 95% CI, 1.22 to 1.59; P=5.35×10−7) (Table 2 and Fig. S5 in Supplementary Appendix 1).
Figure 2

GWAS Summary (Manhattan) Plot of the Meta-analysis Association Statistics Highlighting Two Susceptibility Loci with Genomewide Significance for Severe Covid-19 with Respiratory Failure.

Shown is a Manhattan plot of the association statistics from the main meta-analysis (controlled for potential population stratification). The red dashed line indicates the genomewide significance threshold of a P value less than 5×10−8. Figure S6 in Supplementary Appendix 1 shows Manhattan plots that include hits passing a suggestive significance threshold of a P value less than 1×10−5 (total of 24 additional suggestive genomic loci) (see the Supplementary Methods section and Supplementary Appendix 4).

Table 2

Susceptibility Loci Associated with Severe Covid-19 with Respiratory Failure.*

Chromosome and AnalysisMeta-analysisItalian PanelSpanish Panel
P ValueOdds Ratio(95% CI)P ValueOdds Ratio(95% CI)Allele FrequencyP ValueOdds Ratio(95% CI)Allele Frequency
patientcontrolpatientcontrol
3p21.31
Main analysis1.15×10−101.77(1.48–2.11)1.98×10−71.74(1.27–2.38)0.140.091.32×10−41.85 (1.50–2.28)0.090.05
Analysis corrected for age and sex9.46×10−122.11(1.70–2.61)7.02×10−81.95(1.53–2.48)0.140.091.17×10−52.79(1.76–4.42)0.090.05
9q34.2
Main analysis4.95×10−81.32(1.20–1.47)2.90×10−61.37(1.20–1.57)0.420.353.55×10−31.26(1.08–1.48)0.420.35
Analysis corrected for age and sex5.35×10−71.39(1.22–1.59)5.31×10−51.37(1.17–1.60)0.420.352.81×10−31.45(1.13–1.84)0.420.35

The meta-analysis included 1610 patients and 2205 control participants; the Italian analysis, 835 and 1255, respectively; and the Spanish analysis, 775 and 950, respectively. Allele frequencies of the minor or risk allele (see below) are shown among the patients and the control participants. All the association test statistics were adjusted for the top 10 principal components from the principal-component analysis. Two analyses were performed: a main analysis, which was corrected for 10 principal components, and an analysis that was corrected for age and sex in addition to 10 principal components. In the analyses that were corrected for age and sex, 25 control participants were excluded from the Spanish analysis and the meta-analysis because of missing covariate data. The P values and corresponding odds ratios and 95% confidence intervals (CIs) are shown with respect to the minor allele. Association results for the recessive and heterozygous models for both meta-analyses (main and corrected for age and sex) are shown in Supplementary Appendix 3. Covid-19 denotes coronavirus disease 2019.

For chromosome 3p21.31, the association boundaries for each index single-nucleotide polymorphism (SNP; see the Supplementary Methods section), with the genomic positions retrieved from genome build hg38, were chr3:45800446 through 46135604. The Single Nucleotide Polymorphism database (dbSNP) identifier was rs11385942 (the rs identifier from the National Center for Biotechnology Information, rs11385942, is annotated as chr3:45834968 through 45834969:AAA:AA in dbSNP, version 153, and as chr3:45834967:GA:G in the Trans-Omics for Precision Medicine [TOPMed] imputation reference panel). The SNP rs11385942 was imputed according to TOPMed with high confidence (TOPMed estimated imputation accuracy, R2=0.94 and R2=0.95 for the Italian and Spanish panels, respectively) (Supplementary Appendix 2).The minor or risk allele was GA, and the major allele was G. The key genes (i.e., the candidate genes in the region) were SLC6A20, LZTFL1, FYCO1, CXCR6, XCR1, and CCR9.

For chromosome 9q34.2, the association boundaries for each index SNP, with the genomic positions retrieved from genome build hg38, were chr9:133257521 through 133279871. The SNP rs657152 was genotyped according to the Global Screening Array (GSA) in the Italian and Spanish panels (Supplementary Appendix 2). The minor or risk allele was A, and the major allele was C. The key gene was ABO.

The allele frequencies in Spanish and Italian control data sets from previously published studies[21-27] are consistent with those we report here (Supplementary Appendix 2). A further 24 different genomic loci showed suggestive evidence (P<1×10−5) for association with Covid-19–induced respiratory failure in the main analysis (Supplementary Appendix 4, available at NEJM.org, and Fig. S6 in Supplementary Appendix 1). Association signals at loci 3p21.31 and 9q34.2 were fine-mapped to 22 and 38 variants, respectively, with greater than 95% certainty (Figure 3A and 3B and Supplementary Appendix 5, available at NEJM.org).
Figure 3

Regional Association Plots of Susceptibility Loci Associated with Severe Covid-19 with Respiratory Failure.

Bayesian fine-mapping analysis prioritized 22 and 38 variants for loci 3p21.31 (Panel A) and 9q34.2 (Panel B), respectively, with greater than 95% certainty. The linkage disequilibrium values were calculated on the basis of genotypes of the merged Italian and Spanish data sets derived from TOPMed (Trans-Omics for Precision Medicine) imputation. The positions in the genome assembly hg38 are plotted. The recombination rate is shown in centimorgans (cM) per million base pairs (Mb). The plot shows the names and locations of the genes; the transcribed strand is indicated with an arrow. Genes are represented with intronic and exonic regions. The purple diamond in each panel represents the variant most strongly associated with severe Covid-19 and respiratory failure.

Chromosome 3p21.31

The association signal at locus 3p21.31 comprised six genes (SLC6A20, LZTFL1, CCR9, FYCO1, CXCR6, and XCR1) (Figure 3A). The risk allele GA of rs11385942 is associated with reduced expression of CXCR6 and increased expression of SLC6A20, and LZTFL1 is strongly expressed in human lung cells (Fig. S7 and Supplementary Appendix 6, available at NEJM.org). We found that the frequency of the risk allele of the lead variant at 3p21.31 (rs11385942) was higher among patients who received mechanical ventilation than among those who received oxygen supplementation only in both the main meta-analysis (odds ratio, 1.70; 95% CI, 1.27 to 2.26; P=3.30×10−4) and the meta-analysis corrected for age and sex (odds ratio, 1.56; 95% CI, 1.17 to 2.01; P=0.003) (Supplementary Appendix 7, available at NEJM.org). Furthermore, the 19 patients who were homozygous for the rs11385942 risk allele were younger than 1591 patients who were heterozygous or homozygous for the nonrisk allele (median age, 59 years [interquartile range, 49 to 68] vs. 66 years [interquartile range, 56 to 75]; P=0.005). Available variant database entries suggest that the frequency of this risk allele varies among populations worldwide (Fig. S8 in Supplementary Appendix 1).

ABO Locus

At locus 9q34.2 the association signal coincided with the ABO blood group locus (Figure 3B and Fig. S9 in Supplementary Appendix 1). Accordingly, the distribution of ABO blood groups (predicted from combinations of genotypes of three different SNPs) was skewed among patients with Covid-19 who had respiratory failure, as compared with the distribution among control participants. In the meta-analysis corrected for age and sex, we found a higher risk among persons with blood group A than among patients with other blood groups (odds ratio, 1.45; 95% CI, 1.20 to 1.75; P=1.48×10−4) and a protective effect for blood group O as compared with the other blood groups (odds ratio, 0.65; 95% CI, 0.53 to 0.79; P=1.06×10−5). Details are provided in Supplementary Appendix 8, available at NEJM.org. Both associations and effect directions were consistent in the separate Spanish and Italian case–control analyses. We found no significant difference in blood-group distribution between patients receiving supplemental oxygen only and those receiving mechanical ventilation of any kind. The ABO blood-group frequency distributions in public registries are provided for comparison in Supplementary Appendix 8, along with details of the results presented here, and corroborate our observations.

HLA Analysis

Given its important role in several viral infections, we scrutinized the extended HLA region (chromosome 6, 25 through 34 Mb). There were no SNP association signals at the HLA complex that met even the significance threshold of suggestive association: P<1×10−5 (Fig. S10 in Supplementary Appendix 1). Dedicated analysis of the classical HLA loci showed no significant allele associations with either Covid-19 or disease severity (oxygen supplementation only or mechanical ventilation of any kind), and further analysis of heterozygote and divergent allele advantage or predicted number of HLA-bound SARS-CoV-2 peptides did not show significant associations with Covid-19 in this data set (see the HLA Analyses section in Supplementary Appendix 1 and Supplementary Appendix 9, available at NEJM.org).

Discussion

Using a pragmatic approach with simplified inclusion criteria and a complementary team of clinicians at the European Covid-19 epicenters in Italy and Spain and scientists in the less-burdened countries of Germany and Norway, we performed a GWAS that included de novo genotyping for Covid-19 with respiratory failure in approximately 2 months. We detected a novel susceptibility locus at a chromosome 3p21.31 gene cluster and confirmed a potential involvement of the ABO blood-group system in Covid-19. On chromosome 3p21.31, the peak association signal covered a cluster of six genes (SLC6A20, LZTFL1, CCR9, FYCO1, CXCR6, and XCR1), several of which have functions that are potentially relevant to Covid-19. A causative gene cannot be reliably implicated by the present data. One candidate is SLC6A20, which encodes the sodium–imino acid (proline) transporter 1 (SIT1) and which functionally interacts with angiotensin-converting enzyme 2, the SARS-CoV-2 cell-surface receptor.[28,29] However, the locus also contains genes encoding chemokine receptors, including the CC motif chemokine receptor 9 (CCR9) and the C-X-C motif chemokine receptor 6 (CXCR6), the latter of which regulates the specific location of lung-resident memory CD8 T cells throughout the sustained immune response to airway pathogens, including influenza viruses.[30] Flanking genes (e.g., CCR1 and CCR2) also have relevant functions,[31] and further studies will be needed to delineate the functional consequences of detected associations. The preliminary results from the Covid-19 Host Genetics Consortium[32] include suggestive associations within the same locus at chromosome 3p21.31, which lend considerable support to our findings (Fig. S11 in Supplementary Appendix 1). The consortium analysis also used population-based controls, but the patients included persons with mild Covid-19 and those with severe Covid-19. The parallel findings nevertheless underscore an important point about the ascertainment of patients and controls in genetic studies of Covid-19. Because the majority of patients with SARS-CoV-2 infection are asymptomatic, any sample involving patients with a positive nasopharyngeal RNA test is likely to hold a bias toward some degree of symptomatic burden. Two of the identifiers for inclusion in the current study were a positive result for the presence of SARS-CoV-2 according to PCR testing and receipt of respiratory support (an extreme Covid-19 phenotype). As such, it seems reasonable to conclude that the chromosome 3p21.31 locus is involved in Covid-19 susceptibility per se, with a possible enrichment in patients with severe disease. This latter interpretation is supported by the significantly higher frequency of the risk allele among patients who received mechanical ventilation than among those who received supplemental oxygen only as well as by the finding of younger age among patients who were homozygous for the risk allele than among patients who were heterozygous or homozygous for the nonrisk allele. Nongenetic studies that were reported as preprints[33,34] have previously implicated the involvement of ABO blood groups in Covid-19 susceptibility, and ABO blood groups have also been implicated in susceptibility to SARS-CoV-1 infection.[35] Our genetic data confirm that blood group O is associated with a risk of acquiring Covid-19 that was lower than that in non-O blood groups, whereas blood group A was associated with a higher risk than non-A blood groups.[33,34] The biologic mechanisms undergirding these findings may have to do with the ABO group per se (e.g., with the development of neutralizing antibodies against protein-linked N-glycans)[36] or with other biologic effects of the identified variant,[37-39] including the stabilization of von Willebrand factor.[40,41] The ABO locus holds considerable risk for population stratification,[42] which is increased by the inclusion of randomly selected blood donors in the current study (for which there is an inherent risk of blood group O enrichment). Alignment of the allele frequencies at the ABO locus in our control population with those in several non–blood-donor control populations would suggest that this is not a major bias, and at least one study[34] that tested for association with blood type used disease controls with no affiliation to blood donors. The pragmatic aspects leading to the feasibility of this massive undertaking in a very short period of time during the extreme clinical circumstances of the pandemic imposed limitations that will be important to explore in follow-up studies. For example, to enable the recruitment of study participants, a bare minimum of clinical metadata was requested. For this reason, extensive genotype–phenotype elaboration of current findings could not be conducted, and adjustments for all potential sources of bias (e.g., underlying cardiovascular and metabolic factors relevant to Covid-19) could not be performed. Furthermore, we have limited information about the SARS-CoV-2 infection status in the control participants; this concern is mitigated by the fact that the presence of susceptible persons in the control group would only bias the tests toward the null. In addition, few restrictions were imposed during inclusion, which led to genotyped samples having to be excluded owing to differing ethnic groups (population outliers). Further exploration of current findings, both as to their usefulness in clinical risk profiling of patients with Covid-19 and toward a mechanistic understanding of the underlying pathophysiology, is warranted.
  39 in total

Review 1.  Severe Covid-19.

Authors:  David A Berlin; Roy M Gulick; Fernando J Martinez
Journal:  N Engl J Med       Date:  2020-05-15       Impact factor: 91.245

2.  CCR1 deficiency increases susceptibility to fatal coronavirus infection of the central nervous system.

Authors:  Michelle J Hickey; Katherine S Held; Elizabeth Baum; Ji-Liang Gao; Philip M Murphy; Thomas E Lane
Journal:  Viral Immunol       Date:  2007-12       Impact factor: 2.257

3.  Multiple common variants for celiac disease influencing immune gene expression.

Authors:  Patrick C A Dubois; Gosia Trynka; Lude Franke; Karen A Hunt; Jihane Romanos; Alessandra Curtotti; Alexandra Zhernakova; Graham A R Heap; Róza Adány; Arpo Aromaa; Maria Teresa Bardella; Leonard H van den Berg; Nicholas A Bockett; Emilio G de la Concha; Bárbara Dema; Rudolf S N Fehrmann; Miguel Fernández-Arquero; Szilvia Fiatal; Elvira Grandone; Peter M Green; Harry J M Groen; Rhian Gwilliam; Roderick H J Houwen; Sarah E Hunt; Katri Kaukinen; Dermot Kelleher; Ilma Korponay-Szabo; Kalle Kurppa; Padraic MacMathuna; Markku Mäki; Maria Cristina Mazzilli; Owen T McCann; M Luisa Mearin; Charles A Mein; Muddassar M Mirza; Vanisha Mistry; Barbara Mora; Katherine I Morley; Chris J Mulder; Joseph A Murray; Concepción Núñez; Elvira Oosterom; Roel A Ophoff; Isabel Polanco; Leena Peltonen; Mathieu Platteel; Anna Rybak; Veikko Salomaa; Joachim J Schweizer; Maria Pia Sperandeo; Greetje J Tack; Graham Turner; Jan H Veldink; Wieke H M Verbeek; Rinse K Weersma; Victorien M Wolters; Elena Urcelay; Bozena Cukrowska; Luigi Greco; Susan L Neuhausen; Ross McManus; Donatella Barisani; Panos Deloukas; Jeffrey C Barrett; Paivi Saavalainen; Cisca Wijmenga; David A van Heel
Journal:  Nat Genet       Date:  2010-02-28       Impact factor: 38.330

4.  METAL: fast and efficient meta-analysis of genomewide association scans.

Authors:  Cristen J Willer; Yun Li; Gonçalo R Abecasis
Journal:  Bioinformatics       Date:  2010-07-08       Impact factor: 6.937

5.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

Review 6.  Trilogy of ACE2: a peptidase in the renin-angiotensin system, a SARS receptor, and a partner for amino acid transporters.

Authors:  Keiji Kuba; Yumiko Imai; Takayo Ohto-Nakanishi; Josef M Penninger
Journal:  Pharmacol Ther       Date:  2010-07-03       Impact factor: 12.310

7.  A Novel Coronavirus from Patients with Pneumonia in China, 2019.

Authors:  Na Zhu; Dingyu Zhang; Wenling Wang; Xingwang Li; Bo Yang; Jingdong Song; Xiang Zhao; Baoying Huang; Weifeng Shi; Roujian Lu; Peihua Niu; Faxian Zhan; Xuejun Ma; Dayan Wang; Wenbo Xu; Guizhen Wu; George F Gao; Wenjie Tan
Journal:  N Engl J Med       Date:  2020-01-24       Impact factor: 91.245

8.  ABO blood group is a determinant of von Willebrand factor protein levels in human pulmonary endothelial cells.

Authors:  Glenn P Murray; Steven R Post; Ginell R Post
Journal:  J Clin Pathol       Date:  2019-10-29       Impact factor: 3.411

9.  IPD-IMGT/HLA Database.

Authors:  James Robinson; Dominic J Barker; Xenia Georgiou; Michael A Cooper; Paul Flicek; Steven G E Marsh
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

10.  An interactive web-based dashboard to track COVID-19 in real time.

Authors:  Ensheng Dong; Hongru Du; Lauren Gardner
Journal:  Lancet Infect Dis       Date:  2020-02-19       Impact factor: 25.071

View more
  564 in total

1.  Vitamin D and COVID-19.

Authors:  Tom D Thacher
Journal:  Mayo Clin Proc       Date:  2021-02-25       Impact factor: 7.616

2.  Neanderthal DNA highlights complexity of COVID risk factors.

Authors:  Yang Luo
Journal:  Nature       Date:  2020-11       Impact factor: 49.962

Review 3.  ABO blood group and COVID-19: an updated systematic literature review and meta-analysis.

Authors:  Massimo Franchini; Mario Cruciani; Carlo Mengoli; Giuseppe Marano; Fabio Candura; Nadia Lopez; Ilaria Pati; Simonetta Pupella; Vincenzo De Angelis
Journal:  Blood Transfus       Date:  2021-05-12       Impact factor: 3.443

Review 4.  Global pandemics interconnected - obesity, impaired metabolic health and COVID-19.

Authors:  Norbert Stefan; Andreas L Birkenfeld; Matthias B Schulze
Journal:  Nat Rev Endocrinol       Date:  2021-01-21       Impact factor: 43.330

Review 5.  Response to the Novel Corona Virus (COVID-19) Pandemic Across Africa: Successes, Challenges, and Implications for the Future.

Authors:  Olayinka O Ogunleye; Debashis Basu; Debjani Mueller; Jacqueline Sneddon; R Andrew Seaton; Adesola F Yinka-Ogunleye; Joshua Wamboga; Nenad Miljković; Julius C Mwita; Godfrey Mutashambara Rwegerera; Amos Massele; Okwen Patrick; Loveline Lum Niba; Melaine Nsaikila; Wafaa M Rashed; Mohamed Ali Hussein; Rehab Hegazy; Adefolarin A Amu; Baffour Boaten Boahen-Boaten; Zinhle Matsebula; Prudence Gwebu; Bongani Chirigo; Nongabisa Mkhabela; Tenelisiwe Dlamini; Siphiwe Sithole; Sandile Malaza; Sikhumbuzo Dlamini; Daniel Afriyie; George Awuku Asare; Seth Kwabena Amponsah; Israel Sefah; Margaret Oluka; Anastasia N Guantai; Sylvia A Opanga; Tebello Violet Sarele; Refeletse Keabetsoe Mafisa; Ibrahim Chikowe; Felix Khuluza; Dan Kibuule; Francis Kalemeera; Mwangana Mubita; Joseph Fadare; Laurien Sibomana; Gwendoline Malegwale Ramokgopa; Carmen Whyte; Tshegofatso Maimela; Johannes Hugo; Johanna C Meyer; Natalie Schellack; Enos M Rampamba; Adel Visser; Abubakr Alfadl; Elfatih M Malik; Oliver Ombeva Malande; Aubrey C Kalungia; Chiluba Mwila; Trust Zaranyika; Blessmore Vimbai Chaibva; Ioana D Olaru; Nyasha Masuka; Janney Wale; Lenias Hwenda; Regina Kamoga; Ruaraidh Hill; Corrado Barbui; Tomasz Bochenek; Amanj Kurdi; Stephen Campbell; Antony P Martin; Thuy Nguyen Thi Phuong; Binh Nguyen Thanh; Brian Godman
Journal:  Front Pharmacol       Date:  2020-09-11       Impact factor: 5.810

6.  Homozygosity for rs17775810 Minor Allele Associated With Reduced Mortality of COVID-19 in the UK Biobank Cohort.

Authors:  Steven Lehrer; Peter H Rheinstein
Journal:  In Vivo       Date:  2021 Mar-Apr       Impact factor: 2.155

7.  Human genetic variants identified that affect COVID susceptibility and severity.

Authors:  Samira Asgari; Lionel A Pousaz
Journal:  Nature       Date:  2021-12       Impact factor: 49.962

8.  Multilevel proteomics reveals host perturbations by SARS-CoV-2 and SARS-CoV.

Authors:  Alexey Stukalov; Virginie Girault; Vincent Grass; Ozge Karayel; Valter Bergant; Christian Urban; Darya A Haas; Yiqi Huang; Lila Oubraham; Anqi Wang; M Sabri Hamad; Antonio Piras; Fynn M Hansen; Maria C Tanzer; Igor Paron; Luca Zinzula; Thomas Engleitner; Maria Reinecke; Teresa M Lavacca; Rosina Ehmann; Roman Wölfel; Jörg Jores; Bernhard Kuster; Ulrike Protzer; Roland Rad; John Ziebuhr; Volker Thiel; Pietro Scaturro; Matthias Mann; Andreas Pichlmair
Journal:  Nature       Date:  2021-04-12       Impact factor: 49.962

9.  BIGwas: Single-command quality control and association testing for multi-cohort and biobank-scale GWAS/PheWAS data.

Authors:  Jan Christian Kässens; Lars Wienbrandt; David Ellinghaus
Journal:  Gigascience       Date:  2021-06-29       Impact factor: 6.524

10.  CCR5Delta32 deletion as a protective factor in Czech first-wave COVID-19 subjects.

Authors:  J A Hubacek; L Dusek; O Majek; V Adamek; T Cervinkova; D Dlouha; J Pavel; V Adamkova
Journal:  Physiol Res       Date:  2021-03-17       Impact factor: 1.881

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.