Literature DB >> 32917282

Genetic variants of the human host influencing the coronavirus-associated phenotypes (SARS, MERS and COVID-19): rapid systematic review and field synopsis.

Emilio Di Maria1,2, Andrea Latini3, Paola Borgiani3, Giuseppe Novelli3,4,5.   

Abstract

The COVID-19 pandemic has strengthened the interest in the biological mechanisms underlying the complex interplay between infectious agents and the human host. The spectrum of phenotypes associated with the SARS-CoV-2 infection, ranging from the absence of symptoms to severe systemic complications, raised the question as to what extent the variable response to coronaviruses (CoVs) is influenced by the variability of the hosts' genetic background.To explore the current knowledge about this question, we designed a systematic review encompassing the scientific literature published from Jan. 2003 to June 2020, to include studies on the contemporary outbreaks caused by SARS-CoV-1, MERS-CoV and SARS-CoV-2 (namely SARS, MERS and COVID-19 diseases). Studies were eligible if human genetic variants were tested as predictors of clinical phenotypes.An ad hoc protocol for the rapid review process was designed according to the PRISMA paradigm and registered at the PROSPERO database (ID: CRD42020180860). The systematic workflow provided 32 articles eligible for data abstraction (28 on SARS, 1 on MERS, 3 on COVID-19) reporting data on 26 discovery cohorts. Most studies considered the definite clinical diagnosis as the primary outcome, variably coupled with other outcomes (severity was the most frequently analysed). Ten studies analysed HLA haplotypes (1 in patients with COVID-19) and did not provide consistent signals of association with disease-associated phenotypes. Out of 22 eligible articles that investigated candidate genes (2 as associated with COVID-19), the top-ranked genes in the number of studies were ACE2, CLEC4M (L-SIGN), MBL, MxA (n = 3), ACE, CD209, FCER2, OAS-1, TLR4, TNF-α (n = 2). Only variants in MBL and MxA were found as possibly implicated in CoV-associated phenotypes in at least two studies. The number of studies for each predictor was insufficient to conduct meta-analyses.Studies collecting large cohorts from different ancestries are needed to further elucidate the role of host genetic variants in determining the response to CoVs infection. Rigorous design and robust statistical methods are warranted.

Entities:  

Keywords:  COVID-19; Coronavirus; Genetic association; Genetic susceptibility; Genomic biomarker; Genotype; Human host; Polymorphism

Mesh:

Year:  2020        PMID: 32917282      PMCID: PMC7484929          DOI: 10.1186/s40246-020-00280-6

Source DB:  PubMed          Journal:  Hum Genomics        ISSN: 1473-9542            Impact factor:   4.639


Introduction

Infectious diseases are known to accompany human evolution through a complex interaction between the host and the infection [1]. Infectious diseases, by definition, are caused by a single infectious agent. However, even prior to the current molecular genetics era, heritability studies provided the first line of evidence that part of inter-individual differences is attributable to the host genetics profile [2, 3]. The global impact of tuberculosis and HIV infection, the interest in the understanding of the genetic background of infectious disease, coupled with the emerging molecular technologies, lead to an increase of investigations on the role of the human host genetics profile [4]. Most candidate-gene studies were concentrating on respiratory infections. However, these studies provided conflicting results [5]. Several human-tropic coronaviruses (CoVs) constantly circulate in the human population and usually cause mild respiratory disease. Two of these RNA viruses—SARS-CoV-1 and MERS-CoV—had been discovered in 2002 and 2012, respectively, as the cause of severe acute respiratory syndromes. The former emerged in Guangdong province, China, and its spread in China is known as the SARS epidemic. The latter caused an epidemic that began in Saudi Arabia and was limited in the Middle East (thus named Middle East Respiratory Syndrome, MERS) [6]. In December 2019, a new infectious respiratory disease emerged in Wuhan, Hubei province, China. The disease, termed coronavirus disease 19 (COVID-19), rapidly spread from China as a global devastating pandemic [7-9]. At the time we were drafting this manuscript, the impact of the outbreak was still growing worldwide. According to the World Health Organisation, confirmed cases had exceeded 14 million and the number of deaths 600,000 (covid19.who.int—last access July 20, 2020). As the knowledge of the COVID-19 risk factors has progressed, age has become recognised as one of the predominant determinants of severe outcomes [10, 11], in association with cardiovascular disease and metabolic disorders as major comorbidity factors [12]. The latest findings of seroprevalence in COVID-19 were allowed to estimate a higher prevalence of infection than previously inferred [13]. This is in line with the hypothesis that the high rate of severe clinical manifestations experienced in the initial spread was attributable to vulnerable individuals. The extent to which the susceptibility to the severe clinical course is due to comorbidity or is determined by constitutional factors needs to be elucidated. Soon after the SARS outbreak in 2003, researchers tried to respond to the question as to whether germline genetic variants influence the occurrence of the clinical syndrome caused by CoVs as well as its variable outcome. At present, this question has begun a priority for human genomics. The first extensive systematic review addressing host genetic factors implicated in common respiratory tract infectious diseases is dated after the SARS and MERS outbreaks. Pooled analysis of respiratory infections revealed a significant association with the rs2070874 of the IL4 gene and additional genetic risk factors for tuberculosis. The study implemented a formal assessment of the risk of bias and concluded that almost 95% of eligible studies were affected by a strong risk of bias or confounding [14]. Immune response is the primary arm which should be explored in infectious diseases. Seminal studies on human Mendelian disorders causing deficiencies of the immune response to viruses (reviewed in Casanova JL et al. 2020 [15]) provided the background to investigate the influence of germline genetic variations in other, much more common, sporadic infectious diseases. According to the current paradigm for the complex disorder, the model to explain the genetic susceptibility to infection is not Mendelian—but may be either monogenic or polygenic—and postulates that any single genetic biomarker may be associated with a risk with low to very low effect. CoVs bind their cellular receptors using the homotrimeric spike glycoprotein (S1 subunit and S2 subunit in each spike monomer) on the envelope. Such binding triggers a cascade of events leading to the fusion between the cell and viral membranes for cell entry. X-ray crystallography modelling elucidated the structure of the SARS-CoV-2 receptor-binding motif which binds the human Angiotensin 1 Converting Enzyme 2 (ACE2), demonstrating that the binding mode of the SARS-CoV-2 is nearly identical to that observed in previously determined CoV-receptor complex structure [16]. Since it has been demonstrated that the SARS-CoV-2 spike protein interacts with human ACE2 [16, 17], a large number of studies focused on the ACE2 gene to investigate the hypothesis that variability in ACE2 structure and expression is related to different susceptibility to COVID-19 (recently reviewed by Devaux CA et al. [18]). The variability of ACE2, as well as of TMPRSS2 which promotes SARS-CoV-2 cellular entry, was already explored comparing public data from the population of different ancestries, providing the rationale for investigations on patients [19-21]. COVID-19 is a new disease and the current pandemic is incommensurable with other CoV-related clinical conditions. Remarkably, the SARS-CoV-2 infection occurs on a global scale with no influence from acquired immunity, either from vaccines or previous exposures. In turn, this tragic experience is an unprecedented opportunity to explore the genetic determinants of an emerging infection. Therefore, the human genetics community should endorse an extraordinary scientific effort to comprehensively investigate the influence of human genetic factors involved in the variability of susceptibility to viral infection and, in particular, in the variability of the clinical manifestations caused by SARS-CoV-2 [15, 22]. We embarked on the present systematic search of genetic association analyses in order to detect all possible hints suggesting the role of the host genome in determining the susceptibility to CoVs infections. We provide here a synthesis of evidence that could be helpful to design and conduct effective studies and, in turn, to find possible healthcare strategies aimed at facing the current COVID-19 pandemic.

Methods

Design and registration

The protocol was published in the PROSPERO repository of systematic reviews (registration number: CRD42020180860). We designed a systematic review process to summarise the results and provide a field synopsis about the host genomic biomarkers associated with COVID-19 and other CoV-associated human syndromes caused by CoVs, and able to explain part of the variability in the relevant phenotype. Secondary key questions, if applicable, regarded the effect size of each associated genomic biomarker and the predictive value of such genomic biomarkers. The protocol was developed according to the guidance of the PRISMA-P extension [23]. The PRISMA framework was applied to design reporting of results [24]. All records retrieved from databases were downloaded locally and managed by using the EndNote™ X8 software facilities. An ad hoc form for data abstraction and synthesis was defined on a worksheet.

Eligibility criteria

Population: patients affected with COVID-19 and with other severe acute respiratory syndromes sustained by CoVs, with no restriction in a clinical setting. The COVID-19 phenotype was defined extensively to include infection, development of any clinical symptom or sign, grade of severity, specific organ involvement, sequelae or complications. Population studies on healthy individuals were not eligible. Exposure: genomic biomarkers defined according to the European Medicine Agency definition (document EMEA/CHMP/ICH/437986/2006), that is “a measurable DNA and/or RNA characteristic that is an indicator of normal biologic processes, pathogenic processes, and/or response to therapeutic or other interventions”. This definition includes DNA variants such as single-nucleotide polymorphisms (SNPs), variability of short sequence repeats, haplotype, etc. Outcomes: clinical diagnosis of SARS or associated syndromes; evidence of infection; atypical manifestation of illness; measurements of clinical outcomes (mortality, recurrence, severity, quality of life, etc.). Main outcome measure: odds ratio. Ancestry and sub-phenotypes were pre-defined as possible stratification parameters in data synthesis. Study design: all study types showing effect size and significance were eligible (RCTs, cohort, case-control and cross-sectional studies). Type of article: original peer-reviewed articles reporting quantitative results of the association between the CoVs-related phenotypes and any germline genomic variant. Exclusion criteria: editorials and opinion papers were not considered. Pertinent narrative and systematic reviews were tagged in the screening phase and inspected for relevant references. Preprint articles that had not peer-reviewed were not eligible, though were considered for discussion.

Strategy for literature search, data collection and data synthesis

The following databases were inspected: PubMed and Scopus for indexed published articles; medRxive and bioRxive for pre-print articles. The search strategy was developed by two expert reviewers following an iterative process to ensure sensitivity. Two domains (“CoV” and “Genetics”) were defined combining MeSH terms and keywords using the Boolean operator “or”; the two domains were combined with “and”. Keywords for the dimension “genetics” were searched in the title field only. Several test runs showed that sensitivity was not affected, despite the remarkable reduction in the number of records retrieved [data not shown]. After interim results, a domain was added (“CoV-associated Biomarkers”) which targeted specific genes and genomic biomarkers found during the preliminary inclusion phase; the domain was combined with the “CoV” dimension with “and”; the resulting records were checked for duplicates and screened, then added to the final list of included articles. Search strings are reported in Supplementary Table S1. Records were filtered for language (English) and date of publication (Jan 2003 to Apr 2020). The publication year field was restricted to the period from 2003 onwards, to include all articles published after the SARS outbreak dated 2003. The distribution of relevant publications indexed in PubMed confirmed a sharp rise since 2003 and a massive increase in 2020 (not shown). Given the rapid evolution of the field, the literature search was replicated on June 17, in order to retrieve the latest articles and published versions of pre-print documents. To this purpose, PubMed and Scopus were interrogated. Each identified study was indexed and manually abstracted. A total of 37 fields were pre-set for data extraction. Findings for all individual genetic variants under investigation were recorded, whether significantly associated or not. Some studies provided more than one data point (corresponding to one variant—one outcome measure entry) and were consequently represented with more than one record. The consistency of results between studies was analysed by gene and by the individual genomic biomarker. The study protocol did not envisage clinical recommendations after data synthesis. Owing to the need for a rapid appraisal, the following modifications of the standard protocol for systematic reviews were applied, according to the current recommendations for rapid reviews [25, 26]. Grey literature was not systematically searched. Only articles in English were included in the screening phase and considered as eligible. One reviewer completed the screening phase according to an expedited PICO-based method previously described [27] and applying the rule of thumb “if in doubt keep it in” [24]. A second reviewer screened a sample of records. Eligible articles were inspected in-full by one reviewer and verified on title and abstract by a second independent reviewer. In case of doubts, the two reviewers discussed the accordance with the inclusion criteria. Data abstraction was drafted by one reviewer and verified independently by a second reviewer. Data synthesis was drafted by one reviewer and verified by all authors. Raw data were not requested. A formal assessment of the risk of bias was omitted. We relied on peer-review to ensure that included studies were methodologically sound, including proper and reliable statistical analysis (e.g. threshold for significance, correction for multiple testing, etc.). Therefore, unrefereed preprints were not included. It is noteworthy that most genetic association studies are based on cross-sectional or case-control design and as such are intrinsically affected by a high risk of bias.

Results

Systematic search

The systematic literature search in PubMed, Scopus, medRxive and bioRxive provided 1567 unique records and additional 362 records found in the last update (June 17, 2020). After the screening phase and full-text inspection, 32 articles fulfilled the criteria and were eligible for data abstraction. The detailed flow of literature assessment according to the PRISMA statement is depicted in Supplementary Figure 1. Considering all variants tested as a predictor and the different outcome measures modelled as a dependent variable, >500 data points were included. In fact, this count should be further multiplied by the models tested in each study, i.e. allele and genotype association, dominant, recessive, codominant, and by the number of tests. However, we abstracted for each variant the statistics showing the larger effect size or the most significant result, if applicable.

Populations

The 32 articles included for data abstraction analysed 26 independent cohorts—i.e. 3 cohorts were described in 4, 3 and 2 studies, respectively (Table 1).
Table 1

Articles included as eligible for data abstraction, ordered by year of publication and first author. For each study, main features, genes/loci examined and summary of the main findings are reported

Author, yearPopulation descriptionCountryDiseasePrimary outcomeOther outcomesSample NCases NControls NNotes on cohortsGene/locusConclusionsRef.
Lin M, 2003SARS cases admitted to Taipei Hospital, TaiwanTaiwanSARSManifest diseaseSeverity13433101HLA-A, HLA-B, HLA-DRB1HLA-B*4601 nominally associated with severity (vs larger control group), not with infection after P correction[28]
Chiu RWK, 2004SARS patients admitted at Hong Kong Chinese University HospitalHong KongSARSManifest diseaseSeverity496168328ACE2Negative results[29]
Itoyama S, 2004Vietnamese patients with SARSVietnamSARSManifest diseaseHypoxemia14744103#ACEACE nominally associated with severity[30]
Ng MHL, 2004SARS patients admitted at Hong Kong Chinese University HospitalHong KongSARSManifest diseaseSeverity188649018774HLA-A, HLA-B, HLA-DR, HLA-DQHLA-B*0703 and -B*0301 nominally associated with the disease; no association with severity[31]
Chan KC, 2005SARS patients previously admitted at Hong Kong HospitalsHong KongSARSManifest diseaseSeverity466140326ACENegative results[32]
Hamano E, 2005SARS patientsVietnamSARSManifest diseaseSeverity (oxygen therapy)14744103OAS-1, MxA, PKRNominal association of OAS-1 with disease but not severity; nominal association of MxA with severity only[33]
Ip WK, 2005SARS patients previously admitted at 5 Hong Kong HospitalsHong KongSARSManifest diseaseSerum MBL level; severity (death)17575691188MBLAssociation of -221 Y allele with disease and MBL level; no association with mortality[34]
Itoyama S, 2005Vietnamese patients with SARSVietnamSARSManifest disease14744103#ACE2Negative results[35]
Yuan FF, 2005SARS patients admitted at Prince of Wales Hospital, Hong KongHong KongSARSManifest diseaseSeverity (ICU or death)380180200§FcγRIIA, MBLNo association with MBL; FcγRIIA nominally associated with severity[36]
Zhang H, 2005SARS patients from Bejing, ChinaChinaSARSManifest diseaseSerum MBL level744352392MBLAssociation of nt 54 B allele with disease and MBL level[37]
Chan VS, 2006SARS patients admitted at four hospitals in Hong KongHong KongSARSManifest disease1127285842$CLEC4M (L-SIGN)

CLEC4M nominally associated with homozigosity (protective).

Functional analysis consistent with genetic risk.

[38]
Chong WP, 2006SARS cases retrospecivelly selected in Hong Kong and Bejing, ChinaHong KongSARSManifest diseaseSeverity (death)925476449IFN-γ, IL-10, TNF-αNo association with IL-10 and TNFalfa; IFNγ nominally associated with disease, not associated with severity[39]
He J, 2006SARS patients from Bejing, ChinaChinaSARSManifest disease666664Cohort assessed for risk factorsOAS-1, MxA

Nominal association of OAS-1 and MxA with disease.

For both variants association found in dominant model only.

[40]
Chen WJ, 2006SARS cases retrospecivelly selected from the National Taiwan databaseTaiwanSARSPositive NPSManifest disease1889494ACE2, ACP5, ADAR, AIP, ANPEP, B2M, CAT, CCL5/RANTES, CD209, CIITA, CXCL9, CXCL10, CYP17A1, EIF2AK3, EIF2S1, EIF4G1, ESR1, FGL2, FN1, G6PD, GNB3, GPX1, GSS, HMOX1, IFNAR1, IFNAR2, IFNG, IFNGR1, IFNGR2, IL1A, IL1B, IL1RN, IL4, IL6, IL10, IL10RB, IL12A, IL15, IL18, IRF1, IRF3, IRF7, MBL2, MX1, NFRKB, OAS1, PRDX2, PRKRA, PTGS2, RelB, RFX5, RNASEL, SERPINB3, SH2DIA, SLAMF1, SOCS1, SOCS3, SOD1, TBF, TFRC, TGFB1, TLR3, TLR4, TRAF6, WSX1Nominal association of detectable NPS with 4 genes (IL1A, IL18, FGL2, RelB) in the patients’ cohort; no association with infection in the case-control study[41]
Chen YM, 2006Employees of the Municipal Hoping Hospital, TaipeiTaiwanSARSSeropositivity1002080HLA-A, HLA-B, HLA-Cw, HLA-DQB1, HLA-DRB1HLA-Cw0801 nominally associated with SARS infection[42]
Chan KYK, 2007SARS patients previously admitted at 6 Hong Kong HospitalsHong KongSARSManifest diseaseLDH level, WBC1723817906$ICAM3, FCER2, CD209 (DC-SIGN), CLEC4M (L-SIGN)ICAM3 possibly associated with severity, via LDH level as a proxy[43]
Ng MW, 2007SARS cases retrospectively selected in Hong Kong and Bejing, ChinaHong KongSARSManifest diseaseSeverity (ICU or death)1073495578CCL5/RANTES, IP-10, MigNo association with IL-10 and Mig; CCL5/RANTES -28C > G is associated with disease (discovery cohort only) and severity (both discovery and validation cohorts). Association between CCL5/RANTES and severity was validated in an independent cohort of Chinese patients[44]
Yuan FF, 2007SARS patients admitted at Prince of Wales Hospital, Hong KongHong KongSARSManifest diseaseSeverity (ICU)350152198§TLR2, TLR4, CD14No informative variant in TLR2 and TLR4; CD14 nominally associated with severity[45]
Li H, 2008SARS patients retrospectively collectedHong KongSARSManifest diseaseSeverity (ICU or death)353181172FCER2, CLEC4G, CD209, CLEC4M (L-SIGN)Negative results (4 genes in the C-type lectin genes cluster, chr. 19).[46]
Wang S, 2008SARS patients admitted at Tianjin Hospital, ChinaChinaSARSManifest diseaseInterstitial lung fibrosis; femoral head necrosis20875133TNF-αNo association with infection and interstitial lung fibrosis; nominal association with femoral head necrosis[47]
Xiong P, 2008SARS patients from Guangdong province, ChinaChinaSARSManifest diseaseSeverity49895403HLA-A, HLA-B, HLA-DRB1Negative results[48]
Keicho N, 2009Vietnamese patients with SARSVietnamSARSManifest diseaseSeropositivity14544101#HLA-A, HLA-B, HLA-C, HLA-DRB1, HLA-DQB1HLA-DRB1*12 possibly associated with disease[49]
Wang Y, 2009SARS patients from Bejing and Guangzhou, ChinaChinaSARSManifest disease899376523MASP2Negative results (MASP is a downstream protein of MBL).[50]
Chan KYK, 2010SARS patients previously admitted at Hong Kong HospitalsHong KongSARSLDH level681681n/a$CD209 (DC-SIGN); combined CD209 and ICAM3CD209 and CD209 + ICAM3 possibly associated with severity, via LDH level as proxy[51]
Ching JCY, 2010SARS patients previously admitted at 6 Hong Kong HospitalsHong KongSARSManifest diseaseSeverity (assisted ventilation and other not reported measures)1210792418$MxANominal association of MxA with disease, not with severity; OAS-1 + MxA dyplotype also analysed[52]
Ng MH, 2010SARS patients recruited from the Hong Kong Department of Health databaseHong KongSARSManifest disease210102108Controls were contact of patients; families also enrolledHLA-A, HLA-B, HLA-Cw, HLA-DQ, HLA-DR

Negative results: nominal association with disease (increase in DRB4*01010101 frequency, decrease in HLA-B*1502 and HLA-DRB3*030101 frequency in patients) not surviving to correction.

Replication of previous study failed. Analysis of family data was done.

[53]
Wang SF, 2011Affected health care workers followed-up at the Dept. of Health, TaipeiTaiwanSARSManifest diseaseProlonged neutralising antibody expression975641HLA-B, HLA-Cw, HLA-DRHLA-Cw1502 and DR0301 possibly associated with resistance to SARS infection[54]
Yuan FF, 2014SARS patients admitted at Hong Kong Chinese University HospitalHong KongSARSManifest diseaseSeverity189501761877448 cases classified as severeHLA-A, HLA-B, HLA-DRNegative results[55]
Hajeer AH, 2016Patients with laboratory-confirmed MERS-CoV infection admitted at King Abdulaziz Medical City, RiyadhSaudi ArabiaMERSManifest diseaseSeverity18423161HLA-DRB1, HLA-DQB1HLA-DRB1*11:01 nominally associated with severe MERS[56]
Kuo CL, 2020COVID-19 patients from the UN BiobankCOVID-19COVID-19 positivity status323570622322948General population as controlAPOEAPOE ε4 homozygous genotype associated, independent of preexisting dementia[57]
Wang W, 2020Donors of convalescent plasma who recovered from COVID-19 in Zhejiang, ChinaChinaCOVID-19Manifest disease3630823548No severe case; 242 controls for HLA-DP1 locusHLA-A, HLA-B, HLA-C, HLA-DQ, HLA-DR, HLA-DPHLA haplotype C*07:29 and B*15:27 nominally associated with SARS-CoV-2 infection[58]
Zhang Y, 2020Patients with COVID-19 admitted to Beijing Youan Hospital, ChinaChinaCOVID-19Disease severity808024 severe cases vs 56 mildIFITM3Association with severity[59]

Notes and legend: gene and polymorphism names were reported as described in articles; the dbSNP nomenclature was added in brackets; statistics was reported if the significant association was found, not significant was annotated otherwise.

ICU intensive care unit, LDH lactate dehydrogenase, WBC white blood counts, LR logistic regression, ns not significant, n/r not reported, n/a not applicable, #: Same cohort used by Itoyama S 2004, 2005, Keicho N 2009; § Same cohort used by Yuan FF 2005, 2007; $: Same cohort used by Chan VS 2006, Chan KYK 2007, 2010, Ching JCY 2010.

Articles included as eligible for data abstraction, ordered by year of publication and first author. For each study, main features, genes/loci examined and summary of the main findings are reported CLEC4M nominally associated with homozigosity (protective). Functional analysis consistent with genetic risk. Nominal association of OAS-1 and MxA with disease. For both variants association found in dominant model only. Negative results: nominal association with disease (increase in DRB4*01010101 frequency, decrease in HLA-B*1502 and HLA-DRB3*030101 frequency in patients) not surviving to correction. Replication of previous study failed. Analysis of family data was done. Notes and legend: gene and polymorphism names were reported as described in articles; the dbSNP nomenclature was added in brackets; statistics was reported if the significant association was found, not significant was annotated otherwise. ICU intensive care unit, LDH lactate dehydrogenase, WBC white blood counts, LR logistic regression, ns not significant, n/r not reported, n/a not applicable, #: Same cohort used by Itoyama S 2004, 2005, Keicho N 2009; § Same cohort used by Yuan FF 2005, 2007; $: Same cohort used by Chan VS 2006, Chan KYK 2007, 2010, Ching JCY 2010. Consistently with the outbreaks experienced in the last decades and the period considered for inclusion, SARS infection was by far the most frequent endpoint. Therefore, all studies investigated patients of Asian ancestry—Chinese Han and Vietnamese—except the only study which focused on MERS in Saudi Arabia and one recent analysis on individuals of European ancestry based in the UK Biobank. Patient series were ascertained according to the current diagnostic criteria and recruited retrospectively. No randomisation procedure was found in eligible studies. The healthcare setting—i.e. whether in-patient or out-patient clinic, follow-up, etc.—was seldom reported. The control series resulted to be consistent for ancestry. No paired case-control enrolment was applied.The discovery samples were heterogeneous in size across studies, ranging from 44 to 323,570, while the patient series count was from 20 to 817 (Table 1).

Outcomes

Out of the 32 studies, 27 considered the disease status as the primary outcome measure, 3 seropositivity or positive nasopharyngeal shedding, 1 used disease-associated biomarkers (serum LDH level, white-blood count) and 1 measured COVID-19 severity (Table 1). A large proportion of association studies considered the severity of the disease course as a secondary outcome, as measured by admittance to intensive care unit, administration of oxygen therapy, or death. The clinical variables analysed as secondary outcomes were ascertained on a subset of the patient series and modelled as binary variables (Table 1).

Designs

All studies implemented a case-control design. If the association with clinical outcome was tested, a subset of patients was analysed and classified in a binary variable. Genotype and allele frequencies were compared by using univariate parametric statistics. Several studies applied also multivariate statistics including age and sex as covariate; one considered lifestyle-related risk factors; one corrected for putatively associated variants. All studies assumed 0.05 as the threshold of significance (as reported or inferred from article reading). Correction for multiple testing was not common in included studies, though hypothesis testing in genetic association studies implies multiple testing (see Table 3). One attempt to validate findings on an independent cohort was found [44].
Table 3

Significant data points ordered by gene and by variant, according to the positive signals of association found across studies (not significant data points were omitted). Sample size and effect size were reported as abstracted from the source article. Genes and HLA loci were listed separately

Author, yearDiseaseAncestryOutcomeSample NCases NControls NNotes on sampleGene/locusVariantRisk alleleEffect size, OR95% CIP valueNotes on statisticsMain findingRef.
Itoyama S, 2004SARSVIEHypoxemia4444n/a#ACEI/D (rs4646994)D3.041.12–8.250.02922 severe vs 22 non-severe casesNo association with disease[30]
Kuo CL, 2020COVID-19EURCOVID-19 positivity status323570622322948General population as controlAPOErs429358 + rs7412ε42.311.71–3.35<0.001mStatistics from the whole cohort; strata availableAssociation with the ε4 homozygous genotype[57]
Ng MW, 2007SARSCHIManifest disease1073495578CCL5rs2107538G2.82.11–3.71<0001cmAllele-associated OR reportedAssociation with disease[44]
Ng MW, 2007SARSCHISeverity (death)495495n/aCCL5rs2107538G2.11.30–3.390.002cm57 deaths vs438 survivals; allele-associated OR reportedAssociation with death (57 deceased vs 438 survived patients)[44]
Ng MW, 2007SARSCHISeverity (ICU)356356n/aCCL5rs2107538G2.781.37–5.630.005cm20 severe cases vs 336; allele-associated OR reportedAssociation with severity[44]
Yuan FF, 2007SARSCHIManifest disease152152n/a§CD14159C/T (rs2569190)Cn/rn/r0.02726 severe cases vs 136 mildCD14 associated with severity[45]
Chan KYK, 2010SARSCHILDH level681681n/a$CD209 (DC-SIGN)-366AG, promoter regionG0.40.19–0.850.014LDH modelled as binary variable; univariate statistics reportedAssociation with lower LDH level[51]
Chan VS, 2006SARSCHIManifest disease1127285842$CLEC4M (L-SIGN)exon 4 tandem repeathomozigosity0.6910.523–0.9130.009.Homozigosity associated with resistance to infection[38]
Chan KYK, 2010SARSCHILDH level677677n/a$combined CD209 and ICAM3-366AG, promoter region + rs2304237A; A4.341.34–14.120.021LDH level modelled as binary variable; univariate statistics reportedAssociation with higher LDH level[51]
Yuan FF, 2005SARSCHISeverity (ICU or death)38026200§FcγRIIAR131H (rs1801274)R3.21.1–9.10.0326 patients vs controlsNominal association with severity[36]
Chen WJ, 2006SARSCHIPositive NPS18894940FGL2+158 (rs2075761)A41.39–11.480.031GA genotype; outcome modelled as binary variableNominal association of NPS with 4 genes (IL1A, IL18, FGL2, RelB)[41]
Chan KYK, 2007SARSCHILDH level677677n/a$ICAM3rs2304237C (Gly)4.311.37–13.560.0067LDH modelled as binary variable; P corrected for 2 testsICAM3 associated with LDH level[43]
Zhang Y, 2020COVID-19CHIDisease severity8080n/a24 severe vs 56 mildIFITM3rs12252C6.37n/r<0.001CC vs other genotypes; CI n/rAssociation with severity[59]
Chong WP, 2006SARSCHIManifest disease925476449IFN-γrs2430561A5.192.78–9.68<0.001mRecessive model; multivariate LR n/rAssociation with one SNP[39]
Chen WJ, 2006SARSCHIPositive NPS1889494IL18-607 (rs1946518)T10.62.03–55.00.014Outcome modelled as binary variableNominal association of NPS with 4 genes (IL1A, IL18, FGL2, RelB)[41]
Chen WJ, 2006SARSCHIPositive NPS1889494IL1A−889 (rs1800587)T10.21.82–56.80.008Outcome modelled as binary variableNominal association of NPS with 4 genes (IL1A, IL18, FGL2, RelB)[41]
Ip WK, 2005SARSCHIManifest disease15413531188MBL−221X/Y (rs7096206); ex1 54A/B (rs1800450)YB1.551.21–1.99<0.001mStatistics n/rAssociation of YB haplotype with disease[34]
Ip WK, 2005SARSCHISerum MBL level353353n/aMBL−221X/Y (rs7096206); ex1 54A/B (rs1800450)YBn/rn/rn/rmStatistics n/rAssociation of YB haplotype with low serum MBL level[34]
Zhang H, 2005SARSCHIManifest disease744352392MBL230[codon 54]A/B (rs1800450)B1.731.25–2.390.00086cAssociation of B allele with disease[37]
Zhang H, 2005SARSCHISerum MBL level744352392MBL230[codon 54]A/B (rs1800450)B1.671.21–2.290.00187cAssociation of B allele with MBL level[37]
Hamano E, 2005SARSVIESeverity (oxygen therapy)4444n/aMxArs2071430 − 88)G3.751.08–10.70.0346Association with severity[33]
He J, 2006SARSCHIManifest disease666664Cohort assessed for risk factorsMxArs2071430T3.221.13–9.180.029mAssociation with AG genotype, dominant G protective[40]
Ching JCY, 2010SARSCHIManifest disease1210792418$MxArs17000900 (−123)A0.580.41–0.820.002cmAssociation with resistance to infection[52]
Hamano E, 2005SARSVIESeverity (oxygen therapy)4444n/a0OAS-1rs3741981G2.681.17–6.150.0178Association with disease[33]
He J, 2006SARSCHIManifest disease666664Cohort assessed for risk factorsOAS-1rs2660G0.380.14–0.980.047mAssociation with GT genotype, dominant T[40]
Chen WJ, 2006SARSCHIPositive NPS1889494RelB+23692 (rs2288918)T7.21.47–35.30.034Nominal association of NPS with 4 genes (IL1A, IL18, FGL2, RelB)[41]
Wang S, 2008SARSCHIManifest disease5454n/aTNF-α−1031C61.60–22.550.0424 severe cases vs 30 mildNominal association with femoral head necrosis[47]
Wang S, 2008SARSCHIManifest disease5454n/aTNF-α−863A8.41.76–40.020.0124 severe cases vs 30 mildNominal association with femoral head necrosis[47]
Chen YM, 2006SARSCHISeropositivity1002080HLA-Cw,Cw08013.41.50–7.580.003cNumber of tests not shownAssociation found in recessive model[42]
Lin M, 2003SARSn/rManifest disease13433101HLA-BB*460110.622.80–40.260.0279c6 severe cases vs 101 health care workersAssociation with both infection and severity[28]
Ng MHL, 2004SARSCHIManifest disease188649018774HLA-BB*03010.060.01–0.470.0042c.Association with resistance to infection[31]
Ng MHL, 2004SARSCHIManifest disease188649018774HLA-BB*07034.082.03–8.180.0022c.Association with infection[31]
Wang W, 2020COVID-19CHIManifest disease3630823548HLA-BB*15:273.591.72–7.500.03cAssociation with infection[58]
Wang W, 2020COVID-19CHIManifest disease3630823548HLA-CC*07:29130.25.28–32110.025cAssociation with infection[58]
Wang SF, 2011SARSn/rManifest disease975641HLA-CwCw 15020.170.04–0.810.01.Association with resistance to infection[54]
Hajeer AH, 2016MERSSAUManifest disease18423161HLA-DRB1DRB1*11:016.111.36–24.760.022c.Association with the disease[56]

Notes and legend: gene and polymorphism names were reported as described in articles; the dbSNP nomenclature was added in brackets.

CHI Chinese, EUR European, SAU Saudi Arabia, VIE Vietnamese, ICU intensive care unit, LDH lactate dehydrogenase, WBC white blood counts, NPS nasopharyngeal shedding

#: Same cohort used by Itoyama S 2004, 2005, Keicho N 2009; § Same cohort used by Yuan FF 2005, 2007; $: Same cohort used by Chan VS 2006, Chan KYK 2007, 2010, Ching JCY 2010; LR logistic regression, ns: not significant, n/r not reported, n/a not applicable, c corrected for multiple testing, m multivariate analysis

Genetic association

Table 1 reports all studies included. For each study, we reported genes/loci examined and a summary of main findings. Twenty-two articles reported the association analysis of CoV-related phenotypes with single-gene variants, either in coding or regulatory region. Ten genes were analysed in more than one study (Table 2), namely: ACE2, CLEC4M (L-SIGN), MBL, MxA (3 studies); ACE, CD209, FCER2, OAS-1, TLR4, TNF-α (2 studies). Out of these, only MBL and MxA provided positive signal of association in at least two studies. The CCL5/RANTES gene was reported in one article (Table 1) which investigated also an independent cohort of the same ancestry (i.e. Chinese) as a replication study; the second cohort provided a significant signal of association with severity but not with the manifest disease [44] (Table 3).
Table 2

Genes/loci examined in included studies, ordered by the number of relevant studies. Genes and HLA loci were listed separately

N of studiesGene
3ACE2
3CLEC4M (L-SIGN)
3MBL
3MxA
2ACE
2CD209
2FCER2
2OAS-1
2TLR4
2TNF-α
1ACP5, ADAR, AIP, ANPEP, B2M, CAT, CCL5, CD14, CIITA, CLEC4G, CXCL10, CXCL9, CYP17A1, EIF2AK3, EIF2S1, EIF4G1, ESR1, FcγRIIA, FGL2, FN1, G6PD, GNB3, GPX1, GSS, HMOX1, IFNAR1, IFNAR2, IFNG, IFNGR1, IFNGR2, IL10, IL10RB, IL12A, IL15, IL18, IL1A, IL1B, IL1RN, IL4, IL6, IP-10, IRF1, IRF3, IRF7, MASP2, MBL2, Mig, MX1, NFRKB, OAS1, PRDX2, PRKRA, PTGS2, RANTES, RelB, RFX5, RNASEL, SERPINB3, SH2DIA, SLAMF1, SOCS1, SOCS3, SOD1, TBF, TFRC, TGFB1, TLR2, TLR3, TRAF6, WSX1
HLA haplotype
10HLA-DR
9HLA-B
8HLA-A
5HLA-Cw
5HLA-DQ
Genes/loci examined in included studies, ordered by the number of relevant studies. Genes and HLA loci were listed separately Significant data points ordered by gene and by variant, according to the positive signals of association found across studies (not significant data points were omitted). Sample size and effect size were reported as abstracted from the source article. Genes and HLA loci were listed separately Notes and legend: gene and polymorphism names were reported as described in articles; the dbSNP nomenclature was added in brackets. CHI Chinese, EUR European, SAU Saudi Arabia, VIE Vietnamese, ICU intensive care unit, LDH lactate dehydrogenase, WBC white blood counts, NPS nasopharyngeal shedding #: Same cohort used by Itoyama S 2004, 2005, Keicho N 2009; § Same cohort used by Yuan FF 2005, 2007; $: Same cohort used by Chan VS 2006, Chan KYK 2007, 2010, Ching JCY 2010; LR logistic regression, ns: not significant, n/r not reported, n/a not applicable, c corrected for multiple testing, m multivariate analysis Ten studies investigated the hypothesis of association with HLA loci (Tables 1 and 2). Three of these reported negative results: two independent studies analysed Chinese patients with SARS [48, 55]; one study [53] failed to replicate the association with HLA-B*0703 and HLA-DRB1*0301 previously reported [31]. Positive signals were found in 7 studies (5 on SARS, 1 in MERS and COVID-19). The following HLA haplotypes were found differentially distributed in patients with respect to the reference population: B*4601; B*0703; B*0301; Cw0801: DRB1*1202; Cw1502; DR0301; DRB1*11:01; C*07:29; B*15:27. None of these provided positive signals in more than one cohort (Table 3).

Discussion

This is the first comprehensive review which systematically collected all studies examining the involvement of human genetic variants in severe CoV infections, including COVID-19. Patarčić and co-workers had published the first extensive review including all studies investigating five common respiratory tract infectious diseasespneumonia, tuberculosis, influenza, respiratory syncytial virus and SARS-CoV. At that time (last update: August 2015), they could cover the SARS-CoV-1 outbreak, though with some limitations such as the exclusion of HLA haplotypes [14]. A narrative review published well before the first SARS outbreak had pointed to CoV infection with a focus on animal studies [60]. By developing this comprehensive review, we recapitulated the body of knowledge about the influence of host germline genetic variants on the clinical phenotypes associated with CoVs infection. In total, we examined 32 articles that met the criteria and were eligible for data extraction. HLA haplotypes were examined in ten studies spanning the entire period from the SARS-CoV-1 outbreak in 2003 up to the SARS-CoV-2 pandemic in 2020. The remaining 22 studies tested the hypothesis of association with single-gene variants. A summary of the findings is reported below.

Association signals from single-gene studies

Two different studies reported an association between mannose-binding lectin (MBL) polymorphisms and susceptibility to SARS-CoV infection [34, 37]. MBL is a serum protein of the collectin family and plays a critical role in the innate immune response. MBL binds, by its multiple carbohydrate recognition domains, to repeating mannose and N-acetylglucosamine sugar motifs frequently present on the microbial surfaces of bacteria, viruses and protozoa [61]. After binding to a pathogen, MBL activates the complement system via MBL-associated serine protease or promotes phagocytosis by acting as an opsonin [62]. Polymorphisms in the promoter and coding regions of the MBL gene seem to have functional effects on MBL serum levels. The variant alleles causing low plasma concentrations of functional MBL were described as associated with an increased risk of developing infections [63]. Since both included studies the variants associated with lower MBL levels were observed more frequently in patients with SARS, it might be speculated that MBL is able to bind the spike protein of SARS-CoV and contributes to the defence of the host cell. Abnormalities in the production levels of several cytokines such as IL-1, IL-2, IL-4, IL-6, IL-8, IL-10, IFN-γ, TNF-α and TGF -β1 were detected in patients with SARS-CoV infection [64]. Cytokines are key mediators of the inflammatory response and are important for host defence against a wide variety of viruses, by participating in the regulation of both innate immunity and inflammatory processes. The individual level of cytokines is extremely variable with an important contribution of the genetic component since it was demonstrated that polymorphisms located in genes coding for cytokines can influence their transcriptional activity [65]. For these reasons, the genetic variants of inflammatory cytokines genes were extensively studied in patients with CoVs infection. However, we found one study reporting a significant association with SARS susceptibility, namely the minor allele of rs2430561 polymorphism in IFN-γ gene in a Chinese cohort [39]. Moreover, variants in Il1A and IL18 showed nominal association with nasopharyngeal shedding [41], and variants in the TNF-α promoter region were found as associated with femoral head necrosis [47]. None of these findings was replicated to date. Type 1 interferon (IFN-a/b) plays an important role in the innate immune response against viral infections. Among the antiviral proteins induced by IFN1, 2'-5' oligoadenylate synthetase 1 (OAS1) and myxovirus resistance 1 (MxA) harbour genetic variants that might affect susceptibility to the SARS-CoV infection and progression [66]. OAS1 can bind double-stranded RNA and activate a latent ribonuclease, RNase L, which cleaves cellular and viral RNAs. Associations between SARS and OAS1 genetic variants located in exon 3, exon 6 [33] and in region 3’ UTR [40] were found in our study. In particular, the rs1131454 polymorphism in exon 3, located near the dsRNA binding domain, causes the aminoacid substitution Gly162Ser and could affect its activity [33]. MxA is a cytoplasmic protein that belongs to the dynamin family and shows activity against several viruses. The variant alleles of rs2071430 and rs17000900 SNPs, located in the promoter region, were shown to be associated with increased promoter activity and could influence the binding affinity with nuclear protein [67]. However, the precise mechanism of antiviral action of MxA was not elucidated so far. The genetic association studies that were found eligible in this study are discordant. In fact, while two studies reported that the minor allele of rs2071430 SNP is associated with SARS-CoV [40] and hypoxia caused by the infection [33], Ching at al. claim that this allele confers resistance to the virus [52]. Yet, the formers were conducted on small cohorts and found positive signal with different outcomes. Despite the promising background, the association of MxA with CoVs-related phenotypes needs to be investigated in large cohorts by applying a robust study design. One article reported the association between SARS susceptibility and a genetic variant in the RANTES (Regulated upon Activation, Normal T cell-Expressed and Secreted) gene, also known as CCL5 [44]. RANTES codes for a chemokine responsible for the recruitment of eosinophils, lymphocytes, monocytes and basophils at the site of inflammation [68]. The study showed that the minor allele -28G is associated with severe clinical outcome in SARS Chinese patients, and increases RANTES expression and enhances NF-κB binding in vitro, thus suggesting its role in promoting inflammation [44]. This was the only study included in the eligible series which provided a functional hint linking the association signal with a putative biological relevance—yet, the promising findings warrant independent replications. After the spread of the COVID-19 pandemic, it has been speculated on the possible role of ACE2 variants, as well as other factors implicated along the pathway of SARS-CoV2 infection. In fact, biochemical interaction studies and crystal structure analyses revealed that the SARS-CoV-2 spike protein has a strong binding affinity with the human ACE2 extracellular domain [16, 17, 69, 70], and that SARS-CoV-2 and the original SARS-CoV display a high degree of homology [16, 71, 72], demonstrating that both viruses recognise the ACE2 peptidase domain as a receptor. Several studies compared the allele frequency of relevant ACE2 polymorphisms obtained from reference data with the prevalence of COVID-19 in different countries [19, 73, 74]. These dry lab investigations produced interesting hypotheses. However, the articles did not satisfy our inclusion criteria as the analyses did not entail patients’ genetic data. Although the authors correctly pointed to possible confounders and adjusted the statistical models through a multivariate approach, these studies assumed that (i) patients affected with COVID-19 were a random sample of the reference population, and (ii) patients did not differ in any determinant of either disease or severity, other than the polymorphisms under investigation—two assumptions that cannot be postulated. Although the role of the host immune system in the development and clinical course of the severe respiratory syndrome caused by SARS-CoV1 and SARS-CoV-2 has been increasingly apparent, it is also worth to underline that some promising candidate genes, namely IL4 and IL6, were not sufficiently investigated so far. One out of the 32 eligible studies included variants of both genes in a panel of 65 loci, with negative results [41]. In the systematic review and meta-analysis of studies addressing host genetic factors implicated in five common respiratory tract infectious diseases—i.e tuberculosis, influenza, respiratory syncytial virus (RSV) and SARS-CoV—published in 2015 [14], Patarčić and co-workers reported IL4 as the single result that was significant in pooled analyses, and marginally significant in disease-specific meta-analyses. IL4 promotes both T-cell and B-cell differentiation and provides a balance between Th1 and Th2 response [75]. Given its pivotal role in shaping the nature of the immune response, it is conceivable that even subtle modifications of IL4 function and expression may substantially affect the immune response and influence the course of CoVs infection. IL-6 is one of the pro-inflammatory cytokines whose levels appear to increase sharply in patients with COVID19 [76]. In particular, the highest circulating levels of IL-6 were observed in patients with respiratory dysfunction, suggesting that SARS-CoV-2 infection triggers a cytokine-mediated lung damage mechanism [77]. Recent meta-analyses confirmed that elevated IL-6 levels are closely associated with the severity of COVID-19 [78-80], further suggesting that IL6 genetic variants should be considered as a potential determinant of the host response against CoVs [81]. Based on these data, COVID19 patients with pneumonia were also treated with drugs directed against IL-6 receptor (IL6R), such as tocilizumab, with promising results [82]. The association of IL6R gene polymorphisms with the effectiveness of tocilizumab was found in rheumatoid arthritis [83] and was recently investigated in patients affected with COVID-19 [84]. Conversely, our systematic search did not find any genetic association analysis targeting IL6R in CoV-related diseases. We did not focus on pharmacogenetics of treatments against CoV infections, as a response to drugs is a distinct phenotype which deserves a specific focus. Preliminary data pointed to germline genetic variants influencing the efficacy of specific therapies administered in patients with COVID-19 [84]. This line of research may disclose basic mechanisms linked to the disease pathophysiology and provide additional hints on the genetic determinants of the host response [85, 86].

Association signals from HLA haplotypes

As the involvement of immune response in the clinical course of CoV-related diseases had been apparent since the first severe cases with SARS occurred in 2003, several efforts have been made to identify susceptibility or resistance factors associated with the variability of HLA alleles in the human population. This review included ten studies that investigated the hypothesis of association with HLA. In summary, no HLA haplotype was found to be significantly associated with CoV-related phenotypes in more than one study. Moreover, most studies appear rather weak in terms of study design and robustness of statistics (see also below on the methodological issues). Our findings are in line with the recent review by Sanchez-Mazas [87], who critically resumed both case-control studies and structure analyses (bioinformatics and in vitro), concluding that rigorous case-control studies should be combined with experimental HLA binding and T cell response assays in order to elucidate the role of the huge HLA variability in response to CoVs.

Methodological issues

In general, the sample size of eligible studies was limited by the absolute number of cases available for clinical and genetic ascertainment, and supposedly by the logistic constraints of an outbreak. A few studies, however, used a very small case-control sample—i.e. <100 cases, see Tables 1 and 3—thus implying a very high prior risk of both type I and type II errors. The limited sample size was prevented from applying an unbiased, agnostic approach, such as genome-wide association analyses. In fact, all studies considered in the present review were based on the candidate gene approach, sensibly due to the limited sample size. As long as the candidate approach is used, a relative increase in the prior likelihood of association could reduce the risk of type I error. To this aim, the latest studies on SARS-CoV-2 infection could inform on the biological relevance of gene products. Gene ontology, in turn, may allow to extend the focus from a single gene to a network of genes related to the same pathway. To accomplish this approach, statistical modelling and analyses should be more sophisticated with respect to the simple statistics applied in most genetic association studies. The use of polygenic models might overcome the intrinsic constraint due to low effect size attributable to single-gene variants and allows to explain part of the missing heritability in complex disorders. This is the case, for instance, of Alzheimer’s disease, for which complex trait analysis has been proved to successfully interpret a large proportion of the genetic variance [88], as well as breast cancer, in which multifactorial models incorporating multiple genetic variants are under validation [89, 90]. By the way, family studies should be warranted to estimate the heritability of the COVID-19 phenotypes through a formal assessment and provide the grounded basis for subsequent hypotheses. Many nominally significant signals of association listed in tables would not survive to rigorous corrections for multiple testing (see Table 3). Unfortunately, the actual set of data is not suitable for post hoc corrections, as the number of tests was not declared, and statistical models were not reported in most instances. A few attempts to replicate and validate findings were found. Ng MW and co-workers [44] found a positive association signal of rs2107538 in the RANTES/CCL5 gene with manifest disease and severity of the clinical course. The analysis on an independent cohort succeeded in replicating only the association with severity as measured by admission to the intensive care unit, by using a sample of 20 severe vs 336 mild patients [44]. The original findings [31] were matched in an independent study which applied a conservative correction for multiple testing, failing to replicate the association of SARS with HLA-B*0703 and HLA-DRB1*0301 [53]. Study design is another limitation of all investigations evaluated in this systematic search. Patient and control series were retrospectively collected based on the clinical diagnosis with no definite procedure for enrolment, thus inflating the risk of selection bias. Most studies used the general population as control, with no matched case-control design. Control series were matched for ancestry but not paired with patients for other relevant variables. Most important, in the majority of studies the exposure to CoV had not been established—therefore, control individuals could be defined as “non-cases”, but their susceptibility to the disease (i.e. the phenotype under study) had not been ascertained. Studies on HLA used reference data from the general population as the control series (see Table 3).Reference HLA data provide the best estimates of haplotype frequencies for a given ancestry and geographical area. However, these data sets do not convey information on the phenotype under study, namely the susceptibility to CoV infection and the associated illness. Whether these data set can be assumed as phenotype-negative control population is arguable. Individuals who acquired the infection and developed a severe form of COVID-19 can be presumed to lie in the extreme of the curve corresponding to the most susceptible individuals. On the opposite extreme, there might be located persons sharing the basic risk factors (such as spouses) who were ascertained as either seronegative or mildly affected. The latter individuals could be reasonably classified as resistant—either to infection or severe manifestations—and could serve as matched controls with respect to selected cases. Our recognition found one study that implemented a procedure to enrol control individuals [53]. To deploy this approach, clinical registries and linked biobanks are a helpful source of information on COVID-19 phenotype variability. The UK Biobank (UKB) has already provided pieces of evidence on COVID-19 [91], and one study matched the inclusion criteria of the present review. By interrogating the UKB, Kuo and co-workers found that ApoE ε4 homozygous individuals are more likely to be COVID-19 test positives compared with ε3 homozygotes, independently from dementia [57].

Limitations of the study

This systematic review and summary of findings should be weighted for some limitations. The rapid protocol may have limited the sensitivity of the literature search; thus, some relevant articles might have been missed. To reduce this risk, we performed an extensive hand-search of reference lists and updated the search in order to include the latest studies and unrefereed articles previously found. The present study also suffers from the inability to adjust significance and effect size for multiple testing and basic covariates, such as age, sex and comorbidity that were found to account for most of the variability in the clinical manifestations of CoV-associated diseases, particularly COVID-19 [10, 11]. The paucity of such information is an intrinsic limit of the examined literature. In fact, the lack of multiple data sets for each genetic risk factor, as well as the heterogeneity of samples and outcome measures, has prevented any post hoc analyses. As a consequence, publication bias towards nominally significant results may be supposed but could not be estimated. The short term of our rapid protocol implied that raw data and unpublished data could not be collected and analysed. The choice of rapid systematic review protocol implied also to renounce a formal quality assessment of eligible articles. Therefore, we included only accepted articles, relying on the ability of the peer-review process of rejecting papers that did not meet the minimal criteria for publication. Since no meta-analysis was possible on the set of extracted data, the lack of a structured assessment of the risk of bias did not affect the final synthesis. Somatic variants in the target tissues of CoVs were not comprised in the inclusion criteria of the present systematic search. The variable expression of virus receptors implies a complex network which controls expression level and processing in response to the virus, encompassing both coding and non-coding RNA molecules. The post-transcriptional regulation of the interaction between CoVs and the human host is a fascinating field of investigation—however, it was beyond the scope of our effort. We weighted these limitations against the utility of a synopsis which summarises a heterogeneous and promising line of research. We are also aware that the body of evidence on the host genetic determinants of CoV-related phenotypes is increasing—our synthesis provides a snapshot of the current situation but is far from being conclusive. Due to the short time-lapse from the discovery of COVID-19, the majority of studies that were appraised in the present review are related to the SARS outbreak caused by SARS-CoV-1. This finding underlines the importance of reliable genetic association studies targeting the human host response to SARS-CoV-2 infection. In a rapidly changing scenario which has a major impact on public health, such as the current pandemic, it is mandatory to maintain the knowledge with the most recent evidence and discuss the research priorities. In adherence to the recommendations recently reminded by the Cochrane’s initiative [92], the review process outlined herein can be easily replicated in order to keep the scientific community abreast of the latest evidence.

Lessons learned and perspectives

The considerations outlined above highlights the need for consortia aimed at coordinating multidisciplinary efforts to elucidate the genetic determinants of COVID-19 [15, 93]. To date, several dedicated programmes were launched within the European National Health Systems. The UK Biobank—which collects samples and detailed health data from about 500,000 volunteers—has started to analyse data collected from COVID-19 patients (https://www.ukbiobank.ac.uk/2020/04/covid/). In Finland, the COVID-19 Host Genetics Initiative intends to integrate data with major biobanks (https://www.covid19hg.org/). The Greek public initiative, COVID-19-GR, will genotype COVID-19 patients and their SARS-CoV-2 genome, linking data with the Greek COVID-19 registry (http://www.gsrt.gr/central.aspx?sId=119I428I1089I646I488772). In the UK, the GenOMICC Consortium is working in the project ‘GenOMICC study on COVID-19 patients’ aimed at achieving whole genome sequencing of up to 20,000 patients who were severely affected by COVID-19 and 15,000 who had mild symptoms (https://genomicc.org/). In Iceland, the company deCODE Genetics has partnered with Iceland’s government to sequence the genomes of patients who were previously infected with COVID-19 (https://www.decode.com/). In the USA, researchers from Harvard’s Wyss Institute and the Personal Genome Project at Harvard University are launching a multi-omics project to compare genomes, microbiomes, viromes, and immune systems of individuals with extreme susceptibility to SARS-CoV-2 infection and individuals who have shown resistance (https://wyss.harvard.edu). In collaboration with Yale University, the Yale SARS-CoV-2 Genomic Surveillance Initiative is using a variety of tools, including viral sequencing, case surveillance, test development, and wastewater surveillance, in order to monitor the spreading of the virus (https://covidtrackerct.com/). International initiatives, such as the COVID Human Genetic Effort [15] (www.covidhge.com), are having a pivotal role in facilitating the implementation of standardised phenotyping, collection of reliable clinical data, as well as biological samples, according to the best practices in adherence to shared ethical and legal standards. The GEFACOVID consortium, made up of 29 participating institutions, is focused on virus’ genetic polymorphisms and pathogenesis mechanisms as well as genetic, genomic, metabolomic, epidemiological and clinical data in order to identify biomarkers that confer susceptibility to virus infection and increase the risk of life-threatening complications (https://davincidtx.com/gefacovid-project/). Registries and biobanks are invaluable sources of demographic and clinical assessments, including imaging and biomarkers. These large data sets may be used to explore different models of response to infection and generate hypotheses encompassing multiple genetic risk factors and interactions with other predictors. Promising hypotheses could be subsequently tested in clinical studies. Assuming that ethnicity does not merely reflect the genetic background, as it is a complex construct made of genetic inheritance, cultural heritage and shared social and behavioural profiles, ancestry is a mandatory ascertainment in any clinical data set. Most genetic association studies included in the present review were targeted on patients of Chinese Han ancestry. In a global pandemic, the genetic diversity of populations across the world implies the need for inclusive investigations recruiting participants from the different genealogical and geographical origin. We endorse the recommendations recently invoked [94] and followed in an exploratory study [95], strongly suggesting the implementation of ethnicity forms and the analysis of data disaggregated by attributed ancestry.

Conclusions

After the global spread of COVID-19, the knowledge about the interplay between CoVs and the human host has been growing at an impressive rate. However, as compared to the number of clinical studies that were delivered so far, we found a quite limited number of investigations focused on the effect of human genetic variability on clinical phenotypes. A handful of candidate genes were tested in more than one study, and no single variant was confirmed to be associated with the clinical outcome in independent cohorts. The paucity of studies converging on common genetic determinants hampered any quantitative synthesis. In the light of the present synopsis, the discovery of a single major gene which determines a substantial variability of the CoV-associated phenotype cannot be sensibly anticipated. Since all genetic association studies eligible for the present review were presumably prone to both type I and type II errors, their findings should be regarded as exploratory. Plenty of recent studies addressing the COVID-19 biological mechanisms and response to therapy might provide the rationale for novel candidate genes. Genetic association analyses warrant rigorous design and robust statistical methods, in order to test precise models of interaction among multiple genetic variants and between genetic variants and exposure. To this purpose, the shared effort of the human genomics community will enable the development of robust methodological approaches through a close multidisciplinary interchange, as well as the assembly of large data sets. Prompt dissemination of genetic studies—including negative findings—is helpful to focus on the rapidly evolving research questions. Genetic variants are non-modifiable risk factors and cannot be claimed to be directly translated into the clinic as actionable markers in the present emergency. However, once a panel of associated genetic biomarkers is established, genetic biomarkers could be included in multidimensional models with age, comorbidities, socioeconomic status, etc., to prioritise public health interventions, such as active surveillance for the vulnerable population strata. From the speculative perspective, although association studies do not prove mechanisms per se, evidence from genetic variants might be crucial to elucidate the biological pathways underlying the severe clinical presentation associated with CovS infection and to identify promising therapeutic targets. Additional file 1: Supplementary table 1. Search strings used to interrogate publication databases. Supplementary figure 1. PRISMA workflow.
  88 in total

1.  Genetic polymorphism of the MxA gene promoter and interferon responsiveness of hepatitis C patients: revisited by analyzing two SNP sites (-123 and -88) in vivo and in vitro.

Authors:  M Hijikata; S Mishiro; C Miyamoto; Y Furuichi; M Hashimoto; Y Ohta
Journal:  Intervirology       Date:  2001       Impact factor: 1.763

2.  Human-leukocyte antigen class I Cw 1502 and class II DR 0301 genotypes are associated with resistance to severe acute respiratory syndrome (SARS) infection.

Authors:  Sheng-Fan Wang; Kuan-Hsuan Chen; Marcelo Chen; Wen-Yi Li; Yen-Ju Chen; Ching-Han Tsao; Muh-yong Yen; Jason C Huang; Yi-Ming Arthur Chen
Journal:  Viral Immunol       Date:  2011-09-29       Impact factor: 2.257

Review 3.  Genetic resistance to coronavirus infection. A review.

Authors:  E Buschman; E Skamene
Journal:  Adv Exp Med Biol       Date:  1995       Impact factor: 2.622

4.  Redefining rapid reviews: a flexible framework for restricted systematic reviews.

Authors:  Annette Plüddemann; Jeffrey K Aronson; Igho Onakpoya; Carl Heneghan; Kamal R Mahtani
Journal:  BMJ Evid Based Med       Date:  2018-06-27

Review 5.  Antiviral actions of interferons.

Authors:  C E Samuel
Journal:  Clin Microbiol Rev       Date:  2001-10       Impact factor: 26.132

Review 6.  Impact of mannose-binding lectin on susceptibility to infectious diseases.

Authors:  Damon P Eisen; Robyn M Minchinton
Journal:  Clin Infect Dis       Date:  2003-11-06       Impact factor: 9.079

7.  APOE e4 Genotype Predicts Severe COVID-19 in the UK Biobank Community Cohort.

Authors:  Chia-Ling Kuo; Luke C Pilling; Janice L Atkins; Jane A H Masoli; João Delgado; George A Kuchel; David Melzer
Journal:  J Gerontol A Biol Sci Med Sci       Date:  2020-10-15       Impact factor: 6.053

8.  Significance of the myxovirus resistance A (MxA) gene -123C>a single-nucleotide polymorphism in suppressed interferon beta induction of severe acute respiratory syndrome coronavirus infection.

Authors:  Johannes Chi-Yun Ching; Kelvin Yuen Kwong Chan; Eric Hing Leung Lee; Mei-Shu Xu; Campbell Kam Po Ting; Thomas M K So; Pak C Sham; Gabriel M Leung; Joseph S M Peiris; Ui-Soon Khoo
Journal:  J Infect Dis       Date:  2010-06-15       Impact factor: 5.226

9.  A novel coronavirus outbreak of global health concern.

Authors:  Chen Wang; Peter W Horby; Frederick G Hayden; George F Gao
Journal:  Lancet       Date:  2020-01-24       Impact factor: 79.321

10.  Association of human leukocyte antigen class II alleles with severe Middle East respiratory syndrome-coronavirus infection.

Authors:  Ali H Hajeer; Hanan Balkhy; Sameera Johani; Mohammed Z Yousef; Yaseen Arabi
Journal:  Ann Thorac Med       Date:  2016 Jul-Sep       Impact factor: 2.219

View more
  34 in total

1.  Age-dependent impact of the major common genetic risk factor for COVID-19 on severity and mortality.

Authors:  Tomoko Nakanishi; Sara Pigazzini; Frauke Degenhardt; Mattia Cordioli; Guillaume Butler-Laporte; Douglas Maya-Miles; Luis Bujanda; Youssef Bouysran; Mari Ek Niemi; Adriana Palom; David Ellinghaus; Atlas Khan; Manuel Martínez-Bueno; Selina Rolker; Sara Amitrano; Luisa Roade Tato; Francesca Fava; Christoph D Spinner; Daniele Prati; David Bernardo; Federico Garcia; Gilles Darcis; Israel Fernández-Cadenas; Jan Cato Holter; Jesus M Banales; Robert Frithiof; Krzysztof Kiryluk; Stefano Duga; Rosanna Asselta; Alexandre C Pereira; Manuel Romero-Gómez; Beatriz Nafría-Jiménez; Johannes R Hov; Isabelle Migeotte; Alessandra Renieri; Anna M Planas; Kerstin U Ludwig; Maria Buti; Souad Rahmouni; Marta E Alarcón-Riquelme; Eva C Schulte; Andre Franke; Tom H Karlsen; Luca Valenti; Hugo Zeberg; J Brent Richards; Andrea Ganna
Journal:  J Clin Invest       Date:  2021-12-01       Impact factor: 14.808

Review 2.  The Evolutionary Dance between Innate Host Antiviral Pathways and SARS-CoV-2.

Authors:  Saba R Aliyari; Natalie Quanquin; Olivier Pernet; Shilei Zhang; Lulan Wang; Genhong Cheng
Journal:  Pathogens       Date:  2022-05-03

3.  Mannose-Binding Lectin 2 Gene Polymorphism during Pandemic: COVID-19 Family.

Authors:  Tufan Tukek; Sacide Pehlivan; Yasemin Oyaci; Ummuhan Isoglu-Alkac
Journal:  Glob Med Genet       Date:  2022-06-13

Review 4.  Insights into COVID-19 vaccines development: Translation from benchside to bedside.

Authors:  Marwa Houssein; Aya El Asir Al Hossainy; Jana Al Soussi; Jana El Batch; Lana El-Samadi; Sherine El Imam; Rawan Fakih; Hoda Dakdouk; Mahmoud Khalil
Journal:  Health Sci Rev (Oxf)       Date:  2022-07-10

Review 5.  Antiviral performance of graphene-based materials with emphasis on COVID-19: A review.

Authors:  Tahereh Seifi; Ali Reza Kamali
Journal:  Med Drug Discov       Date:  2021-05-25

6.  Integrative genomics analysis reveals a 21q22.11 locus contributing risk to COVID-19.

Authors:  Yunlong Ma; Yukuan Huang; Sen Zhao; Yinghao Yao; Yaru Zhang; Jia Qu; Nan Wu; Jianzhong Su
Journal:  Hum Mol Genet       Date:  2021-06-17       Impact factor: 6.150

Review 7.  COVID-19 one year into the pandemic: from genetics and genomics to therapy, vaccination, and policy.

Authors:  Giuseppe Novelli; Michela Biancolella; Ruty Mehrian-Shai; Vito Luigi Colona; Anderson F Brito; Nathan D Grubaugh; Vasilis Vasiliou; Lucio Luzzatto; Juergen K V Reichardt
Journal:  Hum Genomics       Date:  2021-05-10       Impact factor: 4.639

8.  Characterization and Outcomes of SARS-CoV-2 Infection in Patients with Sarcoidosis.

Authors:  P Brito-Zerón; B Gracia-Tello; A Robles; A Alguacil; M Bonet; B De-Escalante; A Noblejas-Mosso; R Gómez-de-la-Torre; M Akasbi; M Pérez-de-Lis; R Pérez-Alvarez; M Ramos-Casals
Journal:  Viruses       Date:  2021-05-27       Impact factor: 5.048

9.  Heterozygous OAS1 gain-of-function variants cause an autoinflammatory immunodeficiency.

Authors:  Thomas Magg; Tsubasa Okano; Lars M Koenig; Daniel F R Boehmer; Samantha L Schwartz; Kento Inoue; Jennifer Heimall; Francesco Licciardi; Julia Ley-Zaporozhan; Ronald M Ferdman; Andrés Caballero-Oteyza; Esther N Park; Brenda M Calderon; Debayan Dey; Hirokazu Kanegane; Kazutoshi Cho; Davide Montin; Karl Reiter; Matthias Griese; Michael H Albert; Meino Rohlfs; Paul Gray; Christoph Walz; Graeme L Conn; Kathleen E Sullivan; Christoph Klein; Tomohiro Morio; Fabian Hauck
Journal:  Sci Immunol       Date:  2021-06-18

Review 10.  The Role of Immunogenetics in COVID-19.

Authors:  Fanny Pojero; Giuseppina Candore; Calogero Caruso; Danilo Di Bona; David A Groneberg; Mattia E Ligotti; Giulia Accardi; Anna Aiello
Journal:  Int J Mol Sci       Date:  2021-03-05       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.