Literature DB >> 35725860

Presence of rare potential pathogenic variants in subjects under 65 years old with very severe or fatal COVID-19.

Rosario López-Rodríguez1,2,3, Marta Del Pozo-Valero1,2, Marta Corton1,2, Pablo Minguez1,2,4, Javier Ruiz-Hornillos5,6,7, María Elena Pérez-Tomás8, María Barreda-Sánchez8,9, Esther Mancebo10,11, Cristina Villaverde1,2, Gonzalo Núñez-Moreno1,4, Raquel Romero1,2, Estela Paz-Artal10,11,12,13, Encarna Guillén-Navarro2,8,14, Berta Almoguera1,2, Carmen Ayuso15,16.   

Abstract

Rare variants affecting host defense against pathogens could be involved in COVID-19 severity and may help explain fatal outcomes in young and middle-aged patients. Our aim was to report the presence of rare genetic variants in certain genes, by using whole exome sequencing, in a selected group of COVID-19 patients under 65 years who required intubation or resulting in death (n = 44). To this end, different etiopathogenic mechanisms were explored using gene prioritization-based analysis in which genes involved in immune response, immunodeficiencies or blood coagulation were studied. We detected 44 different variants of interest, in 29 different patients (66%). Some of these variants were previously described as pathogenic and were located in genes mainly involved in immune response. A network analysis, including the 42 genes with candidate variants, showed three main components, consisting of 25 highly interconnected genes related to immune response and two additional networks composed by genes enriched in carbohydrate metabolism and in DNA metabolism and repair processes. In conclusion, we have detected candidate variants that may potentially influence COVID-19 outcome in our cohort of patients. Further studies are needed to confirm the ultimate role of the genetic variants described in the present study on COVID-19 severity.
© 2022. The Author(s).

Entities:  

Mesh:

Year:  2022        PMID: 35725860      PMCID: PMC9208539          DOI: 10.1038/s41598-022-14035-x

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.996


Introduction

Since the first outbreak of the Coronavirus disease in the 2019 (COVID-19) pandemic, over 243 million cases of COVID-19 and more than 6.1 million deaths have been confirmed (https://coronavirus.jhu.edu, last accessed in March 2022). SARS-CoV-2 infection displays high inter-individual clinical variability, ranging from asymptomatic to lethal outcomes[1]. The most important life-threatening factor is age, increasing the risk for critical illness for individuals over 65 years of age[2]. Other known risk factors are being male and having comorbidities such as hypertension, diabetes and cardiovascular, renal or respiratory diseases[3,4]. However, these risk factors do not explain completely why apparently healthy young and middle-aged adults present severe COVID-19 with acute respiratory distress syndrome (ARDS) that cause a fulminant disease in some cases. Genetic background has been proposed as a candidate factor to explain some of the inter-individual variability observed in COVID-19 severity. Recently, different genome-wide association studies (GWAS) have identified several loci associated with an increased susceptibility to SARS-CoV-2 infection and severe disease[5-7].These loci include genes involved in type I interferon (IFN) signaling pathway (IFNAR2, DPP9 or OAS1-3), autoimmunity (TYK2) or in lung function (FOXP4).However, top associated variants displayed low odd ratios to be considered predictive biomarkers of COVID-19 severity[5-7]. In addition to common variants detected in GWAS, rare variants affecting host defense against pathogens could be involved in COVID-19 severity and may help explain fatal outcomes in young and middle-aged patients. In fact, inborn errors of immunity producing increased infection susceptibility and/or infection recurrence (such as primary immunodeficiencies or N-glycosylation defects), may aggravate the course of SARS-CoV-2 infection[8-10]. Enrichment in loss-of-function (LoF) variants in 13 genes belonging to type I IFN signaling pathway has been reported in patients with life-threatening COVID-19 pneumonia[11], although this finding has not been replicated[12]. Moreover, LoF genetic variants in Toll-like receptor 7 (TLR7), which is critical in the recognition of single-stranded RNA viruses and fostering the antiviral responses, have been associated to more severe outcomes in young males without comorbidities[13-16]. Apart from ARDS, thrombosis and coagulopathy emerge as critical complications of SARS-CoV-2 infection[17-19]. In fact, elevation of the thrombotic related D-dimer is one of the most frequent laboratory findings, particularly in critically ill patients[20]. In addition, different studies have shown the influence of ABO blood groups on the risk of SARS-CoV-2 infection and/or the severity of the disease[21]. In this regard, the demonstrated impact of ABO phenotypes on vascular homeostasis and function[22], has been suggested as an explanation of the reported associations between COVID-19 severity and ABO blood groups. Thus, pathogenic genetic variants altering protein functionality of coagulation system may also impact on COVID-19 resolution. Based on this evidence, our objective was to identify rare genetic variants related to COVID-19 severity. To this end, we selected a group of patients under 65 years who experienced a very severe outcome defined as requiring intubation or resulting in death and were subjected to whole exome sequencing. Different etiopathogenic mechanisms were explored using gene prioritization-based analysis in which more than 800 genes involved in immune response, immunodeficiencies or blood coagulation were studied.

Results

Clinical and demographic characteristics

A total of 44 unrelated patients with very severe COVID-19) that required intubation, non-invasive ventilatory support or did not survive to SARS-CoV-2 infection were included in the present study. A total of 16 patients (36%) did not survive, most of them received invasive ventilatory support (intubation, 44%) and 31% received exclusively non-invasive ventilatory support (Table 1). The median age was 46 years, and patients were mainly males (70%, Table 1). Most patients were of European ancestry (86%), except for 7 Admixed Americans and one patient from another ethnicity. Main clinical characteristics of this cohort such as pre-existing diseases and COVID-19 management are summarized in Table 1. The most frequent comorbidities were obesity (defined as body mass index > 33 kg/m2; 39%), hypertension (18%), respiratory disease (16%) and oncohematological antecedents (14%). Detailed clinical data of each patient are provided in Supplementary Table 1.
Table 1

Clinical and demographic characteristics.

All(n = 44)Survivors(n = 28)Deceased(n = 16)
Males (n, %)31, 70%23, 82%8, 50%
Age (median, range)46, 24–6242, 24–5056, 45–62
Europeans (n, %)36, 82%20, 71%16, 100%
Pre-existing conditions (n, %)
Hypertension8, 18%5, 18%3, 19%
Hypercholesterolemia3, 7%1, 4%2, 13%
Type 2 diabetes5, 11%3, 11%2, 13%
Obesity17, 39%9, 32%8, 50%
Cardiovascular disease4, 9%2, 7%2, 13%
Neurological disease2, 4.5%1, 4%1, 6%
Respiratory disease7, 16%3, 11%4, 25%
Digestive/liver disease4, 9%3, 11%1, 6%
Oncohematological antecedents6, 14%1, 4%5, 31%
Kidney disease2, 4.5%02, 13%
Management of COVID-19 disease
ICU (n, %)39, 89%28, 100%11, 69%
Invasive ventilation (intubation, n, %)35, 80%28, 100%7, 44%
Non-invasive ventilation (CPAP/BiPAP/high flow nasal cannula)a (n, %)5, 11%%0, 0%5, 31%%
Without ventilatory support4, 9%0, 0%4, 25%
Hospitalization (mean ± SD of days)33.2 ± 24.838.8 ± 26.324.0 ± 19.3

aPatients that received exclusively non-invasive ventilatory support.

Clinical and demographic characteristics. aPatients that received exclusively non-invasive ventilatory support.

Identification of candidate variants

We have detected 44 different variants of interest located in 42 genes. These variants were identified in 29 patients, 11 of them carrying 2 or more candidate variants in different genes (Fig. 1A). Available data about pathogenic predictors, MAF and pathogenic annotation from public databases (ClinVar and HGMD) are summarized in Supplementary Table 2. A total of 12 (26%) variants were previously described as likely pathogenic or displayed strong evidence for being considered as likely pathogenic following ACMG criteria (Table 2).
Figure 1

Identification of variants of interest in very severe COVID-19 patients: frequency and functional pathways involved. (A) Number and percentage of patients with none, one, two or more variants of interest, (B) Number and percentage of variants detected in each of the functional pathways.

Table 2

List of filtered candidate variants identified in our cohort of COVID-19 patients.

PatientAgeSexOGeneIMGTHGVScHGVSpVarsome ClassaLP/P
FJD_000446MSPLAUADHTNM_002658.3:c.970 + 1G>Ap.?LPNo
FJD_001950MSNPHS1ARHTNM_004646.3:c.767G>Ap.Arg256GlnLPNo
FJD_004449MSG6PDX-LHMNM_000402.4:c.934G>Cp.Asp312HisLPYesb
NFATC1ADHTNM_006162.5: c.230C>Tp.Pro77LeuVUSNo
TLR8X-LHMNM_016610.3:c.2263A>Tp.Ser755CysVUSNo
FJD_008124MSCARD11ADHTNM_032415.7:c.572A>Gp.Asn191SerVUSNo
FJD_032557MSPDGFRAADHTNM_006206.6:c.2464C>Tp.Arg847CysVUSNo
SLC9A3ARHTNM_004174.2:c.1144C>Tp.Arg382TrpLPNo
FJD_037458MDFANCD2ARHTNM_033084.3:c.2444G>Ap.Arg815GlnLPYesb
FCN3ARHTNM_003665.2:c.232 + 1G>Ap.?LPNob
FKRPARHTNM_024301.4:c.265C>Tp.Pro89SerLPNo
MYO5BARHTNM_001080467.2:c.3843G>Cp.Ala1281AlaLPNo
FJD_041260FDPOLR3CADHTNM_001303456.1:c.706G>Cp.Asp236HisVUSNo
FJD_062659FDFN1ADHTNM_212482.2:c.4654C>Gp.Pro1552AlaVUSNo
MOGSARHTNM_006302.2:c.882delp.Glu295Asnfs*10LPNob
POLD1ARHTNM_002691.4:c.378_394delp.Ala127*LPNo
FJD_071444MSIL17RCARHTNM_153461.3:c.1132C>Tp.Arg378*LPNob
FJD_072862FDSRD5A3ARHTNM_024592.4:c.57G>Ap.Trp19*LPYesb
STAT3ADHTNM_139276.3:c.523A>Cp.Asn270HisVUSNo
FJD_141738MSPLCG2ADHTNM_002661.4:c.3022C>Tp.Gln1008*LPNo
FJD_145649FDG6PC3ARHTNM_138387.3:c.889delp.Leu297Trpfs*27LPNob
MPIARHTNM_002435.1:c.1123G>Cp.Gly375ArgLPNo
PIK3CDADHTNM_005026.5:c.2869C>Tp.Arg981TrpVUSNo
FJD_145856MDNFATC1ADHTNM_006162.5:c.2311C>Ap.Leu771IleVUSNo
FJD_146251FDIL17RCARHTNM_153461.3:c.1132C>Tp.Arg378 *LPNob
FJD_153246MSTMEM199ARHTNM_152464.1:c.92G>Cp.Arg31ProLPYesb
FJD_164559MDRECQL4ARHTNM_004260.3:c.2756G>A:p.Ala919ThrLPNo
FJD_214953MDMSH6ARHTNM_000179.2:c.3226C>Tp.Arg1076CysLPYesb
H12O_10430MSTLR7X-LHMNM_016562.3:c.2050A>Tp.Lys684*LPYesb
H12O_10738FSEP300ADHTNM_001429.3:c.7180G>Cp.Gly2394ArgVUSNo
SLC37A4ARHTNM_001164277.1:c.1016G>Tp.Gly339CysLPYesb
H12O_11027MSPOLR3CADHTNM_001303456.1:c.1597A>Tp.Ile533PheVUSNo
H12O_11145FDNFKB1ADHTNM_003998.4:c.233A>Gp.Asn86SerVUSNo
H12O_11242MSCFIARHTNM_000204.5:c.1643A>Gp.Glu556GlyLPNo
LDLRADHTNM_000527.4:c.1816G>Ap.Ala606ThrLPNo
H12O_11345MSRNASELADHTNM_021133.3:c.1450C>Tp.Gln484*VUSNo
H12O_23653MDFASLGADHTNM_000639.1:c.596G>Tp.Gly199ValVUSNo
HAVCR2ARHTNM_032782.4:c.291A>Gp.Ile97MetLPNo
HVAM_08348MSSLX4ARHTNM_032444.2:c.2340_2343delp.Glu781Serfs*38LPNob
TLR4ADHTNM_138554.4:c.1976T>Cp.Met659ThrVUSNo
HVAM_13741MSPTPN11ADHTNM_002834.3:c.369G>Tp.Glu123AspVUSNo
HVAM_14249MSSTING1ADHTNM_198282.2:c.65C>Ap.Ala22AspVUSNo
HVAM_21246MSCHD7ADHTNM_017780.3:c.8257A>Gp.Met2753ValVUSNo
TEKADHTNM_000459.3:c.2357A>Gp.Gln786ArgVUSNo
HVAM_25244MSITGB3ADHTNM_000212.2:c.1658_1660delp.Ser553delVUSNo

M male, F female, O Outcome, D deceased, S survivor, IM inheritance mode, AD autosomal dominant, AR autosomal recessive, X-L linked to X chromosome, GT genotype, HT heterozygous, HM Hemizygous, Chr chromosome, HGVSc HGVS coding sequence name, HGVSp HGVS protein sequence name, LP likely pathogenic, P pathogenic, VUS variant of unknown significance. aVarsome class followingACMG criteria; bvariants previously described as likely pathogenic or displaying strong evidence for being considered as likely pathogenic.

Identification of variants of interest in very severe COVID-19 patients: frequency and functional pathways involved. (A) Number and percentage of patients with none, one, two or more variants of interest, (B) Number and percentage of variants detected in each of the functional pathways. List of filtered candidate variants identified in our cohort of COVID-19 patients. M male, F female, O Outcome, D deceased, S survivor, IM inheritance mode, AD autosomal dominant, AR autosomal recessive, X-L linked to X chromosome, GT genotype, HT heterozygous, HM Hemizygous, Chr chromosome, HGVSc HGVS coding sequence name, HGVSp HGVS protein sequence name, LP likely pathogenic, P pathogenic, VUS variant of unknown significance. aVarsome class followingACMG criteria; bvariants previously described as likely pathogenic or displaying strong evidence for being considered as likely pathogenic. Thirty eight percent of the identified variants (n = 17, 15 different variants and 1 variant found in 2 cases) was located in 15 genes involved in immune response (Fig. 1B). Eight of these candidate variants were found in genes with an autosomal dominant inheritance mode (such as PLCG2, RNASEL, TLR4 or STAT3), six variants were detected in genes with an autosomal recessive inheritance mode (CFI, FCN3, HAVCR2 and IL17RC) and three variants were identified in X-linked genes (TLR7, TLR8 and G6PD). The detected LoF variant in TLR7 was found in a 30-year-old male who was included in a case-series recently reported[16]. The p.Arg378* variant on IL17RC, which was not described in public databases (GnomAD[23], ExAc[24] or 1000 genome project[25]), was detected in two unrelated patients from our cohort (MAF of 2.3%). The patient carrying the variant p.Ser755Cys located in the TLR8 gene was also hemizygous for a pathogenic variant in G6PD (both XL genes) and did not show pre-existing comorbidities or risk factors at the time of the SARS-CoV-2 infection (Supplementary Table 1). Besides, one of the deceased patients, clinically diagnosed with a primary immunodeficiency (FJD_0728); carried a variant on the STAT3 gene (p.Asn175His), in addition to one pathogenic variant in the recessive SRD5A3 gene (Table 2). A total of 6 candidate variants (14%) were found in genes related to congenital disorders of glycosylation (G6PC3, FKRP, MPI, MOGS, SDR5A3 and TMEM199) and another four variants (9%) were detected in damaged DNA binding genes (EP300, MSH6, POLD1 and RECQL4). These 10 variants, identified in recessive genes, were carried in heterozygosis by 8 patients (Table 2). Additionally, two variants were found in genes related to coagulation (PLAU) and cardiovascular risk (LDLR) in two different cases (Table 2). These two cases required intubation and developed severe complications during SARS-COV-2 infection (Supplementary Table 2). The patient carrying the LDLR variant was also heterozygous for a variant in CFI.

Network analysis of genes with candidate variants

A network analysis, including genes with candidate variants (Supplementary Table 3), was performed to detect functional interactions among them. The network (Fig. 2) shows three main components. First component consists of 25 highly interconnected genes, 15 involved in immune response and enriched in cell signaling compared to the rest of the network (Fig. 2). Two additional network components were identified, one composed by 6 genes and enriched in carbohydrate metabolism and a third component with 5 genes enriched in DNA metabolism and repair processes, both compared to the rest of the network.
Figure 2

Network analysis of the genes with candidate variants.

Network analysis of the genes with candidate variants.

Discussion

Understanding inter-individual clinical variability in COVID-19 has important implications for the identification of high-risk patients, clinical decision-making and the development of individualized treatments. In the present study, a group of young and middle-aged patients with very severe COVID-19 were selected for a genetic study, in which 44 different variants of interest have been detected. As expected, most of the detected variants (40%) were encoded by genes directly related to immune response, as the gene panel used in this exploratory study was enriched in immune-related pathways and included additional immune genes than those reported in previous studies[11,13]. Innate immunity is crucial for early antiviral response; thus, LoF variants in related genes could affect the onset of the immune response and also alter the appropriate clearance of the infection by adaptative response[9,26,27]. In this sense, pathogenic rare variants in 13 candidate genes involved in TLR3- and IRF7-dependent type I IFN pathways showed a higher risk to severe SARS-CoV-2 infection, which could explain up to 3.5% of severe cases[11]. However, these initial findings were not replicated in subsequent studies[12,28]. In our study, we did not find any LoF variant in those 13 genes, but we have detected seven likely pathogenic variants in other genes directly related to immune response (IFN pathways, mainly).We have confirmed the presence of a TLR7 variant in a male also participating in a recently published study[16], suffering from very severe COVID-19 and without relevant comorbidities or risk factors at the time of the infection. Therefore, our results support the genetic screening of TLR7 variants in young men in absence of pre-existing conditions as a preventing biomarker that may help clinical management of this subset of patients. Even more, we have found two variants in other Toll-like receptor (TLRs) genes, TLR4 (Chr9) and TLR8 (ChrX), in two males under 50 years of age requiring intubation. Of note, TLRs are crucial in innate response by recognizing pathogen-associated molecular patterns from different microorganisms[29], being TLR3, 7 and 8 key sensors of RNA viruses[30]. Furthermore, nearly 40% of the variants detected in the present study were located in immune response genes, some of them with a high probability of intolerance to heterozygous LoF variation (pLI ≥ 0.9)[31];thus, a single LoF variant may lead to a severe clinical phenotype due to haploinsufficiency in genes such as CARD11, STAT3 or NFKB1(pLI = 1, each). In addition to allelic dosage, subjects carrying the same genotypes can display variable expressivity and additional common or rare genetic variants may modify the penetrance of monogenic variants (polygenic risk)[32]. In this sense, 30% of our patients carried more than one variant of interest. Even more, other well-known COVID-19 risk factors, such as age, comorbidities, or environmental factors may affect monogenic variants penetrance to the final observed phenotype[33]. Additionally, we found 5 patients (11%) carrying heterozygous variants in genes related to glycosylation defects. Congenital defects of glycosylation (CDG) is a group of rare diseases caused mainly by recessive genes[34]. Clinical manifestations of CDG include neurological, cardiovascular, and hematologic involvement and recurrent infections, among others[35]. An increased risk of thrombotic events and bleeding complications have been related to abnormal glycosylation of coagulation factors[36] and thrombosis is one of the most important complications of COVID-19. Therefore, patients carrying a defective copy may experience a more severe course of SARS-CoV-2 infection due to the importance of glycosylation in immune response[35]. In contrast, ACE2 is a protein extensively glycosylated and previous studies showed that cellular SARS-CoV-2 entry is reduced by blocking the N-glycan and O-glycan formation[37]. Thus, it is difficult to conclude about the effect of these defective variants on the glycosylation status of the monoallelic carriers and the impact of those variants on SARS-CoV-2 clearance. Moreover, four variants were detected in damaged DNA binding genes and a cluster including three of these genes (POLD1, MSH6 and REQL4), in addition to SLX4 and FANCD2, was detected in the network analysis. There is evidence that senescence is in part caused by accumulated DNA damage[38] and severity of some pathologies, as COVID-19, has been related to cell senescence, particularly in the elderly[39]. In addition, premature cellular senescence could be induced by viral infections[40]; therefore, COVID-19 patients with pathogenic variants in damaged DNA binding genes may be more likely to develop cellular senescence and severe COVID-19. Interestingly, one of the candidate variants identified was in the canonical splice site of a key player of the coagulant pathway, PLAU, that has been previously related to bleeding disorders, tandem duplication of this gene is related to Quebec platelet disorder (MIM #601709) in a dominant model. Therefore, we could hypothesize that this variant may impair thrombosis resolution, as demonstrated previously in a knocked-out model[41] and inferred by the critical role of PLAU in the natural thrombus resolution by its fibrinolytic function[42]. Anticoagulant and fibrinolytic gene expression has been found dramatically down-regulated in the lung of COVID-19 patients compared to controls[43]. Thus, COVID-19 patients with loss of functions in the PLAU gene may be more likely to develop a thrombotic event. Moreover, we found a variant in the LDRL gene, classified as a variant of uncertain significance in relation to familial hypercholesterolemia[44]. Patients with impaired cholesterol metabolism could display a higher risk of COVID-19 severe outcomes due to the intimate relationship of hypercholesterolemia, metabolic syndrome, and heart disease[45]. Therefore, variants predisposing to hypercholesteremia such as LDLR pathogenic variants may confer a higher risk of suffering severe COVID-19 disease, even in the absence of other relevant comorbidities[46]. Our study has several limitations. First, we have a limited sample size. Despite recruiting more than 3500 in the Stop_Coronavirus cohort, the stringent cut-off for age (< 65 years) and outcome (only very severe COVID-19) led us to select those patients displaying an extreme phenotype in our cohort. Second, we have analyzed only the coding region; thus, we could have missed a second pathogenic allele (deep intronic regions or CNVs) in monoallelic patients that could help us explain the COVID-19 outcome. Besides, together with the effect of the detected genetic variants, it is necessary to consider the possible additional effect of pre-existing conditions related to COVID-19 severity in the patients on the outcome. In conclusion, our descriptive study in very severe COVID-19 patients has reported the presence of rare variants in certain biological pathways such as immune response. Moreover, two additional signaling pathways have been detected including genes involved in carbohydrate metabolisms and DNA repair. Further studies are needed to confirm the ultimate role of the variants described in the present study on COVID-19 severity.

Patients and methods

Subjects and clinical data

A case series study was performed by selecting a subgroup of patients (n = 44) from the Spanish STOP_Coronavirus[47] cohort, which comprises more than 3,500 COVID-19 patients, from 4 hospitals (three from Madrid and one from Murcia). Extreme phenotypes were selected from our STOP_Coronavirus cohort using a similar design to previous case series studies[13]. Inclusion criteria were young and middle-aged patients (age under 65 years) with a confirmatory test of SARS-CoV-2 infection that presented ARDS (survivors or deceased). More information about the Spanish STOP_Coronavirus cohort is provided in Supplementary methodology. Cases were retrospectively and prospectively enrolled from March to May 2020 and followed-up until December 2020. SARS-CoV-2 infection was confirmed by a positive PCR (n = 41) and/or serological test (n = 3, IgG and IgM both positives). COVID-19 patients were recruited from four hospitals in Spain: Hospital Universitario Fundación Jiménez Díaz (HUFJD), Hospital Universitario Infanta Elena (HUIE) and Hospital Universitario 12 de Octubre (H12O) in Madrid, and Hospital Clínico Universitario Virgen de la Arrixaca in Murcia (HVAM). Clinical data obtained in HUFJD and HUIE were extracted from the patients’ electronic medical records using batch-based complex queries and then reviewed and refined manually by two clinicians and two clinician researchers. At H12O and HVAM, clinical data were manually collected by researchers from electronic medical records. Clinical information included primary demographic data, comorbidities, COVID-19 symptoms, laboratory findings, treatments, related complications from COVID-19, ICU admissions, and outcomes (Supplementary Table 1). Descriptive statistics (mean and SD) were calculated for main clinical and demographic data (Table 1). This study was approved by the research ethics committees of HUFJD, HVAM and H12O. Wherever was possible, patients provided written or verbal informed consent to participate in this study. Due to the health emergency, the research ethics committees of each center waived the requirement for informed consent for the STOP_Coronavirus cohort. All samples were de-identified (pseudonymized) and clinical data were managed in accordance with national legislation and institutional requirements.

Ancestry inference

Principal component analysis (PCA) based on the variance-standardized relationship matrix was used to infer the ancestry of each patient and classify them as one of the selected ancestry groups (European, African, admixed American, and East Asian) using a set of 1000 genome samples (phase 3) as a reference population. For PCA, we used previously collected genetic data from our cohort (unpublished) obtained with the Applied Biosystems™ Axiom™ Spain Biobank Array (COL32017 1217, Thermo Fisher Scientific Inc.), which contains 758,740 variants. PCA was performed using Plink software version 1.9[48].

Kinship test

To assess kinship, we used previously collected genetic data from our cohort[2] obtained with the Applied Biosystems™ Axiom™ Spain Biobank Array (COL32017 1217, Thermo Fisher Scientific Inc.), which contains 758,740 variants. Autosomal SNPs (MAF > 5%) were pruned with PLINK[3] using a window of 1000 markers, a step size of 80 and a r2 of 0.1. A subset of 131,937 independent SNPs was used to evaluate kinship (IBD estimation) in PLINK[3]. Only one individual from each pair of individuals with PI_HAT > 0.25 (second-degree relatives) that showed a Z0, Z1, and Z2 coherent pattern (according to theoretically expected values for each relatedness level), was removed.

Whole exome sequencing analysis

DNA was isolated from EDTA-collected peripheral blood samples using an automated DNA extractor (BioRobot EZ1, QIAGEN GmbH). DNA samples were subjected to library construction using SureSelect Human All Exon V6 (Agilent Technologies, Santa Clara, CA, USA) and sequenced on a Novaseq 6000 instrument (Illumina, San Diego, CA, USA), following the manufacturer’s protocol. Paired-end reads of 2 × 150 bp were generated per sample to provide an on-target coverage of minimum 100 ×, with a total coverage of 12 GB/sample. For WES analysis we applied an in-house maintained bioinformatics pipeline using bwa v0.7.17[49] for mapping to the GRCh37/hg19 human genome assembly, gatk v4.2.0 HaplotypeCaller[50] for single nucleotide variants calling and hard filtering (SNP_filter: QD (Quality of Depth) < 2.0, MQ (Mapping Quality) < 40.0, MQRankSum <  − 12.5, and ReadPosRankSum <  − 8.0, and INDEL_filter: QD < 2.0, and ReadPosRankSum <  − 20.0). Annotations were performed using VEP r103[51]. More details can be seen at the github repository https://github.com/TBLabFJD/VariantCallingFJD and application of the same tool in Romero et al.[52].

Single variant analysis

To search for candidate variants involved in the pathophysiology of severe COVID-19, we used a candidate virtual gene panel summarized in Supplementary Table 4. Candidate gene panel included 330 genes mainly involved in type I IFN immunity, primary immunodeficiencies, and genes related to coagulation (panel 1). Moreover, 234 additional genes were selected by using the COVID-19 severity and susceptibility panel published in PanelApp[53], by selecting only green-labelled genes (panel 2). Besides, other functionally related genes were included by using our GLOWgenes prioritization method (www.glowgenes.org) using the 564 genes from panels 1 and 2 as a seed set. Top 300 prioritized genes were selected and included as panel 3 (Supplementary methodology). Thus, a total of 864 genes (564 candidates and 300 selected by GLOWgenes) were included in the final panel. The PriorR v.2.1 package (https://github.com/TBLabFJD/PriorR) was used for variant filtering and prioritization. Variants were filtered according to a minor allele frequency (MAF) < 0.01 in population databases [the 1000 genomes project[25], the Exome Aggregation Consortium (ExAc)[24], and the Genome Aggregation Database[23] (GnomAD)]. Synonymous, intronic and non-coding variants were excluded from the analysis. ClinVar (ncbi.nlm.nih.gov/clinvar/) and the Human Gene Mutation Database[54] (HGMD) were used to identify variants previously reported as pathogenic and those described as likely benign/benign variants were discarded. The impact of missense variants was assessed using several predictor tools (DANN[55], FATHMM[56], GERP++[57], LRT[58], M-CAP[59], CADD[60], MutationTaster[61], MutationAssessor[62], PhyloP[63], Polyphen2_HDIV[64], Polyphen2_HVAR[64], PROVEAN[65], RadiaISVM[66], SIFT[67], SiPhy[68], among others). Canonical and noncanonical splicing variants were assessed using 5 predictors (MaxEntScan[69], Human Splicing Finder[70], Splice Site Finder-like[71], NNSPLICE[72], and GeneSplicer[73]) using the Alamut software (Interactive Biosoftware, Rouen, France). The potential pathogenicity of prioritized variants was assessed using the Varsome tool[74] following ACGM criteria[75]. Genes carrying at least one of the candidate variants (Supplementary Table 3) were submitted to the STRING database v11.5[76] and interactions with a STRING combined score ≥ 400 were downloaded as a file (.tsv) in short tabular text output format from the Exports tab. Cytoscape[77] version 3.4.0 was used for visualization. Clusters were defined as subgraphs with any two nodes (genes) connected to each other by edges, and not connected to other nodes in the graph, this normally called network components and the most extreme version of a cluster. We applied BINGO[78] Cytoscape app for the enrichment analysis extracting over-represented Gene Ontology (GO) biological processes terms comparing their annotation in every cluster to the rest of the network including genes not grouped in clusters. In the network representation, the STRING combined score, which represents the interaction confidence, is used to characterize edges between genes. Functions enriched for every cluster were selected as having an FDR < 0.05. Supplementary Legends. Supplementary Table 1. Supplementary Table 2. Supplementary Table 3. Supplementary Table 4. Supplementary Information.
  75 in total

Review 1.  Age to survive: DNA damage and aging.

Authors:  Björn Schumacher; George A Garinis; Jan H J Hoeijmakers
Journal:  Trends Genet       Date:  2008-01-14       Impact factor: 11.639

2.  Identification of deleterious mutations within three human genomes.

Authors:  Sung Chun; Justin C Fay
Journal:  Genome Res       Date:  2009-07-14       Impact factor: 9.043

3.  MutationTaster2: mutation prediction for the deep-sequencing age.

Authors:  Jana Marie Schwarz; David N Cooper; Markus Schuelke; Dominik Seelow
Journal:  Nat Methods       Date:  2014-04       Impact factor: 28.547

4.  Genetic diagnosis of familial hypercholesterolemia using a DNA-array based platform.

Authors:  Rodrigo Alonso; Joep C Defesche; Diego Tejedor; Sergio Castillo; Marianne Stef; Nelva Mata; Pilar Gomez-Enterria; Ceferino Martinez-Faedo; Lluis Forga; Pedro Mata
Journal:  Clin Biochem       Date:  2009-02-06       Impact factor: 3.281

5.  Risk Factors Associated With Mortality Among Patients With COVID-19 in Intensive Care Units in Lombardy, Italy.

Authors:  Giacomo Grasselli; Massimiliano Greco; Alberto Zanella; Giovanni Albano; Massimo Antonelli; Giacomo Bellani; Ezio Bonanomi; Luca Cabrini; Eleonora Carlesso; Gianpaolo Castelli; Sergio Cattaneo; Danilo Cereda; Sergio Colombo; Antonio Coluccello; Giuseppe Crescini; Andrea Forastieri Molinari; Giuseppe Foti; Roberto Fumagalli; Giorgio Antonio Iotti; Thomas Langer; Nicola Latronico; Ferdinando Luca Lorini; Francesco Mojoli; Giuseppe Natalini; Carla Maria Pessina; Vito Marco Ranieri; Roberto Rech; Luigia Scudeller; Antonio Rosano; Enrico Storti; B Taylor Thompson; Marcello Tirani; Pier Giorgio Villani; Antonio Pesenti; Maurizio Cecconi
Journal:  JAMA Intern Med       Date:  2020-10-01       Impact factor: 21.873

Review 6.  COVID-19 and pneumonia: a role for the uPA/uPAR system.

Authors:  Daniele D'Alonzo; Maria De Fenza; Vincenzo Pavone
Journal:  Drug Discov Today       Date:  2020-06-18       Impact factor: 7.851

7.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

8.  Pan-ancestry exome-wide association analyses of COVID-19 outcomes in 586,157 individuals.

Authors:  Jack A Kosmicki; Julie E Horowitz; Nilanjana Banerjee; Rouel Lanche; Anthony Marcketta; Evan Maxwell; Xiaodong Bai; Dylan Sun; Joshua D Backman; Deepika Sharma; Fabricio S P Kury; Hyun M Kang; Colm O'Dushlaine; Ashish Yadav; Adam J Mansfield; Alexander H Li; Kyoko Watanabe; Lauren Gurski; Shane E McCarthy; Adam E Locke; Shareef Khalid; Sean O'Keeffe; Joelle Mbatchou; Olympe Chazara; Yunfeng Huang; Erika Kvikstad; Amanda O'Neill; Paul Nioi; Meg M Parker; Slavé Petrovski; Heiko Runz; Joseph D Szustakowski; Quanli Wang; Emily Wong; Aldo Cordova-Palomera; Erin N Smith; Sandor Szalma; Xiuwen Zheng; Sahar Esmaeeli; Justin W Davis; Yi-Pin Lai; Xing Chen; Anne E Justice; Joseph B Leader; Tooraj Mirshahi; David J Carey; Anurag Verma; Giorgio Sirugo; Marylyn D Ritchie; Daniel J Rader; Gundula Povysil; David B Goldstein; Krzysztof Kiryluk; Erola Pairo-Castineira; Konrad Rawlik; Dorota Pasko; Susan Walker; Alison Meynert; Athanasios Kousathanas; Loukas Moutsianas; Albert Tenesa; Mark Caulfield; Richard Scott; James F Wilson; J Kenneth Baillie; Guillaume Butler-Laporte; Tomoko Nakanishi; Mark Lathrop; J Brent Richards; Marcus Jones; Suganthi Balasubramanian; William Salerno; Alan R Shuldiner; Jonathan Marchini; John D Overton; Lukas Habegger; Michael N Cantor; Jeffrey G Reid; Aris Baras; Goncalo R Abecasis; Manuel A R Ferreira
Journal:  Am J Hum Genet       Date:  2021-06-03       Impact factor: 11.043

9.  Determinants of penetrance and variable expressivity in monogenic metabolic conditions across 77,184 exomes.

Authors:  Julia K Goodrich; Moriel Singer-Berk; Rachel Son; Abigail Sveden; Jordan Wood; Eleina England; Joanne B Cole; Ben Weisburd; Nick Watts; Lizz Caulkins; Peter Dornbos; Ryan Koesterer; Zachary Zappala; Haichen Zhang; Kristin A Maloney; Andy Dahl; Carlos A Aguilar-Salinas; Gil Atzmon; Francisco Barajas-Olmos; Nir Barzilai; John Blangero; Eric Boerwinkle; Lori L Bonnycastle; Erwin Bottinger; Donald W Bowden; Federico Centeno-Cruz; John C Chambers; Nathalie Chami; Edmund Chan; Juliana Chan; Ching-Yu Cheng; Yoon Shin Cho; Cecilia Contreras-Cubas; Emilio Córdova; Adolfo Correa; Ralph A DeFronzo; Ravindranath Duggirala; Josée Dupuis; Ma Eugenia Garay-Sevilla; Humberto García-Ortiz; Christian Gieger; Benjamin Glaser; Clicerio González-Villalpando; Ma Elena Gonzalez; Niels Grarup; Leif Groop; Myron Gross; Christopher Haiman; Sohee Han; Craig L Hanis; Torben Hansen; Nancy L Heard-Costa; Brian E Henderson; Juan Manuel Malacara Hernandez; Mi Yeong Hwang; Sergio Islas-Andrade; Marit E Jørgensen; Hyun Min Kang; Bong-Jo Kim; Young Jin Kim; Heikki A Koistinen; Jaspal Singh Kooner; Johanna Kuusisto; Soo-Heon Kwak; Markku Laakso; Leslie Lange; Jong-Young Lee; Juyoung Lee; Donna M Lehman; Allan Linneberg; Jianjun Liu; Ruth J F Loos; Valeriya Lyssenko; Ronald C W Ma; Angélica Martínez-Hernández; James B Meigs; Thomas Meitinger; Elvia Mendoza-Caamal; Karen L Mohlke; Andrew D Morris; Alanna C Morrison; Maggie C Y Ng; Peter M Nilsson; Christopher J O'Donnell; Lorena Orozco; Colin N A Palmer; Kyong Soo Park; Wendy S Post; Oluf Pedersen; Michael Preuss; Bruce M Psaty; Alexander P Reiner; Cristina Revilla-Monsalve; Stephen S Rich; Jerome I Rotter; Danish Saleheen; Claudia Schurmann; Xueling Sim; Rob Sladek; Kerrin S Small; Wing Yee So; Timothy D Spector; Konstantin Strauch; Tim M Strom; E Shyong Tai; Claudia H T Tam; Yik Ying Teo; Farook Thameem; Brian Tomlinson; Russell P Tracy; Tiinamaija Tuomi; Jaakko Tuomilehto; Teresa Tusié-Luna; Rob M van Dam; Ramachandran S Vasan; James G Wilson; Daniel R Witte; Tien-Yin Wong; Noël P Burtt; Noah Zaitlen; Mark I McCarthy; Michael Boehnke; Toni I Pollin; Jason Flannick; Josep M Mercader; Anne O'Donnell-Luria; Samantha Baxter; Jose C Florez; Daniel G MacArthur; Miriam S Udler
Journal:  Nat Commun       Date:  2021-06-09       Impact factor: 17.694

10.  Persistence of viral RNA, pneumocyte syncytia and thrombosis are hallmarks of advanced COVID-19 pathology.

Authors:  Rossana Bussani; Edoardo Schneider; Lorena Zentilin; Chiara Collesi; Hashim Ali; Luca Braga; Maria Concetta Volpe; Andrea Colliva; Fabrizio Zanconati; Giorgio Berlot; Furio Silvestri; Serena Zacchigna; Mauro Giacca
Journal:  EBioMedicine       Date:  2020-11-03       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.