| Literature DB >> 33318519 |
Ana I Hernández Cordero1, Xuan Li2, Chen Xi Yang2, Stephen Milne2,3,4, Yohan Bossé5, Philippe Joubert5, Wim Timens6, Maarten van den Berge7, David Nickle8, Ke Hao9, Don D Sin2,3.
Abstract
Cell entry of SARS-CoV-2, the novel coronavirus causing COVID-19, is facilitated by host cell angiotensin-converting enzyme 2 (ACE2) and transmembrane serine protease 2 (TMPRSS2). We aimed to identify and characterize genes that are co-expressed with ACE2 and TMPRSS2, and to further explore their biological functions and potential as druggable targets. Using the gene expression profiles of 1,038 lung tissue samples, we performed a weighted gene correlation network analysis (WGCNA) to identify modules of co-expressed genes. We explored the biology of co-expressed genes using bioinformatics databases, and identified known drug-gene interactions. ACE2 was in a module of 681 co-expressed genes; 10 genes with moderate-high correlation with ACE2 (r > 0.3, FDR < 0.05) had known interactions with existing drug compounds. TMPRSS2 was in a module of 1,086 co-expressed genes; 31 of these genes were enriched in the gene ontology biologic process 'receptor-mediated endocytosis', and 52 TMPRSS2-correlated genes had known interactions with drug compounds. Dozens of genes are co-expressed with ACE2 and TMPRSS2, many of which have plausible links to COVID-19 pathophysiology. Many of the co-expressed genes are potentially targetable with existing drugs, which may accelerate the development of COVID-19 therapeutics.Entities:
Year: 2020 PMID: 33318519 PMCID: PMC7736291 DOI: 10.1038/s41598-020-78818-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Study cohort demographics.
| Centre | Lung eQTL consortium cohort | ||
|---|---|---|---|
| Groningen | Laval | UBC | |
| n | 342 | 409 | 287 |
| Age, years† | 54 (44–62) | 64 (57–71) | 63 (55–71) |
| Females, n (%) | 160 (46.78) | 180 (44.01) | 132 (45.99) |
| BMI, kg/m2† | 22.65 (20.00–25.42) | 26.10 (22.90–29.00) | 24.90 (22.20–28.93) |
| COPD‡, n (%) | 289 (86.27) | 127 (31.13) | 10 (100.00) |
| Asthma, n (%) | 0 (0.00) | 15 (3.68) | 22 (10.78) |
| Cardiac disease, n (%) | 26 (7.60) | 120 (29.41) | 46 (21.90) |
| Hypertension, n (%) | 2 (22.22) | 107 (26.23) | 33 (100.00) |
| Diabetes, n (%) | 27 (7.89) | 41 (10.05) | 13 (28.26) |
| Never smokers, n (%) | 100 (29.24) | 36 (8.80) | 26 (9.06) |
| Former smokers, n (%) | 185 (54.09) | 283 (69.19) | 163 (56.79) |
| Current smokers, n (%) | 57 (16.67) | 90 (22.00) | 98 (34.15) |
†Median (interquartile range). ‡Chronic obstructive pulmonary disease. Denominators for the percentages are based on the number of non-missing records.
Figure 1ACE2 and TMPRSS2 coexpression modules. The center of each graph represents ACE2 (a) or TMPRSS2 (b), the circles at the edges represent the top 50 genes with the highest connectivity to ACE2 or TMPRSS2 based on the WGCNA analysis. The circle size represents the size of each gene node in their respective modules. The arm thickness represents the relative strength of the connection to the ACE2 or TMPRSS2 expression. Figure was created using R 3.6 (https://www.r-project.org/)[11].
Drug-gene interactions of ACE2-correlated genes.
| Gene | Druggability score† | No. of known drug-gene interactions‡ | r ( | FDR | |
|---|---|---|---|---|---|
| Tier 2 | 9 | 0.37 | 5.94 × 10–34 | ||
| Tier 2 | 7 | 0.36 | 3.04 × 10–32 | ||
| Tier 1 | 3 | 0.36 | 4.31 × 10–33 | 1.85 × 10–31 | |
| Tier 2 | 6 | 0.35 | 3.73 × 10–30 | ||
| Tier 2 | 5 | 0.33 | 1.20 × 10–27 | ||
| Tier 1 | 20 | 0.33 | 1.79 × 10–27 | ||
| Tier 1 | 58 | 0.33 | 8.03 × 10–27 | ||
| Tier 1 | 2 | 0.32 | 5.83 × 10–25 | ||
| Tier 1 | 48 | 0.32 | 1.30 × 10–24 | ||
| Tier 2 | 2 | 0.31 | 4.40 × 10–24 | 3.86 × 10–23 |
†From Finan et al.[12]‡From Drug-Gene Interaction Database (DGIdb)[13]. r(ACE2): Pearson correlation coefficient between gene and ACE2 expression (adjusted for sex age and centre). p: significance of the Pearson correlation coefficient (corresponds to Fisher’s Z score value), adjusted for false discovery rate (FDR).
Drug-gene interactions of TMPRSS2-correlated genes.
| Gene | No. of drug-gene interactions‡ | r ( | FDR | |
|---|---|---|---|---|
| 1 | 0.61 | 1.77 × 10–116 | 1.11 × 10–114 | |
| 4 | 0.61 | 9.72 × 10–113 | 5.66 × 10–111 | |
| 75 | 0.57 | 4.39 × 10–95 | 1.25 × 10–93 | |
| 25 | 0.56 | 2.22 × 10–93 | 5.92 × 10–92 | |
| 2 | 0.55 | 4.64 × 10–89 | 1.11 × 10–87 | |
| 28 | 0.55 | 5.85 × 10–86 | 1.28 × 10–84 | |
| 1 | 0.54 | 1.90 × 10–83 | 3.93 × 10–82 | |
| 1 | 0.53 | 9.85 × 10–82 | 1.91 × 10–80 | |
| 2 | 0.52 | 4.12 × 10–77 | 6.81 × 10–76 | |
| 3 | 0.51 | 6.88 × 10–72 | 1.00 × 10–70 | |
| 1 | 0.47 | 6.68 × 10–61 | 7.62 × 10–60 | |
| 8 | 0.47 | 2.64 × 10–60 | 2.99 × 10–59 | |
| 44 | 0.45 | 6.43 × 10–55 | 6.02 × 10–54 | |
| 75 | 0.45 | 1.25 × 10–54 | 1.15 × 10–53 | |
| 164 | 0.45 | 6.33 × 10–54 | 5.69 × 10–53 | |
| 55 | 0.45 | 1.35 × 10–53 | 1.21 × 10–52 | |
| 118 | 0.44 | 1.11 × 10–52 | 9.48 × 10–52 | |
| 3 | 0.44 | 3.86 × 10–51 | 3.05 × 10–50 | |
| 1 | 0.43 | 5.27 × 10–48 | 3.73 × 10–47 | |
| 4 | 0.40 | 1.95 × 10–41 | 1.08 × 10–40 | |
| 30 | 0.40 | 3.30 × 10–41 | 1.80 × 10–40 | |
| 29 | 0.38 | 4.28 × 10–38 | 2.16 × 10–37 | |
| 5 | 0.38 | 1.51 × 10–37 | 7.44 × 10–37 | |
| 4 | 0.37 | 6.24 × 10–35 | 2.77 × 10–34 | |
| 10 | 0.36 | 4.38 × 10–33 | 1.81 × 10–32 | |
| 15 | 0.36 | 5.65 × 10–33 | 2.32 × 10–32 | |
| 2 | 0.34 | 1.08 × 10–29 | 4.00 × 10–29 | |
| 12 | 0.33 | 2.57 × 10–28 | 9.10 × 10–28 | |
| 16 | 0.33 | 3.55 × 10–28 | 1.24 × 10–27 | |
| 17 | 0.33 | 5.03 × 10–28 | 1.74 × 10–27 | |
| 48 | 0.30 | 5.53 × 10–24 | 1.67 × 10–23 | |
| 33 | 0.30 | 8.11 × 10–24 | 2.42 × 10–23 |
‡from Drug-Gene Interaction Database (DGIdb)[13]. r(TMPRSS2): Pearson correlation coefficient between gene and TMPRSS2 expression (adjusted for sex, age and centre). p: significance of the Pearson correlation coefficient (corresponds to Fisher’s Z score value), adjusted for false discovery rate (FDR).
Figure 2Correlation level and annotation of TMPRSS2-correlated genes. Each bar represents a single gene (all with druggability scores Tier 1–3[12]), and Pearson correlation coefficient (r) between the gene and TMPRSS2 within the module is shown on the y axis. Colours of bars represent combined biological information: aqua (group A0) represents those genes with only druggability information; orange (group A1) represents genes that are related to human diseases based Online Mendelian Inheritance in Man (OMIM) and ClinVar databases; purple (group A2) represents genes with phenotypic information on knockdown or knockout mouse models based on Mouse Genome Informatics (MGI) database; pink (group A3) represents genes with phenotypic information on knockdown or knockout mouse models based on MGI database, and that are related to human diseases based OMIM and ClinVar data bases; green (group A4) represents genes related to human diseases based OMIM and ClinVar databases and with genetic variants associated to lung function traits[14]; yellow (group A5) represents genes with phenotypic information on knockdown or knockout mouse models based on MGI database, and with genetic variants associated to lung function traits[14]; brown (group A6) represents genes with phenotypic information on knockdown or knockout mouse models based on MGI database, and that are related to human diseases based OMIM and ClinVar data bases, and with genetic variants associated to lung function traits[14]. Figure was created using R 3.6 (https://www.r-project.org/)[11].
Figure 3Effects of COVID-19 risk factors on lung tissue gene expression. y axes represent the microarray gene expression level in lung tissue for ACE2-correlated genes (GART [a], ADK [c], DPP4 [e]) and TMPRSS2-correlated genes (LRRK2 [b],CD55 [d], MET [f]). Boxes are median expression ± interquartile range respectively. Numbers at the top of each box plot are FDR obtained from the robust linear regressions. Figure was created using R 3.6 (https://www.r-project.org/)[11].