| Literature DB >> 32344918 |
Jun Gao1,2, John Collyer3, Maochun Wang4, Fengping Sun2, Fuyi Xu1.
Abstract
Hypertrophic cardiomyopathy (HCM) is an inherited disorder of the myocardium, and pathogenic mutations in the sarcomere genes myosin heavy chain 7 (MYH7) and myosin-binding protein C (MYBPC3) explain 60%-70% of observed clinical cases. The heterogeneity of phenotypes observed in HCM patients, however, suggests that novel causative genes or genetic modifiers likely exist. Here, we systemically evaluated RNA-seq data from 28 HCM patients and nine healthy controls with pathogenic variant identification, differential expression analysis, and gene co-expression and protein-protein interaction network analyses. We identified 43 potential pathogenic variants in 19 genes in 24 HCM patients. Genes with more than one variant included the following: MYBPC3, TTN, MYH7, PSEN2, and LDB3. A total of 2538 protein-coding genes, six microRNAs (miRNAs), and 1617 long noncoding RNAs (lncRNAs) were identified differentially expressed between the groups, including several well-characterized cardiomyopathy-related genes (ANKRD1, FHL2, TGFB3, miR-30d, and miR-154). Gene enrichment analysis revealed that those genes are significantly involved in heart development and physiology. Furthermore, we highlighted four subnetworks: mtDNA-subnetwork, DSP-subnetwork, MYH7-subnetwork, and MYBPC3-subnetwork, which could play significant roles in the progression of HCM. Our findings further illustrate that HCM is a complex disease, which results from mutations in multiple protein-coding genes, modulation by non-coding RNAs and perturbations in gene networks.Entities:
Keywords: RNA-seq; differential gene expression; gene network; hypertrophic cardiomyopathy; pathogenic variants
Mesh:
Substances:
Year: 2020 PMID: 32344918 PMCID: PMC7246737 DOI: 10.3390/ijms21093040
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Clinical characteristics of the studied samples [12].
| Sample ID | Group# | Sex | Age | Smoking | LAD (mm)a | LVST (mm)b | LVEDD (mm)c | LVEF (%)d | Maxi LVWT (mm)e | Maxi LVOTG (mmHg)f |
|---|---|---|---|---|---|---|---|---|---|---|
| HCM420 | GENETUN | male | 32 | NA | 41 | 14 | 44 | 76 | 18 | 55 |
| HCM405 | GENETUN | female | 31 | NO | 42 | 14 | 47 | 78 | 16 | 126 |
| HCM541 | GENETUN | male | 38 | NA | 47 | 22 | 46 | 71 | 25 | 96 |
| HCM493 | GENETUN | male | 40 | NA | 40 | 15 | 45 | 75 | 17 | 95 |
| HCM273 | GENETUN | male | 30 | NO | 38 | 15 | 33 | 50 | 21 | 30 |
| HCM269 | GENETUN | male | 25 | NO | 53 | 29 | 51 | 63 | 38 | 56 |
| HCM395 | GENETUN | male | 20 | NO | 49 | 19 | 43 | 60 | 22 | 78 |
| HCM282 | GENETUN | male | 48 | YES | 49 | 21 | 49 | 65.3 | 23 | 75 |
| HCM591 | GENETUN | male | 42 | NA | 39 | 21 | 45 | 70 | 32 | 53 |
| HCM552 | GENETUN | male | 32 | NA | NA | NA | NA | 75 | 21 | 100 |
| HCM515 |
| male | 21 | YES | 39 | 18 | 50 | 75 | 25 | 84 |
| HCM504 |
| male | 25 | YES | 39 | 25 | 39 | 75 | 28 | 66 |
| HCM498 |
| male | 39 | YES | 38 | 20 | 42 | 72 | 21 | 55 |
| HCM486 |
| female | 36 | NA | 45 | 16 | 35 | 75 | 20 | 100 |
| HCM460 |
| male | 43 | YES | 45 | 16 | 45 | 65 | 19 | 61 |
| HCM439 |
| female | 47 | NA | 36 | 15 | 38 | 80 | 18 | 103 |
| HCM437 |
| male | 30 | YES | 54 | 19 | 47 | 78 | 26 | 54 |
| HCM429 |
| female | 36 | NA | 48 | 17 | 42 | 78 | 18 | 70 |
| HCM533 |
| male | 47 | YES | 62 | 24 | 50 | 75 | 29 | 118 |
| HCM518 |
| female | 31 | NA | 37 | 22 | 38 | 68 | 26 | 105 |
| HCM506 |
| male | 19 | YES | 48 | 20 | 43 | 75 | 21 | 59 |
| HCM562 |
| female | 36 | YES | 36 | 0 | 36 | 66 | 21 | 70 |
| HCM490 |
| female | 41 | NA | 44 | 15 | 39 | 58 | 21 | 63 |
| HCM491 |
| female | 30 | NA | 44 | 23 | 32 | 75 | 28 | 64 |
| HCM456 |
| male | 28 | NA | 51 | 15 | 45 | 69 | 20 | 90 |
| HCM483 |
| male | 37 | YES | 57 | 23 | 40 | 72 | 26 | 70 |
| HCM431 |
| male | 24 | NA | 57 | 18 | 49 | 80 | 21 | 80 |
| HCM443 |
| female | 28 | NA | 46 | 17 | 44 | 69 | 18 | 68 |
| sc5-LV | NORMAL | female | NA | NA | NA | NA | NA | NA | NA | NA |
| sc2-LV | NORMAL | male | NA | NA | NA | NA | NA | NA | NA | NA |
| sc6-LV | NORMAL | male | NA | NA | NA | NA | NA | NA | NA | NA |
| N105-LV | NORMAL | male | NA | NA | NA | NA | NA | NA | NA | NA |
| N104-LV | NORMAL | male | NA | NA | NA | NA | NA | NA | NA | NA |
| ND2 | NORMAL | male | NA | NA | NA | NA | NA | NA | NA | NA |
| ND1-LV | NORMAL | male | NA | NA | NA | NA | NA | NA | NA | NA |
| N102-LV | NORMAL | male | NA | NA | NA | NA | NA | NA | NA | NA |
| N103-LV | NORMAL | male | NA | NA | NA | NA | NA | NA | NA | NA |
#: GENETUN, genetically undiagnosed HCM patient; MYBPC3, HCM patient with mutation in MYBPC3; MYH7, HCM patient with mutation in MYH7; NORMAL, normal heart; a: left atrial diameter; b: left ventricular septal thickness; c: left ventricular end-diastolic; d: left ventricular ejection fraction; e: maximum left ventricular wall thickness; f: maximum left ventricular outflow track gradient at rest or after exercise; NA: not available.
Summary statistics of RNA-seq data.
| Sample ID | Raw Reads | HQ Reads | HQ Reads (%) | Mapping Rate (%) # | Total Variant | Exonic Variant |
|---|---|---|---|---|---|---|
| HCM420 | 117,254,002 | 110,409,284 | 94.16 | 96.87 | 742,007 | 34,208 |
| HCM405 | 96,660,588 | 90,958,152 | 94.10 | 96.72 | 697,916 | 32,501 |
| HCM541 | 118,364,902 | 112,199,122 | 94.79 | 96.56 | 748,918 | 34,438 |
| HCM493 | 135,595,138 | 127,235,398 | 93.83 | 96.67 | 802,057 | 35,285 |
| HCM273 | 100,383,614 | 94,562,944 | 94.20 | 96.10 | 674,997 | 32,320 |
| HCM269 | 104,573,612 | 98,483,828 | 94.18 | 96.69 | 657,403 | 30,937 |
| HCM395 | 101,463,898 | 94,778,784 | 93.41 | 96.50 | 718,773 | 32,792 |
| HCM282 | 135,385,378 | 128,496,976 | 94.91 | 96.41 | 835,560 | 36,065 |
| HCM591 | 117,087,250 | 110,633,048 | 94.49 | 96.33 | 742,357 | 33,694 |
| HCM552 | 110,633,048 | 97,644,508 | 88.26 | 96.57 | 714,914 | 32,157 |
| HCM515 | 126,385,206 | 120,132,036 | 95.05 | 96.53 | 797,070 | 35,707 |
| HCM504 | 126,614,812 | 120,041,188 | 94.81 | 96.46 | 747,293 | 34,481 |
| HCM498 | 118,114,640 | 110,661,262 | 93.69 | 96.18 | 745,414 | 33,633 |
| HCM486 | 102,005,260 | 95,821,344 | 93.94 | 96.76 | 688,159 | 32,517 |
| HCM460 | 83,321,826 | 78,346,916 | 94.03 | 97.01 | 641,381 | 30,658 |
| HCM439 | 115,966,430 | 108,531,416 | 93.59 | 96.48 | 777,052 | 34,950 |
| HCM437 | 114,409,150 | 107,630,394 | 94.07 | 96.59 | 718,353 | 33,216 |
| HCM429 | 107,327,856 | 100,460,638 | 93.60 | 96.51 | 781,013 | 34,514 |
| HCM533 | 113,801,296 | 107,068,190 | 94.08 | 96.47 | 765,001 | 34,584 |
| HCM518 | 123,140,430 | 116,174,372 | 94.34 | 96.54 | 821,103 | 36,229 |
| HCM506 | 104,464,396 | 98,442,104 | 94.24 | 96.88 | 721,500 | 33,453 |
| HCM562 | 119,176,332 | 113,518,840 | 95.25 | 96.63 | 782,848 | 34,382 |
| HCM490 | 109,178,274 | 102,325,212 | 93.72 | 96.79 | 728,218 | 33,873 |
| HCM491 | 123,213,328 | 115,698,990 | 93.90 | 95.97 | 756,623 | 34,040 |
| HCM456 | 124,188,740 | 116,826,920 | 94.07 | 96.65 | 793,088 | 35,368 |
| HCM483 | 132,375,212 | 125,845,538 | 95.07 | 96.52 | 838,862 | 35,806 |
| HCM431 | 95,467,544 | 91,611,504 | 95.96 | 96.60 | 701,266 | 33,263 |
| HCM443 | 103,385,170 | 96,969,874 | 93.79 | 96.18 | 707,524 | 33,016 |
| sc5-LV | 115,206,460 | 109,427,776 | 94.98 | 94.80 | 731,586 | 35,144 |
| sc2-LV | 119,628,518 | 113,668,538 | 95.02 | 96.52 | 747,079 | 34,629 |
| sc6-LV | 133,156,688 | 126,807,678 | 95.23 | 95.59 | 789,274 | 37,289 |
| N105-LV | 125,064,692 | 118,533,240 | 94.78 | 96.39 | 830,057 | 35,975 |
| N104-LV | 151,765,118 | 143,413,882 | 94.50 | 96.50 | 930,969 | 37,875 |
| ND2 | 115,981,584 | 110,192,388 | 95.01 | 94.90 | 649,196 | 33,173 |
| ND1-LV | 111,029,746 | 105,848,092 | 95.33 | 95.66 | 715,284 | 33,920 |
| N102-LV | 111,141,904 | 102,446,288 | 92.18 | 96.59 | 754,272 | 33,343 |
| N103-LV | 146,673,012 | 138,682,818 | 94.55 | 96.50 | 864,176 | 35,856 |
: the percentage of uniquely mapped reads to the total high-quality (HQ) reads.
Figure 1Workflow for functional variants and gene prioritization.
Figure 2Heatmap of the pathogenic genes across hypertrophic cardiomyopathy (HCM) patients. Red rectangles represent HCM patients (rows) who carry pathogenic variant(s) in the corresponding gene(s) (columns).
Lists of candidate variants prioritized from HCM patients.
| Group | Sample ID | Gene | dbSNP | Variant Type | AAChange# |
|---|---|---|---|---|---|
| GENETUN | HCM541 |
| rs763968252 | nonsynonymous | NM_005691:exon24:c.T2935C:p.W979R |
| HCM493 |
| rs397516338 | nonsynonymous | NM_001128209:exon8:c.A845G:p.Q282R | |
|
| rs146400809 | nonsynonymous | NM_133378:exon126:c.C28730T:p.P9577L | ||
| HCM395 |
| NA | stop loss | NM_003476:exon7:c.A585C:p.X195C | |
| HCM282 |
| rs146245480 | nonsynonymous | NM_001172309:exon7:c.C643T:p.R215C | |
| HCM591 |
| rs116888866 | nonsynonymous | NM_001008844:exon24:c.G6119A:p.R2040Q | |
|
| rs574125890 | nonsynonymous | NM_000447:exon8:c.G640T:p.V214L | ||
| HCM552 |
| rs75301590 | nonsynonymous | NM_006393:exon6:c.G561C:p.Q187H | |
| MYBPC3 | HCM515 |
| rs869025465 | frameshift deletion | NM_000256:exon13:c.1153_1168del:p.V385Mfs*15 |
|
| rs146245480 | nonsynonymous | NM_001172309:exon7:c.C643T:p.R215C | ||
|
| rs566463138 | nonsynonymous | NM_001080114:exon10:c.T1367G:p.M456R | ||
|
| rs141121678 | nonsynonymous | NM_001001432:exon14:c.G839A:p.R280H | ||
| HCM504 |
| NA | frameshift deletion | NM_000256:exon28:c.3018delC:p.W1007Gfs*12 | |
|
| rs118161093 | nonsynonymous | NM_003319:exon27:c.G5602A:p.A1868T | ||
|
| rs139517732 | nonsynonymous | NM_001256850:exon3:c.G160A:p.V54M | ||
| HCM498 |
| NA | stop gain | NM_000256:exon5:c.G587A:p.W196X | |
|
| rs376144003 | nonsynonymous | NM_001103:exon11:c.T1162A:p.W388R | ||
|
| NA | nonsynonymous | NM_000447:exon10:c.C902A:p.T301K | ||
| HCM486 |
| NA | nonsynonymous | NM_002667:exon2:c.T106C:p.C36R | |
| HCM460 |
| rs730880576 | stop gain | NM_000256:exon26:c.G2748A:p.W916X | |
|
| rs397517221 | nonsynonymous | NM_001080114:exon2:c.C236T:p.T79I | ||
|
| rs199932621 | nonsynonymous | NM_003319:exon186:c.G75632A:p.R25211Q | ||
| HCM439 |
| rs397516073 | splicing | NA | |
|
| rs3752579 | nonsynonymous | NM_001105206:exon12:c.T1475A:p.L492H | ||
|
| NA | nonsynonymous | NM_000447:exon13:c.G1234A:p.A412T | ||
| HCM437 |
| NA | frameshift insertion | NM_000256:exon13:c.1201dupC:p.Q401Pfs*12 | |
|
| rs727503278 | nonsynonymous | NM_000257:exon5:c.C427T:p.R143W | ||
|
| rs769275933 | nonsynonymous | NM_001320976:exon9:c.C584T:p.T195M | ||
| HCM429 |
| NA | frameshift deletion | NM_000256:exon22:c.2237delA:p.E746Gfs*6 | |
|
| rs146245480 | nonsynonymous | NM_001172309:exon7:c.C643T:p.R215C | ||
| HCM533 |
| NA | stop gain | NM_000256:exon24:c.C2526G:p.Y842X | |
|
| rs549841864 | nonsynonymous | NM_003319:exon167:c.C66059T:p.P22020L | ||
| HCM518 |
| NA | frameshift deletion | NM_000256:exon4:c.480delG:p.P161Hfs*5 | |
| MYH7 | HCM506 |
| rs121913627 | nonsynonymous | NM_000257:exon16:c.G1816A:p.V606M |
|
| rs372923744 | nonsynonymous | NM_001134363:exon9:c.G2201A:p.R734Q | ||
| HCM562 |
| rs727503278 | nonsynonymous | NM_000257:exon5:c.C427T:p.R143W | |
|
| rs116888866 | nonsynonymous | NM_001008844:exon24:c.G6119A:p.R2040Q | ||
| HCM490 |
| rs397516201 | nonsynonymous | NM_000257:exon30:c.C4130T:p.T1377M | |
|
| rs368057764 | nonsynonymous | NM_003319:exon79:c.C19652T:p.T6551M | ||
|
|
| nonsynonymous | NM_003319:exon154:c.G56641A:p.D18881N | ||
|
| rs567446185 | nonsynonymous | NM_003319:exon154:c.G46322A:p.G15441D | ||
| HCM491 |
| rs397516127 | nonsynonymous | NM_000257:exon18:c.C1987T:p.R663C | |
|
| NA | splicing | NA | ||
| HCM456 |
| rs3218714 | nonsynonymous | NM_000257:exon13:c.C1207T:p.R403W | |
| HCM483 |
| rs121913627 | nonsynonymous | NM_000257:exon16:c.G1816A:p.V606M | |
| HCM431 |
| rs727503246 | nonsynonymous | NM_000257:exon30:c.G4066A:p.E1356K | |
|
| NA | splicing | NA | ||
| HCM443 |
| rs121913632 | nonsynonymous | NM_000257:exon20:c.G2221T:p.G741W | |
|
| rs75301590 | nonsynonymous | NM_006393:exon6:c.G561C:p.Q187H |
#: Gene may have multiple transcript annotations, but only the first is shown here; the full list of annotations can be found in Supplementary Data 2; NA: not available.
Figure 3Differentially expressed protein-coding genes between HCM patients and normal controls. (A) Volcano plot of the differentially expressed genes (DEGs). The x-axes show the log2 transformed fold change, and the y-axes show the log10 transformed false discovery rate (FDR). Red dots are DEGs with an FDR < 0.05 and fold change > 1.5. (B–D) Dot plots of the expression levels of ANKRD1, TGFB3, and FHL2 in normal controls (NC) and HCM patients, ***FDR < 0.001. (E) Bar plots of the significantly enriched cardiovascular-related Gene Ontology (GO) terms for the differentially expressed protein-coding genes.
Figure 4Correlation networks for the cardiovascular-related differentially expressed protein-coding genes. The Pearson correlation coefficient analysis was calculated across the 192 cardiovascular-related genes. Nodes represent genes, and edges represent correlations between two genes (FDR < 0.01).
Figure 5Differentially expressed microRNAs (miRNAs) between HCM patients and normal controls. (A–F) Dot plots of miR-487a, miR-654, miR-30d, miR-154, miR-3193, and miR-3671 expression levels in NC and HCM patients, *FDR < 0.05, **FDR < 0.01, ***FDR < 0.001. (G) Predicted miRNA-gene pairs with miRWalk2, which implements four miRNA-gene prediction algorithms. Target pairs for which more than two algorithms had signals are included. Seventy-five cardiovascular-related DEGs were targeted by the above miRNAs, with these genes demonstrating inverse expression patterns compared to their corresponding miRNAs.
GO enrichment of long noncoding RNA (lncRNA)-annotated protein-coding genes.
| GO Term | Description | Gene Count | FDR | |
|---|---|---|---|---|
| GO:0072358 | cardiovascular system development | 95 | 6.21 × 10−14 | 8.38 × 10−11 |
| GO:0007507 | heart development | 75 | 2.71 × 10−10 | 1.10 × 10−7 |
| GO:0003007 | heart morphogenesis | 42 | 7.29 × 10−9 | 1.89 × 10−6 |
| GO:0003231 | cardiac ventricle development | 26 | 1.18 × 10−7 | 1.61 × 10−5 |
| GO:0003206 | cardiac chamber morphogenesis | 25 | 6.39 × 10−7 | 6.43 × 10−5 |
| GO:0003205 | cardiac chamber development | 29 | 1.09 × 10−6 | 9.88 × 10−5 |
| GO:0048738 | cardiac muscle tissue development | 31 | 4.73 × 10−6 | 3.00 × 10−4 |
| GO:0060411 | cardiac septum morphogenesis | 16 | 5.09 × 10−6 | 3.11 × 10−4 |
| GO:0003208 | cardiac ventricle morphogenesis | 17 | 7.56 × 10−6 | 4.43 × 10−4 |
| GO:0035051 | cardiocyte differentiation | 22 | 2.22 × 10−5 | 1.12 × 10−3 |
| GO:0003279 | cardiac septum development | 19 | 3.35 × 10−5 | 1.59 × 10−3 |
| GO:0003215 | cardiac right ventricle morphogenesis | 8 | 3.44 × 10−5 | 1.62 × 10−3 |
| GO:0003197 | endocardial cushion development | 11 | 8.84 × 10−5 | 3.62 × 10−3 |
| GO:1905207 | regulation of cardiocyte differentiation | 10 | 2.19 × 10−4 | 7.33 × 10−3 |
| GO:0003203 | endocardial cushion morphogenesis | 9 | 2.62 × 10−4 | 8.54 × 10−3 |
| GO:0055007 | cardiac muscle cell differentiation | 17 | 3.08 × 10−4 | 9.57 × 10−3 |
| GO:0055008 | cardiac muscle tissue morphogenesis | 13 | 3.15 × 10−4 | 9.72 × 10−3 |
| GO:0060047 | heart contraction | 33 | 5.04 × 10−4 | 1.46 × 10−2 |
| GO:0003015 | heart process | 33 | 6.21 × 10−4 | 1.72 × 10−2 |
| GO:0035050 | embryonic heart tube development | 14 | 6.58 × 10 −4 | 1.80 × 10−2 |
| GO:0055017 | cardiac muscle tissue growth | 13 | 6.92 × 10−4 | 1.88 × 10−2 |
| GO:0061323 | cell proliferation involved in heart morphogenesis | 6 | 7.55 × 10−4 | 1.98 × 10−2 |
| GO:0055012 | ventricular cardiac muscle cell differentiation | 6 | 7.55 × 10−4 | 1.98 × 10−2 |
| GO:1905209 | positive regulation of cardiocyte differentiation | 7 | 1.00 × 10−3 | 2.44 × 10−2 |
| GO:0003170 | heart valve development | 9 | 1.27 × 10−3 | 2.97 × 10−2 |
| GO:0003272 | endocardial cushion formation | 7 | 1.31 × 10−3 | 3.02 × 10−2 |
| GO:0010002 | cardioblast differentiation | 6 | 1.46 × 10−3 | 3.32 × 10−2 |
| GO:0060419 | heart growth | 13 | 1.59 × 10−3 | 3.52 × 10−2 |
| GO:0003143 | embryonic heart tube morphogenesis | 12 | 1.73 × 10−3 | 3.78 × 10−2 |
| GO:2000725 | regulation of cardiac muscle cell differentiation | 7 | 2.14 × 10−3 | 4.40 × 10−2 |
| GO:0001947 | heart looping | 11 | 2.44 × 10−3 | 4.85 × 10−2 |
Figure 6Co-expression network analysis. (A) The soft thresholding index R2 (y-axis) as a function of the different thresholding power β (x-axis). (B) Mean connectivity (y-axis) as a function of the power β (x-axis). (C) Sixteen co-expression modules identified from the myocardial RNA-seq dataset. Each cell represents the correlation coefficient p-value computed from correlating the module eigengenes to the clinical traits (columns). Cells filled with red represent significant associations between modules and traits (p < 0.05).
Figure 7MtDNA-, DSP-, MYH7-, and MYBPC3-subnetworks constructed by integrating co-expression and protein–protein interaction (PPI) network analyses. MHY7- and MYBPC3-involved co-expression modules (M14 and M16) identified by co-expression analysis were used to construct PPI-subnetworks from the STRING database (https://string-db.org/) with the Markov Cluster (MCL) clustering method, for which the inflation parameter was set to 3.