| Literature DB >> 31197155 |
Xuanyu Liu1, Yi Ma1, Kunlun Yin1, Wenke Li1, Wen Chen1, Yujing Zhang1, Changsheng Zhu2, Tianjiao Li1, Bianmei Han1, Xuewen Liu1, Shuiyun Wang3, Zhou Zhou4.
Abstract
Hypertrophic cardiomyopathy (HCM) represents one of the most common heritable heart diseases. However, the signalling pathways and regulatory networks underlying the pathogenesis of HCM remain largely unknown. Here, we present a strand-specific RNA-seq dataset for both coding and lncRNA profiling in myocardial tissues from 28 HCM patients and 9 healthy donors. This dataset constitutes a valuable resource for the community to examine the dysregulated coding and lncRNA genes in HCM versus normal conditions.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31197155 PMCID: PMC6565738 DOI: 10.1038/s41597-019-0094-6
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Overview of the experimental procedure. (a) Schematic representation of the experimental workflow. The sampling position is indicated by a black rectangular. RNA isolation and library preparation for all samples were performed in the same batch. HCM: hypertrophic cardiomyopathy; GENETUN: Genetically undiagnosed HCM; MYBPC3: HCM patient with mutation in MYBPC3; MYH7: HCM patient with mutation in MYH7; NORMAL: Normal heart. (b) Bioinformatic analysis workflow.
Summary statistics for the sequencing data.
| Sample | Group | #read pairs | #bases (G) | Q20 | Q30 | overall alignment rate |
|---|---|---|---|---|---|---|
| HCM269 | GENETUN | 51,524,526 | 14.35 | 97.89% | 94.49% | 97.79% |
| HCM273 | GENETUN | 49,447,308 | 13.77 | 97.88% | 94.50% | 97.93% |
| HCM282 | GENETUN | 66,813,232 | 18.61 | 98.30% | 95.44% | 98.00% |
| HCM395 | GENETUN | 49,855,174 | 13.88 | 97.77% | 94.27% | 97.27% |
| HCM405 | GENETUN | 47,625,057 | 13.27 | 97.85% | 94.42% | 97.70% |
| HCM420 | GENETUN | 57,763,729 | 16.08 | 97.86% | 94.42% | 97.76% |
| HCM493 | GENETUN | 66,750,733 | 18.55 | 97.73% | 94.15% | 97.54% |
| HCM541 | GENETUN | 58,387,469 | 16.29 | 98.27% | 95.33% | 98.13% |
| HCM552 | GENETUN | 51,288,461 | 14.29 | 97.77% | 94.25% | 97.48% |
| HCM591 | GENETUN | 57,727,967 | 16.07 | 98.06% | 94.92% | 97.74% |
| HCM439 | MYBPC3 | 57,023,162 | 15.87 | 97.76% | 94.20% | 97.57% |
| HCM460 | MYBPC3 | 41,034,383 | 11.42 | 97.82% | 94.32% | 97.71% |
| HCM486 | MYBPC3 | 50,228,447 | 13.99 | 97.80% | 94.27% | 97.79% |
| HCM498 | MYBPC3 | 58,077,468 | 16.15 | 97.84% | 94.43% | 97.40% |
| HCM504 | MYBPC3 | 62,460,847 | 17.41 | 98.27% | 95.34% | 98.11% |
| HCM515 | MYBPC3 | 62,410,260 | 17.38 | 98.31% | 95.45% | 98.00% |
| HCM518 | MYBPC3 | 60,673,907 | 16.89 | 98.01% | 94.81% | 97.62% |
| HCM533 | MYBPC3 | 56,061,058 | 15.60 | 97.83% | 94.42% | 97.63% |
| HCM429 | MYBPC3 | 52,807,037 | 14.71 | 97.64% | 93.95% | 97.60% |
| HCM437 | MYBPC3 | 56,355,435 | 15.71 | 97.83% | 94.39% | 97.66% |
| HCM431 | MYH7 | 47,270,654 | 13.15 | 98.55% | 95.97% | 97.87% |
| HCM443 | MYH7 | 50,878,232 | 14.16 | 97.80% | 94.28% | 97.49% |
| HCM456 | MYH7 | 61,153,662 | 17.01 | 97.85% | 94.39% | 97.78% |
| HCM483 | MYH7 | 65,366,081 | 18.19 | 98.32% | 95.47% | 97.97% |
| HCM490 | MYH7 | 53,694,284 | 14.94 | 97.78% | 94.29% | 97.65% |
| HCM491 | MYH7 | 60,649,986 | 16.87 | 97.80% | 94.33% | 97.58% |
| HCM506 | MYH7 | 51,473,866 | 14.31 | 97.86% | 94.39% | 97.74% |
| HCM562 | MYH7 | 58,882,347 | 16.37 | 98.37% | 95.59% | 98.03% |
| N102-LV | NORMAL | 54,725,491 | 15.25 | 97.43% | 93.70% | 97.59% |
| N103-LV | NORMAL | 72,263,796 | 20.14 | 98.26% | 95.33% | 97.99% |
| N104-LV | NORMAL | 74,732,382 | 20.77 | 98.26% | 95.33% | 98.06% |
| N105-LV | NORMAL | 61,657,432 | 17.15 | 98.30% | 95.39% | 98.06% |
| ND1-LV | NORMAL | 54,854,093 | 15.25 | 98.41% | 95.63% | 98.13% |
| ND2 | NORMAL | 57,230,198 | 15.90 | 98.34% | 95.49% | 97.86% |
| sc2-LV | NORMAL | 59,025,988 | 16.41 | 98.34% | 95.49% | 98.16% |
| sc5-LV | NORMAL | 56,871,247 | 15.84 | 98.32% | 95.48% | 97.67% |
| sc6-LV | NORMAL | 65,688,425 | 18.27 | 98.46% | 95.76% | 97.87% |
GENETUN: Genetically undiagnosed HCM patient; MYBPC3: HCM patient with mutation in MYBPC3; MYH7: HCM patient with mutation in MYH7; NORMAL: Normal heart.
Fig. 2Expression profiles of coding and lncRNA genes. (a) Hierarchical clustering of the samples from the three HCM groups and the normal group based on the expression of coding genes. (b) Hierarchical clustering of the samples from the three HCM groups and the normal group based on the expression of lncRNA genes. In a and b, each row represents a gene, and each column represents a sample. For better visualization, only the expression of 1,000 randomly selected genes are displayed on the heatmap. (c) Volcano plot showing the differentially expressed coding genes between HCM and normal groups. (d) Volcano plot showing the differentially expressed lncRNA genes between HCM and normal groups. In c and d, dots coloured in light red or light blue denote statistically and biologically significant genes being up-regulated or down-regulated, respectively. The dot size reflects the absolute fold change. Only the top 30 DEGs were labelled with gene symbols.
Fig. 3Quality assessment of the RNA-seq dataset. (a) Distribution curve of estimated insert size for each sample. (b) Gene body coverage profile for each sample. Only the genes in the upper-middle quartile by read-count are considered. (c) Read mapping rates for different location categories in each sample. Unique Gene: exons of only one gene; Unique Gene UTR: UTRs of only one gene; Ambig Gene: exons of more than one gene; No Gene: a region without annotated genes; No Gene, Intronic: a region bridged by an annotated splice junction; No gene, 1 kb from gene: 1 kilobase from the nearest annotated gene; No gene, 10 kb from gene: 10 kilobases from the nearest annotated gene; No gene, middle of nowhere: more than 10 kilobases from the nearest annotated gene (d) Number of splice junctions for different categories of each sample. “1–3 reads” means the junction locus is covered by 1–3 read-pairs. (e) Nucleotide rate by cycle for aligned bases in each sample. Nucleotide types are differentiated by colour. Sample groups are differentiated by shape. (f) Alignment soft clipping rate by cycle in each sample. (a–f) Plots are generated by QoRTs. (g) PCA for visualizing the high-dimensional expression datasets.
| Design Type(s) | transcription profiling design • disease state design • sequence analysis objective |
| Measurement Type(s) | transcription profiling assay |
| Technology Type(s) | RNA sequencing |
| Factor Type(s) | experimental condition |
| Sample Characteristic(s) | Homo sapiens • heart |