| Literature DB >> 31779203 |
Xiaoming Ma1,2, Congjun Jia1,2, Min Chu1,2, Donghai Fu1,2, Qinhui Lei1,2, Xuezhi Ding1,2, Xiaoyun Wu1,2, Xian Guo1,2, Jie Pei1,2, Pengjia Bao1,2, Ping Yan1, Chunnian Liang2.
Abstract
DNA methylation modifications are implicated in many biological processes. As the most common epigenetic mechanism DNA methylation also affects muscle growth and development. The majority of previous studies have focused on different varieties of yak, but little is known about the epigenetic regulation mechanisms in different age groups of animals. The development of muscles in the different stages of yak growth remains unclear. In this study, we selected the longissimus dorsi muscle tissue at three different growth stages of the yak, namely, 90-day-old fetuses (group E), six months old (group M), and three years old (group A). Using RNA-Seq transcriptome sequencing and methyl-RAD whole-genome methylation sequencing technology, changes in gene expression levels and DNA methylation status throughout the genome were investigated during the stages of yak development. Each group was represented by three biological replicates. The intersections of expression patterns of 7694 differentially expressed genes (DEGs) were identified (padj < 0.01, |log2FC| > 1.2) at each of the three developmental periods. Time-series expression profile clustering analysis indicated that the DEGs were significantly arranged into eight clusters which could be divided into two classes (padj < 0.05), class I profiles that were downregulated and class II profiles that were upregulated. Based on this cluster analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis revealed that DEGs from class I profiles were significantly (padj < 0.05) enriched in 21 pathways, the most enriched pathway being the Axon guidance signaling pathway. DEGs from the class II profile were significantly enriched in 58 pathways, the pathway most strongly enriched being Metabolic pathway. After establishing the methylation profiles of the whole genomes, and using two groups of comparisons, the three combinations of groups (M-vs.-E, M-vs.-A, A-vs.-E) were found to have 1344, 822, and 420 genes, respectively, that were differentially methylated at CCGG sites and 2282, 3056, and 537 genes, respectively, at CCWGG sites. The two sets of data were integrated and the negative correlations between DEGs and differentially methylated promoters (DMPs) analyzed, which confirmed that TMEM8C, IGF2, CACNA1S and MUSTN1 were methylated in the promoter region and that expression of the modified genes was negatively correlated. Interestingly, these four genes, from what was mentioned above, perform vital roles in yak muscle growth and represent a reference for future genomic and epigenomic studies in muscle development, in addition to enabling marker-assisted selection of growth traits.Entities:
Keywords: Longissimus dorsi muscle; Polled yak; methylation; transcriptome
Mesh:
Year: 2019 PMID: 31779203 PMCID: PMC6947547 DOI: 10.3390/genes10120970
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Sequencing data quality preprocessing results.
| Sample | Raw_Reads | Raw_Bases | Clean_Reads | Clean_Bases | Valid_Bases | Q30 1 | GC |
|---|---|---|---|---|---|---|---|
| E6 | 98.03M | 14.70G | 93.46M | 13.67G | 92.96% | 94.20% | 46.73% |
| E7 | 98.27M | 14.74G | 94.05M | 13.77G | 93.45% | 94.57% | 47.06% |
| E8 | 99.04M | 14.86G | 93.91M | 13.73G | 92.43% | 93.74% | 47.61% |
| M2 | 97.85M | 14.68G | 95.03M | 13.67G | 93.16% | 94.91% | 49.89% |
| M3 | 98.12M | 14.72G | 95.08M | 13.78G | 93.64% | 95.01% | 48.90% |
| M4 | 99.61M | 14.94G | 96.08M | 13.86G | 92.75% | 94.70% | 51.32% |
| A6 | 98.22M | 14.73G | 93.56M | 13.68G | 92.87% | 94.33% | 48.93% |
| A7 | 99.56M | 14.93G | 94.64M | 13.86G | 92.83% | 94.00% | 47.78% |
| A10 | 99.28M | 14.89G | 94.57M | 13.86G | 93.07% | 94.27% | 48.58% |
1 Q30: Bases with a Phred values greater than 30 in Raw_Bases as a percentage of the total number of bases.
Figure 1Fragments per kilobase million (FPKM) expression distribution map. The different colors represent different ranges of FPKM values. The abscissa represents different samples and the ordinate is the number of protein-coding genes.
Figure 2Differential expression volcano map. Differences in gene expression among the 3 combinations of groups, expressed as volcano maps. A black pixel represents a gene where the difference in expression is not significantly different, red and green represent those that are significant; the X-axis displays log2 fold change, the Y-axis displays the -log10 p-value. (A–C) represents three comparison groups of A-vs.-E, M-vs.-A and M-vs.-E, respectively.
Figure 3Differentially expressed genes among the three combinations of groups. (A) Numbers of expressed and differentially expressed genes: Results of RNA-Seq differentially expressed genes among three comparisons, namely, M-vs.-E, M-vs.-A and A-vs.-E. M, A, and E represent groups of yaks at different stages of development, namely 6-month-old, adult, and fetal, respectively. (B) Heat map of differentially expressed genes in pathways related to yak growth for the different gene onology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Rows indicate genes with significant differences in expression among the three stages; columns represent individual samples from three stages (E, M, and A represent muscle samples from 6 months old, 3 years old and 90-day-old fetuses, respectively). Colors in the figure from red to blue indicate the level of gene expression from high to low.
Figure 4Short Time-Series Expression Miner (STEM) analysis of DEG expression profiles. (A) Each box corresponds to a type expression profile and only colored profiles are statistically significant. The upper-left and upper-right numbers in each box indicate the order of profiles and p-values, respectively. (B–D) Three significant clusters of DEG profiles across all three stages.
Figure 5Bubble plot of the top 20 significantly enriched pathways for (A) Class I profiles (profiles 0 and 1) and (B) Class II profile (profile 6). Colors represent minus logarithms of adjusted padj values. Lengths of columns represent numbers of genes enriched in a pathway.
Figure 6Expression levels of nine DEGs detected by RNA-Seq and validated by qPCR. (A–I) Results from RNA-Seq are shown by bar graphs. Values on the right Y-axis represent FPKM. Results of qPCR are shown by line graphs with values on the left Y-axis as relative expression levels. Data represent means ± SE.
Sample sequencing data volume and comparison rate.
| Sample | Raw Reads | Enzyme Reads | Mapping Reads | Ratio |
|---|---|---|---|---|
| E6 | 139,680,224 | 64,643,466 | 30,210,658 | 46.73% |
| E7 | 142,659,082 | 65,566,251 | 38,487,733 | 58.70% |
| E8 | 142,659,082 | 75,344,977 | 35,403,055 | 46.99% |
| M2 | 36,447,959 | 21,083,527 | 9,529,418 | 45.20% |
| M3 | 36,447,959 | 22,576,370 | 10,604,071 | 46.97% |
| M4 | 36,447,959 | 23,206,057 | 11,126,778 | 47.95% |
| A6 | 139,680,224 | 57,187,481 | 30,914,358 | 54.06% |
| A7 | 139,680,224 | 65,584,779 | 30,544,305 | 46.57% |
| A10 | 139,680,224 | 65,816,098 | 32,854,859 | 49.92% |
| Average | 105,931,437 | 51,223,223 | 25,519,471 | 49.23% |
Figure 7DNA methylation sites, methylation levels, and distribution of functional components. (A) Change in DNA methylation levels. In the line chart, red represents the methylation level of the A group; green represents that of the E group and blue represents that of the M group. (B) Histogram of distribution of DNA methylation sites in different gene functional components. Exon: exon; Body region: gene area; Intergenic: intergenic region; Intron: intron; Upstream: gene starting sites in a region 2000 bp upstream; Utr3prime: 3’ untranslated region; Utr5prime: 5’ untranslated region. (C) Gene ontology (GO) function is classified by histograms of differences in methylation sites. (D) Numbers in parenthesis indicate the number of genes affected by differentially methylated regions (DMRs) in a comparison of different groups. Colors represent minus logarithms of adjusted p-values. Lengths of columns represent numbers of genes enriched in a pathway.
Figure 8Pearson correlations of fold-change in gene expression and differential methylation rates for DEGs in muscle tissues and Venn diagrams of numbers of differentially methylated promoters (DMPs) among different comparisons. Scatter plots for (A) A-E, (B) M-E and (C,D) M-A. The X-axis represents mean difference in methylation ratio between M and E groups; the Y-axis represents the logarithm of fold-change (log 2 FC). The blue dots represent negatively correlated genes and red dots represent positively correlated genes. The trend lines and formula in each scatter plot represent the correlation coefficients.
Statistics of two sets of data overlap.
| Profile | Profile Number | DMP Group | Two Set Data Overlap Number of DMPs |
|---|---|---|---|
| profile1_0 | 1811 | A-vs.-E | 274 |
| profile1_0 | 1811 | M-vs.-A | 27 |
| profile1_0 | 1811 | M-vs.-E | 574 |
| profile6 | 3702 | A-vs.-E | 214 |
| profile6 | 3702 | M-vs.-A | 18 |
| profile6 | 3702 | M-vs.-E | 323 |
Muscle developmental genes shared between different groups according to DMPs and DEGs.
| Gene | Description | Group |
|---|---|---|
|
| insulin-like growth factor 2 | M-E; A-E; profile0_1 |
|
| transmembrane protein 8c | M-E; A-E; profile0_1 |
|
| musculoskeletal, embryonic nuclear protein I | M-E; A-E; profile6 |
|
| calcium voltage-gated channel subunit alpha1 S | M-E; A-E; profile6 |