| Literature DB >> 29730206 |
Yuan Zhou1, Qinghua Cui2.
Abstract
The m6A modification has been implicated as an important epitranscriptomic marker, which plays extensive roles in the regulation of transcript stability, splicing, translation, and localization. Nevertheless, only some genes are repeatedly modified across various conditions and the principle of m6A regulation remains elusive. In this study, we performed a systems-level analysis of human genes frequently regulated by m6A modification (m6Afreq genes) and those occasionally regulated by m6A modification (m6Aocca genes). Compared to the m6Aocca genes, the m6Afreq genes exhibit gene importance-related features, such as lower dN/dS ratio, higher protein-protein interaction network degree, and reduced tissue expression specificity. Signaling network analysis indicates that the m6Afreq genes are associated with downstream components of signaling cascades, high-linked signaling adaptors, and specific network motifs like incoherent feed forward loops. Moreover, functional enrichment analysis indicates significant overlaps between the m6Afreq genes and genes involved in various layers of gene expression, such as being the microRNA targets and the regulators of RNA processing. Therefore, our findings suggest the potential interplay between m6A epitranscriptomic regulation and other gene expression regulatory machineries.Entities:
Keywords: Epitranscriptome; Gene expression regulation; Gene importance; Signaling network; m(6)A
Mesh:
Substances:
Year: 2018 PMID: 29730206 PMCID: PMC6112303 DOI: 10.1016/j.gpb.2018.01.001
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Figure 1The overall distribution of the number of m
A. The raw count of the number of m6A regulated conditions in our comprehensive m6A dataset. Intuitively, an m6A regulated condition is counted if there is any m6A peak identified in a particular gene under a specified condition. B. The corrected number of m6A regulated conditions in the comprehensive m6A dataset. The corrected number of m6A regulated conditions was obtained by dividing the number of m6A regulated conditions against that of cell types (covered by m6A profiles) where the gene shows baseline expression. A gene is considered to show baseline expression in a cell type, if TPM is greater than 0.5 in the corresponding cell type according to the Human Protein Atlas database. TPM, transcripts per kilobase million.
Figure 2The correlation between the corrected number of m
The correlation curves between the corrected number of m6A regulated conditions and various gene features are plotted by using the LOESS smoothing technique. The line indicates the local average estimated by LOESS smoothing and the shade indicates the confidence interval. Outlier genes (0.5%) with extremely high corrected number of m6A regulated conditions are omitted due to their high variation in gene feature values, which could result in badly skewed regression lines. A. Correlation of corrected number of m6A regulated conditions with dN/dS ratio. B. Correlation of corrected number of m6A regulated conditions with number of orthologous genes. C. Correlation of corrected number of m6A regulated conditions with PPI network degree. D. Correlation of corrected number of m6A regulated conditions with tissue expression specificity. E. Correlation of corrected number of m6A regulated conditions with number of targeting microRNAs. F. The summary of Spearman’s correlation coefficient and P values for panels A–E. PPI, protein–protein interaction.
Figure 3The network feature of the m
A. Boxplot comparing the distributions of relative level in the signaling network, between the m6Afreq genes and m6Aocca genes. The relative level in the signaling network shown here was calculated as the shortest distance to any upstream receptor divided by the sum of the shortest distance to any upstream receptor and the shortest distance to any downstream transcriptional factors. B. Cumulative distribution plot comparing the PPI-only degree distribution of the m6Afreq genes and that of m6Aocca genes. The PPI-only degree only considers PPI edges in the signaling network but omits the activating and repressing edges. C. Cumulative distribution plot comparing the PPI-only degree distribution of the interacting partners of m6Afreq genes and that of the interacting partners of m6Aocca genes. D. The overrepresented network motifs of m6Afreq genes. In a network motif, if there are more m6Afreq genes than 9500 out of 10,000 sets of randomly picked genes (corresponding to an empirical P value < 0.05), this motif is considered as an overrepresented motif. The respective motifs are explicitly depicted by the schemas on top. The activating and repressing edges are indicated using lines with arrowhead and circle, respectively. The name of the motif is composed of the motif type and the code describing the edge topology in the motif. For example, IFF3a1i2abbc indicates an incoherent feedforward loop with three nodes (a, b, and c) that form one activating edge and two repressing edges. Among the three edges, the major class of the edges (in this motif, the major class is repressing edge) comprises the edge between a and b, and the edge between b and c. IFF, incoherent feedforward loop; CFF, coherent feedforward loop; NFB, negative feedback loop; PFB, positive feedback loop.