| Literature DB >> 23434047 |
Yuli Li1, Shuhui Song, Cuiping Li, Jun Yu.
Abstract
RNA modifications, especially methylation of the N(6) position of adenosine (A)-m(6)A, represent an emerging research frontier in RNA biology. With the rapid development of high-throughput sequencing technology, in-depth study of m(6)A distribution and function relevance becomes feasible. However, a robust method to effectively identify m(6)A-modified regions has not been available yet. Here, we present a novel high-efficiency and user-friendly analysis pipeline called MeRIP-PF for the signal identification of MeRIP-Seq data in reference to controls. MeRIP-PF provides a statistical P-value for each identified m(6)A region based on the difference of read distribution when compared to the controls and also calculates false discovery rate (FDR) as a cut off to differentiate reliable m(6)A regions from the background. Furthermore, MeRIP-PF also achieves gene annotation of m(6)A signals or peaks and produce outputs in both XLS and graphical format, which are useful for further study. MeRIP-PF is implemented in Perl and is freely available at http://software.big.ac.cn/MeRIP-PF.html.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23434047 PMCID: PMC4357668 DOI: 10.1016/j.gpb.2013.01.002
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Figure 1MeRIP-Seq, MeRIP-PF and result processing A. Schematic of MeRIP-Seq and MeRIP-PF pipelines (red dots indicate m6A sites). B. Distribution of m6A peaks in different regions of transcripts. Pie chart showing the percentage of m6A peaks within distinct regions of RNA. NP, non-protein-coding genes; PR, protein-coding genes. C. Distribution of m6A peaks along mRNAs. 5′ UTRs, CDSs and 3′ UTRs of every transcript are separately binned into regions spanning 1% of their total lengths. Y-coordinates represent percentage of m6A peaks located in every bin. D. An example of transcripts in wig plot. Y-coordinates show the read coverage of every position in transcripts. NM_009484 was taken as an example. Different regions of transcripts are color coded with UTR indicated in blue and CDS in pink, and intronic regions are indicated with blank space. Red triangle indicates the position of m6A peaks.
Peak-finding performance of MeRIP-PF
| Methods | No. of peaks | Group A | Group B | Group C | Group D |
|---|---|---|---|---|---|
| Identical peaks (overlap ⩾ 75%) | Similar peaks (50% ⩽ overlap < 75%) | Different peaks (overlap < 50%) | New peaks (overlap = 0%) | ||
| Ref. | 13,471 | 9555 | 3719 | 197 | |
| MeRIP-PF | 20,199 | 9555 | 3719 | 1772 | 5153 |
Note: The “overlap” we mean is reciprocal for peak M and peak N. In other words, if “overlap” is more than 75%, it requires that peak N overlaps at least 75% of peak M and that peak M also overlaps at least 75% of peak N.
Time spent on processing GSM854223/GSM854224 datasets
| Process | Timing (min) |
|---|---|
| Mapping | ∼200 |
| Statistical test | ∼15 |
| Annotation | ∼10 |
| Plotting | ∼10 |