| Literature DB >> 30042402 |
Shahid Y Khan1, Muhammad Ali1, Firoz Kabir1, Ruiqiang Chen2, Chan Hyun Na2, Mei-Chong W Lee3, Nader Pourmand3, Sean F Hackett1, S Amer Riazuddin4.
Abstract
We previously investigated the transcriptome and proteome profiles of the murine ocular lens at six developmental time points including two embryonic (E15 and E18) and four postnatal time points (P0, P3, P6, and P9). Here, we extend our analyses to identify novel transcripts and peptides in developing mouse lens. We identified a total of 9,707 novel transcripts and 325 novel fusion genes in developing mouse lens. Additionally, we identified 13,281 novel alternative splicing (AS) events in mouse lens including 6,990 exon skipping (ES), 2,447 alternative 3' splice site (A3SS), 1,900 alternative 5' splice site (A5SS), 1,771 mutually exclusive exons (MXE), and 173 intron retention (IR). Finally, we integrated our OMIC (Transcriptome and Proteome) datasets identifying 20 novel peptides in mouse lens. All 20 peptides were validated through matching MS/MS spectra of synthetic peptides. To the best of our knowledge, this is the first report integrating OMIC datasets to identify novel peptides in developing murine lens.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30042402 PMCID: PMC6057992 DOI: 10.1038/s41598-018-28727-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Illustration of the workflow to identify novel transcripts and peptides in developing mouse lens. Briefly, HISAT2, a splice aligner tool was used for the alignment of mouse lens RNA-Seq reads (FASTQ) to the mouse genome, followed by the transcripts annotation and expression quantification using the StringTie algorithm. In parallel, the RNA-Seq aligned data was further processed using JAFFA, and rMATS algorithms to detect fusion genes, and alternative splicing events, respectively, expressing in mouse lens. The novel transcripts (≥1.0TPM) were analyzed using a proteogenomics approach to identify novel peptides. The novel transcripts were translated into potential open reading frames (ORFs) to generate a reference database. The mouse lens proteome data (MS/MS spectra) was searched against this reference database to identify novel peptides. Finally, the novel peptides were validated through matching MS/MS spectra of synthetic peptides.
Transcripts identified in developing mouse lens.
| Developmental Stage | Transcripts expression (≥1.0 TPM) | |||
|---|---|---|---|---|
| Annotated | Novel | Semi-novel | Fusion Genes | |
| E15 | 18,087 | 6,645 | 11,025 | 195 |
| E18 | 17,500 | 7,061 | 10,335 | 161 |
| P0 | 17,257 | 7,569 | 11,194 | 275 |
| P3 | 17,257 | 7,513 | 11,199 | 218 |
| P6 | 16,656 | 7,293 | 10,831 | 280 |
| P9 | 16,674 | 7,084 | 10,605 | 223 |
| Total | 21,265 | 9,707 | 14,113 | 325 |
Note: E15 and E18 are embryonic days 15, and 18, and P0, P3, P6, and P9 are postnatal days 0, 3, 6, and 9, respectively; TPM: transcripts per million.
Alternative splicing events identified in developing mouse lens.
| Developmental Stage | Types of AS Events | Total AS Events | Significant Novel AS Events* |
|---|---|---|---|
| E15 vs. E18 | ES | 22160 | 416 |
| MXE | 3069 | 93 | |
| A5SS | 9753 | 162 | |
| A3SS | 13755 | 306 | |
| IR | 575 | 23 | |
| E15 vs. P0 | ES | 26560 | 521 |
| MXE | 4379 | 124 | |
| A5SS | 10743 | 97 | |
| A3SS | 14818 | 93 | |
| IR | 646 | 11 | |
| E15 vs. P3 | ES | 23261 | 336 |
| MXE | 3881 | 107 | |
| A5SS | 10247 | 106 | |
| A3SS | 14215 | 107 | |
| IR | 605 | 8 | |
| E15 vs. P6 | ES | 22898 | 459 |
| MXE | 3827 | 129 | |
| A5SS | 10119 | 110 | |
| A3SS | 14093 | 119 | |
| IR | 617 | 8 | |
| E15 vs. P9 | ES | 21982 | 444 |
| MXE | 3872 | 121 | |
| A5SS | 9859 | 97 | |
| A3SS | 13623 | 94 | |
| IR | 559 | 13 | |
| E18 vs. P0 | ES | 23579 | 1107 |
| MXE | 3470 | 194 | |
| A5SS | 9295 | 259 | |
| A3SS | 12849 | 309 | |
| IR | 557 | 26 | |
| E18 vs. P3 | ES | 20212 | 810 |
| MXE | 3321 | 219 | |
| A5SS | 8888 | 233 | |
| A3SS | 12271 | 287 | |
| IR | 515 | 19 | |
| E18 vs. P6 | ES | 19789 | 931 |
| MXE | 3109 | 198 | |
| A5SS | 80692 | 238 | |
| A3SS | 12123 | 300 | |
| IR | 503 | 21 | |
| E18 vs. P9 | ES | 18776 | 921 |
| MXE | 2938 | 188 | |
| A5SS | 8384 | 241 | |
| A3SS | 11645 | 290 | |
| IR | 452 | 15 | |
| P0 vs. P3 | ES | 24298 | 281 |
| MXE | 4296 | 131 | |
| A5SS | 9669 | 82 | |
| A3SS | 13227 | 108 | |
| IR | 586 | 5 | |
| P0 vs. P6 | ES | 23843 | 194 |
| MXE | 4265 | 33 | |
| A5SS | 9581 | 46 | |
| A3SS | 13119 | 88 | |
| IR | 583 | 6 | |
| P0 vs. P9 | ES | 22985 | 214 |
| MXE | 4137 | 49 | |
| A5SS | 9293 | 67 | |
| A3SS | 12681 | 114 | |
| IR | 533 | 9 | |
| P3 vs. P6 | ES | 20632 | 149 |
| MXE | 4121 | 108 | |
| A5SS | 9142 | 62 | |
| A3SS | 12526 | 89 | |
| IR | 542 | 3 | |
| P3 vs. P9 | ES | 19721 | 143 |
| MXE | 3816 | 60 | |
| A5SS | 8832 | 54 | |
| A3SS | 12003 | 78 | |
| IR | 499 | 4 | |
| P6 vs. P9 | ES | 19208 | 64 |
| MXE | 4083 | 17 | |
| A5SS | 8672 | 46 | |
| A3SS | 11934 | 65 | |
| IR | 492 | 2 |
Note: AS: alternative splicing; ES: exon skipping; MXE: mutually exclusive exon; A5SS: alternative 5′ splice site; A3SS: alternative 3′ splice site; and IR: intron retention. *Significant events based on false discovery rate (FDR) <0.01. E15 and E18 are embryonic days 15, and 18, and P0, P3, P6, and P9 are postnatal days 0, 3, 6, and 9, respectively.
Novel peptides identified in mouse lens through integration of OMIC (Transcriptome and Proteome) datasets[22,24].
| No. | Novel Peptide | Amino Acid Length | Number of PSMs | Novel Transcript ID | Genomic Coordinates (mm10) |
|---|---|---|---|---|---|
| 1 | AAESDLSTARPAPPEPR | 17 | 1 | MSTRG.4120.3 | Chr11: 117620218–117620288 |
| 2 | AFAHAEPR | 8 | 2 | MSTRG.6156.3 | Chr13: 120244503–120244596 |
| 3 | ATEDCFQER | 9 | 2 | MSTRG.5109.18 | Chr12: 113023646–113023713 |
| 4 | AVGVDCSAPEPR | 12 | 2 | MSTRG.3713.5 | Chr11: 97361423–97363714 |
| 5 | DLGGVESASPSAAR | 14 | 1 | MSTRG.19717.4 | Chr7: 137903048–137903154 |
| 6 | DREIWLNR | 8 | 4 | MSTRG.12904.4 | Chr2: 177493993–177494080 |
| 7 | GLQRPDGGDHR | 11 | 2 | MSTRG.398.2 | Chr1: 65973222–65973464 |
| 8 | LCGACGTASGTK | 12 | 3 | MSTRG.16432.2 | Chr5: 112688277–112688395 |
| 9 | LRHLNNVNILK | 11 | 2 | MSTRG.11709.1 | Chr2: 38729037–38735032 |
| 10 | NYFYTGAEIK | 10 | 1 | MSTRG.19785.1 | Chr7: 146523115–146545389 |
| 11 | QGISSISTFK | 10 | 3 | MSTRG.18410.1 | Chr7: 19857400–19857609 |
| 12 | SAQALVK | 7 | 2 | MSTRG.6483.20 | Chr14: 48326083–48326173 |
| 13 | SINEVIK | 7 | 1 | MSTRG.5839.2 | Chr13: 74088982–74089093 |
| 14 | SMGEDTVPK | 9 | 1 | MSTRG.6973.3 | Chr14: 121639196–121639264 |
| 15 | WLIEISK | 7 | 3 | MSTRG.8658.5 | Chr17: 3516990–3517538 |
| 16 | FLTVSTSPGFPGT | 13 | 1 | MSTRG.16436.1 | Chr5: 112684967–112698364 |
| 17 | AAGDAEPEDQAQPQPQPEPEPR | 22 | 4 | MSTRG.5528.4 | Chr13: 46517220–46517274 |
| 18 | LGHVGGADGANPSSAGSPQDGR | 22 | 1 | MSTRG.3412.1 | Chr11: 77839387–77839530 |
| 19 | SPGSEPQTQEAQEAGSDPQAARPQR | 25 | 2 | MSTRG.18532.2 | Chr7: 28289684–28289782 |
| 20 | ERPTPDVGDGQGPQLSESSSSPFSIPPDK | 29 | 11 | MSTRG.12232.3 | Chr2: 120330209–120330655 |
All 20 peptides were validated through matching MS/MS spectra of synthetic peptides. Note: PSMs: Peptide Spectrum Matches.
Figure 2Validation through matching MS/MS spectra of synthetic peptides. (A) MS/MS spectra of a novel peptide (WLIEISK) shown with a similar fragmentation pattern observed from the corresponding synthetic peptide. (B) MS/MS spectra of a novel peptide (SMGEDTVPK) shown with a similar fragmentation pattern observed from the corresponding synthetic peptide. (C) MS/MS spectra of a novel peptide (LCGACGTASGTK) shown with a similar fragmentation pattern observed from the corresponding synthetic peptide. Note: the term “peptide identified from sample” refers to the MS/MS spectra identified in mouse lens proteome[24], and the synthetic peptide refers to MS/MS spectra of the peptide synthesized by JPT Peptide Technologies (Berlin, Germany).
Data retrieved from previously published OMIC (Transcriptome and Proteome) datasets[22,24].
| A. Mouse Lens Transcriptome (RNA-Seq) | B. Mouse Lens Proteome (TMT) | ||||||
|---|---|---|---|---|---|---|---|
| Stage | Replicates | Genes | Total Genes | Stage | TMT | Proteins | Total Proteins |
| E15 | E15A | 13,274 | 14,465 | E15 | Set 1 | 4,630 | 5,404 |
| E18 | |||||||
| P0 | |||||||
| E18 | E18A | 13,900 | P3 | ||||
| P6 | |||||||
| P9 | |||||||
| P0 | P0A | 12,560 | E15 | Set 2 | 4,426 | ||
| E18 | |||||||
| P0 | |||||||
| P3 | P3A | 12,940 | P3 | ||||
| P6 | |||||||
| P9 | |||||||
| P6 | P6A | 12,130 | E15 | Set 3 | 3,747 | ||
| E18 | |||||||
| P0 | |||||||
| P9 | P9A | 12,229 | P3 | ||||
| P6 | |||||||
| P9 | |||||||
Note: E15 and E18 are embryonic days 15, and 18, and P0, P3, P6, and P9 are postnatal days 0, 3, 6, and 9, respectively.