| Literature DB >> 29922328 |
Fouzia Yeasmin1, Tetsushi Yada2, Nobuyoshi Akimitsu1.
Abstract
Integrative analysis using omics-based technologies results in the identification of a large number of putative short open reading frames (sORFs) with protein-coding capacity within transcripts previously identified as long noncoding RNAs (lncRNAs) or transcripts of unknown function (TUFs). sORFs were previously overlooked because of their diminutive size and the difficulty of identification by bioinformatics analyses. There is now growing evidence of the existence of potentially functional micropeptides produced from sORFs within cells of diverse species. Recent characterization of a few of these revealed their significant divergent roles in many fundamental biological processes, where some also show important relationships with pathogenesis. Recent works therefore provide new insights for exploring the wealth of information that may lie within sORF-encoded short proteins. Here, we summarize the current progress and view of micropeptides encoded in sORFs of protein-coding genes.Entities:
Keywords: TUFs; lncRNAs; micropeptides; sORFs; translation
Year: 2018 PMID: 29922328 PMCID: PMC5996887 DOI: 10.3389/fgene.2018.00144
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Computational and experimental approaches to protein-coding sORFs.
| Computing-based method | sORFfinder, HAItORF, uPEPperoni | Web based tools to locate sORF having coding potential | Hanada et al., |
| PhyloCSF | A computational method examines evolutionary conservation of a sORF across species | Lin et al., | |
| Transcriptomic-based method | Ribosome profiling | A deep sequencing- based tool of ribosome protected mRNA fragments to obtain a global snapshot of translation | Ingolia et al., |
| Poly-ribo seq | A combination of ribosome profiling and polysome to enrich more potent protein coding ORFs | Aspden et al., | |
| Ribosome releasing scores (RRS) | These three metrics are developed and combined with ribosome profiling to assist in identification of true protein coding ORFs | Guttman et al., | |
| Fragment length organization similarity score (FLOSS) | Ingolia et al., | ||
| ORF regression algorithm for translation evaluation RPFS (ribosome-protected mRNA fragments) (ORF-RATER) | Fields et al., | ||
| Proteomics-based | Proteo genomics | A combined approach of proteomics and genomics | Slavoff et al., |
Figure 1Diverse biological function of recently annotated micropeptides. Micropeptides are found to be involved in many biological processes. Myoregulin (MLN), phospholamban (PLN), sarcolipin (SLN), and another regulin (ALN) are a group of peptides that interact with the protein SERCA (a Ca2+ Pump) in sarcoplasmic and endoplasmic reticulum (S/ER) and maintain Ca2+ homeostasis in the cell. MOTS-c and humanin are mitochondrial sORF-encoded micropeptides that display important roles in metabolic homeostasis and apoptosis, respectively. Humanin suppresses apoptosis by preventing the translocation of an apoptosis inducing protein, Bax (Bcl2-associated X protein), from cytoplasm to mitochondria. Another micropeptide named MRI-2 is found to enhance non-homologous end joining (NHEJ) of double-strand DNA breaks (DSBs) by associating with other DNA end-binding proteins (Ku proteins). Myomixer, minion, SPAR, and NoBody, four other micropeptides that have been recently discovered, have distinct biological roles wherein myomixer and minion stimulate the fusion of myoblast to form myofiber during muscle formation by participating with another protein, myomaker. The micropeptide SPAR is localized into lysome where it interacts with the lysosomal v-ATPase complex and regulates mTORC1 protein activation during stress signaling. NoBody, a p-body (processing-body, which is involved in mRNA turnover) dissociating micropeptide, shows its function by interacting with the mRNA decapping complex.
Figure 2Various approaches for functional characterization of micropeptides. (A) Evolutionary conservation of a peptide sequence is suggestive of functionality. Homology- based searching among species thus can be performed to identify whether the target peptide sequence shares any functional similarity with other proteins. Here the blue and red boxes indicate the conserved sequences among species. (B) Functional proteomics is a commonly used approach for identifying the interacting proteins of a target protein. In this method, first, immunoprecipitation is conducted by using an antibody (Ab) that is designed either against the epitope tagged with a target micropeptide or directly against the micropeptide. Western blot is then performed followed by mass spectrometry analysis to separate and identify the interacting proteins. Red brackets indicate the bands of interacting proteins that are separated by western blot analysis. A negative control (NC) denotes an empty vector that also runs for comparison. The nature of the interacting protein will thus provide clues about the function of the target micropeptide. (C) CRISPR-cas9 mediated gene editing approaches can also be used to check the coding potential of sORFs. To verify the coding potential, an epitope tag (FLAG) can be inserted at the downstream of the sORF into the endogenous locus. CRISPR-cas9 mediated gene editing is started by the recognition of the target site, which is mediated by a guide RNA (gRNA). Guide RNA guides the cas9 endonuclease to a specific location in the genome sequence, which is immediately adjacent to a protospacer adjacent motif (PAM). Upon recognition, the cas9 creates a double strand break (DSB) at the target site. This DSB can then be repaired either by non-homologous end joining (NHEJ) or by homology directed repair (HDR). HDR is used to insert an epitope tag at the target site where a donor vector with homology to the targeted locus must be provided. The donor vector must contain the epitope tag that has to be knocked-in at the target site. Expression of the engineered fusion protein can then be verified by western blot analysis.
Micropeptides and their diverse biological functions.
| Plant | Early nodulin 40 (Enod 40) | Plants | Nodule organogenesis | 12.24 | Rohrig et al., | |
| POLARIS (PLS) | Gene expression analysis by promoter trapping; Mutation analysis | Leaf morphogenesis | 36 | Casson et al., | ||
| Brick1 (Brk) | Plants and animals | Mutation analysis | Leaf morphogenesis | 76 | Frank and Smith, | |
| ROTUNDIFOLIA (ROT4) | Plants | Screening of a mutant in | Leaf morphogenesis | 53 | Narita et al., | |
| ROT18/DLV1 | Plants | Gain of function screening of genes responsible for fruit growth and development in Arabidopsis | Plant organogenesis | 51 | Wen et al., | |
| Kiss of death (KOD) | Gene expression analysis by promoter trapping | Programmed cell death regulation | 25 | Blanvillain et al., | ||
| Zm401p10, Zm908p11 | Poaceae | Bioinformatics analysis | Pollen development | 89.97 | Ma et al., | |
| Animal | Polished rice (Pri) | Insects | Mutation analysis | Fly embryogenesis | 11–32 | Galindo et al., |
| Toddler | Vertebrates | Ribo-seq-based search for novel signaling peptides | Promotes cell migration | 58 | Pauli et al., | |
| AGD3 | Mammals | Sequencing analysis. | Involve in stem cell differentiation | 63 | Kikuchi et al., | |
| Myoregulin (MLN) | Mammals | Bioinformatics approaches; Homology-based characterization | Calcium homeostasis | 46 | Magny et al., | |
| DWORF | Lamprey | PhyloCSF search; Gain and loss of function | Enhance muscle performance | 34 | Nelson et al., | |
| Myomixer | Vertebrates | CRISPR-cas9 mediated loss of function screening of genes required for myoblast fusion | Functionally involve in controlling muscle performance | 84 | Bi et al., | |
| MRI-2 | Mammals | HPLC-MS/MS screening combining with RNA seq; characterized by functional proteomics | DNA repairing process | 69 | Slavoff et al., | |
| NoBody | Mammals | HPLC-MS/MS screening combining with RNA seq; characterized by functional Proteomics | mRNA recycling | 68 | D'Lima et al., | |
| SPAR | Human and mouse | Proteomics strategy | Regulate muscle regeneration | 90 | Matsumoto et al., | |
| Humanin | Different species | Functional expression screening | Involve in program cell death | 24 | Hashimoto et al., | |
| MOTS-c | 14 species | Metabolic Homeostasis | 16 | Lee et al., | ||
| Minion | Mammalian species | RNA seq analysis of uninjured and regenerating muscle | Muscle formation | 84 | Zhang et al., | |
| HOXB-AS3 | Primates | Ribosome profiling | Suppresses colon cancer growth | 53 | Huang et al., |