Literature DB >> 34304369

LncRNA-Encoded Short Peptides Identification Using Feature Subset Recombination and Ensemble Learning.

Siyuan Zhao1, Jun Meng2, Yushi Luan3.   

Abstract

Long non-coding RNA (lncRNA), which is a type of non-coding RNA, was reported to contain short open reading frames (sORFs). SORFs-encoded short peptides (SEPs) have been demonstrated to play a crucial role in regulating the biological processes such as growth, development, and resistance response. The identification of SEPs is vital to further understanding their function. However, there is still a lack of methods for identifying SEPs effectively and rapidly. In this study, a novel method for lncRNA-encoded short peptides identification based on feature subset recombination and ensemble learning, lncPepid, is developed. lncPepid transforms the data of Zea mays and Arabidopsis thaliana into hybrid features from two aspects including sequence composition and physicochemical properties separately. It optimizes hybrid features by proposing a novel weighted iteration-based feature selection method to recombine a stable subset that characterizes SEPs effectively. Different classification models with different optimized features are constructed and tested separately. The outputs of the optimal models are integrated for ensemble classification to improve efficiency. Experimental results manifest that the geometric mean of sensitivity and specificity of lncPepid is about 70% on the identification of functional SEPs derived from multiple species. It is an effective and rapid method for the identification of lncRNA-encoded short peptides. This study can be extended to the research on SEPs from other species and have crucial implications for further findings and studies of functional genomics.
© 2021. International Association of Scientists in the Interdisciplinary Areas.

Entities:  

Keywords:  Ensemble learning; Feature subset recombination; Long non-coding RNA; Short open reading frames; Short peptides

Mesh:

Substances:

Year:  2021        PMID: 34304369     DOI: 10.1007/s12539-021-00464-1

Source DB:  PubMed          Journal:  Interdiscip Sci        ISSN: 1867-1462            Impact factor:   2.233


  26 in total

1.  Soybean ENOD40 encodes two peptides that bind to sucrose synthase.

Authors:  Horst Rohrig; Jurgen Schmidt; Edvins Miklashevichs; Jeff Schell; Michael John
Journal:  Proc Natl Acad Sci U S A       Date:  2002-02-12       Impact factor: 11.205

2.  Control of muscle formation by the fusogenic micropeptide myomixer.

Authors:  Pengpeng Bi; Andres Ramirez-Martinez; Hui Li; Jessica Cannavino; John R McAnally; John M Shelton; Efrain Sánchez-Ortiz; Rhonda Bassel-Duby; Eric N Olson
Journal:  Science       Date:  2017-04-06       Impact factor: 47.728

3.  The Arabidopsis peptide kiss of death is an inducer of programmed cell death.

Authors:  Robert Blanvillain; Bennett Young; Yao-min Cai; Valérie Hecht; Fabrice Varoquaux; Valérie Delorme; Jean-Marc Lancelin; Michel Delseny; Patrick Gallois
Journal:  EMBO J       Date:  2011-02-15       Impact factor: 11.598

4.  DVL, a novel class of small polypeptides: overexpression alters Arabidopsis development.

Authors:  Jiangqi Wen; Kevin A Lease; John C Walker
Journal:  Plant J       Date:  2004-03       Impact factor: 6.417

5.  Zm401, a short-open reading-frame mRNA or noncoding RNA, is essential for tapetum and microspore development and can regulate the floret formation in maize.

Authors:  Jinxia Ma; Bingxue Yan; Yanying Qu; Fangfang Qin; Yantao Yang; Xiujing Hao; Jingjuan Yu; Qian Zhao; Dengyun Zhu; Guangming Ao
Journal:  J Cell Biochem       Date:  2008-09-01       Impact factor: 4.429

6.  A peptide encoded by a transcript annotated as long noncoding RNA enhances SERCA activity in muscle.

Authors:  Benjamin R Nelson; Catherine A Makarewich; Douglas M Anderson; Benjamin R Winders; Constantine D Troupes; Fenfen Wu; Austin L Reese; John R McAnally; Xiongwen Chen; Ege T Kavalali; Stephen C Cannon; Steven R Houser; Rhonda Bassel-Duby; Eric N Olson
Journal:  Science       Date:  2016-01-15       Impact factor: 47.728

7.  A Peptide Encoded by a Putative lncRNA HOXB-AS3 Suppresses Colon Cancer Growth.

Authors:  Jin-Zhou Huang; Min Chen; Xing-Cheng Gao; Song Zhu; Hongyang Huang; Min Hu; Huifang Zhu; Guang-Rong Yan
Journal:  Mol Cell       Date:  2017-10-05       Impact factor: 17.970

8.  The POLARIS peptide of Arabidopsis regulates auxin transport and root growth via effects on ethylene signaling.

Authors:  Paul M Chilley; Stuart A Casson; Petr Tarkowski; Nathan Hawkins; Kevin L-C Wang; Patrick J Hussey; Mike Beale; Joseph R Ecker; Göran K Sandberg; Keith Lindsey
Journal:  Plant Cell       Date:  2006-11-30       Impact factor: 11.277

9.  Mitoregulin: A lncRNA-Encoded Microprotein that Supports Mitochondrial Supercomplexes and Respiratory Efficiency.

Authors:  Colleen S Stein; Pooja Jadiya; Xiaoming Zhang; Jared M McLendon; Gabrielle M Abouassaly; Nathan H Witmer; Ethan J Anderson; John W Elrod; Ryan L Boudreau
Journal:  Cell Rep       Date:  2018-06-26       Impact factor: 9.423

10.  Transcripts of unknown function in multiple-signaling pathways involved in human stem cell differentiation.

Authors:  Kunio Kikuchi; Makiha Fukuda; Tomoya Ito; Mitsuko Inoue; Takahide Yokoi; Suenori Chiku; Toutai Mitsuyama; Kiyoshi Asai; Tetsuro Hirose; Yasunori Aizawa
Journal:  Nucleic Acids Res       Date:  2009-06-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.