Literature DB >> 23509110

Systematization of the protein sequence diversity in enzymes related to secondary metabolic pathways in plants, in the context of big data biology inspired by the KNApSAcK motorcycle database.

Shun Ikeda1, Takashi Abe, Yukiko Nakamura, Nelson Kibinge, Aki Hirai Morita, Atsushi Nakatani, Naoaki Ono, Toshimichi Ikemura, Kensuke Nakamura, Md Altaf-Ul-Amin, Shigehiko Kanaya.   

Abstract

Biology is increasingly becoming a data-intensive science with the recent progress of the omics fields, e.g. genomics, transcriptomics, proteomics and metabolomics. The species-metabolite relationship database, KNApSAcK Core, has been widely utilized and cited in metabolomics research, and chronological analysis of that research work has helped to reveal recent trends in metabolomics research. To meet the needs of these trends, the KNApSAcK database has been extended by incorporating a secondary metabolic pathway database called Motorcycle DB. We examined the enzyme sequence diversity related to secondary metabolism by means of batch-learning self-organizing maps (BL-SOMs). Initially, we constructed a map by using a big data matrix consisting of the frequencies of all possible dipeptides in the protein sequence segments of plants and bacteria. The enzyme sequence diversity of the secondary metabolic pathways was examined by identifying clusters of segments associated with certain enzyme groups in the resulting map. The extent of diversity of 15 secondary metabolic enzyme groups is discussed. Data-intensive approaches such as BL-SOM applied to big data matrices are needed for systematizing protein sequences. Handling big data has become an inevitable part of biology.

Entities:  

Keywords:  Batch-learning self-organizing map; Big data biology; Database; KNApSAcK family; Secondary metabolic pathway; Species–metabolite relationship

Mesh:

Substances:

Year:  2013        PMID: 23509110     DOI: 10.1093/pcp/pct041

Source DB:  PubMed          Journal:  Plant Cell Physiol        ISSN: 0032-0781            Impact factor:   4.927


  4 in total

Review 1.  Systems biology in the context of big data and networks.

Authors:  Md Altaf-Ul-Amin; Farit Mochamad Afendi; Samuel Kuria Kiboi; Shigehiko Kanaya
Journal:  Biomed Res Int       Date:  2014-05-27       Impact factor: 3.411

2.  Integration of residue attributes for sequence diversity characterization of terpenoid enzymes.

Authors:  Nelson Kibinge; Shun Ikeda; Naoaki Ono; Md Altaf-Ul-Amin; Shigehiko Kanaya
Journal:  Biomed Res Int       Date:  2014-05-11       Impact factor: 3.411

3.  Novel Approach to Classify Plants Based on Metabolite-Content Similarity.

Authors:  Kang Liu; Azian Azamimi Abdullah; Ming Huang; Takaaki Nishioka; Md Altaf-Ul-Amin; Shigehiko Kanaya
Journal:  Biomed Res Int       Date:  2017-01-09       Impact factor: 3.411

Review 4.  A Novel Bioinformatics Strategy to Analyze Microbial Big Sequence Data for Efficient Knowledge Discovery: Batch-Learning Self-Organizing Map (BLSOM).

Authors:  Yuki Iwasaki; Takashi Abe; Kennosuke Wada; Yoshiko Wada; Toshimichi Ikemura
Journal:  Microorganisms       Date:  2013-11-20
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.