Literature DB >> 29757649

Functional Annotation of Proteins Encoded by the Minimal Bacterial Genome Based on Secondary Structure Element Alignment.

Zhiyuan Yang1,2,3, Stephen Kwok-Wing Tsui2,3,4.   

Abstract

In synthetic biology, one of the key focuses is building a minimal artificial cell which can provide basic chassis for functional study. Recently, the J. Craig Venter Institute published the latest version of the minimal bacterial genome JCVI-syn3.0, which only encoded 438 essential proteins. However, among them functions of 149 proteins remain unknown because of the lack of effective annotation method. Here, we report a secondary structure element alignment method called SSEalign based on an effective training data set extracting from various bacterial genomes. The experimentally validated homologous genes in different species were selected as training positives, while unrelated genes in different species were selected as training negatives. Moreover, SSEalign used a set of well-defined basic alignment elements with the backtracking line search algorithm to derive the best parameters for accurate prediction. Experimental results showed that SSEalign achieved 88.2% test accuracy, which is better than the existing prediction methods. SSEalign was subsequently applied to identify the functions of those unannotated proteins in the latest published minimal bacteria genome JCVI-syn3.0. Results indicated that at least 136 proteins out of 149 unannotated proteins in the JCVI-syn3.0 genome could be annotated by SSEalign. Our method is effective for the identification of protein homology in JCVI-syn3.0 and can be used to annotate those hypothetical proteins in other bacterial genomes.

Keywords:  JCVI-syn3.0; essential gene; homology identification; protein secondary structure

Mesh:

Substances:

Year:  2018        PMID: 29757649     DOI: 10.1021/acs.jproteome.8b00262

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  7 in total

1.  Functions of Essential Genes and a Scale-Free Protein Interaction Network Revealed by Structure-Based Function and Interaction Prediction for a Minimal Genome.

Authors:  Chengxin Zhang; Wei Zheng; Micah Cheng; Gilbert S Omenn; Peter L Freddolino; Yang Zhang
Journal:  J Proteome Res       Date:  2021-01-04       Impact factor: 4.466

2.  Traditional protocols and optimization methods lead to absent expression in a mycoplasma cell-free gene expression platform.

Authors:  Andrei Sakai; Christopher R Deich; Frank H T Nelissen; Aafke J Jonker; Daniela M de C Bittencourt; Christopher P Kempes; Kim S Wise; Hans A Heus; Wilhelm T S Huck; Katarzyna P Adamala; John I Glass
Journal:  Synth Biol (Oxf)       Date:  2022-05-21

3.  Improved annotation of Lutzomyia longipalpis genome using bioinformatics analysis.

Authors:  Zhiyuan Yang; Ying Wu
Journal:  PeerJ       Date:  2019-10-09       Impact factor: 2.984

Review 4.  Bacterial genome reductions: Tools, applications, and challenges.

Authors:  Nicole LeBlanc; Trevor C Charles
Journal:  Front Genome Ed       Date:  2022-08-31

5.  Toward the Complete Functional Characterization of a Minimal Bacterial Proteome.

Authors:  David M Bianchi; James F Pelletier; Clyde A Hutchison; John I Glass; Zaida Luthey-Schulten
Journal:  J Phys Chem B       Date:  2022-09-01       Impact factor: 3.466

Review 6.  Hypothetical Proteins as Predecessors of Long Non-coding RNAs.

Authors:  Girik Malik; Tanu Agarwal; Utkarsh Raj; Vijayaraghava Seshadri Sundararajan; Obul Reddy Bandapalli; Prashanth Suravajhala
Journal:  Curr Genomics       Date:  2020-11       Impact factor: 2.236

7.  Genome-scale metabolic modeling reveals key features of a minimal gene set.

Authors:  Jean-Christophe Lachance; Dominick Matteau; Joëlle Brodeur; Colton J Lloyd; Nathan Mih; Zachary A King; Thomas F Knight; Adam M Feist; Jonathan M Monk; Bernhard O Palsson; Pierre-Étienne Jacques; Sébastien Rodrigue
Journal:  Mol Syst Biol       Date:  2021-07       Impact factor: 11.429

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.