Literature DB >> 34873061

Decoding the link of microbiome niches with homologous sequences enables accurately targeted protein structure prediction.

Pengshuo Yang1, Wei Zheng2, Kang Ning3, Yang Zhang4,5.   

Abstract

Information derived from metagenome sequences through deep-learning techniques has significantly improved the accuracy of template free protein structure modeling. However, most of the deep learning-based modeling studies are based on blind sequence database searches and suffer from low efficiency in computational resource utilization and model construction, especially when the sequence library becomes prohibitively large. We proposed a MetaSource model built on 4.25 billion microbiome sequences from four major biomes (Gut, Lake, Soil, and Fermentor) to decode the inherent linkage of microbial niches with protein homologous families. Large-scale protein family folding experiments on 8,700 unknown Pfam families showed that a microbiome targeted approach with multiple sequence alignment constructed from individual MetaSource biomes requires more than threefold less computer memory and CPU (central processing unit) time but generates contact-map and three-dimensional structure models with a significantly higher accuracy, compared with that using combined metagenome datasets. These results demonstrate an avenue to bridge the gap between the rapidly increasing metagenome databases and the limited computing resources for efficient genome-wide database mining, which provides a useful bluebook to guide future microbiome sequence database and modeling development for high-accuracy protein structure and function prediction.

Entities:  

Keywords:  deep learning; microbiome; multiple sequence alignments; protein homologous families; protein structure prediction

Mesh:

Substances:

Year:  2021        PMID: 34873061      PMCID: PMC8670487          DOI: 10.1073/pnas.2110828118

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   12.779


  61 in total

Review 1.  Progress and challenges in protein structure prediction.

Authors:  Yang Zhang
Journal:  Curr Opin Struct Biol       Date:  2008-04-22       Impact factor: 6.809

2.  Crystallography & NMR system: A new software suite for macromolecular structure determination.

Authors:  A T Brünger; P D Adams; G M Clore; W L DeLano; P Gros; R W Grosse-Kunstleve; J S Jiang; J Kuszewski; M Nilges; N S Pannu; R J Read; L M Rice; T Simonson; G L Warren
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  1998-09-01

3.  Improved protein structure prediction using predicted interresidue orientations.

Authors:  Jianyi Yang; Ivan Anishchenko; Hahnbeom Park; Zhenling Peng; Sergey Ovchinnikov; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2020-01-02       Impact factor: 11.205

4.  Improved protein structure prediction using potentials from deep learning.

Authors:  Andrew W Senior; Richard Evans; John Jumper; James Kirkpatrick; Laurent Sifre; Tim Green; Chongli Qin; Augustin Žídek; Alexander W R Nelson; Alex Bridgland; Hugo Penedones; Stig Petersen; Karen Simonyan; Steve Crossan; Pushmeet Kohli; David T Jones; David Silver; Koray Kavukcuoglu; Demis Hassabis
Journal:  Nature       Date:  2020-01-15       Impact factor: 49.962

5.  Gut microbiota utilize immunoglobulin A for mucosal colonization.

Authors:  G P Donaldson; M S Ladinsky; K B Yu; J G Sanders; B B Yoo; W-C Chou; M E Conner; A M Earl; R Knight; P J Bjorkman; S K Mazmanian
Journal:  Science       Date:  2018-05-03       Impact factor: 47.728

6.  Characteristics and in situ remediation effects of heavy metal immobilizing bacteria on cadmium and nickel co-contaminated soil.

Authors:  Ying Wang; Yao Luo; Guoquan Zeng; Xudong Wu; Bin Wu; Xue Li; Heng Xu
Journal:  Ecotoxicol Environ Saf       Date:  2020-02-07       Impact factor: 6.291

7.  Protein structure determination using metagenome sequence data.

Authors:  Sergey Ovchinnikov; Hahnbeom Park; Neha Varghese; Po-Ssu Huang; Georgios A Pavlopoulos; David E Kim; Hetunandan Kamisetty; Nikos C Kyrpides; David Baker
Journal:  Science       Date:  2017-01-20       Impact factor: 47.728

8.  Ultrasensitive proteome analysis using paramagnetic bead technology.

Authors:  Christopher S Hughes; Sophia Foehr; David A Garfield; Eileen E Furlong; Lars M Steinmetz; Jeroen Krijgsveld
Journal:  Mol Syst Biol       Date:  2014-10-30       Impact factor: 11.429

9.  UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.

Authors:  Baris E Suzek; Yuqi Wang; Hongzhan Huang; Peter B McGarvey; Cathy H Wu
Journal:  Bioinformatics       Date:  2014-11-13       Impact factor: 6.937

Review 10.  Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases.

Authors:  Ramnik J Xavier; Curtis Huttenhower; Jason Lloyd-Price; Cesar Arze; Ashwin N Ananthakrishnan; Melanie Schirmer; Julian Avila-Pacheco; Tiffany W Poon; Elizabeth Andrews; Nadim J Ajami; Kevin S Bonham; Colin J Brislawn; David Casero; Holly Courtney; Antonio Gonzalez; Thomas G Graeber; A Brantley Hall; Kathleen Lake; Carol J Landers; Himel Mallick; Damian R Plichta; Mahadev Prasad; Gholamali Rahnavard; Jenny Sauk; Dmitry Shungin; Yoshiki Vázquez-Baeza; Richard A White; Jonathan Braun; Lee A Denson; Janet K Jansson; Rob Knight; Subra Kugathasan; Dermot P B McGovern; Joseph F Petrosino; Thaddeus S Stappenbeck; Harland S Winter; Clary B Clish; Eric A Franzosa; Hera Vlamakis
Journal:  Nature       Date:  2019-05-29       Impact factor: 49.962

View more
  3 in total

1.  Characterization of Treponema denticola Major Surface Protein (Msp) by Deletion Analysis and Advanced Molecular Modeling.

Authors:  M Paula Goetting-Minesky; Valentina Godovikova; Wei Zheng; J Christopher Fenno
Journal:  J Bacteriol       Date:  2022-08-01       Impact factor: 3.476

2.  LOMETS3: integrating deep learning and profile alignment for advanced protein template recognition and function annotation.

Authors:  Wei Zheng; Qiqige Wuyun; Xiaogen Zhou; Yang Li; Peter L Freddolino; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2022-04-14       Impact factor: 19.160

Review 3.  Using metagenomic data to boost protein structure prediction and discovery.

Authors:  Qingzhen Hou; Fabrizio Pucci; Fengming Pan; Fuzhong Xue; Marianne Rooman; Qiang Feng
Journal:  Comput Struct Biotechnol J       Date:  2022-01-03       Impact factor: 7.271

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.