Gongchao Jing1, Yufeng Zhang1,2, Wenzhi Cui3, Lu Liu1, Jian Xu1, Xiaoquan Su4. 1. Single-Cell Center, CAS Key Lab of Biofuels, Shandong Key Lab of Energy Genetics and Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, China. 2. College of Computer Science and Technology, Qingdao University, Qingdao, China. 3. College of Control Science and Engineering, China University of Petroleum, Qingdao, China. 4. College of Computer Science and Technology, Qingdao University, Qingdao, China. suxq@qdu.edu.cn.
Abstract
BACKGROUND: Due to their much lower costs in experiment and computation than metagenomic whole-genome sequencing (WGS), 16S rRNA gene amplicons have been widely used for predicting the functional profiles of microbiome, via software tools such as PICRUSt 2. However, due to the potential PCR bias and gene profile variation among phylogenetically related genomes, functional profiles predicted from 16S amplicons may deviate from WGS-derived ones, resulting in misleading results. RESULTS: Here we present Meta-Apo, which greatly reduces or even eliminates such deviation, thus deduces much more consistent diversity patterns between the two approaches. Tests of Meta-Apo on > 5000 16S-rRNA amplicon human microbiome samples from 4 body sites showed the deviation between the two strategies is significantly reduced by using only 15 WGS-amplicon training sample pairs. Moreover, Meta-Apo enables cross-platform functional comparison between WGS and amplicon samples, thus greatly improve 16S-based microbiome diagnosis, e.g. accuracy of gingivitis diagnosis via 16S-derived functional profiles was elevated from 65 to 95% by WGS-based classification. Therefore, with the low cost of 16S-amplicon sequencing, Meta-Apo can produce a reliable, high-resolution view of microbiome function equivalent to that offered by shotgun WGS. CONCLUSIONS: This suggests that large-scale, function-oriented microbiome sequencing projects can probably benefit from the lower cost of 16S-amplicon strategy, without sacrificing the precision in functional reconstruction that otherwise requires WGS. An optimized C++ implementation of Meta-Apo is available on GitHub ( https://github.com/qibebt-bioinfo/meta-apo ) under a GNU GPL license. It takes the functional profiles of a few paired WGS:16S-amplicon samples as training, and outputs the calibrated functional profiles for the much larger number of 16S-amplicon samples.
BACKGROUND: Due to their much lower costs in experiment and computation than metagenomic whole-genome sequencing (WGS), 16S rRNA gene amplicons have been widely used for predicting the functional profiles of microbiome, via software tools such as PICRUSt 2. However, due to the potential PCR bias and gene profile variation among phylogenetically related genomes, functional profiles predicted from 16S amplicons may deviate from WGS-derived ones, resulting in misleading results. RESULTS: Here we present Meta-Apo, which greatly reduces or even eliminates such deviation, thus deduces much more consistent diversity patterns between the two approaches. Tests of Meta-Apo on > 5000 16S-rRNA amplicon human microbiome samples from 4 body sites showed the deviation between the two strategies is significantly reduced by using only 15 WGS-amplicon training sample pairs. Moreover, Meta-Apo enables cross-platform functional comparison between WGS and amplicon samples, thus greatly improve 16S-based microbiome diagnosis, e.g. accuracy of gingivitis diagnosis via 16S-derived functional profiles was elevated from 65 to 95% by WGS-based classification. Therefore, with the low cost of 16S-amplicon sequencing, Meta-Apo can produce a reliable, high-resolution view of microbiome function equivalent to that offered by shotgun WGS. CONCLUSIONS: This suggests that large-scale, function-oriented microbiome sequencing projects can probably benefit from the lower cost of 16S-amplicon strategy, without sacrificing the precision in functional reconstruction that otherwise requires WGS. An optimized C++ implementation of Meta-Apo is available on GitHub ( https://github.com/qibebt-bioinfo/meta-apo ) under a GNU GPL license. It takes the functional profiles of a few paired WGS:16S-amplicon samples as training, and outputs the calibrated functional profiles for the much larger number of 16S-amplicon samples.
Authors: Duy Tin Truong; Eric A Franzosa; Timothy L Tickle; Matthias Scholz; George Weingart; Edoardo Pasolli; Adrian Tett; Curtis Huttenhower; Nicola Segata Journal: Nat Methods Date: 2015-10 Impact factor: 28.547
Authors: Daniel McDonald; Morgan N Price; Julia Goodrich; Eric P Nawrocki; Todd Z DeSantis; Alexander Probst; Gary L Andersen; Rob Knight; Philip Hugenholtz Journal: ISME J Date: 2011-12-01 Impact factor: 10.302
Authors: Sahar Abubucker; Nicola Segata; Johannes Goll; Alyxandria M Schubert; Jacques Izard; Brandi L Cantarel; Beltran Rodriguez-Mueller; Jeremy Zucker; Mathangi Thiagarajan; Bernard Henrissat; Owen White; Scott T Kelley; Barbara Methé; Patrick D Schloss; Dirk Gevers; Makedonka Mitreva; Curtis Huttenhower Journal: PLoS Comput Biol Date: 2012-06-13 Impact factor: 4.475
Authors: Samuel C Forster; Nitin Kumar; Blessing O Anonye; Alexandre Almeida; Elisa Viciani; Mark D Stares; Matthew Dunn; Tapoka T Mkandawire; Ana Zhu; Yan Shao; Lindsay J Pike; Thomas Louie; Hilary P Browne; Alex L Mitchell; B Anne Neville; Robert D Finn; Trevor D Lawley Journal: Nat Biotechnol Date: 2019-02-04 Impact factor: 54.908
Authors: Jakob Wirbel; Paul Theodor Pyl; Ece Kartal; Konrad Zych; Alireza Kashani; Alessio Milanese; Jonas S Fleck; Anita Y Voigt; Albert Palleja; Ruby Ponnudurai; Shinichi Sunagawa; Luis Pedro Coelho; Petra Schrotz-King; Emily Vogtmann; Nina Habermann; Emma Niméus; Andrew M Thomas; Paolo Manghi; Sara Gandini; Davide Serrano; Sayaka Mizutani; Hirotsugu Shiroma; Satoshi Shiba; Tatsuhiro Shibata; Shinichi Yachida; Takuji Yamada; Levi Waldron; Alessio Naccarati; Nicola Segata; Rashmi Sinha; Cornelia M Ulrich; Hermann Brenner; Manimozhiyan Arumugam; Peer Bork; Georg Zeller Journal: Nat Med Date: 2019-04-01 Impact factor: 53.440
Authors: Daniel H Haft; Michael DiCuccio; Azat Badretdin; Vyacheslav Brover; Vyacheslav Chetvernin; Kathleen O'Neill; Wenjun Li; Farideh Chitsaz; Myra K Derbyshire; Noreen R Gonzales; Marc Gwadz; Fu Lu; Gabriele H Marchler; James S Song; Narmada Thanki; Roxanne A Yamashita; Chanjuan Zheng; Françoise Thibaud-Nissen; Lewis Y Geer; Aron Marchler-Bauer; Kim D Pruitt Journal: Nucleic Acids Res Date: 2018-01-04 Impact factor: 16.971