Tanya P Garcia1, Samuel Müller, Raymond J Carroll, Rosemary L Walzem. 1. Department of Epidemiology & Biostatistics, School of Rural Public Health, Texas A&M Health Science Center, College Station, TX 77843-1266, USA, School of Mathematics and Statistics, University of Sydney, NSW 2006 Australia, Department of Statistics, Texas A&M University, College Station, TX 77843-3143, USA and Department of Poultry Science, Intercollegiate Faculty of Nutrition, Texas A&M University, College Station, TX 77840, USA.
Abstract
MOTIVATION: Gut microbiota can be classified at multiple taxonomy levels. Strategies to use changes in microbiota composition to effect health improvements require knowing at which taxonomy level interventions should be aimed. Identifying these important levels is difficult, however, because most statistical methods only consider when the microbiota are classified at one taxonomy level, not multiple. RESULTS: Using L1 and L2 regularizations, we developed a new variable selection method that identifies important features at multiple taxonomy levels. The regularization parameters are chosen by a new, data-adaptive, repeated cross-validation approach, which performed well. In simulation studies, our method outperformed competing methods: it more often selected significant variables, and had small false discovery rates and acceptable false-positive rates. Applying our method to gut microbiota data, we found which taxonomic levels were most altered by specific interventions or physiological status. AVAILABILITY: The new approach is implemented in an R package, which is freely available from the corresponding author. CONTACT: tpgarcia@srph.tamhsc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Gut microbiota can be classified at multiple taxonomy levels. Strategies to use changes in microbiota composition to effect health improvements require knowing at which taxonomy level interventions should be aimed. Identifying these important levels is difficult, however, because most statistical methods only consider when the microbiota are classified at one taxonomy level, not multiple. RESULTS: Using L1 and L2 regularizations, we developed a new variable selection method that identifies important features at multiple taxonomy levels. The regularization parameters are chosen by a new, data-adaptive, repeated cross-validation approach, which performed well. In simulation studies, our method outperformed competing methods: it more often selected significant variables, and had small false discovery rates and acceptable false-positive rates. Applying our method to gut microbiota data, we found which taxonomic levels were most altered by specific interventions or physiological status. AVAILABILITY: The new approach is implemented in an R package, which is freely available from the corresponding author. CONTACT: tpgarcia@srph.tamhsc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Anne Vrieze; Els Van Nood; Frits Holleman; Jarkko Salojärvi; Ruud S Kootte; Joep F W M Bartelsman; Geesje M Dallinga-Thie; Mariette T Ackermans; Mireille J Serlie; Raish Oozeer; Muriel Derrien; Anne Druesne; Johan E T Van Hylckama Vlieg; Vincent W Bloks; Albert K Groen; Hans G H J Heilig; Erwin G Zoetendal; Erik S Stroes; Willem M de Vos; Joost B L Hoekstra; Max Nieuwdorp Journal: Gastroenterology Date: 2012-06-20 Impact factor: 22.682
Authors: Tanya P Garcia; Samuel Müller; Raymond J Carroll; Tamara N Dunn; Anthony P Thomas; Sean H Adams; Suresh D Pillai; Rosemary L Walzem Journal: Biostatistics Date: 2013-04-10 Impact factor: 5.899
Authors: Josue G Martinez; Raymond J Carroll; Samuel Müller; Joshua N Sampson; Nilanjan Chatterjee Journal: Am Stat Date: 2011-11-01 Impact factor: 8.710
Authors: Marc J van de Vijver; Yudong D He; Laura J van't Veer; Hongyue Dai; Augustinus A M Hart; Dorien W Voskuil; George J Schreiber; Johannes L Peterse; Chris Roberts; Matthew J Marton; Mark Parrish; Douwe Atsma; Anke Witteveen; Annuska Glas; Leonie Delahaye; Tony van der Velde; Harry Bartelink; Sjoerd Rodenhuis; Emiel T Rutgers; Stephen H Friend; René Bernards Journal: N Engl J Med Date: 2002-12-19 Impact factor: 91.245
Authors: Anthony P Thomas; Tamara N Dunn; Josephine B Drayton; Pieter J Oort; Sean H Adams Journal: Obesity (Silver Spring) Date: 2013-03 Impact factor: 5.002
Authors: Anthony P Thomas; Tamara N Dunn; Josephine B Drayton; Pieter J Oort; Sean H Adams Journal: Nutr Metab (Lond) Date: 2012-01-23 Impact factor: 4.169
Authors: Erica M Carlisle; Valeriy Poroyko; Michael S Caplan; John Alverdy; Michael J Morowitz; Donald Liu Journal: Ann Surg Date: 2013-02 Impact factor: 12.969
Authors: Michelle I Smith; Tanya Yatsunenko; Mark J Manary; Indi Trehan; Rajhab Mkakosya; Jiye Cheng; Andrew L Kau; Stephen S Rich; Patrick Concannon; Josyf C Mychaleckyj; Jie Liu; Eric Houpt; Jia V Li; Elaine Holmes; Jeremy Nicholson; Dan Knights; Luke K Ursell; Rob Knight; Jeffrey I Gordon Journal: Science Date: 2013-01-30 Impact factor: 47.728
Authors: Qunyuan Zhang; Haley Abel; Alan Wells; Petra Lenzini; Felicia Gomez; Michael A Province; Alan A Templeton; George M Weinstock; Nita H Salzman; Ingrid B Borecki Journal: Bioinformatics Date: 2015-01-06 Impact factor: 6.937
Authors: Jyoti Shankar; Sebastian Szpakowski; Norma V Solis; Stephanie Mounaud; Hong Liu; Liliana Losada; William C Nierman; Scott G Filler Journal: BMC Bioinformatics Date: 2015-02-01 Impact factor: 3.169
Authors: Emily Goren; Chong Wang; Zhulin He; Amy M Sheflin; Dawn Chiniquy; Jessica E Prenni; Susannah Tringe; Daniel P Schachtman; Peng Liu Journal: BMC Bioinformatics Date: 2021-07-06 Impact factor: 3.169