Zachary D Wallen1. 1. Department of Neurology, University of Alabama At Birmingham, Birmingham, AL, 35294, USA. zacharywallen@uabmc.edu.
Abstract
BACKGROUND: Testing for differential abundance of microbes in disease is a common practice in microbiome studies. Numerous differential abundance (DA) testing methods exist and range from traditional statistical tests to methods designed for microbiome data. Comparison studies of DA testing methods have been performed, but none performed on microbiome datasets collected for the study of real, complex disease. Due to this, DA testing was performed here using various DA methods in two large, uniformly collected gut microbiome datasets on Parkinson disease (PD), and their results compared. RESULTS: Overall, 78-92% of taxa tested were detected as differentially abundant by at least one method, while 5-22% were called differentially abundant by the majority of methods (depending on dataset and filtering of taxonomic data prior to testing). Concordances between method results ranged from 1 to 100%. Average concordance for datasets 1 and 2 were 24% and 28% respectively, and 27% for replicated DA signatures. Concordances increased when removing rarer taxa before testing, increasing average concordances by 2-32%. Certain methods consistently resulted in higher concordances (e.g. ANCOM-BC, LEfSe), while others consistently resulted in lower (e.g. edgeR, fitZIG). Hierarchical clustering revealed three groups of DA signatures that were (1) replicated by the majority of methods on average and included taxa previously associated with PD, (2) replicated by a subset of methods and included taxa largely enriched in PD, and (3) replicated by few to one method(s). CONCLUSIONS: Differential abundance tests yielded varied concordances, and amounts of detected DA signatures. Some methods were more concordant than others on both filtered and unfiltered data, therefore, if consistency with other study methodology is a key goal, one might choose among these methods. Even still, using one method on one dataset may find true associations, but may also detect false positives. To help lower false positives, one might analyze data with two or more DA methods to gauge concordance, and use a built-in replication dataset. This study will hopefully serve to complement previously reported DA method comparison studies by implementing and coalescing a large number of both previously and yet to be compared methods on two real gut microbiome datasets.
BACKGROUND: Testing for differential abundance of microbes in disease is a common practice in microbiome studies. Numerous differential abundance (DA) testing methods exist and range from traditional statistical tests to methods designed for microbiome data. Comparison studies of DA testing methods have been performed, but none performed on microbiome datasets collected for the study of real, complex disease. Due to this, DA testing was performed here using various DA methods in two large, uniformly collected gut microbiome datasets on Parkinson disease (PD), and their results compared. RESULTS: Overall, 78-92% of taxa tested were detected as differentially abundant by at least one method, while 5-22% were called differentially abundant by the majority of methods (depending on dataset and filtering of taxonomic data prior to testing). Concordances between method results ranged from 1 to 100%. Average concordance for datasets 1 and 2 were 24% and 28% respectively, and 27% for replicated DA signatures. Concordances increased when removing rarer taxa before testing, increasing average concordances by 2-32%. Certain methods consistently resulted in higher concordances (e.g. ANCOM-BC, LEfSe), while others consistently resulted in lower (e.g. edgeR, fitZIG). Hierarchical clustering revealed three groups of DA signatures that were (1) replicated by the majority of methods on average and included taxa previously associated with PD, (2) replicated by a subset of methods and included taxa largely enriched in PD, and (3) replicated by few to one method(s). CONCLUSIONS: Differential abundance tests yielded varied concordances, and amounts of detected DA signatures. Some methods were more concordant than others on both filtered and unfiltered data, therefore, if consistency with other study methodology is a key goal, one might choose among these methods. Even still, using one method on one dataset may find true associations, but may also detect false positives. To help lower false positives, one might analyze data with two or more DA methods to gauge concordance, and use a built-in replication dataset. This study will hopefully serve to complement previously reported DA method comparison studies by implementing and coalescing a large number of both previously and yet to be compared methods on two real gut microbiome datasets.
Authors: Siddhartha Mandal; Will Van Treuren; Richard A White; Merete Eggesbø; Rob Knight; Shyamal D Peddada Journal: Microb Ecol Health Dis Date: 2015-05-29
Authors: Jiye Kwon; Yong Kong; Martina Wade; Derek J Williams; Clarence Buddy Creech; Scott Evans; Emmanuel B Walter; Judy M Martin; Jeffrey S Gerber; Jason G Newland; Meghan E Hofto; Mary Allen Staat; Henry F Chambers; Vance G Fowler; W Charles Huskins; Melinda M Pettigrew Journal: J Infect Dis Date: 2022-09-21 Impact factor: 7.759
Authors: David T J Broderick; David W Waite; Robyn L Marsh; Carlos A Camargo; Paul Cardenas; Anne B Chang; William O C Cookson; Leah Cuthbertson; Wenkui Dai; Mark L Everard; Alain Gervaix; J Kirk Harris; Kohei Hasegawa; Lucas R Hoffman; Soo-Jong Hong; Laurence Josset; Matthew S Kelly; Bong-Soo Kim; Yong Kong; Shuai C Li; Jonathan M Mansbach; Asuncion Mejias; George A O'Toole; Laura Paalanen; Marcos Pérez-Losada; Melinda M Pettigrew; Maxime Pichon; Octavio Ramilo; Lasse Ruokolainen; Olga Sakwinska; Patrick C Seed; Christopher J van der Gast; Brandie D Wagner; Hana Yi; Edith T Zemanick; Yuejie Zheng; Naveen Pillarisetti; Michael W Taylor Journal: Front Microbiol Date: 2021-12-23 Impact factor: 5.640
Authors: Josip Rudar; Teresita M Porter; Michael Wright; G Brian Golding; Mehrdad Hajibabaei Journal: BMC Bioinformatics Date: 2022-03-31 Impact factor: 3.169
Authors: M M Pettigrew; J Kwon; J F Gent; Y Kong; M Wade; D J Williams; C B Creech; S Evans; Q Pan; E B Walter; J M Martin; J S Gerber; J G Newland; M E Hofto; M A Staat; V G Fowler; H F Chambers; W C Huskins Journal: mBio Date: 2022-03-24 Impact factor: 7.786