BACKGROUND: Ancestral gene order reconstruction for flowering plants has lagged behind developments in yeasts, insects and higher animals, because of the recency of widespread plant genome sequencing, sequencers' embargoes on public data use, paralogies due to whole genome duplication (WGD) and fractionation of undeleted duplicates, extensive paralogy from other sources, and the computational cost of existing methods. RESULTS: We address these problems, using the gene order of four core eudicot genomes (cacao, castor bean, papaya and grapevine) that have escaped any recent WGD events, and two others (poplar and cucumber) that descend from independent WGDs, in inferring the ancestral gene order of the rosid clade and those of its main subgroups, the fabids and malvids. We improve and adapt techniques including the OMG method for extracting large, paralogy-free, multiple orthologies from conflated pairwise synteny data among the six genomes and the PATHGROUPS approach for ancestral gene order reconstruction in a given phylogeny, where some genomes may be descendants of WGD events. We use the gene order evidence to evaluate the hypothesis that the order Malpighiales belongs to the malvids rather than as traditionally assigned to the fabids. CONCLUSIONS: Gene orders of ancestral eudicot species, involving 10,000 or more genes can be reconstructed in an efficient, parsimonious and consistent way, despite paralogies due to WGD and other processes. Pairwise genomic syntenies provide appropriate input to a parameter-free procedure of multiple ortholog identification followed by gene-order reconstruction in solving instances of the "small phylogeny" problem.
BACKGROUND: Ancestral gene order reconstruction for flowering plants has lagged behind developments in yeasts, insects and higher animals, because of the recency of widespread plant genome sequencing, sequencers' embargoes on public data use, paralogies due to whole genome duplication (WGD) and fractionation of undeleted duplicates, extensive paralogy from other sources, and the computational cost of existing methods. RESULTS: We address these problems, using the gene order of four core eudicot genomes (cacao, castor bean, papaya and grapevine) that have escaped any recent WGD events, and two others (poplar and cucumber) that descend from independent WGDs, in inferring the ancestral gene order of the rosid clade and those of its main subgroups, the fabids and malvids. We improve and adapt techniques including the OMG method for extracting large, paralogy-free, multiple orthologies from conflated pairwise synteny data among the six genomes and the PATHGROUPS approach for ancestral gene order reconstruction in a given phylogeny, where some genomes may be descendants of WGD events. We use the gene order evidence to evaluate the hypothesis that the order Malpighiales belongs to the malvids rather than as traditionally assigned to the fabids. CONCLUSIONS: Gene orders of ancestral eudicot species, involving 10,000 or more genes can be reconstructed in an efficient, parsimonious and consistent way, despite paralogies due to WGD and other processes. Pairwise genomic syntenies provide appropriate input to a parameter-free procedure of multiple ortholog identification followed by gene-order reconstruction in solving instances of the "small phylogeny" problem.
Authors: Vladimir Shulaev; Daniel J Sargent; Ross N Crowhurst; Todd C Mockler; Otto Folkerts; Arthur L Delcher; Pankaj Jaiswal; Keithanne Mockaitis; Aaron Liston; Shrinivasrao P Mane; Paul Burns; Thomas M Davis; Janet P Slovin; Nahla Bassil; Roger P Hellens; Clive Evans; Tim Harkins; Chinnappa Kodira; Brian Desany; Oswald R Crasta; Roderick V Jensen; Andrew C Allan; Todd P Michael; Joao Carlos Setubal; Jean-Marc Celton; D Jasper G Rees; Kelly P Williams; Sarah H Holt; Juan Jairo Ruiz Rojas; Mithu Chatterjee; Bo Liu; Herman Silva; Lee Meisel; Avital Adato; Sergei A Filichkin; Michela Troggio; Roberto Viola; Tia-Lynn Ashman; Hao Wang; Palitha Dharmawardhana; Justin Elser; Rajani Raja; Henry D Priest; Douglas W Bryant; Samuel E Fox; Scott A Givan; Larry J Wilhelm; Sushma Naithani; Alan Christoffels; David Y Salama; Jade Carter; Elena Lopez Girona; Anna Zdepski; Wenqin Wang; Randall A Kerstetter; Wilfried Schwab; Schuyler S Korban; Jahn Davik; Amparo Monfort; Beatrice Denoyes-Rothan; Pere Arus; Ron Mittler; Barry Flinn; Asaph Aharoni; Jeffrey L Bennetzen; Steven L Salzberg; Allan W Dickerman; Riccardo Velasco; Mark Borodovsky; Richard E Veilleux; Kevin M Folta Journal: Nat Genet Date: 2010-12-26 Impact factor: 38.330
Authors: William J Murphy; Denis M Larkin; Annelie Everts-van der Wind; Guillaume Bourque; Glenn Tesler; Loretta Auvil; Jonathan E Beever; Bhanu P Chowdhary; Francis Galibert; Lisa Gatzke; Christophe Hitte; Stacey N Meyers; Denis Milan; Elaine A Ostrander; Greg Pape; Heidi G Parker; Terje Raudsepp; Margarita B Rogatcheva; Lawrence B Schook; Loren C Skow; Michael Welge; James E Womack; Stephen J O'brien; Pavel A Pevzner; Harris A Lewin Journal: Science Date: 2005-07-22 Impact factor: 47.728
Authors: Douglas E Soltis; Victor A Albert; Jim Leebens-Mack; Charles D Bell; Andrew H Paterson; Chunfang Zheng; David Sankoff; Claude W Depamphilis; P Kerr Wall; Pamela S Soltis Journal: Am J Bot Date: 2009-01 Impact factor: 3.844
Authors: Ray Ming; Shaobin Hou; Yun Feng; Qingyi Yu; Alexandre Dionne-Laporte; Jimmy H Saw; Pavel Senin; Wei Wang; Benjamin V Ly; Kanako L T Lewis; Steven L Salzberg; Lu Feng; Meghan R Jones; Rachel L Skelton; Jan E Murray; Cuixia Chen; Wubin Qian; Junguo Shen; Peng Du; Moriah Eustice; Eric Tong; Haibao Tang; Eric Lyons; Robert E Paull; Todd P Michael; Kerr Wall; Danny W Rice; Henrik Albert; Ming-Li Wang; Yun J Zhu; Michael Schatz; Niranjan Nagarajan; Ricelle A Acob; Peizhu Guan; Andrea Blas; Ching Man Wai; Christine M Ackerman; Yan Ren; Chao Liu; Jianmei Wang; Jianping Wang; Jong-Kuk Na; Eugene V Shakirov; Brian Haas; Jyothi Thimmapuram; David Nelson; Xiyin Wang; John E Bowers; Andrea R Gschwend; Arthur L Delcher; Ratnesh Singh; Jon Y Suzuki; Savarni Tripathi; Kabi Neupane; Hairong Wei; Beth Irikura; Maya Paidi; Ning Jiang; Wenli Zhang; Gernot Presting; Aaron Windsor; Rafael Navajas-Pérez; Manuel J Torres; F Alex Feltus; Brad Porter; Yingjun Li; A Max Burroughs; Ming-Cheng Luo; Lei Liu; David A Christopher; Stephen M Mount; Paul H Moore; Tak Sugimura; Jiming Jiang; Mary A Schuler; Vikki Friedman; Thomas Mitchell-Olds; Dorothy E Shippen; Claude W dePamphilis; Jeffrey D Palmer; Michael Freeling; Andrew H Paterson; Dennis Gonsalves; Lei Wang; Maqsudul Alam Journal: Nature Date: 2008-04-24 Impact factor: 49.962
Authors: G Romagnoli; M D Verhoeven; R Mans; Y Fleury Rey; R Bel-Rhlid; M van den Broek; R Maleki Seifar; A Ten Pierick; M Thompson; V Müller; S A Wahl; J T Pronk; J M Daran Journal: Mol Microbiol Date: 2014-06-23 Impact factor: 3.501