Literature DB >> 28123115

Homology-Aware Phylogenomics at Gigabase Scales.

M J Sanderson1, Marius Nicolae2, M M McMahon3.   

Abstract

Obstacles to inferring species trees from whole genome data sets range from algorithmic and data management challenges to the wholesale discordance in evolutionary history found in different parts of a genome. Recent work that builds trees directly from genomes by parsing them into sets of small $k$-mer strings holds promise to streamline and simplify these efforts, but existing approaches do not account well for gene tree discordance. We describe a "seed and extend" protocol that finds nearly exact matching sets of orthologous $k$-mers and extends them to construct data sets that can properly account for genomic heterogeneity. Exploiting an efficient suffix array data structure, sets of whole genomes can be parsed and converted into phylogenetic data matrices rapidly, with contiguous blocks of $k$-mers from the same chromosome, gene, or scaffold concatenated as needed. Phylogenetic trees constructed from highly curated rice genome data and a diverse set of six other eukaryotic whole genome, transcriptome, and organellar genome data sets recovered trees nearly identical to published phylogenomic analyses, in a small fraction of the time, and requiring many fewer parameter choices. Our method's ability to retain local homology information was demonstrated by using it to characterize gene tree discordance across the rice genome, and by its robustness to the high rate of interchromosomal gene transfer found in several rice species.
© The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

Entities:  

Keywords:  Oryza; k-mer; lineage sorting; phylogenomics; suffix array

Mesh:

Year:  2017        PMID: 28123115      PMCID: PMC5790135          DOI: 10.1093/sysbio/syw104

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   9.160


  56 in total

1.  Highly conserved low-copy nuclear genes as effective markers for phylogenetic analyses in angiosperms.

Authors:  Ning Zhang; Liping Zeng; Hongyan Shan; Hong Ma
Journal:  New Phytol       Date:  2012-07-11       Impact factor: 10.151

2.  The average common substring approach to phylogenomic reconstruction.

Authors:  Igor Ulitsky; David Burstein; Tamir Tuller; Benny Chor
Journal:  J Comput Biol       Date:  2006-03       Impact factor: 1.479

3.  Phylogeny-aware gap placement prevents errors in sequence alignment and evolutionary analysis.

Authors:  Ari Löytynoja; Nick Goldman
Journal:  Science       Date:  2008-06-20       Impact factor: 47.728

4.  Terraces in phylogenetic tree space.

Authors:  Michael J Sanderson; Michelle M McMahon; Mike Steel
Journal:  Science       Date:  2011-06-16       Impact factor: 47.728

5.  Sources of error and confidence intervals in estimating the age of angiosperms from rbcL and 18S rDNA data.

Authors:  M J Sanderson; J A Doyle
Journal:  Am J Bot       Date:  2001-08       Impact factor: 3.844

6.  An Elegant Algorithm for the Construction of Suffix Arrays.

Authors:  Sanguthevar Rajasekaran; Marius Nicolae
Journal:  J Discrete Algorithms (Amst)       Date:  2014-07-01

7.  Mosquito genomics. Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes.

Authors:  Daniel E Neafsey; Robert M Waterhouse; Mohammad R Abai; Sergey S Aganezov; Max A Alekseyev; James E Allen; James Amon; Bruno Arcà; Peter Arensburger; Gleb Artemov; Lauren A Assour; Hamidreza Basseri; Aaron Berlin; Bruce W Birren; Stephanie A Blandin; Andrew I Brockman; Thomas R Burkot; Austin Burt; Clara S Chan; Cedric Chauve; Joanna C Chiu; Mikkel Christensen; Carlo Costantini; Victoria L M Davidson; Elena Deligianni; Tania Dottorini; Vicky Dritsou; Stacey B Gabriel; Wamdaogo M Guelbeogo; Andrew B Hall; Mira V Han; Thaung Hlaing; Daniel S T Hughes; Adam M Jenkins; Xiaofang Jiang; Irwin Jungreis; Evdoxia G Kakani; Maryam Kamali; Petri Kemppainen; Ryan C Kennedy; Ioannis K Kirmitzoglou; Lizette L Koekemoer; Njoroge Laban; Nicholas Langridge; Mara K N Lawniczak; Manolis Lirakis; Neil F Lobo; Ernesto Lowy; Robert M MacCallum; Chunhong Mao; Gareth Maslen; Charles Mbogo; Jenny McCarthy; Kristin Michel; Sara N Mitchell; Wendy Moore; Katherine A Murphy; Anastasia N Naumenko; Tony Nolan; Eva M Novoa; Samantha O'Loughlin; Chioma Oringanje; Mohammad A Oshaghi; Nazzy Pakpour; Philippos A Papathanos; Ashley N Peery; Michael Povelones; Anil Prakash; David P Price; Ashok Rajaraman; Lisa J Reimer; David C Rinker; Antonis Rokas; Tanya L Russell; N'Fale Sagnon; Maria V Sharakhova; Terrance Shea; Felipe A Simão; Frederic Simard; Michel A Slotman; Pradya Somboon; Vladimir Stegniy; Claudio J Struchiner; Gregg W C Thomas; Marta Tojo; Pantelis Topalis; José M C Tubio; Maria F Unger; John Vontas; Catherine Walton; Craig S Wilding; Judith H Willis; Yi-Chieh Wu; Guiyun Yan; Evgeny M Zdobnov; Xiaofan Zhou; Flaminia Catteruccia; George K Christophides; Frank H Collins; Robert S Cornman; Andrea Crisanti; Martin J Donnelly; Scott J Emrich; Michael C Fontaine; William Gelbart; Matthew W Hahn; Immo A Hansen; Paul I Howell; Fotis C Kafatos; Manolis Kellis; Daniel Lawson; Christos Louis; Shirley Luckhart; Marc A T Muskavitch; José M Ribeiro; Michael A Riehle; Igor V Sharakhov; Zhijian Tu; Laurence J Zwiebel; Nora J Besansky
Journal:  Science       Date:  2014-11-27       Impact factor: 47.728

8.  Resolution of Brassicaceae Phylogeny Using Nuclear Genes Uncovers Nested Radiations and Supports Convergent Morphological Evolution.

Authors:  Chien-Hsun Huang; Renran Sun; Yi Hu; Liping Zeng; Ning Zhang; Liming Cai; Qiang Zhang; Marcus A Koch; Ihsan Al-Shehbaz; Patrick P Edger; J Chris Pires; Dun-Yan Tan; Yang Zhong; Hong Ma
Journal:  Mol Biol Evol       Date:  2015-10-29       Impact factor: 16.240

9.  When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

Authors:  Shea N Gardner; Barry G Hall
Journal:  PLoS One       Date:  2013-12-09       Impact factor: 3.240

View more
  1 in total

1.  Orchid Phylotranscriptomics: The Prospects of Repurposing Multi-Tissue Transcriptomes for Phylogenetic Analysis and Beyond.

Authors:  Darren C J Wong; Rod Peakall
Journal:  Front Plant Sci       Date:  2022-05-27       Impact factor: 6.627

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.