Literature DB >> 33177663

Progressive Cactus is a multiple-genome aligner for the thousand-genome era.

Joel Armstrong1, Glenn Hickey1, Mark Diekhans1, Ian T Fiddes1, Adam M Novak1, Alden Deran1, Qi Fang2,3, Duo Xie2,4, Shaohong Feng2,5, Josefin Stiller3, Diane Genereux6, Jeremy Johnson6, Voichita Dana Marinescu7, Jessica Alföldi6, Robert S Harris8, Kerstin Lindblad-Toh6,7, David Haussler1,9, Elinor Karlsson6,10,11, Erich D Jarvis9,12, Guojie Zhang13,14,15,16, Benedict Paten17.   

Abstract

New genome assemblies have been arriving at a rapidly increasing pace, thanks to decreases in sequencing costs and improvements in third-generation sequencing technologies1-3. For example, the number of vertebrate genome assemblies currently in the NCBI (National Center for Biotechnology Information) database4 increased by more than 50% to 1,485 assemblies in the year from July 2018 to July 2019. In addition to this influx of assemblies from different species, new human de novo assemblies5 are being produced, which enable the analysis of not only small polymorphisms, but also complex, large-scale structural differences between human individuals and haplotypes. This coming era and its unprecedented amount of data offer the opportunity to uncover many insights into genome evolution but also present challenges in how to adapt current analysis methods to meet the increased scale. Cactus6, a reference-free multiple genome alignment program, has been shown to be highly accurate, but the existing implementation scales poorly with increasing numbers of genomes, and struggles in regions of highly duplicated sequences. Here we describe progressive extensions to Cactus to create Progressive Cactus, which enables the reference-free alignment of tens to thousands of large vertebrate genomes while maintaining high alignment quality. We describe results from an alignment of more than 600 amniote genomes, which is to our knowledge the largest multiple vertebrate genome alignment created so far.

Entities:  

Mesh:

Year:  2020        PMID: 33177663      PMCID: PMC7673649          DOI: 10.1038/s41586-020-2871-y

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


  48 in total

1.  Cactus: Algorithms for genome multiple sequence alignment.

Authors:  Benedict Paten; Dent Earl; Ngan Nguyen; Mark Diekhans; Daniel Zerbino; David Haussler
Journal:  Genome Res       Date:  2011-06-10       Impact factor: 9.043

2.  PHAST and RPHAST: phylogenetic analysis with space/time models.

Authors:  Melissa J Hubisz; Katherine S Pollard; Adam Siepel
Journal:  Brief Bioinform       Date:  2010-12-21       Impact factor: 11.622

3.  A maximum pseudo-likelihood approach for estimating species trees under the coalescent model.

Authors:  Liang Liu; Lili Yu; Scott V Edwards
Journal:  BMC Evol Biol       Date:  2010-10-11       Impact factor: 3.260

4.  Simultaneous gene finding in multiple genomes.

Authors:  Stefanie König; Lars W Romoth; Lizzy Gerischer; Mario Stanke
Journal:  Bioinformatics       Date:  2016-07-27       Impact factor: 6.937

5.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

6.  The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community.

Authors:  Miten Jain; Hugh E Olsen; Benedict Paten; Mark Akeson
Journal:  Genome Biol       Date:  2016-11-25       Impact factor: 13.583

7.  Nanopore sequencing and assembly of a human genome with ultra-long reads.

Authors:  Miten Jain; Sergey Koren; Karen H Miga; Josh Quick; Arthur C Rand; Thomas A Sasani; John R Tyson; Andrew D Beggs; Alexander T Dilthey; Ian T Fiddes; Sunir Malla; Hannah Marriott; Tom Nieto; Justin O'Grady; Hugh E Olsen; Brent S Pedersen; Arang Rhie; Hollian Richardson; Aaron R Quinlan; Terrance P Snutch; Louise Tee; Benedict Paten; Adam M Phillippy; Jared T Simpson; Nicholas J Loman; Matthew Loose
Journal:  Nat Biotechnol       Date:  2018-01-29       Impact factor: 54.908

8.  Direct determination of diploid genome sequences.

Authors:  Neil I Weisenfeld; Vijay Kumar; Preyas Shah; Deanna M Church; David B Jaffe
Journal:  Genome Res       Date:  2017-04-05       Impact factor: 9.043

9.  Assembly: a resource for assembled genomes at NCBI.

Authors:  Paul A Kitts; Deanna M Church; Françoise Thibaud-Nissen; Jinna Choi; Vichet Hem; Victor Sapojnikov; Robert G Smith; Tatiana Tatusova; Charlie Xiang; Andrey Zherikov; Michael DiCuccio; Terence D Murphy; Kim D Pruitt; Avi Kimchi
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

10.  ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.

Authors:  Chao Zhang; Maryam Rabiee; Erfan Sayyari; Siavash Mirarab
Journal:  BMC Bioinformatics       Date:  2018-05-08       Impact factor: 3.169

View more
  46 in total

Review 1.  Pangenome Graphs.

Authors:  Jordan M Eizenga; Adam M Novak; Jonas A Sibbesen; Simon Heumos; Ali Ghaffaari; Glenn Hickey; Xian Chang; Josiah D Seaman; Robin Rounthwaite; Jana Ebler; Mikko Rautiainen; Shilpa Garg; Benedict Paten; Tobias Marschall; Jouni Sirén; Erik Garrison
Journal:  Annu Rev Genomics Hum Genet       Date:  2020-05-26       Impact factor: 8.929

2.  The germline mutational process in rhesus macaque and its implications for phylogenetic dating.

Authors:  Lucie A Bergeron; Søren Besenbacher; Jaco Bakker; Jiao Zheng; Panyi Li; George Pacheco; Mikkel-Holger S Sinding; Maria Kamilari; M Thomas P Gilbert; Mikkel H Schierup; Guojie Zhang
Journal:  Gigascience       Date:  2021-05-05       Impact factor: 6.524

3.  A complete, telomere-to-telomere human genome sequence presents new opportunities for evolutionary genomics.

Authors:  Yafei Mao; Guojie Zhang
Journal:  Nat Methods       Date:  2022-06       Impact factor: 28.547

4.  Widespread introgression across a phylogeny of 155 Drosophila genomes.

Authors:  Anton Suvorov; Bernard Y Kim; Jeremy Wang; Ellie E Armstrong; David Peede; Emmanuel R R D'Agostino; Donald K Price; Peter Waddell; Michael Lang; Virginie Courtier-Orgogozo; Jean R David; Dmitri Petrov; Daniel R Matute; Daniel R Schrider; Aaron A Comeault
Journal:  Curr Biol       Date:  2021-11-16       Impact factor: 10.834

5.  Two new hybrid populations expand the swordtail hybridization model system.

Authors:  Daniel L Powell; Benjamin M Moran; Bernard Y Kim; Shreya M Banerjee; Stepfanie M Aguillon; Paola Fascinetto-Zago; Quinn K Langdon; Molly Schumer
Journal:  Evolution       Date:  2021-09-13       Impact factor: 4.171

6.  Novel functional sequences uncovered through a bovine multiassembly graph.

Authors:  Danang Crysnanto; Alexander S Leonard; Zih-Hua Fang; Hubert Pausch
Journal:  Proc Natl Acad Sci U S A       Date:  2021-05-18       Impact factor: 11.205

7.  Comparative host-coronavirus protein interaction networks reveal pan-viral disease mechanisms.

Authors:  David E Gordon; Joseph Hiatt; Mehdi Bouhaddou; Veronica V Rezelj; Svenja Ulferts; Hannes Braberg; Alexander S Jureka; Kirsten Obernier; Jeffrey Z Guo; Jyoti Batra; Robyn M Kaake; Andrew R Weckstein; Tristan W Owens; Meghna Gupta; Sergei Pourmal; Erron W Titus; Merve Cakir; Margaret Soucheray; Michael McGregor; Zeynep Cakir; Gwendolyn Jang; Matthew J O'Meara; Tia A Tummino; Ziyang Zhang; Helene Foussard; Ajda Rojc; Yuan Zhou; Dmitry Kuchenov; Ruth Hüttenhain; Jiewei Xu; Manon Eckhardt; Danielle L Swaney; Jacqueline M Fabius; Manisha Ummadi; Beril Tutuncuoglu; Ujjwal Rathore; Maya Modak; Paige Haas; Kelsey M Haas; Zun Zar Chi Naing; Ernst H Pulido; Ying Shi; Inigo Barrio-Hernandez; Danish Memon; Eirini Petsalaki; Alistair Dunham; Miguel Correa Marrero; David Burke; Cassandra Koh; Thomas Vallet; Jesus A Silvas; Caleigh M Azumaya; Christian Billesbølle; Axel F Brilot; Melody G Campbell; Amy Diallo; Miles Sasha Dickinson; Devan Diwanji; Nadia Herrera; Nick Hoppe; Huong T Kratochvil; Yanxin Liu; Gregory E Merz; Michelle Moritz; Henry C Nguyen; Carlos Nowotny; Cristina Puchades; Alexandrea N Rizo; Ursula Schulze-Gahmen; Amber M Smith; Ming Sun; Iris D Young; Jianhua Zhao; Daniel Asarnow; Justin Biel; Alisa Bowen; Julian R Braxton; Jen Chen; Cynthia M Chio; Un Seng Chio; Ishan Deshpande; Loan Doan; Bryan Faust; Sebastian Flores; Mingliang Jin; Kate Kim; Victor L Lam; Fei Li; Junrui Li; Yen-Li Li; Yang Li; Xi Liu; Megan Lo; Kyle E Lopez; Arthur A Melo; Frank R Moss; Phuong Nguyen; Joana Paulino; Komal Ishwar Pawar; Jessica K Peters; Thomas H Pospiech; Maliheh Safari; Smriti Sangwan; Kaitlin Schaefer; Paul V Thomas; Aye C Thwin; Raphael Trenker; Eric Tse; Tsz Kin Martin Tsui; Feng Wang; Natalie Whitis; Zanlin Yu; Kaihua Zhang; Yang Zhang; Fengbo Zhou; Daniel Saltzberg; Anthony J Hodder; Amber S Shun-Shion; Daniel M Williams; Kris M White; Romel Rosales; Thomas Kehrer; Lisa Miorin; Elena Moreno; Arvind H Patel; Suzannah Rihn; Mir M Khalid; Albert Vallejo-Gracia; Parinaz Fozouni; Camille R Simoneau; Theodore L Roth; David Wu; Mohd Anisul Karim; Maya Ghoussaini; Ian Dunham; Francesco Berardi; Sebastian Weigang; Maxime Chazal; Jisoo Park; James Logue; Marisa McGrath; Stuart Weston; Robert Haupt; C James Hastie; Matthew Elliott; Fiona Brown; Kerry A Burness; Elaine Reid; Mark Dorward; Clare Johnson; Stuart G Wilkinson; Anna Geyer; Daniel M Giesel; Carla Baillie; Samantha Raggett; Hannah Leech; Rachel Toth; Nicola Goodman; Kathleen C Keough; Abigail L Lind; Reyna J Klesh; Kafi R Hemphill; Jared Carlson-Stevermer; Jennifer Oki; Kevin Holden; Travis Maures; Katherine S Pollard; Andrej Sali; David A Agard; Yifan Cheng; James S Fraser; Adam Frost; Natalia Jura; Tanja Kortemme; Aashish Manglik; Daniel R Southworth; Robert M Stroud; Dario R Alessi; Paul Davies; Matthew B Frieman; Trey Ideker; Carmen Abate; Nolwenn Jouvenet; Georg Kochs; Brian Shoichet; Melanie Ott; Massimo Palmarini; Kevan M Shokat; Adolfo García-Sastre; Jeremy A Rassen; Robert Grosse; Oren S Rosenberg; Kliment A Verba; Christopher F Basler; Marco Vignuzzi; Andrew A Peden; Pedro Beltrao; Nevan J Krogan
Journal:  Science       Date:  2020-10-15       Impact factor: 47.728

8.  Comparative genomics of Chlamydomonas.

Authors:  Rory J Craig; Ahmed R Hasan; Rob W Ness; Peter D Keightley
Journal:  Plant Cell       Date:  2021-05-31       Impact factor: 12.085

Review 9.  Towards population-scale long-read sequencing.

Authors:  Wouter De Coster; Matthias H Weissensteiner; Fritz J Sedlazeck
Journal:  Nat Rev Genet       Date:  2021-05-28       Impact factor: 53.242

10.  Highly contiguous assemblies of 101 drosophilid genomes.

Authors:  Bernard Y Kim; Jeremy R Wang; Daniel R Matute; Dmitri A Petrov; Danny E Miller; Olga Barmina; Emily Delaney; Ammon Thompson; Aaron A Comeault; David Peede; Emmanuel R R D'Agostino; Julianne Pelaez; Jessica M Aguilar; Diler Haji; Teruyuki Matsunaga; Ellie E Armstrong; Molly Zych; Yoshitaka Ogawa; Marina Stamenković-Radak; Mihailo Jelić; Marija Savić Veselinović; Marija Tanasković; Pavle Erić; Jian-Jun Gao; Takehiro K Katoh; Masanori J Toda; Hideaki Watabe; Masayoshi Watada; Jeremy S Davis; Leonie C Moyle; Giulia Manoli; Enrico Bertolini; Vladimír Košťál; R Scott Hawley; Aya Takahashi; Corbin D Jones; Donald K Price; Noah Whiteman; Artyom Kopp
Journal:  Elife       Date:  2021-07-19       Impact factor: 8.713

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.