Literature DB >> 35361931

Chasing perfection: validation and polishing strategies for telomere-to-telomere genome assemblies.

Ann M Mc Cartney1, Kishwar Shafin2, Michael Alonge3, Andrey V Bzikadze4, Giulio Formenti5, Arkarachai Fungtammasan6, Kerstin Howe7, Chirag Jain1,8, Sergey Koren1, Glennis A Logsdon9, Karen H Miga2,10, Alla Mikheenko11, Benedict Paten2, Alaina Shumate12, Daniela C Soto13, Ivan Sović14,15, Jonathan M D Wood7, Justin M Zook16, Adam M Phillippy17, Arang Rhie18.   

Abstract

Advances in long-read sequencing technologies and genome assembly methods have enabled the recent completion of the first telomere-to-telomere human genome assembly, which resolves complex segmental duplications and large tandem repeats, including centromeric satellite arrays in a complete hydatidiform mole (CHM13). Although derived from highly accurate sequences, evaluation revealed evidence of small errors and structural misassemblies in the initial draft assembly. To correct these errors, we designed a new repeat-aware polishing strategy that made accurate assembly corrections in large repeats without overcorrection, ultimately fixing 51% of the existing errors and improving the assembly quality value from 70.2 to 73.9 measured from PacBio high-fidelity and Illumina k-mers. By comparing our results to standard automated polishing tools, we outline common polishing errors and offer practical suggestions for genome projects with limited resources. We also show how sequencing biases in both high-fidelity and Oxford Nanopore Technologies reads cause signature assembly errors that can be corrected with a diverse panel of sequencing technologies.
© 2022. This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply.

Entities:  

Mesh:

Year:  2022        PMID: 35361931     DOI: 10.1038/s41592-022-01440-3

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   47.990


  5 in total

Review 1.  Satellite DNAs and human sex chromosome variation.

Authors:  Monika Cechova; Karen H Miga
Journal:  Semin Cell Dev Biol       Date:  2022-05-27       Impact factor: 7.499

2.  Complete genomic and epigenetic maps of human centromeres.

Authors:  Glennis A Logsdon; Andrey V Bzikadze; Pragya Sidhwani; Sasha A Langley; Gina V Caldas; Nicolas Altemose; Savannah J Hoyt; Lev Uralsky; Fedor D Ryabov; Colin J Shew; Michael E G Sauria; Matthew Borchers; Ariel Gershman; Alla Mikheenko; Valery A Shepelev; Tatiana Dvorkina; Olga Kunyavskaya; Mitchell R Vollger; Arang Rhie; Ann M McCartney; Mobin Asri; Ryan Lorig-Roach; Kishwar Shafin; Julian K Lucas; Sergey Aganezov; Daniel Olson; Leonardo Gomes de Lima; Tamara Potapova; Gabrielle A Hartley; Marina Haukness; Peter Kerpedjiev; Fedor Gusev; Kristof Tigyi; Shelise Brooks; Alice Young; Sergey Nurk; Sergey Koren; Sofie R Salama; Benedict Paten; Evgeny I Rogaev; Aaron Streets; Gary H Karpen; Abby F Dernburg; Beth A Sullivan; Aaron F Straight; Travis J Wheeler; Jennifer L Gerton; Evan E Eichler; Adam M Phillippy; Winston Timp; Megan Y Dennis; Rachel J O'Neill; Justin M Zook; Michael C Schatz; Pavel A Pevzner; Mark Diekhans; Charles H Langley; Ivan A Alexandrov; Karen H Miga
Journal:  Science       Date:  2022-04-01       Impact factor: 63.714

3.  A complete reference genome improves analysis of human genetic variation.

Authors:  Sergey Aganezov; Stephanie M Yan; Daniela C Soto; Melanie Kirsche; Samantha Zarate; Pavel Avdeyev; Dylan J Taylor; Kishwar Shafin; Alaina Shumate; Chunlin Xiao; Justin Wagner; Jennifer McDaniel; Nathan D Olson; Michael E G Sauria; Mitchell R Vollger; Arang Rhie; Melissa Meredith; Skylar Martin; Joyce Lee; Sergey Koren; Jeffrey A Rosenfeld; Benedict Paten; Ryan Layer; Chen-Shan Chin; Fritz J Sedlazeck; Nancy F Hansen; Danny E Miller; Adam M Phillippy; Karen H Miga; Rajiv C McCoy; Megan Y Dennis; Justin M Zook; Michael C Schatz
Journal:  Science       Date:  2022-04-01       Impact factor: 63.714

4.  From telomere to telomere: The transcriptional and epigenetic state of human repeat elements.

Authors:  Jessica M Storer; Gabrielle A Hartley; Patrick G S Grady; Ariel Gershman; Savannah J Hoyt; Leonardo G de Lima; Charles Limouse; Reza Halabian; Luke Wojenski; Matias Rodriguez; Nicolas Altemose; Arang Rhie; Leighton J Core; Jennifer L Gerton; Wojciech Makalowski; Daniel Olson; Jeb Rosen; Arian F A Smit; Aaron F Straight; Mitchell R Vollger; Travis J Wheeler; Michael C Schatz; Evan E Eichler; Adam M Phillippy; Winston Timp; Karen H Miga; Rachel J O'Neill
Journal:  Science       Date:  2022-04-01       Impact factor: 63.714

5.  The complete sequence of a human genome.

Authors:  Sergey Nurk; Sergey Koren; Arang Rhie; Mikko Rautiainen; Andrey V Bzikadze; Alla Mikheenko; Mitchell R Vollger; Nicolas Altemose; Lev Uralsky; Ariel Gershman; Sergey Aganezov; Savannah J Hoyt; Mark Diekhans; Glennis A Logsdon; Michael Alonge; Stylianos E Antonarakis; Matthew Borchers; Gerard G Bouffard; Shelise Y Brooks; Gina V Caldas; Nae-Chyun Chen; Haoyu Cheng; Chen-Shan Chin; William Chow; Leonardo G de Lima; Philip C Dishuck; Richard Durbin; Tatiana Dvorkina; Ian T Fiddes; Giulio Formenti; Robert S Fulton; Arkarachai Fungtammasan; Erik Garrison; Patrick G S Grady; Tina A Graves-Lindsay; Ira M Hall; Nancy F Hansen; Gabrielle A Hartley; Marina Haukness; Kerstin Howe; Michael W Hunkapiller; Chirag Jain; Miten Jain; Erich D Jarvis; Peter Kerpedjiev; Melanie Kirsche; Mikhail Kolmogorov; Jonas Korlach; Milinn Kremitzki; Heng Li; Valerie V Maduro; Tobias Marschall; Ann M McCartney; Jennifer McDaniel; Danny E Miller; James C Mullikin; Eugene W Myers; Nathan D Olson; Benedict Paten; Paul Peluso; Pavel A Pevzner; David Porubsky; Tamara Potapova; Evgeny I Rogaev; Jeffrey A Rosenfeld; Steven L Salzberg; Valerie A Schneider; Fritz J Sedlazeck; Kishwar Shafin; Colin J Shew; Alaina Shumate; Ying Sims; Arian F A Smit; Daniela C Soto; Ivan Sović; Jessica M Storer; Aaron Streets; Beth A Sullivan; Françoise Thibaud-Nissen; James Torrance; Justin Wagner; Brian P Walenz; Aaron Wenger; Jonathan M D Wood; Chunlin Xiao; Stephanie M Yan; Alice C Young; Samantha Zarate; Urvashi Surti; Rajiv C McCoy; Megan Y Dennis; Ivan A Alexandrov; Jennifer L Gerton; Rachel J O'Neill; Winston Timp; Justin M Zook; Michael C Schatz; Evan E Eichler; Karen H Miga; Adam M Phillippy
Journal:  Science       Date:  2022-03-31       Impact factor: 63.714

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.