Literature DB >> 32928274

Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies.

Arang Rhie1, Brian P Walenz2, Sergey Koren2, Adam M Phillippy2.   

Abstract

Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to those found in unassembled high-accuracy reads, Merqury estimates base-level accuracy and completeness. For trios, Merqury can also evaluate haplotype-specific accuracy, completeness, phase block continuity, and switch errors. Multiple visualizations, such as k-mer spectrum plots, can be generated for evaluation. We demonstrate on both human and plant genomes that Merqury is a fast and robust method for assembly validation.

Entities:  

Keywords:  Assembly validation; Benchmarking; Genome assembly; Haplotype phasing; K-mers; Trio binning

Mesh:

Year:  2020        PMID: 32928274      PMCID: PMC7488777          DOI: 10.1186/s13059-020-02134-9

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


  40 in total

1.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment.

Authors:  B Ewing; L Hillier; M C Wendl; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

2.  Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls.

Authors:  Justin M Zook; Brad Chapman; Jason Wang; David Mittelman; Oliver Hofmann; Winston Hide; Marc Salit
Journal:  Nat Biotechnol       Date:  2014-02-16       Impact factor: 54.908

3.  Best practices for benchmarking germline small-variant calls in human genomes.

Authors:  Peter Krusche; Len Trigg; Paul C Boutros; Christopher E Mason; Francisco M De La Vega; Benjamin L Moore; Mar Gonzalez-Porta; Michael A Eberle; Zivana Tezak; Samir Lababidi; Rebecca Truty; George Asimenos; Birgit Funke; Mark Fleharty; Brad A Chapman; Marc Salit; Justin M Zook
Journal:  Nat Biotechnol       Date:  2019-03-11       Impact factor: 54.908

4.  DNA template strand sequencing of single-cells maps genomic rearrangements at high resolution.

Authors:  Ester Falconer; Mark Hills; Ulrike Naumann; Steven S S Poon; Elizabeth A Chavez; Ashley D Sanders; Yongjun Zhao; Martin Hirst; Peter M Lansdorp
Journal:  Nat Methods       Date:  2012-10-07       Impact factor: 28.547

5.  KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies.

Authors:  Daniel Mapleson; Gonzalo Garcia Accinelli; George Kettleborough; Jonathan Wright; Bernardo J Clavijo
Journal:  Bioinformatics       Date:  2017-02-15       Impact factor: 6.937

6.  HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies.

Authors:  Peter Edge; Vineet Bafna; Vikas Bansal
Journal:  Genome Res       Date:  2016-12-09       Impact factor: 9.043

7.  Comparison of phasing strategies for whole human genomes.

Authors:  Yongwook Choi; Agnes P Chan; Ewen Kirkness; Amalio Telenti; Nicholas J Schork
Journal:  PLoS Genet       Date:  2018-04-05       Impact factor: 5.917

8.  Versatile genome assembly evaluation with QUAST-LG.

Authors:  Alla Mikheenko; Andrey Prjibelski; Vladislav Saveliev; Dmitry Antipov; Alexey Gurevich
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

9.  Improved genome inference in the MHC using a population reference graph.

Authors:  Alexander Dilthey; Charles Cox; Zamin Iqbal; Matthew R Nelson; Gil McVean
Journal:  Nat Genet       Date:  2015-04-27       Impact factor: 38.330

10.  De novo assembly of haplotype-resolved genomes with trio binning.

Authors:  Sergey Koren; Arang Rhie; Brian P Walenz; Alexander T Dilthey; Derek M Bickhart; Sarah B Kingan; Stefan Hiendleder; John L Williams; Timothy P L Smith; Adam M Phillippy
Journal:  Nat Biotechnol       Date:  2018-10-22       Impact factor: 54.908

View more
  77 in total

1.  HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.

Authors:  Sergey Nurk; Brian P Walenz; Arang Rhie; Mitchell R Vollger; Glennis A Logsdon; Robert Grothe; Karen H Miga; Evan E Eichler; Adam M Phillippy; Sergey Koren
Journal:  Genome Res       Date:  2020-08-14       Impact factor: 9.043

2.  Detecting Structural Variants and Associated Gene Presence-Absence Variation Phenomena in the Genomes of Marine Organisms.

Authors:  Marco Sollitto; Nathan J Kenny; Samuele Greco; Carmen Federica Tucci; Andrew D Calcino; Marco Gerdol
Journal:  Methods Mol Biol       Date:  2022

3.  Enhancing Long-Read-Based Strain-Aware Metagenome Assembly.

Authors:  Xiao Luo; Xiongbin Kang; Alexander Schönhuth
Journal:  Front Genet       Date:  2022-05-13       Impact factor: 4.772

4.  Chromosome-Scale, Haplotype-Resolved Genome Assembly of Non-Sex-Reversal Females of Swamp Eel Using High-Fidelity Long Reads and Hi-C Data.

Authors:  Hai-Feng Tian; Qiaomu Hu; Hong-Yi Lu; Zhong Li
Journal:  Front Genet       Date:  2022-05-18       Impact factor: 4.772

5.  A Reference Genome Assembly of Simmental Cattle, Bos taurus taurus.

Authors:  Michael P Heaton; Timothy P L Smith; Derek M Bickhart; Brian L Vander Ley; Larry A Kuehn; Jonas Oppenheimer; Wade R Shafer; Fred T Schuetze; Brad Stroud; Jennifer C McClure; Jennifer P Barfield; Harvey D Blackburn; Theodore S Kalbfleisch; Kimberly M Davenport; Kristen L Kuhn; Richard E Green; Beth Shapiro; Benjamin D Rosen
Journal:  J Hered       Date:  2021-03-29       Impact factor: 2.645

6.  An Annotated Draft Genome for the Andean Bear, Tremarctos ornatus.

Authors:  Nedda F Saremi; Jonas Oppenheimer; Christopher Vollmers; Brendan O'Connell; Shard A Milne; Ashley Byrne; Li Yu; Oliver A Ryder; Richard E Green; Beth Shapiro
Journal:  J Hered       Date:  2021-07-15       Impact factor: 2.645

7.  Assembly of a Hybrid Formica aquilonia × F. polyctena Ant Genome From a Haploid Male.

Authors:  Pierre Nouhaud; Jack Beresford; Jonna Kulmuni
Journal:  J Hered       Date:  2022-07-09       Impact factor: 2.679

8.  Large structural variations in the haplotype-resolved African cassava genome.

Authors:  Ben N Mansfeld; Adam Boyher; Jeffrey C Berry; Mark Wilson; Shujun Ou; Seth Polydore; Todd P Michael; Noah Fahlgren; Rebecca S Bart
Journal:  Plant J       Date:  2021-11-10       Impact factor: 7.091

9.  LazyB: fast and cheap genome assembly.

Authors:  Thomas Gatter; Sarah von Löhneysen; Jörg Fallmann; Polina Drozdova; Tom Hartmann; Peter F Stadler
Journal:  Algorithms Mol Biol       Date:  2021-06-01       Impact factor: 1.405

10.  Highly contiguous assemblies of 101 drosophilid genomes.

Authors:  Bernard Y Kim; Jeremy R Wang; Daniel R Matute; Dmitri A Petrov; Danny E Miller; Olga Barmina; Emily Delaney; Ammon Thompson; Aaron A Comeault; David Peede; Emmanuel R R D'Agostino; Julianne Pelaez; Jessica M Aguilar; Diler Haji; Teruyuki Matsunaga; Ellie E Armstrong; Molly Zych; Yoshitaka Ogawa; Marina Stamenković-Radak; Mihailo Jelić; Marija Savić Veselinović; Marija Tanasković; Pavle Erić; Jian-Jun Gao; Takehiro K Katoh; Masanori J Toda; Hideaki Watabe; Masayoshi Watada; Jeremy S Davis; Leonie C Moyle; Giulia Manoli; Enrico Bertolini; Vladimír Košťál; R Scott Hawley; Aya Takahashi; Corbin D Jones; Donald K Price; Noah Whiteman; Artyom Kopp
Journal:  Elife       Date:  2021-07-19       Impact factor: 8.713

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.