Literature DB >> 33066802

The design and construction of reference pangenome graphs with minigraph.

Heng Li1,2, Xiaowen Feng3,4, Chong Chu4.   

Abstract

The recent advances in sequencing technologies enable the assembly of individual genomes to the quality of the reference genome. How to integrate multiple genomes from the same species and make the integrated representation accessible to biologists remains an open challenge. Here, we propose a graph-based data model and associated formats to represent multiple genomes while preserving the coordinate of the linear reference genome. We implement our ideas in the minigraph toolkit and demonstrate that we can efficiently construct a pangenome graph and compactly encode tens of thousands of structural variants missing from the current reference genome.

Entities:  

Keywords:  Bioinformatics; Genomics; Pangenome

Mesh:

Year:  2020        PMID: 33066802      PMCID: PMC7568353          DOI: 10.1186/s13059-020-02168-z

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


  55 in total

1.  An Eulerian path approach to DNA fragment assembly.

Authors:  P A Pevzner; H Tang; M S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-14       Impact factor: 11.205

Review 2.  Ten years of pan-genome analyses.

Authors:  George Vernikos; Duccio Medini; David R Riley; Hervé Tettelin
Journal:  Curr Opin Microbiol       Date:  2014-12-05       Impact factor: 7.934

3.  SplitMEM: a graphical algorithm for pan-genome analysis with suffix skips.

Authors:  Shoshana Marcus; Hayan Lee; Michael C Schatz
Journal:  Bioinformatics       Date:  2014-11-13       Impact factor: 6.937

4.  Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2016-03-19       Impact factor: 6.937

5.  PBSIM: PacBio reads simulator--toward accurate genome assembly.

Authors:  Yukiteru Ono; Kiyoshi Asai; Michiaki Hamada
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

6.  High-resolution comparative analysis of great ape genomes.

Authors:  Zev N Kronenberg; Ian T Fiddes; David Gordon; Shwetha Murali; Stuart Cantsilieris; Olivia S Meyerson; Jason G Underwood; Bradley J Nelson; Mark J P Chaisson; Max L Dougherty; Katherine M Munson; Alex R Hastie; Mark Diekhans; Fereydoun Hormozdiari; Nicola Lorusso; Kendra Hoekzema; Ruolan Qiu; Karen Clark; Archana Raja; AnneMarie E Welch; Melanie Sorensen; Carl Baker; Robert S Fulton; Joel Armstrong; Tina A Graves-Lindsay; Ahmet M Denli; Emma R Hoppe; PingHsun Hsieh; Christopher M Hill; Andy Wing Chun Pang; Joyce Lee; Ernest T Lam; Susan K Dutcher; Fred H Gage; Wesley C Warren; Jay Shendure; David Haussler; Valerie A Schneider; Han Cao; Mario Ventura; Richard K Wilson; Benedict Paten; Alex Pollen; Evan E Eichler
Journal:  Science       Date:  2018-06-08       Impact factor: 47.728

7.  Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

Authors:  Valerie A Schneider; Tina Graves-Lindsay; Kerstin Howe; Nathan Bouk; Hsiu-Chuan Chen; Paul A Kitts; Terence D Murphy; Kim D Pruitt; Françoise Thibaud-Nissen; Derek Albracht; Robert S Fulton; Milinn Kremitzki; Vincent Magrini; Chris Markovic; Sean McGrath; Karyn Meltz Steinberg; Kate Auger; William Chow; Joanna Collins; Glenn Harden; Timothy Hubbard; Sarah Pelan; Jared T Simpson; Glen Threadgold; James Torrance; Jonathan M Wood; Laura Clarke; Sergey Koren; Matthew Boitano; Paul Peluso; Heng Li; Chen-Shan Chin; Adam M Phillippy; Richard Durbin; Richard K Wilson; Paul Flicek; Evan E Eichler; Deanna M Church
Journal:  Genome Res       Date:  2017-04-10       Impact factor: 9.043

8.  HLA*LA-HLA typing from linearly projected graph alignments.

Authors:  Alexander T Dilthey; Alexander J Mentzer; Raphael Carapito; Clare Cutland; Nezih Cereb; Shabir A Madhi; Arang Rhie; Sergey Koren; Seiamak Bahram; Gil McVean; Adam M Phillippy
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

9.  Genotyping structural variants in pangenome graphs using the vg toolkit.

Authors:  Glenn Hickey; David Heller; Jean Monlong; Jonas A Sibbesen; Jouni Sirén; Jordan Eizenga; Eric T Dawson; Erik Garrison; Adam M Novak; Benedict Paten
Journal:  Genome Biol       Date:  2020-02-12       Impact factor: 13.583

10.  Improved genome inference in the MHC using a population reference graph.

Authors:  Alexander Dilthey; Charles Cox; Zamin Iqbal; Matthew R Nelson; Gil McVean
Journal:  Nat Genet       Date:  2015-04-27       Impact factor: 38.330

View more
  44 in total

1.  The effect of genome graph expressiveness on the discrepancy between genome graph distance and string set distance.

Authors:  Yutong Qiu; Carl Kingsford
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

2.  Extensive variation within the pan-genome of cultivated and wild sorghum.

Authors:  Yongfu Tao; Hong Luo; Jiabao Xu; Alan Cruickshank; Xianrong Zhao; Fei Teng; Adrian Hathorn; Xiaoyuan Wu; Yuanming Liu; Tracey Shatte; David Jordan; Haichun Jing; Emma Mace
Journal:  Nat Plants       Date:  2021-05-20       Impact factor: 15.793

3.  Novel functional sequences uncovered through a bovine multiassembly graph.

Authors:  Danang Crysnanto; Alexander S Leonard; Zih-Hua Fang; Hubert Pausch
Journal:  Proc Natl Acad Sci U S A       Date:  2021-05-18       Impact factor: 11.205

4.  MONI: A Pangenomic Index for Finding Maximal Exact Matches.

Authors:  Massimiliano Rossi; Marco Oliva; Ben Langmead; Travis Gagie; Christina Boucher
Journal:  J Comput Biol       Date:  2022-01-17       Impact factor: 1.479

5.  Expanding the conservation genomics toolbox: Incorporating structural variants to enhance genomic studies for species of conservation concern.

Authors:  Jana Wold; Klaus-Peter Koepfli; Stephanie J Galla; David Eccles; Carolyn J Hogg; Marissa F Le Lec; Joseph Guhlin; Anna W Santure; Tammy E Steeves
Journal:  Mol Ecol       Date:  2021-09-12       Impact factor: 6.622

6.  The complete sequence of a human genome.

Authors:  Sergey Nurk; Sergey Koren; Arang Rhie; Mikko Rautiainen; Andrey V Bzikadze; Alla Mikheenko; Mitchell R Vollger; Nicolas Altemose; Lev Uralsky; Ariel Gershman; Sergey Aganezov; Savannah J Hoyt; Mark Diekhans; Glennis A Logsdon; Michael Alonge; Stylianos E Antonarakis; Matthew Borchers; Gerard G Bouffard; Shelise Y Brooks; Gina V Caldas; Nae-Chyun Chen; Haoyu Cheng; Chen-Shan Chin; William Chow; Leonardo G de Lima; Philip C Dishuck; Richard Durbin; Tatiana Dvorkina; Ian T Fiddes; Giulio Formenti; Robert S Fulton; Arkarachai Fungtammasan; Erik Garrison; Patrick G S Grady; Tina A Graves-Lindsay; Ira M Hall; Nancy F Hansen; Gabrielle A Hartley; Marina Haukness; Kerstin Howe; Michael W Hunkapiller; Chirag Jain; Miten Jain; Erich D Jarvis; Peter Kerpedjiev; Melanie Kirsche; Mikhail Kolmogorov; Jonas Korlach; Milinn Kremitzki; Heng Li; Valerie V Maduro; Tobias Marschall; Ann M McCartney; Jennifer McDaniel; Danny E Miller; James C Mullikin; Eugene W Myers; Nathan D Olson; Benedict Paten; Paul Peluso; Pavel A Pevzner; David Porubsky; Tamara Potapova; Evgeny I Rogaev; Jeffrey A Rosenfeld; Steven L Salzberg; Valerie A Schneider; Fritz J Sedlazeck; Kishwar Shafin; Colin J Shew; Alaina Shumate; Ying Sims; Arian F A Smit; Daniela C Soto; Ivan Sović; Jessica M Storer; Aaron Streets; Beth A Sullivan; Françoise Thibaud-Nissen; James Torrance; Justin Wagner; Brian P Walenz; Aaron Wenger; Jonathan M D Wood; Chunlin Xiao; Stephanie M Yan; Alice C Young; Samantha Zarate; Urvashi Surti; Rajiv C McCoy; Megan Y Dennis; Ivan A Alexandrov; Jennifer L Gerton; Rachel J O'Neill; Winston Timp; Justin M Zook; Michael C Schatz; Evan E Eichler; Karen H Miga; Adam M Phillippy
Journal:  Science       Date:  2022-03-31       Impact factor: 63.714

7.  Higher Rates of Processed Pseudogene Acquisition in Humans and Three Great Apes Revealed by Long-Read Assemblies.

Authors:  Xiaowen Feng; Heng Li
Journal:  Mol Biol Evol       Date:  2021-06-25       Impact factor: 16.240

Review 8.  Towards population-scale long-read sequencing.

Authors:  Wouter De Coster; Matthias H Weissensteiner; Fritz J Sedlazeck
Journal:  Nat Rev Genet       Date:  2021-05-28       Impact factor: 53.242

9.  Constructing small genome graphs via string compression.

Authors:  Yutong Qiu; Carl Kingsford
Journal:  Bioinformatics       Date:  2021-07-12       Impact factor: 6.937

10.  Comprehensive identification of transposable element insertions using multiple sequencing technologies.

Authors:  Chong Chu; Rebeca Borges-Monroy; Vinayak V Viswanadham; Soohyun Lee; Heng Li; Eunjung Alice Lee; Peter J Park
Journal:  Nat Commun       Date:  2021-06-22       Impact factor: 17.694

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.