Literature DB >> 33396198

Probably Correct: Rescuing Repeats with Short and Long Reads.

Monika Cechova1.   

Abstract

Ever since the introduction of high-throughput sequencing following the human genome project, assembling short reads into a reference of sufficient quality posed a significant problem as a large portion of the human genome-estimated 50-69%-is repetitive. As a result, a sizable proportion of sequencing reads is multi-mapping, i.e., without a unique placement in the genome. The two key parameters for whether or not a read is multi-mapping are the read length and genome complexity. Long reads are now able to span difficult, heterochromatic regions, including full centromeres, and characterize chromosomes from "telomere to telomere". Moreover, identical reads or repeat arrays can be differentiated based on their epigenetic marks, such as methylation patterns, aiding in the assembly process. This is despite the fact that long reads still contain a modest percentage of sequencing errors, disorienting the aligners and assemblers both in accuracy and speed. Here, I review the proposed and implemented solutions to the repeat resolution and the multi-mapping read problem, as well as the downstream consequences of reference choice, repeat masking, and proper representation of sex chromosomes. I also consider the forthcoming challenges and solutions with regards to long reads, where we expect the shift from the problem of repeat localization within a single individual to the problem of repeat positioning within pangenomes.

Entities:  

Keywords:  long reads; multi-mapping; reference; repeats; satellite

Mesh:

Year:  2020        PMID: 33396198      PMCID: PMC7823596          DOI: 10.3390/genes12010048

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


  102 in total

1.  Nanopore sequencing meets epigenetics.

Authors:  Michael C Schatz
Journal:  Nat Methods       Date:  2017-03-31       Impact factor: 28.547

2.  HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.

Authors:  Sergey Nurk; Brian P Walenz; Arang Rhie; Mitchell R Vollger; Glennis A Logsdon; Robert Grothe; Karen H Miga; Evan E Eichler; Adam M Phillippy; Sergey Koren
Journal:  Genome Res       Date:  2020-08-14       Impact factor: 9.043

3.  Widespread adenine N6-methylation of active genes in fungi.

Authors:  Stephen J Mondo; Richard O Dannebaum; Rita C Kuo; Katherine B Louie; Adam J Bewick; Kurt LaButti; Sajeet Haridas; Alan Kuo; Asaf Salamov; Steven R Ahrendt; Rebecca Lau; Benjamin P Bowen; Anna Lipzen; William Sullivan; Bill B Andreopoulos; Alicia Clum; Erika Lindquist; Christopher Daum; Trent R Northen; Govindarajan Kunde-Ramamoorthy; Robert J Schmitz; Andrii Gryganskyi; David Culley; Jon Magnuson; Timothy Y James; Michelle A O'Malley; Jason E Stajich; Joseph W Spatafora; Axel Visel; Igor V Grigoriev
Journal:  Nat Genet       Date:  2017-05-08       Impact factor: 38.330

4.  Correlated variation and population differentiation in satellite DNA abundance among lines of Drosophila melanogaster.

Authors:  Kevin H-C Wei; Jennifer K Grenier; Daniel A Barbash; Andrew G Clark
Journal:  Proc Natl Acad Sci U S A       Date:  2014-12-15       Impact factor: 11.205

5.  TRTools: a toolkit for genome-wide analysis of tandem repeats.

Authors:  Nima Mousavi; Jonathan Margoliash; Neha Pusarla; Shubham Saini; Richard Yanicky; Melissa Gymrek
Journal:  Bioinformatics       Date:  2021-05-05       Impact factor: 6.937

6.  Centromere reference models for human chromosomes X and Y satellite arrays.

Authors:  Karen H Miga; Yulia Newton; Miten Jain; Nicolas Altemose; Huntington F Willard; W James Kent
Journal:  Genome Res       Date:  2014-02-05       Impact factor: 9.043

Review 7.  Handling multi-mapped reads in RNA-seq.

Authors:  Gabrielle Deschamps-Francoeur; Joël Simoneau; Michelle S Scott
Journal:  Comput Struct Biotechnol J       Date:  2020-06-12       Impact factor: 7.271

8.  Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads.

Authors:  Satomi Mitsuhashi; Martin C Frith; Takeshi Mizuguchi; Satoko Miyatake; Tomoko Toyota; Hiroaki Adachi; Yoko Oma; Yoshihiro Kino; Hiroaki Mitsuhashi; Naomichi Matsumoto
Journal:  Genome Biol       Date:  2019-03-19       Impact factor: 13.583

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  Dynamic evolution of great ape Y chromosomes.

Authors:  Monika Cechova; Rahulsimham Vegesna; Marta Tomaszkiewicz; Robert S Harris; Di Chen; Samarth Rangavittal; Paul Medvedev; Kateryna D Makova
Journal:  Proc Natl Acad Sci U S A       Date:  2020-10-05       Impact factor: 11.205

View more
  2 in total

Review 1.  Satellite DNAs and human sex chromosome variation.

Authors:  Monika Cechova; Karen H Miga
Journal:  Semin Cell Dev Biol       Date:  2022-05-27       Impact factor: 7.499

Review 2.  Variation and Evolution of Human Centromeres: A Field Guide and Perspective.

Authors:  Karen H Miga; Ivan A Alexandrov
Journal:  Annu Rev Genet       Date:  2021-11-23       Impact factor: 13.826

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.