Literature DB >> 30476267

Deep repeat resolution-the assembly of the Drosophila Histone Complex.

Philipp Bongartz1, Siegfried Schloissnig1.   

Abstract

Though the advent of long-read sequencing technologies has led to a leap in contiguity of de novo genome assemblies, current reference genomes of higher organisms still do not provide unbroken sequences of complete chromosomes. Despite reads in excess of 30 000 base pairs, there are still repetitive structures that cannot be resolved by current state-of-the-art assemblers. The most challenging of these structures are tandemly arrayed repeats, which occur in the genomes of all eukaryotes. Untangling tandem repeat clusters is exceptionally difficult, since the rare differences between repeat copies are obscured by the high error rate of long reads. Solving this problem would constitute a major step towards computing fully assembled genomes. Here, we demonstrate by example of the Drosophila Histone Complex that via machine learning algorithms, it is possible to exploit the underlying distinguishing patterns of single nucleotide variants of repeats from very noisy data to resolve a large and highly conserved repeat cluster. The ideas explored in this paper are a first step towards the automated assembly of complex repeat structures and promise to be applicable to a wide range of eukaryotic genomes.
© The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30476267      PMCID: PMC6380962          DOI: 10.1093/nar/gky1194

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  16 in total

1.  Separation of nearly identical repeats in shotgun assemblies using defined nucleotide positions, DNPs.

Authors:  Martti T Tammi; Erik Arner; Tom Britton; Björn Andersson
Journal:  Bioinformatics       Date:  2002-03       Impact factor: 6.937

Review 2.  One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly.

Authors:  Sergey Koren; Adam M Phillippy
Journal:  Curr Opin Microbiol       Date:  2014-12-01       Impact factor: 7.934

Review 3.  Mechanisms of gene duplication and amplification.

Authors:  Andrew B Reams; John R Roth
Journal:  Cold Spring Harb Perspect Biol       Date:  2015-02-02       Impact factor: 10.005

4.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

5.  Nucleotide variation and divergence in the histone multigene family in Drosophila melanogaster.

Authors:  Y Matsuo; T Yamazaki
Journal:  Genetics       Date:  1989-05       Impact factor: 4.562

6.  The sequence of the human genome.

Authors:  J C Venter; M D Adams; E W Myers; P W Li; R J Mural; G G Sutton; H O Smith; M Yandell; C A Evans; R A Holt; J D Gocayne; P Amanatides; R M Ballew; D H Huson; J R Wortman; Q Zhang; C D Kodira; X H Zheng; L Chen; M Skupski; G Subramanian; P D Thomas; J Zhang; G L Gabor Miklos; C Nelson; S Broder; A G Clark; J Nadeau; V A McKusick; N Zinder; A J Levine; R J Roberts; M Simon; C Slayman; M Hunkapiller; R Bolanos; A Delcher; I Dew; D Fasulo; M Flanigan; L Florea; A Halpern; S Hannenhalli; S Kravitz; S Levy; C Mobarry; K Reinert; K Remington; J Abu-Threideh; E Beasley; K Biddick; V Bonazzi; R Brandon; M Cargill; I Chandramouliswaran; R Charlab; K Chaturvedi; Z Deng; V Di Francesco; P Dunn; K Eilbeck; C Evangelista; A E Gabrielian; W Gan; W Ge; F Gong; Z Gu; P Guan; T J Heiman; M E Higgins; R R Ji; Z Ke; K A Ketchum; Z Lai; Y Lei; Z Li; J Li; Y Liang; X Lin; F Lu; G V Merkulov; N Milshina; H M Moore; A K Naik; V A Narayan; B Neelam; D Nusskern; D B Rusch; S Salzberg; W Shao; B Shue; J Sun; Z Wang; A Wang; X Wang; J Wang; M Wei; R Wides; C Xiao; C Yan; A Yao; J Ye; M Zhan; W Zhang; H Zhang; Q Zhao; L Zheng; F Zhong; W Zhong; S Zhu; S Zhao; D Gilbert; S Baumhueter; G Spier; C Carter; A Cravchik; T Woodage; F Ali; H An; A Awe; D Baldwin; H Baden; M Barnstead; I Barrow; K Beeson; D Busam; A Carver; A Center; M L Cheng; L Curry; S Danaher; L Davenport; R Desilets; S Dietz; K Dodson; L Doup; S Ferriera; N Garg; A Gluecksmann; B Hart; J Haynes; C Haynes; C Heiner; S Hladun; D Hostin; J Houck; T Howland; C Ibegwam; J Johnson; F Kalush; L Kline; S Koduru; A Love; F Mann; D May; S McCawley; T McIntosh; I McMullen; M Moy; L Moy; B Murphy; K Nelson; C Pfannkoch; E Pratts; V Puri; H Qureshi; M Reardon; R Rodriguez; Y H Rogers; D Romblad; B Ruhfel; R Scott; C Sitter; M Smallwood; E Stewart; R Strong; E Suh; R Thomas; N N Tint; S Tse; C Vech; G Wang; J Wetter; S Williams; M Williams; S Windsor; E Winn-Deen; K Wolfe; J Zaveri; K Zaveri; J F Abril; R Guigó; M J Campbell; K V Sjolander; B Karlak; A Kejariwal; H Mi; B Lazareva; T Hatton; A Narechania; K Diemer; A Muruganujan; N Guo; S Sato; V Bafna; S Istrail; R Lippert; R Schwartz; B Walenz; S Yooseph; D Allen; A Basu; J Baxendale; L Blick; M Caminha; J Carnes-Stine; P Caulk; Y H Chiang; M Coyne; C Dahlke; A Deslattes Mays; M Dombroski; M Donnelly; D Ely; S Esparham; C Fosler; H Gire; S Glanowski; K Glasser; A Glodek; M Gorokhov; K Graham; B Gropman; M Harris; J Heil; S Henderson; J Hoover; D Jennings; C Jordan; J Jordan; J Kasha; L Kagan; C Kraft; A Levitsky; M Lewis; X Liu; J Lopez; D Ma; W Majoros; J McDaniel; S Murphy; M Newman; T Nguyen; N Nguyen; M Nodell; S Pan; J Peck; M Peterson; W Rowe; R Sanders; J Scott; M Simpson; T Smith; A Sprague; T Stockwell; R Turner; E Venter; M Wang; M Wen; D Wu; M Wu; A Xia; A Zandieh; X Zhu
Journal:  Science       Date:  2001-02-16       Impact factor: 47.728

7.  Long-read, whole-genome shotgun sequence data for five model organisms.

Authors:  Kristi E Kim; Paul Peluso; Primo Babayan; P Jane Yeadon; Charles Yu; William W Fisher; Chen-Shan Chin; Nicole A Rapicavoli; David R Rank; Joachim Li; David E A Catcheside; Susan E Celniker; Adam M Phillippy; Casey M Bergman; Jane M Landolin
Journal:  Sci Data       Date:  2014-11-25       Impact factor: 6.444

8.  The Release 6 reference sequence of the Drosophila melanogaster genome.

Authors:  Roger A Hoskins; Joseph W Carlson; Kenneth H Wan; Soo Park; Ivonne Mendez; Samuel E Galle; Benjamin W Booth; Barret D Pfeiffer; Reed A George; Robert Svirskas; Martin Krzywinski; Jacqueline Schein; Maria Carmela Accardo; Elisabetta Damia; Giovanni Messina; María Méndez-Lago; Beatriz de Pablos; Olga V Demakova; Evgeniya N Andreyeva; Lidiya V Boldyreva; Marco Marra; A Bernardo Carvalho; Patrizio Dimitri; Alfredo Villasante; Igor F Zhimulev; Gerald M Rubin; Gary H Karpen; Susan E Celniker
Journal:  Genome Res       Date:  2015-01-14       Impact factor: 9.043

9.  FlyBase: establishing a Gene Group resource for Drosophila melanogaster.

Authors:  Helen Attrill; Kathleen Falls; Joshua L Goodman; Gillian H Millburn; Giulia Antonazzo; Alix J Rey; Steven J Marygold
Journal:  Nucleic Acids Res       Date:  2015-10-13       Impact factor: 16.971

10.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors:  Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

View more
  4 in total

1.  CDK-Regulated Phase Separation Seeded by Histone Genes Ensures Precise Growth and Function of Histone Locus Bodies.

Authors:  Woonyung Hur; James P Kemp; Marco Tarzia; Victoria E Deneke; William F Marzluff; Robert J Duronio; Stefano Di Talia
Journal:  Dev Cell       Date:  2020-06-23       Impact factor: 12.270

2.  A region of SLBP outside the mRNA-processing domain is essential for deposition of histone mRNA into the Drosophila egg.

Authors:  Jennifer Michelle Potter-Birriel; Graydon B Gonsalvez; William F Marzluff
Journal:  J Cell Sci       Date:  2021-02-11       Impact factor: 5.285

3.  Drosophila histone locus body assembly and function involves multiple interactions.

Authors:  Kaitlin P Koreski; Leila E Rieder; Lyndsey M McLain; Ashlesha Chaubal; William F Marzluff; Robert J Duronio
Journal:  Mol Biol Cell       Date:  2020-05-13       Impact factor: 4.138

4.  Assembly of complete diploid-phased chromosomes from draft genome sequences.

Authors:  Andrea Minio; Noé Cochetel; Amanda M Vondras; Mélanie Massonnet; Dario Cantu
Journal:  G3 (Bethesda)       Date:  2022-07-29       Impact factor: 3.542

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.