Literature DB >> 32432328

Smash++: an alignment-free and memory-efficient tool to find genomic rearrangements.

Morteza Hosseini1, Diogo Pratas1,2, Burkhard Morgenstern3,4, Armando J Pinho1.   

Abstract

BACKGROUND: The development of high-throughput sequencing technologies and, as its result, the production of huge volumes of genomic data, has accelerated biological and medical research and discovery. Study on genomic rearrangements is crucial owing to their role in chromosomal evolution, genetic disorders, and cancer.
RESULTS: We present Smash++, an alignment-free and memory-efficient tool to find and visualize small- and large-scale genomic rearrangements between 2 DNA sequences. This computational solution extracts information contents of the 2 sequences, exploiting a data compression technique to find rearrangements. We also present Smash++ visualizer, a tool that allows the visualization of the detected rearrangements along with their self- and relative complexity, by generating an SVG (Scalable Vector Graphics) image.
CONCLUSIONS: Tested on several synthetic and real DNA sequences from bacteria, fungi, Aves, and Mammalia, the proposed tool was able to accurately find genomic rearrangements. The detected regions were in accordance with previous studies, which took alignment-based approaches or performed FISH (fluorescence in situ hybridization) analysis. The maximum peak memory usage among all experiments was ∼1 GB, which makes Smash++ feasible to run on present-day standard computers.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Keywords:  alignment-free; complexity; data compression; genome comparison; genome duplication; genomic rearrangement; high-throughput sequencing; information theory; probabilistic-algorithmic model; visualization

Year:  2020        PMID: 32432328      PMCID: PMC7238676          DOI: 10.1093/gigascience/giaa048

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  32 in total

1.  DRIMM-Synteny: decomposing genomes into evolutionary conserved segments.

Authors:  Son K Pham; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2010-08-24       Impact factor: 6.937

2.  TimeTree: A Resource for Timelines, Timetrees, and Divergence Times.

Authors:  Sudhir Kumar; Glen Stecher; Michael Suleski; S Blair Hedges
Journal:  Mol Biol Evol       Date:  2017-07-01       Impact factor: 16.240

3.  Microdeletion 4p16.3 in three unrelated patients with Wolf-Hirschhorn syndrome.

Authors:  A Dufke; J Seidel; M Schöning; M Döbler-Neumann; C Kelbova; T Liehr; V Beensen; C Backsch; U Klein-Vogler; H Enders
Journal:  Cytogenet Cell Genet       Date:  2000

4.  progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.

Authors:  Aaron E Darling; Bob Mau; Nicole T Perna
Journal:  PLoS One       Date:  2010-06-25       Impact factor: 3.240

5.  Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): genome assembly and analysis.

Authors:  Rami A Dalloul; Julie A Long; Aleksey V Zimin; Luqman Aslam; Kathryn Beal; Le Ann Blomberg; Pascal Bouffard; David W Burt; Oswald Crasta; Richard P M A Crooijmans; Kristal Cooper; Roger A Coulombe; Supriyo De; Mary E Delany; Jerry B Dodgson; Jennifer J Dong; Clive Evans; Karin M Frederickson; Paul Flicek; Liliana Florea; Otto Folkerts; Martien A M Groenen; Tim T Harkins; Javier Herrero; Steve Hoffmann; Hendrik-Jan Megens; Andrew Jiang; Pieter de Jong; Pete Kaiser; Heebal Kim; Kyu-Won Kim; Sungwon Kim; David Langenberger; Mi-Kyung Lee; Taeheon Lee; Shrinivasrao Mane; Guillaume Marcais; Manja Marz; Audrey P McElroy; Thero Modise; Mikhail Nefedov; Cédric Notredame; Ian R Paton; William S Payne; Geo Pertea; Dennis Prickett; Daniela Puiu; Dan Qioa; Emanuele Raineri; Magali Ruffier; Steven L Salzberg; Michael C Schatz; Chantel Scheuring; Carl J Schmidt; Steven Schroeder; Stephen M J Searle; Edward J Smith; Jacqueline Smith; Tad S Sonstegard; Peter F Stadler; Hakim Tafer; Zhijian Jake Tu; Curtis P Van Tassell; Albert J Vilella; Kelly P Williams; James A Yorke; Liqing Zhang; Hong-Bin Zhang; Xiaojun Zhang; Yang Zhang; Kent M Reed
Journal:  PLoS Biol       Date:  2010-09-07       Impact factor: 8.029

6.  Genome rearrangements in mammalian evolution: lessons from human and mouse genomes.

Authors:  Pavel Pevzner; Glenn Tesler
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

7.  An alignment-free method to find and visualise rearrangements between pairs of DNA sequences.

Authors:  Diogo Pratas; Raquel M Silva; Armando J Pinho; Paulo J S G Ferreira
Journal:  Sci Rep       Date:  2015-05-18       Impact factor: 4.379

8.  Synteny Portal: a web-based application portal for synteny block analysis.

Authors:  Jongin Lee; Woon-Young Hong; Minah Cho; Mikang Sim; Daehwan Lee; Younhee Ko; Jaebum Kim
Journal:  Nucleic Acids Res       Date:  2016-05-06       Impact factor: 16.971

9.  A survey of localized sequence rearrangements in human DNA.

Authors:  Martin C Frith; Sofia Khan
Journal:  Nucleic Acids Res       Date:  2018-02-28       Impact factor: 16.971

10.  Disorders caused by chromosome abnormalities.

Authors:  Aaron Theisen; Lisa G Shaffer
Journal:  Appl Clin Genet       Date:  2010-12-10
View more
  5 in total

1.  The complexity landscape of viral genomes.

Authors:  Jorge Miguel Silva; Diogo Pratas; Tânia Caetano; Sérgio Matos
Journal:  Gigascience       Date:  2022-08-11       Impact factor: 7.658

2.  Efficient DNA sequence compression with neural networks.

Authors:  Milton Silva; Diogo Pratas; Armando J Pinho
Journal:  Gigascience       Date:  2020-11-11       Impact factor: 6.524

3.  AC2: An Efficient Protein Sequence Compression Tool Using Artificial Neural Networks and Cache-Hash Models.

Authors:  Milton Silva; Diogo Pratas; Armando J Pinho
Journal:  Entropy (Basel)       Date:  2021-04-26       Impact factor: 2.524

4.  Genome and Methylome analysis of a phylogenetic novel Campylobacter coli cluster with C. jejuni introgression.

Authors:  Anastasia-Lisa Dieckmann; Thomas Riedel; Boyke Bunk; Cathrin Spröer; Jörg Overmann; Uwe Groß; Oliver Bader; Wolfgang Bohne; Burkhard Morgenstern; Morteza Hosseini; Andreas E Zautner
Journal:  Microb Genom       Date:  2021-10

5.  The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features.

Authors:  Weihong Qi; Yi-Wen Lim; Andrea Patrignani; Pascal Schläpfer; Anna Bratus-Neuenschwander; Simon Grüter; Christelle Chanez; Nathalie Rodde; Elisa Prat; Sonia Vautrin; Margaux-Alison Fustier; Diogo Pratas; Ralph Schlapbach; Wilhelm Gruissem
Journal:  Gigascience       Date:  2022-03-24       Impact factor: 6.524

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.