Literature DB >> 31312019

Ultra-fast genome comparison for large-scale genomic experiments.

Esteban Pérez-Wohlfeil1, Sergio Diaz-Del-Pino1, Oswaldo Trelles2.   

Abstract

In the last decade, a technological shift in the bioinformatics field has occurred: larger genomes can now be sequenced quickly and cost effectively, resulting in the computational need to efficiently compare large and abundant sequences. Furthermore, detecting conserved similarities across large collections of genomes remains a problem. The size of chromosomes, along with the substantial amount of noise and number of repeats found in DNA sequences (particularly in mammals and plants), leads to a scenario where executing and waiting for complete outputs is both time and resource consuming. Filtering steps, manual examination and annotation, very long execution times and a high demand for computational resources represent a few of the many difficulties faced in large genome comparisons. In this work, we provide a method designed for comparisons of considerable amounts of very long sequences that employs a heuristic algorithm capable of separating noise and repeats from conserved fragments in pairwise genomic comparisons. We provide software implementation that computes in linear time using one core as a minimum and a small, constant memory footprint. The method produces both a previsualization of the comparison and a collection of indices to drastically reduce computational complexity when performing exhaustive comparisons. Last, the method scores the comparison to automate classification of sequences and produces a list of detected synteny blocks to enable new evolutionary studies.

Entities:  

Mesh:

Year:  2019        PMID: 31312019      PMCID: PMC6635410          DOI: 10.1038/s41598-019-46773-w

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  25 in total

1.  Fast Edge Detection Using Structured Forests.

Authors:  Piotr Dollár; C Lawrence Zitnick
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-08       Impact factor: 6.226

2.  Alignment-free sequence comparison (I): statistics and power.

Authors:  Gesine Reinert; David Chew; Fengzhu Sun; Michael S Waterman
Journal:  J Comput Biol       Date:  2009-12       Impact factor: 1.479

3.  Using MUMmer to identify similar regions in large sequence sets.

Authors:  Arthur L Delcher; Steven L Salzberg; Adam M Phillippy
Journal:  Curr Protoc Bioinformatics       Date:  2003-02

4.  A computational approach to edge detection.

Authors:  J Canny
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1986-06       Impact factor: 6.226

5.  Cgaln: fast and space-efficient whole-genome alignment.

Authors:  Ryuichiro Nakato; Osamu Gotoh
Journal:  BMC Bioinformatics       Date:  2010-04-30       Impact factor: 3.169

6.  Genome Sequence of "Candidatus Carsonella ruddii" Strain BC, a Nutritional Endosymbiont of Bactericera cockerelli.

Authors:  Alex B Riley; Dohyup Kim; Allison K Hansen
Journal:  Genome Announc       Date:  2017-04-27

7.  Narcisse: a mirror view of conserved syntenies.

Authors:  Emmanuel Courcelle; Yoann Beausse; Sébastien Letort; Olivier Stahl; Romain Fremez; Catherine Ngom-Bru; Jérôme Gouzy; Thomas Faraut
Journal:  Nucleic Acids Res       Date:  2007-11-02       Impact factor: 16.971

8.  Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies.

Authors:  David B Neale; Jill L Wegrzyn; Kristian A Stevens; Aleksey V Zimin; Daniela Puiu; Marc W Crepeau; Charis Cardeno; Maxim Koriabine; Ann E Holtz-Morris; John D Liechty; Pedro J Martínez-García; Hans A Vasquez-Gross; Brian Y Lin; Jacob J Zieve; William M Dougherty; Sara Fuentes-Soriano; Le-Shin Wu; Don Gilbert; Guillaume Marçais; Michael Roberts; Carson Holt; Mark Yandell; John M Davis; Katherine E Smith; Jeffrey F D Dean; W Walter Lorenz; Ross W Whetten; Ronald Sederoff; Nicholas Wheeler; Patrick E McGuire; Doreen Main; Carol A Loopstra; Keithanne Mockaitis; Pieter J deJong; James A Yorke; Steven L Salzberg; Charles H Langley
Journal:  Genome Biol       Date:  2014-03-04       Impact factor: 13.583

9.  SynFind: Compiling Syntenic Regions across Any Set of Genomes on Demand.

Authors:  Haibao Tang; Matthew D Bomhoff; Evan Briones; Liangsheng Zhang; James C Schnable; Eric Lyons
Journal:  Genome Biol Evol       Date:  2015-11-11       Impact factor: 3.416

10.  Ensembl 2018.

Authors:  Daniel R Zerbino; Premanand Achuthan; Wasiu Akanni; M Ridwan Amode; Daniel Barrell; Jyothish Bhai; Konstantinos Billis; Carla Cummins; Astrid Gall; Carlos García Girón; Laurent Gil; Leo Gordon; Leanne Haggerty; Erin Haskell; Thibaut Hourlier; Osagie G Izuogu; Sophie H Janacek; Thomas Juettemann; Jimmy Kiang To; Matthew R Laird; Ilias Lavidas; Zhicheng Liu; Jane E Loveland; Thomas Maurel; William McLaren; Benjamin Moore; Jonathan Mudge; Daniel N Murphy; Victoria Newman; Michael Nuhn; Denye Ogeh; Chuang Kee Ong; Anne Parker; Mateus Patricio; Harpreet Singh Riat; Helen Schuilenburg; Dan Sheppard; Helen Sparrow; Kieron Taylor; Anja Thormann; Alessandro Vullo; Brandon Walts; Amonida Zadissa; Adam Frankish; Sarah E Hunt; Myrto Kostadima; Nicholas Langridge; Fergal J Martin; Matthieu Muffato; Emily Perry; Magali Ruffier; Dan M Staines; Stephen J Trevanion; Bronwen L Aken; Fiona Cunningham; Andrew Yates; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

View more
  6 in total

1.  Chromosome-Scale, Haplotype-Resolved Genome Assembly of Non-Sex-Reversal Females of Swamp Eel Using High-Fidelity Long Reads and Hi-C Data.

Authors:  Hai-Feng Tian; Qiaomu Hu; Hong-Yi Lu; Zhong Li
Journal:  Front Genet       Date:  2022-05-18       Impact factor: 4.772

2.  Chromosome-level genome assembly of the shuttles hoppfish, Periophthalmus modestus.

Authors:  Youngik Yang; Ji Yong Yoo; Sang Ho Baek; Ha Yeun Song; Seonmi Jo; Seung-Hyun Jung; Jeong-Hyeon Choi
Journal:  Gigascience       Date:  2022-01-12       Impact factor: 6.524

3.  The pink salmon genome: Uncovering the genomic consequences of a two-year life cycle.

Authors:  Kris A Christensen; Eric B Rondeau; Dionne Sakhrani; Carlo A Biagi; Hollie Johnson; Jay Joshi; Anne-Marie Flores; Sreeja Leelakumari; Richard Moore; Pawan K Pandoh; Ruth E Withler; Terry D Beacham; Rosalind A Leggatt; Carolyn M Tarpey; Lisa W Seeb; James E Seeb; Steven J M Jones; Robert H Devlin; Ben F Koop
Journal:  PLoS One       Date:  2021-12-17       Impact factor: 3.240

4.  Robertsonian Fusion and Centromere Repositioning Contributed to the Formation of Satellite-free Centromeres During the Evolution of Zebras.

Authors:  Eleonora Cappelletti; Francesca M Piras; Lorenzo Sola; Marco Santagostino; Wasma A Abdelgadir; Elena Raimondi; Francesco Lescai; Solomon G Nergadze; Elena Giulotto
Journal:  Mol Biol Evol       Date:  2022-08-03       Impact factor: 8.800

5.  Chromosome-level genome assembly of Plazaster borealis sheds light on the morphogenesis of multiarmed starfish and its regenerative capacity.

Authors:  Yujung Lee; Bongsang Kim; Jaehoon Jung; Bomin Koh; So Yun Jhang; Chaeyoung Ban; Won-Jae Chi; Soonok Kim; Jaewoong Yu
Journal:  Gigascience       Date:  2022-07-09       Impact factor: 7.658

6.  Evolutionary diversification of the canonical Wnt signaling effector TCF/LEF in chordates.

Authors:  Nuria P Torres-Aguila; Marika Salonna; Stefan Hoppler; David E K Ferrier
Journal:  Dev Growth Differ       Date:  2022-02-03       Impact factor: 3.063

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.