Literature DB >> 20525342

Scaffold filling, contig fusion and comparative gene order inference.

Adriana Muñoz1, Chunfang Zheng, Qian Zhu, Victor A Albert, Steve Rounsley, David Sankoff.   

Abstract

BACKGROUND: There has been a trend in increasing the phylogenetic scope of genome sequencing without finishing the sequence of the genome. Increasing numbers of genomes are being published in scaffold or contig form. Rearrangement algorithms, however, including gene order-based phylogenetic tools, require whole genome data on gene order or syntenic block order. How then can we use rearrangement algorithms to compare genomes available in scaffold form only? Can the comparative evidence predict the location of unsequenced genes?
RESULTS: Our method involves optimally filling in genes missing from the scaffolds, while incorporating the augmented scaffolds directly into the rearrangement algorithms as if they were chromosomes. This is accomplished by an exact, polynomial-time algorithm. We then correct for the number of extra fusion/fission operations required to make scaffolds comparable to full assemblies. We model the relationship between the ratio of missing genes actually absent from the genome versus merely unsequenced ones, on one hand, and the increase of genomic distance after scaffold filling, on the other. We estimate the parameters of this model through simulations and by comparing the angiosperm genomes Ricinus communis and Vitis vinifera.
CONCLUSIONS: The algorithm solves the comparison of genomes with 18,300 genes, including 4500 missing from one genome, in less than a minute on a MacBook, putting virtually all genomes within range of the method.

Entities:  

Mesh:

Year:  2010        PMID: 20525342      PMCID: PMC2902449          DOI: 10.1186/1471-2105-11-304

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  12 in total

1.  Rearrangement phylogeny of genomes in contig form.

Authors:  Adriana Muñoz; David Sankoff
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Oct-Dec       Impact factor: 3.710

2.  Efficient sorting of genomic permutations by translocation, inversion and block interchange.

Authors:  Sophia Yancopoulos; Oliver Attie; Richard Friedberg
Journal:  Bioinformatics       Date:  2005-06-09       Impact factor: 6.937

3.  MSOAR: a high-throughput ortholog assignment system based on genome rearrangement.

Authors:  Zheng Fu; Xin Chen; Vladimir Vacic; Peng Nan; Yang Zhong; Tao Jiang
Journal:  J Comput Biol       Date:  2007-11       Impact factor: 1.479

4.  Locating large-scale gene duplication events through reconciled trees: implications for identifying ancient polyploidy events in plants.

Authors:  J G Burleigh; M S Bansal; A Wehe; O Eulenstein
Journal:  J Comput Biol       Date:  2009-08       Impact factor: 1.479

5.  Polyploidy and angiosperm diversification.

Authors:  Douglas E Soltis; Victor A Albert; Jim Leebens-Mack; Charles D Bell; Andrew H Paterson; Chunfang Zheng; David Sankoff; Claude W Depamphilis; P Kerr Wall; Pamela S Soltis
Journal:  Am J Bot       Date:  2009-01       Impact factor: 3.844

6.  Gene loss under neighborhood selection following whole genome duplication and the reconstruction of the ancestral Populus genome.

Authors:  Chunfang Zheng; P Kerr Wall; James Leebens-Mack; Claude DE Pamphilis; Victor A Albert; David Sankoff
Journal:  J Bioinform Comput Biol       Date:  2009-06       Impact factor: 1.122

7.  Towards improved reconstruction of ancestral gene order in angiosperm phylogeny.

Authors:  David Sankoff; Chunfang Zheng; P Kerr Wall; Claude dePamphilis; Jim Leebens-Mack; Victor A Albert
Journal:  J Comput Biol       Date:  2009-10       Impact factor: 1.479

8.  OrthoMCL: identification of ortholog groups for eukaryotic genomes.

Authors:  Li Li; Christian J Stoeckert; David S Roos
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

9.  Inparanoid: a comprehensive database of eukaryotic orthologs.

Authors:  Kevin P O'Brien; Maido Remm; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  A high quality draft consensus sequence of the genome of a heterozygous grapevine variety.

Authors:  Riccardo Velasco; Andrey Zharkikh; Michela Troggio; Dustin A Cartwright; Alessandro Cestaro; Dmitry Pruss; Massimo Pindo; Lisa M Fitzgerald; Silvia Vezzulli; Julia Reid; Giulia Malacarne; Diana Iliev; Giuseppina Coppola; Bryan Wardell; Diego Micheletti; Teresita Macalma; Marco Facci; Jeff T Mitchell; Michele Perazzolli; Glenn Eldredge; Pamela Gatto; Rozan Oyzerski; Marco Moretto; Natalia Gutin; Marco Stefanini; Yang Chen; Cinzia Segala; Christine Davenport; Lorenzo Demattè; Amy Mraz; Juri Battilana; Keith Stormo; Fabrizio Costa; Quanzhou Tao; Azeddine Si-Ammour; Tim Harkins; Angie Lackey; Clotilde Perbost; Bruce Taillon; Alessandra Stella; Victor Solovyev; Jeffrey A Fawcett; Lieven Sterck; Klaas Vandepoele; Stella M Grando; Stefano Toppo; Claudio Moser; Jerry Lanchbury; Robert Bogden; Mark Skolnick; Vittorio Sgaramella; Satish K Bhatnagar; Paolo Fontana; Alexander Gutin; Yves Van de Peer; Francesco Salamini; Roberto Viola
Journal:  PLoS One       Date:  2007-12-19       Impact factor: 3.240

View more
  12 in total

1.  CSAR-web: a web server of contig scaffolding using algebraic rearrangements.

Authors:  Kun-Tze Chen; Chin Lung Lu
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

2.  Filling a Protein Scaffold With a Reference.

Authors:  Letu Qingge; Xiaowen Liu; Farong Zhong; Binhai Zhu
Journal:  IEEE Trans Nanobioscience       Date:  2017-02-09       Impact factor: 2.935

3.  Multi-CSAR: a web server for scaffolding contigs using multiple reference genomes.

Authors:  Shu-Cheng Liu; Yan-Ru Ju; Chin Lung Lu
Journal:  Nucleic Acids Res       Date:  2022-05-07       Impact factor: 19.160

4.  Comparative mapping in the Fagaceae and beyond with EST-SSRs.

Authors:  Catherine Bodénès; Emilie Chancerel; Oliver Gailing; Giovanni G Vendramin; Francesca Bagnoli; Jerome Durand; Pablo G Goicoechea; Carolina Soliani; Fiorella Villani; Claudia Mattioni; Hans Peter Koelewijn; Florent Murat; Jerome Salse; Guy Roussel; Christophe Boury; Florian Alberto; Antoine Kremer; Christophe Plomion
Journal:  BMC Plant Biol       Date:  2012-08-29       Impact factor: 4.215

5.  SIS: a program to generate draft genome sequence scaffolds for prokaryotes.

Authors:  Zanoni Dias; Ulisses Dias; João C Setubal
Journal:  BMC Bioinformatics       Date:  2012-05-14       Impact factor: 3.169

6.  CAR: contig assembly of prokaryotic draft genomes using rearrangements.

Authors:  Chin Lung Lu; Kun-Tze Chen; Shih-Yuan Huang; Hsien-Tai Chiu
Journal:  BMC Bioinformatics       Date:  2014-11-28       Impact factor: 3.169

7.  Multi-CAR: a tool of contig scaffolding using multiple references.

Authors:  Kun-Tze Chen; Cheih-Jung Chen; Hsin-Ting Shen; Chia-Liang Liu; Shang-Hao Huang; Chin Lung Lu
Journal:  BMC Bioinformatics       Date:  2016-12-23       Impact factor: 3.169

8.  Assembling contigs in draft genomes using reversals and block-interchanges.

Authors:  Chi-Long Li; Kun-Tze Chen; Chin Lung Lu
Journal:  BMC Bioinformatics       Date:  2013-04-10       Impact factor: 3.169

9.  Locating rearrangement events in a phylogeny based on highly fragmented assemblies.

Authors:  Chunfang Zheng; David Sankoff
Journal:  BMC Genomics       Date:  2016-01-11       Impact factor: 3.969

10.  Approaches for in silico finishing of microbial genome sequences.

Authors:  Frederico Schmitt Kremer; Alan John Alexander McBride; Luciano da Silva Pinto
Journal:  Genet Mol Biol       Date:  2017 Jul-Sep 01       Impact factor: 1.771

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.