| Literature DB >> 27318200 |
Jianwei Zhang1, Dave Kudrna2, Ting Mu3, Weiming Li3, Dario Copetti4, Yeisoo Yu2, Jose Luis Goicoechea2, Yang Lei3, Rod A Wing4.
Abstract
MOTIVATION: Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool-Genome Puzzle Master (GPM)-that enables the integration of additional genomic signposts to edit and build 'new-gen-assemblies' that result in high-quality 'annotation-ready' pseudomolecules.Entities:
Mesh:
Year: 2016 PMID: 27318200 PMCID: PMC5048067 DOI: 10.1093/bioinformatics/btw370
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.GPM assemblyRun operations
Fig. 2.Visualization of typical available data in GPM. (A) GPM assemblyCtg view of a 500-KB region. AssemblySeqs, top and bottom, are shown as overlapping (yellow) and fully redundant assemblySeqs are gray. The retained (green) and removed (gray) portions of assemblySeqs are indicated. (B) Chromosome-scale view of a 500-KB region that compares two genome assemblies to a Reference sequence. The Reference Sequence is shown in the middle (bright green) with alignments (yellow) to each assemblyCtg (violet) at the top and bottom. The assemblyCtg order can be changed by drag-and-drop (Color version of this figure is available at Bioinformatics online.)
Fig. 3.Flowchart for processing unitigs with postHGAP