Literature DB >> 9146965

Hopper: software for automating data tracking and flow in DNA sequencing.

T M Smith1, C Abajian, L Hood.   

Abstract

MOTIVATION: Genome-scale DNA sequencing is a multistep process in which large numbers of small template clones are propagated, purified, sequenced and analyzed on acrylamide gels. A significant challenge to these projects is the scale at which the data handling must be done. Hence, large-scale sequencing facilities will benefit from tracking template DNA information (purification methods, reaction and electrophoresis conditions) in a systematic fashion. A lack of software tools that support automated sample entry, and automatic data storage, retrieval and analysis are a major hindrance to recording and using laboratory workflow information to monitor the overall quality of data production.
RESULTS: The UNIX file system has been used to prototype automation of the flow of data from the ABI sequencer to a data repository. Data are automatically processed by a central Perl program, Hopper, which runs a series of programs that analyze data quality (read length estimate, fraction of indeterminate bases, and number of contaminating and repetitive sequences), assemble shotgun sequence data, and generates simple reports describing the results.

Entities:  

Mesh:

Year:  1997        PMID: 9146965     DOI: 10.1093/bioinformatics/13.2.175

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  5 in total

1.  Automated sequence preprocessing in a large-scale sequencing environment.

Authors:  M C Wendl; S Dear; D Hodgson; L Hillier
Journal:  Genome Res       Date:  1998-09       Impact factor: 9.043

2.  Kaleidaseq: a Web-based tool to monitor data flow in a high throughput sequencing facility.

Authors:  N N Dedhia; W R McCombie
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

3.  PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities.

Authors:  Peter V Troshin; Vincent Lg Postis; Denise Ashworth; Stephen A Baldwin; Michael J McPherson; Geoffrey J Barton
Journal:  BMC Res Notes       Date:  2011-03-07

4.  MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools.

Authors:  Chun Liang; Feng Sun; Haiming Wang; Junfeng Qu; Robert M Freeman; Lee H Pratt; Marie-Michèle Cordonnier-Pratt
Journal:  BMC Bioinformatics       Date:  2006-03-07       Impact factor: 3.169

5.  Design and implementation of a generalized laboratory data model.

Authors:  Michael C Wendl; Scott Smith; Craig S Pohl; David J Dooling; Asif T Chinwalla; Kevin Crouse; Todd Hepler; Shin Leong; Lynn Carmichael; Mike Nhan; Benjamin J Oberkfell; Elaine R Mardis; LaDeana W Hillier; Richard K Wilson
Journal:  BMC Bioinformatics       Date:  2007-09-26       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.