Literature DB >> 24143057

AutoAssemblyD: a graphical user interface system for several genome assemblers.

Adonney Allan de Oliveira Veras1, Pablo Henrique Caracciolo Gomes de Sá, Vasco Azevedo, Artur Silva, Rommel Thiago Jucá Ramos.   

Abstract

UNLABELLED: Next-generation sequencing technologies have increased the amount of biological data generated. Thus, bioinformatics has become important because new methods and algorithms are necessary to manipulate and process such data. However, certain challenges have emerged, such as genome assembly using short reads and high-throughput platforms. In this context, several algorithms have been developed, such as Velvet, Abyss, Euler-SR, Mira, Edna, Maq, SHRiMP, Newbler, ALLPATHS, Bowtie and BWA. However, most such assemblers do not have a graphical interface, which makes their use difficult for users without computing experience given the complexity of the assembler syntax. Thus, to make the operation of such assemblers accessible to users without a computing background, we developed AutoAssemblyD, which is a graphical tool for genome assembly submission and remote management by multiple assemblers through XML templates. AVAILABILITY: AssemblyD is freely available at https://sourceforge.net/projects/autoassemblyd. It requires Sun jdk 6 or higher.

Entities:  

Keywords:  Bioinformatics; Genome Assembly; Next-generation sequencing

Year:  2013        PMID: 24143057      PMCID: PMC3796888          DOI: 10.6026/97320630009840

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Next-generation sequencing technologies (NGS) revolutionized biology, as they reduced costs and increased the speed of genome sequencing, consequently generating large amounts of data compared with the Sanger method [1]. However, challenges have emerged, such as processing the large amounts of data generated by such platforms, especially without a robust computer infrastructure, and conducting genome assembly from short reads, which requires multiple software programs and parameters for optimization [2-4]. The primary assemblers include Velvet, ABySS, AllPaths, SOAPdenovo and MAQ [5]. Most such assemblers operate through lengthy command lines composed of one or several parameters that influence the assembly results, which can be difficult for users with little computing experience [4-6]. Thus, a graphical interface facilitates algorithm use, such as in the VAGUE software [7], which uses a graphical interface for Velvet assembler operation that allows the user to set the parameter values necessary for operation. However, given the wide variety of tools available, such data analyses are limited where a graphical interface is only available for the Velvet assembler [5]. Therefore, this study presents AutoAssemblyD, which is a graphical tool used for submitting and managing genome assembly from different assemblers through XML templates. This tool also facilitates assembly on remote devices through distributed programming.

Methodology

Programming Language:

The Java programming language was used for AutoAssemblyD development. Remote method invocation (RMI) and Socket are used in the AutoAssemblyD application and were implemented through the java.net and java.io package [8].

The Template:

Standard Extensible Markup Language (XML) was used to generate the templates as well as identify the assembler, parameters and respective values in accordance with the userdefined configuration.

Remote Management:

RMI was used to manage the remote assembly; the clients establish communication with the server upon initiation, which can be selected to perform the assembly process. Communication between the clients and server is tested every 3 seconds, and the server application shows the connection status to the user. The Socket application is used for file transfer between the clients and server; this application initiates upon client and server execution.

AutoAssemblyD Modules:

AutoAssemblyD is composed of two modules: local and remote. The local module can be used when the computer has the necessary resources (memory, disk space and CPU) to process the data. The remote module facilitates file transfer and remote processing; thereafter, the results are transmitted to the server.

AutoAssemblyD:

AutoassemblyD comprises three interfaces: template creation, template update and assembly operation. The assembler commands, parameters and their respective values are input at the template creation interface (Figure 1).
Figure 1

Standard window for template creation.

After the assembler informations are defined, the user selects the option creates file to generate the XML template used for assembly operation. The template is updated through the UpdateXML window that facilitates template reuse in future operations with other input data and parameters, which increases its flexibility. To operate the assembly, the template file must be loaded, and the client that performs the process must be selected. The user guide is available at: https://sourceforge.net/projects/ autoassemblyd.
  7 in total

1.  Assemblathon 1: a competitive assessment of de novo short read assembly methods.

Authors:  Dent Earl; Keith Bradnam; John St John; Aaron Darling; Dawei Lin; Joseph Fass; Hung On Ken Yu; Vince Buffalo; Daniel R Zerbino; Mark Diekhans; Ngan Nguyen; Pramila Nuwantha Ariyaratne; Wing-Kin Sung; Zemin Ning; Matthias Haimel; Jared T Simpson; Nuno A Fonseca; İnanç Birol; T Roderick Docking; Isaac Y Ho; Daniel S Rokhsar; Rayan Chikhi; Dominique Lavenier; Guillaume Chapuis; Delphine Naquin; Nicolas Maillet; Michael C Schatz; David R Kelley; Adam M Phillippy; Sergey Koren; Shiaw-Pyng Yang; Wei Wu; Wen-Chi Chou; Anuj Srivastava; Timothy I Shaw; J Graham Ruby; Peter Skewes-Cox; Miguel Betegon; Michelle T Dimon; Victor Solovyev; Igor Seledtsov; Petr Kosarev; Denis Vorobyev; Ricardo Ramirez-Gonzalez; Richard Leggett; Dan MacLean; Fangfang Xia; Ruibang Luo; Zhenyu Li; Yinlong Xie; Binghang Liu; Sante Gnerre; Iain MacCallum; Dariusz Przybylski; Filipe J Ribeiro; Shuangye Yin; Ted Sharpe; Giles Hall; Paul J Kersey; Richard Durbin; Shaun D Jackman; Jarrod A Chapman; Xiaoqiu Huang; Joseph L DeRisi; Mario Caccamo; Yingrui Li; David B Jaffe; Richard E Green; David Haussler; Ian Korf; Benedict Paten
Journal:  Genome Res       Date:  2011-09-16       Impact factor: 9.043

2.  VAGUE: a graphical user interface for the Velvet assembler.

Authors:  David R Powell; Torsten Seemann
Journal:  Bioinformatics       Date:  2012-11-17       Impact factor: 6.937

3.  Rapid hybrid de novo assembly of a microbial genome using only short reads: Corynebacterium pseudotuberculosis I19 as a case study.

Authors:  Louise Teixeira Cerdeira; Adriana Ribeiro Carneiro; Rommel Thiago Jucá Ramos; Sintia Silva de Almeida; Vivian D'Afonseca; Maria Paula Cruz Schneider; Jan Baumbach; Andreas Tauch; John Anthony McCulloch; Vasco Ariston Carvalho Azevedo; Artur Silva
Journal:  J Microbiol Methods       Date:  2011-05-18       Impact factor: 2.363

Review 4.  Next-generation sequencing and large genome assemblies.

Authors:  Joseph Henson; German Tischler; Zemin Ning
Journal:  Pharmacogenomics       Date:  2012-06       Impact factor: 2.533

5.  Simplifier: a web tool to eliminate redundant NGS contigs.

Authors:  Rommel Thiago Jucá Ramos; Adriana Ribeiro Carneiro; Vasco Azevedo; Maria Paula Schneider; Debmalya Barh; Artur Silva
Journal:  Bioinformation       Date:  2012-10-13

6.  Feature-by-feature--evaluating de novo sequence assembly.

Authors:  Francesco Vezzi; Giuseppe Narzisi; Bud Mishra
Journal:  PLoS One       Date:  2012-02-03       Impact factor: 3.240

7.  Tips and tricks for the assembly of a Corynebacterium pseudotuberculosis genome using a semiconductor sequencer.

Authors:  Rommel Thiago Jucá Ramos; Adriana Ribeiro Carneiro; Siomar de Castro Soares; Anderson Rodrigues dos Santos; Sintia Almeida; Luis Guimarães; Flávia Figueira; Eudes Barbosa; Andreas Tauch; Vasco Azevedo; Artur Silva
Journal:  Microb Biotechnol       Date:  2012-12-02       Impact factor: 5.813

  7 in total
  1 in total

1.  Comparative Analysis of WRKY Genes Potentially Involved in Salt Stress Responses in Triticum turgidum L. ssp. durum.

Authors:  Fatma-Ezzahra Yousfi; Emna Makhloufi; William Marande; Abdel W Ghorbel; Mondher Bouzayen; Hélène Bergès
Journal:  Front Plant Sci       Date:  2017-01-31       Impact factor: 5.753

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.