Literature DB >> 27171416

GapBlaster-A Graphical Gap Filler for Prokaryote Genomes.

Pablo H C G de Sá1, Fábio Miranda1, Adonney Veras1, Diego Magalhães de Melo1, Siomar Soares2, Kenny Pinheiro1, Luis Guimarães1, Vasco Azevedo3, Artur Silva1, Rommel T J Ramos1.   

Abstract

The advent of NGS (Next Generation Sequencing) technologies has resulted in an exponential increase in the number of complete genomes available in biological databases. This advance has allowed the development of several computational tools enabling analyses of large amounts of data in each of the various steps, from processing and quality filtering to gap filling and manual curation. The tools developed for gap closure are very useful as they result in more complete genomes, which will influence downstream analyses of genomic plasticity and comparative genomics. However, the gap filling step remains a challenge for genome assembly, often requiring manual intervention. Here, we present GapBlaster, a graphical application to evaluate and close gaps. GapBlaster was developed via Java programming language. The software uses contigs obtained in the assembly of the genome to perform an alignment against a draft of the genome/scaffold, using BLAST or Mummer to close gaps. Then, all identified alignments of contigs that extend through the gaps in the draft sequence are presented to the user for further evaluation via the GapBlaster graphical interface. GapBlaster presents significant results compared to other similar software and has the advantage of offering a graphical interface for manual curation of the gaps. GapBlaster program, the user guide and the test datasets are freely available at https://sourceforge.net/projects/gapblaster2015/. It requires Sun JDK 8 and Blast or Mummer.

Entities:  

Mesh:

Year:  2016        PMID: 27171416      PMCID: PMC4865197          DOI: 10.1371/journal.pone.0155327

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Next generation sequencing (NGS) platforms have reduced sequencing costs and increased the amount of data generated, resulting in a greater number of complete genomes for eukaryotes and prokaryotes, which are subsequently deposited in public databases [1,2]. Several computational tools have been developed for processing reads, such as error correction and quality filters, as well as additional programs and pipelines that perform genome assemblies of reads generated by NGS platforms, producing complete genomes or scaffolds [3,4]. As a result of assembly reads, many contigs are produced. These reads or reference genomes can be used to order the contigs to produce a scaffold. Some regions in the scaffold have no assigned bases (A,C,T or G) due to the limitations of sequencing technology or assembly algorithms; these regions are called gaps and are usually represented by Ns [5-7]. Beyond commercial programs, such as CLC Genomic Workbench and Lasergene Suite, which have available options for finishing genome assemblies, including steps that fill gaps, open source programs are available. For example the open source programs G4ALL [8], GapCloser [3], GapFiller [6], and FGAP [9] use different approaches, such as paired reads or results of assemblies obtained with different software, to fill gap regions. The FGAP program was implemented in Matlab language and uses a draft of the assembly and a set of contigs that are mapped against genome draft to close gaps using BLAST algorithms. Both a fasta and a log file that report the filled gaps are generated at the end of the process. However, FGAP has no graphical interface [9]. G4ALL was implemented via JAVA programming language. The software has a graphical interface that allows the user to perform gap closure through manual curation of the scaffolds by comparing the BLAST results of the assembled contigs to the assembled scaffolds, similar to the GapBlaster method. G4ALL is useful for extending the contigs based on the overlap between them; however, it does not use contigs to close the gap regions [8]. GapCloser uses the information from paired reads to extend the sequences of contigs between gaps. Thus, the gaps can be closed or reduced [3]. Similar to GapCloser, the GapFiller program uses paired reads and is able to use data from different sequencing rounds simultaneously [6]. It is one of the available tools for closing gaps in prokaryotic and eukaryotic genomes of sizes up to ~100 Mb [10]. Genomes that have gaps may impair further studies because they may only partially represent an organism’s gene repertoire. Incomplete genomes can affect downstream analyses of genomic plasticity and comparative genomics [11]. Therefore, it is important to use complete genomes for comparative studies to properly characterize genome structure variations and gene content. This characterization allows the identification of genes that are 1) shared among all isolates and are thus useful for applied issues, such as vaccine and drug design [12]; 2) shared by some organisms, but not all studied organisms, and are thus useful for studying the reference lab activities for pathogenic bacteria [13,14]; and 3) present in a single isolate providing information regarding bacteria lifestyle [15]. Thus, this study presents a computational tool with a graphical user interface that helps reduce gaps through manual curation to increase the completion of genome assembly, rather than relying on the complete automation of this task.

Materials and Methods

Implementation

The GapBlaster was developed via JAVA programming language (http://java.sun.com/) using the paradigm of object orientation and the Swing library to create the visual resources (http://java.sun.com/docs/books/tutorial/uiswing). Through the main interface of GapBlaster (S1 Fig), the user can input the scaffold and the contig files in FASTA format. After processing, another screen (S2 Fig) shows the alignment results. The user is then able to perform manual curation and select alignments that fill gaps confidently, as when the user finds a contig aligned in the gap flanks, closing the gap completely, as shown in S3 Fig. GapBlaster performs five steps to identify possible gaps to be filled. All of the contigs obtained in the assembly are aligned against the draft genome or scaffold using BLAST Legacy [16], Blast+ or Mummer [17] based on user choice, and the alignment result is converted to the GapBlaster format. The contigs are subsequently ordered according to the mapping position in the scaffold. The program searches the alignments of the same contig that flank gap regions. A new ordination of the alignments is performed to determine the best option for gap closure. All identified alignments that fill gaps are presented to the user for evaluation (accepted or rejected) through the GapBlaster interface, and a log of changes made is generated. The selection of the alignment and the parameters can be defined by the user through the GapBlaster interface.

Test data

To evaluate the GapBlaster program, analyses were conducted using two datasets: the first used sequencing data of Corynebacterium pseudotuberculosis, and the second was obtained from the GAGE (Genome Assembly Gold-Standard Evaluation) assembly of genomes [18]. C. pseudotuberculosis is a facultative intracellular gram-positive bacterium that causes caseous lymphadenitis (CLA), an infectious disease that affects small ruminants and belongs to CMNR group (Corynebacterium, Mycobacterium, Nocardia, and Rhodococcus) [19]. The sequencing of C. pseudotuberculosis was performed by an Ion Torrent PGM platform (Table 1). The reads (available in SRA database: SRR3312980) were assembled by a de novo strategy using SPADES version 3.1.0, with default parameters for Ion Torrent PGM data [20]. The scaffolds and contigs files produced in the assembly (available in https://sourceforge.net/projects/gapblaster2015/files/test_dataset/) were used as inputs in GapBlaster.
Table 1

Sequencing information of the genomes used in the analysis.

OrganismPlatformLibraryRead LengthInsert SizeNumber of Reads
Corynebacterium pseudotuberculosis 262Ion Torrent PGMFragment~220 bpN/A1765213
Staphylococcus aureus A-S391_USA300IlluminaPaired-end~101 bp180 bp1294104
Staphylococcus aureus A-S391_USA300IlluminaMate-Pair~37 bp3500 bp3494070
Rhodobacter sphaeroides 2.4.1IlluminaPaired-end~101 bp180 bp2050868
Rhodobacter sphaeroides 2.4.1IlluminaMate-Pair~101 bp3500 bp2050868
The GAGE dataset had the assemblies of the Staphylococcus aureus and Rhodobacter sphaeroides genomes, containing contigs and scaffolds generated by the following assemblers: Abyss, ABySS2, AllPaths-LG, Bambus2, MSR-CA, SGA, SOAPdenovo and Velvet for both organisms, whereas the CABOG was used for only Rhodobacter sphaeroides [18]. The data are available at http://gage.cbcb.umd.edu/, and the genome sequencing information can be seen in Table 1.

GapBlaster

All contigs and scaffolds of the datasets were manually evaluated with GapBlaster version 1.1.1 to close gaps. In our analysis, we used one scaffold and one contig file for each organism/assembly, with the parameter Flank Length = 11 and the aligner Blast+ (the parameters in the GapBlaster should be set to reproduce our results). To close gaps, regions flanking the gaps (represented by Ns) were considered only when they had high identity (the threshold should be defined by the user).

Gap closure comparison

To compare gap filling performance, GapBlaster, GapFiller and FGAP software were used in a gap closure analysis of the GAGE dataset and C. pseudotuberculosis. The GAGE dataset with the mate-pair reads was analyzed with GapFiller [6] and FGAP [9] based on gap closure performance. Both types of software were used under default parameters, and the results were subsequently compared to GapBlaster. The C. pseudotuberculosis genome was analyzed with FGAP only as GapFiller software requires paired-end libraries, and C. pseudotuberculosis was sequenced using fragmented libraries. The results of FGAP were compared to GapBlaster. Additionally, GapBlaster was used to reduce gaps in the output files of FGAP and GapFiller software.

Results evaluation

To validate the gap filling analysis, an in-house script was developed to evaluate the amount of gaps and Ns for each of the tests. The FASTA file (original scaffolds and the results of GapBlaster, FGAP, and GapFiller) was used as an input to count the number of gaps and their respective sizes. This script and a brief manual are available at https://sourceforge.net/projects/gapblaster2015/upload/scripts/. To confirm if the gaps were correctly closed, the validation script of GAGE was used (http://gage.cbcb.umd.edu/results/gage-paper-validation.tar.gz). The input of this script was the reference genome (Table 2) and the original scaffold or gap-filled scaffold file.
Table 2

Information of the reference genomes used to validate the filled-in gaps.

OrganismCorynebacterium Pseudotuberculosis 262Staphylococcus Aureus A-S391_USA300Rhodobacter Sphaeroides 2.4.1
Genome Size232574928729164628173
GC Content52,170,327668,77
Number of Chrs112
Number of Plasmids005
GenbankCP012022.1CP007690.1GCA_000273405.1
150 pb Repeats80071070938073
250 pb Repeats6612946035353

Results and Discussion

The assembly results (number of bases and scaffolds) of the C. pseudotuberculosis genome produced by SPADES and the information concerning several assemblies of S. aureus and R. sphaeroides produced by various types of assemblers are shown in Table 3.
Table 3

Genome assembly information for C. pseudotuberculosis 262, S. aureus and R. sphaeroides.

OrganismAssemblerBases (with N)#Scaffolds
C. pseudotuberculosis 262---------------------
SPADES28938574611
S. aureus---------------------
ABySS38931855012
ABySS23821622125
Allpaths-LG288067619
Bambus2286293017
MSR-CA287290517
SGA3128388546
SOAPdenovo2924135175
Velvet2877995173
R. sphaeroides---------------------
ABySS51601672714
ABySS25331930480
Allpaths-LG460978538
Bambus2442861292
CABOG4259679130
MSR-CA449855944
SGA56146932096
SOAPdenovo4627058312
Velvet4615068382
The results of the gap closure process for the Corynebacterium data assembled by SPADES are shown in Table 4.
Table 4

Gap closure results for the Corynebacterium genome.

#Gaps#N#Gaps GB#N GB#Gaps FGAP#N FGAP
Corynebacterium pseudotuberculosis241794119315360

Results of gap closure analysis of Corynebacterium, showing the #Gaps (amount of gaps) and #N (gap length); #Gaps GB and #N GB show the amount of remaining gaps and Ns, respectively, after the use of GapBlaster. The #Gaps FGAP and #N FGAP show the amount of remaining gaps and Ns, respectively, after the use of FGAP.

Results of gap closure analysis of Corynebacterium, showing the #Gaps (amount of gaps) and #N (gap length); #Gaps GB and #N GB show the amount of remaining gaps and Ns, respectively, after the use of GapBlaster. The #Gaps FGAP and #N FGAP show the amount of remaining gaps and Ns, respectively, after the use of FGAP. For C. pseudotuberculosis the amount of gaps was reduced from 24 to 11 with GapBlaster, and from 24 to 5, with FGAP. Gap length was also reduced for the Corynebacterium genome, as shown in Table 4. The C. pseudotuberculosis genome was sequenced using fragment libraries; thus, they could not be submitted to GapFiller. The GAGE data of S. aureus and R. sphaeroides were assembled by several assemblers, and the results (contigs and scaffolds) were submitted to GapBlaster, FGAP and GapFiller. All assemblies of S. aureus revealed reductions in gaps and Ns when analyzed by GapBlaster. For R. sphaeroides, only the data for SGA did not show a reduction in gaps by GapBlaster (Table 5). It is important to highlight that GapBlaster allows manual curation; it allows less stringent criteria with careful manual evaluation, which is able to produce better results.
Table 5

Gap closure results for GAGE Assemblies.

Staphylococcus aureus#Gaps#N#Gaps GB#N GB#Gaps FGAP#N FGAP#Gaps GF#N GF
AbySS6655882554761445511276956355
AbySS23393912777801748503510003
Allpaths-LG2398752094461587554010472
Bambus29529201932915980274599830771
MSR-CA81103537278684778618011651
SGA654300607642292067634298252654312284
SOAPdenovo94857848377470895010
Velvet1281768812417473941540612719863
Rhodobacter sphaeroides#Gaps#N#Gaps GB#N GB#Gaps FGAP#N FGAP#Gaps GF#N GF
AbySS261114525261114525256113886306118298
AbySS223562570233621282286032329068052
Allpaths-LG90213298720733821950016424001
Bambus28557041835640280559908456930
CABOG19321547192208921902106519125011
MSR-CA35632628349261893473117433637494
SGA9381145600938114560093011449559301159235
SOAPdenovo381046137960137100973811176
Velvet42786815424867854048606341594150

Results of the gap closure process for the data produced by GAGE with several assemblers for S. aureus and R. sphaeroides. Showing the #Gaps (amount of gaps) and #N (gap length); #Gaps GB and #N GB show the amount of remaining gaps and Ns, respectively, after the use of GapBlaster. The #Gaps FGAP and #N FGAP show the amount of remaining gaps and Ns, respectively, after the use of FGAP. The #Gaps GF and #N GF show the amount of remaining gaps and Ns, respectively, after the use of GapFiller.

Results of the gap closure process for the data produced by GAGE with several assemblers for S. aureus and R. sphaeroides. Showing the #Gaps (amount of gaps) and #N (gap length); #Gaps GB and #N GB show the amount of remaining gaps and Ns, respectively, after the use of GapBlaster. The #Gaps FGAP and #N FGAP show the amount of remaining gaps and Ns, respectively, after the use of FGAP. The #Gaps GF and #N GF show the amount of remaining gaps and Ns, respectively, after the use of GapFiller. The FGAP and GapFiller programs were used to perform the gap closure step, and these results were compared with those obtained by GapBlaster (Table 5). GapFiller increased the numbers of gaps in most of the analyzed assemblies due the insert length, which was used to align against the reference sequences. In other cases, any gap that was closed had its length (the amount of Ns) increased, which occurred for the assemblies from SGA and SOAPdenovo for S. aureus and for the assemblies from SOAPdenovo for R. sphaeroides. Other results showed that GapFiller reduced the amount of gaps but increased their length (amount of Ns), which was observed for MSR-CA for S. aureus and CABOG, MSR-CA, SGA and for Velvet for R. sphaeroides (Table 5). Despite GapFiller having closed more gaps than GapBlaster for CABOG, MSR-CA, SGA and Velvet for R. sphaeroides, GapBlaster was superior to GapFiller. GapBlaster was able to fill more gaps and reduce the number of Ns in the sequences for nearly all GAGE assemblies, although it did not use paired reads. FGAP filled more gaps than GapBlaster for all assemblies of the GAGE dataset. Nevertheless, GapBlaster filled more Ns than FGAP for ABySS and SGA for S. aureus and CABOG, MSR-CA and Velvet for R. sphaeroides (Table 5). Despite FGAP performing the gap filling analysis automatically while GapBlaster performed the analysis manually, they achieved very similar results with respect to the number of gaps and N reductions for SOAPdenovo for S. aureus and Bambus2, CABOG, MSR-CA and SOAPdenovo for R. sphaeroides (Table 5). FGAP showed better results for both the C. pseudotuberculosis and the GAGE datasets. We performed the gap filling analysis of the FGAP results with the original contigs of each organism and assembly through GapBlaster to determine whether GapBlaster could improve the results produced by FGAP. The results are shown in Table 6. Compared with the FGAP results, GapBlaster improved 55.55% of all assemblies of the GAGE dataset and C. pseudotuberculosis. GapFiller was not used for this comparison of Corynebacterium data because only a fragment library was available for this organism.
Table 6

Comparison of the original results of FGAP and after manual curation with GapBlaster.

Staphylococcus aureus#Gaps FGAP#N FGAP#Gaps after GB#N after GB
AbySS45511274145439
MSR-CA477861466359
SGA634298252629290825
Rhodobacter sphaeroides#Gaps FGAP#N FGAP#Gaps after GB#N after GB
AbySS22286032322760040
Allpaths-LG82195008119494
Bambus280559907955402
CABOG1902106518819568
MSR-CA3473117434325592
SOAPdenovo3710097369237
#Gaps FGAP#N FGAP#Gaps after GB#N after GB
Corynebacterium pseudotuberculosis53603251

The results produced by FGAP were used as input for GapBlaster, and the organism/assemblies that were improved are shown. The #Gaps FGAP and #N FGAP show the amount of gaps and Ns, respectively, for the results of FGAP. The #Gaps after GB and #N after GB show the amounts of remaining gaps and Ns, respectively, after the use of GapBlaster.

The results produced by FGAP were used as input for GapBlaster, and the organism/assemblies that were improved are shown. The #Gaps FGAP and #N FGAP show the amount of gaps and Ns, respectively, for the results of FGAP. The #Gaps after GB and #N after GB show the amounts of remaining gaps and Ns, respectively, after the use of GapBlaster. GapBlaster improved the results of FGAP for C. pseudotuberculosis in that it reduced the number of gaps from 5 to 3. Therefore, GapBlaster improved the gap filling results for several assemblies for S. aureus and R. sphaeroides, as shown in Table 6. This analysis shows that despite its usefulness for closing gaps through its GUI, GapBlaster is also useful for gap filling when used in combination with another tools. Similar to the analysis of the FGAP results, we conducted an evaluation of the GapFiller output files and the original contigs of each organism/assembler of the GAGE dataset via GapBlaster. Compared with the GapFiller results, GapBlaster improved 70.58% of all assemblies of the GAGE dataset (Table 7).
Table 7

Comparison of the original results of GapFiller and after manual curation with GapBlaster.

Staphylococcus aureus#Gaps GF#N GF#Gaps after GB#N after GB
AbySS69563556654837
AbySS23510003308741
Allpaths-LG40104723910455
Bambus298307719730725
MSR-CA8011651769794
SGA654312284646307095
Rhodobacter sphaeroides#Gaps GF#N GF#Gaps after GB#N after GB
AbySS306118298304118287
AbySS22906805228867740
Allpaths-LG1642400116323780
CABOG1912501119024336
MSR-CA3363749433333590
SGA93011592359291159162

The results produced by GapFiller were used as input for GapBlaster, and the organism/assemblies that were improved are shown. The #Gaps GF and #N GF show the amount of gaps and Ns, respectively, in the results of GapFiller. The #Gaps after GB and #N after GB show the amount of remaining gaps and Ns, respectively, after the use of GapBlaster.

The results produced by GapFiller were used as input for GapBlaster, and the organism/assemblies that were improved are shown. The #Gaps GF and #N GF show the amount of gaps and Ns, respectively, in the results of GapFiller. The #Gaps after GB and #N after GB show the amount of remaining gaps and Ns, respectively, after the use of GapBlaster. GapBlaster improved the results of GapFiller for almost all of the CAGE data (Table 7). The best gap filling results were ABySS2 and SGA for S. aureus, where the gaps decreased from 35 to 30 and 654 to 646, respectively (Table 7). Beyond being a very useful tool with an interface for manual curation, GapBlaster is a valuable open source program that can be used with other tools in the gap filling analysis to produce more complete genome drafts. To evaluate the accuracy of the closed gaps, all results produced by GapBlaster, FGAP, GapFiller and the original files (scaffolds) were aligned against their respective genome reference (Table 2). The results show that all of the files produced in the gap filling analysis showed similar alignment percentages with the original files, which confirms that the bases introduced in the filled gaps were correct (S1 Table). Despite the three methods (Blast+, Blast Legacy, and Mummer) implemented in GapBlaster, we used only Blast+ to fill gaps as this method is the same used for FGAP software. However, we tested all of the algorithms for GAGE data, and Blast Legacy and Blast+ presented similar results (S2 Table). The comparisons of the features of GapBlaster, FGAP and GapFiller helped to identify the main advantage of GapBlaster, the graphical interface, which uses contigs to fill gaps and allows manual curation (Table 8).
Table 8

Comparison of the features of GapBlaster, FGAP and GapFiller.

FeaturesGapBlasterFGAPGapFiller
Alignment methodBlast+ or Blast Legacy or MummerBlast+Bowtie or BWA
Set Flank AlignmentYesYesYes
Allow Manual CurationYesNoNo
Perform Automatic AnalysisYesYesYes
Based on paired-readsNoNoYes
Use contigs to fill gapsYesYesNo
Graphical interfaceYesNoNo
Improve gap filling results of other softwaresYesNot testedNot tested
Correctly fill gaps?YesYesYes

Conclusions

Despite the efficiency of tools such as FGAP and GapFiller, the gap closure process continues to be a step that requires manual curation for the acquisition of high quality results, such as those presented by GapBlaster, the use of which is simplified by the graphical interface. GapBlaster revealed improved gap filling performance using contigs compared to GapFiller for nearly all data evaluated despite the use of paired reads in GapFiller. In addition to presenting better results, the GapBlaster program has the advantage of introducing fewer errors, based on the ability of the interface to allow the user to decide if a gap is filled properly. As an alternative, GapBlaster can be used in addition to other gap closer programs to facilitate genome completion through manual manipulation, as was shown in the analysis of the GapBlaster program to improve the results of FGAP and GapFiller.

GapBlaster main interface.

The main graphical interface through which the user can input the contigs and scaffold files and set the alignment preferences. (TIF) Click here for additional data file.

Alignment interface.

The screen shows the results of the alignment of a contig against a scaffold. All alignments produced are listed. The user can check if the alignments are correct and select them. (TIF) Click here for additional data file.

Selected Alignment.

The aligned contig filled the gap with high accuracy due to the high identity found in the gap flanks. (TIF) Click here for additional data file.

GapBlaster, FGAP and GapFiller accuracy.

This table shows information about the percentage of bases aligned against the GapBlaster, FGAP, GapFiller results and the original scaffolds to the reference genome to evaluate if the filled gaps introduced the correct bases in the analysis. (XLS) Click here for additional data file.

GapBlaster algorithm comparison.

Comparison of the three alignment algorithms implemented in GapBlaster (Blast+, Blast Legacy, Mummer) to evaluate the number of alignments identified, closed gaps and N removed after the gap filling process. (XLS) Click here for additional data file.
  17 in total

1.  GAGE: A critical evaluation of genome assemblies and assembly algorithms.

Authors:  Steven L Salzberg; Adam M Phillippy; Aleksey Zimin; Daniela Puiu; Tanja Magoc; Sergey Koren; Todd J Treangen; Michael C Schatz; Arthur L Delcher; Michael Roberts; Guillaume Marçais; Mihai Pop; James A Yorke
Journal:  Genome Res       Date:  2012-01-06       Impact factor: 9.043

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Evaluation of new biomarker genes for differentiating Haemophilus influenzae from Haemophilus haemolyticus.

Authors:  M Jordan Theodore; Raydel D Anderson; Xin Wang; Lee S Katz; Jeni T Vuong; Melissa E Bell; Billie A Juni; Sara A Lowther; Ruth Lynfield; Jessica R MacNeil; Leonard W Mayer
Journal:  J Clin Microbiol       Date:  2012-02-01       Impact factor: 5.948

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

Review 5.  The bacterial pan-genome:a new paradigm in microbiology.

Authors:  Alex Mira; Ana B Martín-Cuadrado; Giuseppe D'Auria; Francisco Rodríguez-Valera
Journal:  Int Microbiol       Date:  2010-06       Impact factor: 2.479

6.  Alignment of whole genomes.

Authors:  A L Delcher; S Kasif; R D Fleischmann; J Peterson; O White; S L Salzberg
Journal:  Nucleic Acids Res       Date:  1999-06-01       Impact factor: 16.971

Review 7.  Corynebacterium pseudotuberculosis: microbiology, biochemical properties, pathogenesis and molecular studies of virulence.

Authors:  Fernanda Alves Dorella; Luis Gustavo Carvalho Pacheco; Sergio Costa Oliveira; Anderson Miyoshi; Vasco Azevedo
Journal:  Vet Res       Date:  2006 Mar-Apr       Impact factor: 3.683

8.  Graphical contig analyzer for all sequencing platforms (G4ALL): a new stand-alone tool for finishing and draft generation of bacterial genomes.

Authors:  Rommel Thiago Jucá Ramos; Adriana R Carneiro; Pablo H Caracciolo; Vasco Azevedo; Maria Paula C Schneider; Debmalya Barh; Artur Silva
Journal:  Bioinformation       Date:  2013-06-29

9.  Draft genome sequences of Bordetella holmesii strains from blood (F627) and nasopharynx (H558).

Authors:  Kathleen M Tatti; Vladimir N Loparev; Satishkumar Ranganathanganakammal; Shankar Changayil; Michael Frace; Michael Ryan Weil; Scott Sammons; Duncan Maccannell; Leonard W Mayer; M Lucia Tondella
Journal:  Genome Announc       Date:  2013-03-21

10.  FGAP: an automated gap closing tool.

Authors:  Vitor C Piro; Helisson Faoro; Vinicius A Weiss; Maria B R Steffens; Fabio O Pedrosa; Emanuel M Souza; Roberto T Raittz
Journal:  BMC Res Notes       Date:  2014-06-18
View more
  10 in total

1.  LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.

Authors:  Gui-Cai Xu; Tian-Jun Xu; Rui Zhu; Yan Zhang; Shang-Qi Li; Hong-Wei Wang; Jiong-Tang Li
Journal:  Gigascience       Date:  2019-01-01       Impact factor: 6.524

2.  TGS-GapCloser: A fast and accurate gap closer for large genomes with low coverage of error-prone long reads.

Authors:  Mengyang Xu; Lidong Guo; Shengqiang Gu; Ou Wang; Rui Zhang; Brock A Peters; Guangyi Fan; Xin Liu; Xun Xu; Li Deng; Yongwei Zhang
Journal:  Gigascience       Date:  2020-09-01       Impact factor: 6.524

3.  Genomic analysis of four strains of Corynebacterium pseudotuberculosis bv. Equi isolated from horses showing distinct signs of infection.

Authors:  Rafael A Baraúna; Rommel T J Ramos; Adonney A O Veras; Pablo H C G de Sá; Luís C Guimarães; Diego A das Graças; Adriana R Carneiro; Judy M Edman; Sharon J Spier; Vasco Azevedo; Artur Silva
Journal:  Stand Genomic Sci       Date:  2017-01-31

4.  Draft Genome Sequences of Mycobacterium kansasii Clinical Strains.

Authors:  Paulina Borówka; Jakub Lach; Zofia Bakuła; Jakko van Ingen; Aleksandra Safianowska; Anna Brzostek; Jarosław Dziadek; Dominik Strapagiel; Tomasz Jagielski
Journal:  Genome Announc       Date:  2017-06-01

5.  Draft Genome Sequence of Corynebacterium pseudotuberculosis Strain PA05 Isolated from an Ovine Host in Pará State, Brazil.

Authors:  Alyne Cristina Sodré Lima; Vitória Almeida Gonçalves de Moura; Kenny da Costa Pinheiro; Carla Thais Moreira Paixão; Wana Lailan Oliveira da Costa; Adriana Ribeiro Carneiro Folador; Ana Luiza de Mattos Guaraldi; Rommel T J Ramos; Artur Silva; Joana Montezano Marques
Journal:  Genome Announc       Date:  2017-03-30

6.  High-Quality Complete Genome Sequences of Three Bovine Shiga Toxin-Producing Escherichia coli O177:H- (fliCH25) Isolates Harboring Virulent stx2 and Multiple Plasmids.

Authors:  Haiqing Sheng; Mingrui Duan; Samuel S Hunter; Scott A Minnich; Matthew L Settles; Daniel D New; Jennifer R Chase; Matthew W Fagnan; Carolyn J Hovde
Journal:  Genome Announc       Date:  2018-02-15

7.  Re-sequencing and optical mapping reveals misassemblies and real inversions on Corynebacterium pseudotuberculosis genomes.

Authors:  Thiago de Jesus Sousa; Doglas Parise; Rodrigo Profeta; Mariana Teixeira Dornelles Parise; Anne Cybelle Pinto Gomide; Rodrigo Bentos Kato; Felipe Luiz Pereira; Henrique Cesar Pereira Figueiredo; Rommel Ramos; Bertram Brenig; Artur Luiz da Costa da Silva; Preetam Ghosh; Debmalya Barh; Aristóteles Góes-Neto; Vasco Azevedo
Journal:  Sci Rep       Date:  2019-11-08       Impact factor: 4.379

8.  Probiogenomics of Lactobacillus delbrueckii subsp. lactis CIDCA 133: In Silico, In Vitro, and In Vivo Approaches.

Authors:  Luís Cláudio Lima de Jesus; Mariana Martins Drumond; Flávia Figueira Aburjaile; Thiago de Jesus Sousa; Nina Dias Coelho-Rocha; Rodrigo Profeta; Bertram Brenig; Pamela Mancha-Agresti; Vasco Azevedo
Journal:  Microorganisms       Date:  2021-04-14

9.  Complete genome reveals genetic repertoire and potential metabolic strategies involved in lignin degradation by environmental ligninolytic Klebsiella variicola P1CD1.

Authors:  Amanda Oliveira Dos Santos Melo-Nascimento; Brena Mota Moitinho Sant Anna; Carolyne Caetano Gonçalves; Giovanna Santos; Eliane Noronha; Nádia Parachin; Milton Ricardo de Abreu Roque; Thiago Bruce
Journal:  PLoS One       Date:  2020-12-22       Impact factor: 3.240

10.  Approaches for in silico finishing of microbial genome sequences.

Authors:  Frederico Schmitt Kremer; Alan John Alexander McBride; Luciano da Silva Pinto
Journal:  Genet Mol Biol       Date:  2017 Jul-Sep 01       Impact factor: 1.771

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.