Literature DB >> 27074260

Draft genome of the Leptospira interrogans strains, Acegua, RCA, Prea, and Capivara, obtained from wildlife maintenance hosts and infected domestic animals.

Frederico S Kremer1, Marcus R Eslabão1, Sérgio Jorge1, Natasha R Oliveira1, Julia Labonde1, Monize N P Santos1, Leonardo G Monte1, André A Grassmann1, Carlos E P Cunha1, Karine M Forster1, Luísa Z Moreno2, Andrea M Moreno2, Vinicius F Campos1, Alan J A McBride1, Luciano S Pinto1, Odir A Dellagostin1.   

Abstract

In the present paper, we announce new draft genomes of four Leptospira interrogans strains named Acegua, RCA, Prea, and Capivara. These strains were isolated in the state of Rio Grande do Sul, Brazil, from cattle, dog, Brazilian guinea pig, and capybara, respectively.

Entities:  

Mesh:

Year:  2016        PMID: 27074260      PMCID: PMC4830120          DOI: 10.1590/0074-02760160010

Source DB:  PubMed          Journal:  Mem Inst Oswaldo Cruz        ISSN: 0074-0276            Impact factor:   2.743


The Leptospira genus comprises at least 22 different species, some of which, like Leptospira interrogans, Leptospira borgpetersenii, Leptospira santarosai, Leptospira noguchii, and Leptospira kirschneri, are pathogenic and may cause leptospirosis (Boonsilp et al. 2013,Bourhy et al 2014). This neglected zoonosis is globally distributed and has become a reemerging public health problem in many countries, with stronger impact in tropical regions (Evangelista & Coburn 2010, Guerra 2013). Commonly found in rodents, leptospires may also infect and be hosted by different domestic and wildlife animals (Bharti et al. 2003). This wide variety of reservoirs may play a key role in the maintenance and transmission of the disease (Levett 2001). Therefore, genome sequencing of isolates from different hosts potentially provides a starting point to towards understanding the ability of Leptospira spp to adapt to specific host and the basis of pathogen-host interaction. In the present study, whole-genome sequencing was performed for the strains Acegua, isolated from a stillborn bovine foetus (Monte et al. 2015), RCA, isolated from a domestic dog with clinical leptospirosis, Prea, isolated from Brazilian guinea pig (Cavia aperea) (Monte et al. 2013), and Capivara, isolated from capybara (Hydrochoerus hydrochaeris) (Jorge et al. 2012). The isolates were cultured in Ellinghausen-McCullough-Johnson-Harris (EMJH) medium supplemented with 10% Leptospira enrichment EMJH (Difco, USA), 200 μg/mL 5-fluorouracil, and 5% foetal calf serum in an incubator at 30ºC without agitation. DNA extraction was performed using the commercial Illustra Bacteria GenomicPrep Mini Spin kit (GE Healthcare, USA), following the manufacturer instructions. The whole genome sequences were obtained using an Illumina MiSeq paired-end library for Acegua, an Illumina MiSeq paired-end library and an Ion Torrent PGM fragment library for RCA and Prea, and an Ion Torrent PGM fragment library for Capivara. The raw reads were filtered by quality using Fastx-Toolkit (hannonlab.cshl.edu/fastx_toolkit/) and the paired-end reads were trimmed using Trimmomatic (Bolger et al. 2014). De novo assembly was performed using A5 (Tritt et al. 2012), SGA (Simpson & Durbin 2012), and Ray (Boisvert et al. 2010) for Acegua, A5, SGA, Ray, MIRA (chevreux.org/), Newbler (roche.com/), and SPAdes (Bankevich et al. 2012) for RCA and Prea, and MIRA, Newbler, and SPAdes for Capivara. For each strain the de novo assemblies were merged using CISA (Lin & Liao 2013) and evaluated using QUAST (Gurevich et al. 2013). Genome annotation was performed as previous described (Kremer et al. 2015) using Prodigal (Hyatt et al. 2010), NCBI-BLAST+ (Altschul et al. 1990, Camacho et al. 2009), Uniprot (Apweiler et al. 2004), HMMER (Eddy 2011), AntiFam (Eberhardt et al. 2012), tRNAscan-SE (Lowe & Eddy 1997), RNAmmer (Lagesen et al. 2007), INFERNAL (Nawrocki et al. 2009), Aragorn (Laslett 2004), and Rfam (Griffiths-Jones et al. 2003), and manually reviewed using Artemis (Rutherford et al. 2000). In silico multilocus sequence typing (MLST) was performed using BLASTn from the NCBI-BLAST+ and allele data from the Leptospira MLST scheme 1 (Boonsilp et al. 2013), obtained from PubMLST repository (pubmlst.org/). The results of the de novo assemblies are presented in Table I. The isolates were initially sequenced using only the Illumina platform, but the high fragmentation in the resulting assembly for Prea and RCA isolates (data not showed) motivated the use of a second next-generation sequencing technology to improve the original draft sequences. Although usually not required, the combination of data of two or more platforms in the sequencing of a given genome may result in a more accurate assembly, considering that each sequencing technology has it owns bias. The most common errors associated with Illumina data occurs on CG-poor and CG-rich regions, while IonTorrent, duo to its chemistry, has a high error-rate in homopolymeric regions. In fact, both characteristics are found inLeptospira genomes.
TABLE I

Summary of the assembly results

IsolateScaffoldsa(n)Assembly length (Mb)N50 (bp)CG (%)
Acegua1584.663,48935.07
RCA894.4355,78235.06
Prea1064.4446,50835.17
Capivara1604.5145,52634.98

a: all assembled sequences joined (or not) by linkage information.

a: all assembled sequences joined (or not) by linkage information. During genome annotation (Table II), by using our pipeline, in addition to the coding DNA sequences, we were also able to identify many noncoding feature in all four genomes, including not only transfer RNAs and ribosomal RNAs, but also transfer-messenger RNAs (tmRNAs), RNase P loci, and riboswitches. There is an increasing interest in the analysis of gene expression inLeptospira, especially during infection (Matsui et al. 2012, Lehmann et al. 2013, Caimano et al. 2014, Eshghi et al. 2014). Recent studies have already performed whole-transcriptome sequencing of L. interrogans and many noncoding features associated with gene expression regulation and transcriptional/translational processing were identified, including RNase P, tmRNAs, riboswitches, as well other families of noncoding RNA. Therefore, the identification of noncoding features in the annotation of newly sequenced genomes may allow a more accurate description of the resulting transcriptome.
TABLE II

Summary of the annotation results

IsolateCDStRNAsrRNAsOther ncRNAsa Riboswitches
Acegua373437423
RCA3591343105
Prea3616335103
Capivara414637373

a: includes the noncoding RNAs identified by Rfam and Aragorn; CDS: coding DNA sequence; ncRNAs: noncoding RNA; rRNAs: ribosomal RNAs; tRNAs: transfer RNAs.

a: includes the noncoding RNAs identified by Rfam and Aragorn; CDS: coding DNA sequence; ncRNAs: noncoding RNA; rRNAs: ribosomal RNAs; tRNAs: transfer RNAs. The in silico MLST sequence types (ST) for the four isolates are presented in Table III. Previously identified by variable-number tandem-repeat asL. interrogans serogroup Australis serovar Muenchen (Monte et al. 2015), the Acegua isolate was a match for ST24 that contains two L. interrogans serogroup Australis isolates, while the Capivara isolate was identified as ST17 that includes nine L. interrogans serogroup Icterohaemorrhagiae isolates (5 belonging to serovar Copenhageni and 2 to serovar Icterohaemorrhagiae). Preliminary analysis revealed that the pfkBlocus was absent in the draft assemblies of RCA and Prea. To investigate this fact, the raw reads from these isolates were aligned using BLASTn against a reference set of pfkB alleles obtained from the PubMLST repository. The BLAST XML output was analysed by a Python script to identify reads that correspond to this locus using an identity threshold of 95%. The selected reads were saved in FASTQ format and filtered by quality using a minimum Phred score of 20 in at least 95% of the bases. After filtering, 83 reads remained in the Prea set, and 90 in the RCA set, corresponding to mean coverages of about 18 and 20-fold, respectively. Therefore, the absence of this locus in both draft genomes was a result of an assembly artifact. For each genome, the reads that aligned to the pfkB database were assembled using CAP3 (Huang & Madan 1999) and the resulting contigs were aligned against the same database to identify the corresponding alleles in the MLST scheme 1, that are showed in Table III.
TABLE III

Sequence types (ST) profiles of the Acegua, RCA, Prea, and Capivara strains based on the Leptospira multilocus sequence typing scheme 1

Isolate glmU pntA sucA tpiA pfkB mreA caiB ST
Acegua142153424
RCA112210a 4817
Prea112210a 4817
Capivara1122104817

a: hit not found in the BLAST against the draft genome assembly.

a: hit not found in the BLAST against the draft genome assembly. The Leptospira genus comprises more than 300 serovars and pathogenic species were already reported in a wide variety of animal hosts. However, from the 233 genome sequences indexed in BioProject database and available at GenBank with host information, the major part (166) was obtained from human samples (ncbi.nlm.nih.gov/bioproject/). The sequencing of isolates obtained from wildlife animals, like C. aperea and H. hydrochaeris, both rodents and natural reservoirs, provide data for future pangenome and pathogenome analysis intending to understand the factors that guide the pathogen-host interactions. Additionally, the isolate Acegua, obtained from a bovine stillborn, also represents an interesting source of information about these interactions, since abortion induced by leptospirosis in cattle is usually associated to the serovar Hardjo of the speciesL. interrogans and L. borgpetersenii, not to Muenchen, although this serovar has been associated to abortions in pigs (Ellis et al. 1986). Finally, the analysis of these isolates also provide new insights into the serogroups circulating in the south of Brazil, suggesting that while L. interrogans serogroup Icterohaemorrhagiae serovars Icterohaemorrhagiae and Copenhageni are present, they are not the only ones. Based on the MLST profiles, serovars belonging to serogroup Australis are also circulating among wild and domestic animals, and the comparative analysis of genomic data may be applied to trace their distribution and evolution. Furthermore, the availability of these new genome sequences from four L. interrogans strains, isolated from diverse hosts, will provide useful data towards understanding the molecular diversity and pathogenesis of these new strains. Nucleotide sequence accessions - These Whole Genome Shotgun projects have been deposited at DDBJ/EMBL/GenBank under the accessions LCZF00000000 for Acegua, LJBP00000000 for RCA, LJBO00000000 for Prea, and LJBQ00000000 for Capivara. The versions described in this paper are LCZF01000000, LJBP01000000, LJBO01000000, and LJBQ01000000, respectively.
  35 in total

Review 1.  Leptospirosis: a zoonotic disease of global importance.

Authors:  Ajay R Bharti; Jarlath E Nally; Jessica N Ricaldi; Michael A Matthias; Monica M Diaz; Michael A Lovett; Paul N Levett; Robert H Gilman; Michael R Willig; Eduardo Gotuzzo; Joseph M Vinetz
Journal:  Lancet Infect Dis       Date:  2003-12       Impact factor: 25.071

2.  Rfam: an RNA family database.

Authors:  Sam Griffiths-Jones; Alex Bateman; Mhairi Marshall; Ajay Khanna; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

Review 3.  Leptospira as an emerging pathogen: a review of its biology, pathogenesis and host immune responses.

Authors:  Karen V Evangelista; Jenifer Coburn
Journal:  Future Microbiol       Date:  2010-09       Impact factor: 3.165

4.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

5.  Efficient de novo assembly of large genomes using compressed data structures.

Authors:  Jared T Simpson; Richard Durbin
Journal:  Genome Res       Date:  2011-12-07       Impact factor: 9.043

Review 6.  Leptospirosis: public health perspectives.

Authors:  Marta A Guerra
Journal:  Biologicals       Date:  2013-07-10       Impact factor: 1.856

7.  Detection of virulence factors and molecular typing of pathogenic Leptospira from capybara (Hydrochaeris hydrochaeris).

Authors:  Sérgio Jorge; Leonardo G Monte; Marco Antonio Coimbra; Ana Paula Albano; Daiane D Hartwig; Caroline Lucas; Fabiana K Seixas; Odir A Dellagostin; Cláudia P Hartleben
Journal:  Curr Microbiol       Date:  2012-07-11       Impact factor: 2.188

8.  CISA: contig integrator for sequence assembly of bacterial genomes.

Authors:  Shin-Hung Lin; Yu-Chieh Liao
Journal:  PLoS One       Date:  2013-03-28       Impact factor: 3.240

9.  Pathogenomic inference of virulence-associated genes in Leptospira interrogans.

Authors:  Jason S Lehmann; Derrick E Fouts; Daniel H Haft; Anthony P Cannella; Jessica N Ricaldi; Lauren Brinkac; Derek Harkins; Scott Durkin; Ravi Sanka; Granger Sutton; Angelo Moreno; Joseph M Vinetz; Michael A Matthias
Journal:  PLoS Negl Trop Dis       Date:  2013-10-03

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

View more
  4 in total

Review 1.  Reverse Vaccinology: An Approach for Identifying Leptospiral Vaccine Candidates.

Authors:  Odir A Dellagostin; André A Grassmann; Caroline Rizzi; Rodrigo A Schuch; Sérgio Jorge; Thais L Oliveira; Alan J A McBride; Daiane D Hartwig
Journal:  Int J Mol Sci       Date:  2017-01-14       Impact factor: 5.923

2.  Genome of Leptospira borgpetersenii strain 4E, a highly virulent isolate obtained from Mus musculus in southern Brazil.

Authors:  Marcus Redü Eslabão; Frederico Schmitt Kremer; Rommel Thiago Juca Ramos; Artur Luiz da Costa da Silva; Vasco Ariston de Carvalho Azevedo; Luciano da Silva Pinto; Éverton Fagonde da Silva; Odir Antônio Dellagostin
Journal:  Mem Inst Oswaldo Cruz       Date:  2018-02       Impact factor: 2.743

3.  A Universal Vaccine against Leptospirosis: Are We Going in the Right Direction?

Authors:  André Alex Grassmann; Jéssica Dias Souza; Alan John Alexander McBride
Journal:  Front Immunol       Date:  2017-03-09       Impact factor: 7.561

4.  Whole-genome sequencing of Leptospira interrogans from southern Brazil: genetic features of a highly virulent strain.

Authors:  Sérgio Jorge; Frederico Schmitt Kremer; Natasha Rodrigues de Oliveira; Gabrielle de Oliveira Sanches Valerio Navarro; Amanda Munari Guimarães; Christian Domingues Sanchez; Rafael Danelon Dos Santos Woloski; Karine Forster Ridieri; Vinícius Farias Campos; Luciano da Silva Pinto; Odir Antônio Dellagostin
Journal:  Mem Inst Oswaldo Cruz       Date:  2018-02       Impact factor: 2.743

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.