Literature DB >> 32327507

Complete Genome Sequence of a Virulent Leptospira interrogans Serovar Copenhageni Strain, Assembled with a Combination of Nanopore and Illumina Reads.

Alejandro Llanes1, Dhani Prakoso2, Carlos Mario Restrepo1, Sreekumari Rajeev3.   

Abstract

Here, we present the complete genome sequence of a highly virulent Leptospira interrogans serovar Copenhageni strain isolated from a dog with severe leptospirosis. In this work, a gapless genome draft was assembled with a combination of Nanopore and Illumina data of relatively low coverage.
Copyright © 2020 Llanes et al.

Entities:  

Year:  2020        PMID: 32327507      PMCID: PMC7180280          DOI: 10.1128/MRA.00200-20

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Leptospira species are spirochete bacteria that colonize proximal renal tubules of mammals, and infections may cause fatal disease in humans and animals. In this study, we assembled the genome of Leptospira interrogans strain SK1 and compared it with those of L. interrogans serovar Copenhageni (strain Fiocruz L1-130) (1) and L. interrogans serovar Lai (strain 56601) (2). The SK1 strain was isolated from the blood of a canine patient that succumbed to severe clinical leptospirosis (3). The strain was isolated through routine Leptospira culture protocols using Ellinghausen-McCullough-Johnson-Harris medium, as described previously (1). Genomic DNA from strain SK1 was isolated with the MasterPure complete DNA and RNA purification kit (Epicentre, USA). Sequencing libraries for the MiSeq and MinION platforms were prepared with the Nextera XT DNA library preparation kit (Illumina, USA) and the ligation sequencing kit (SQK-LSK109; Oxford Nanopore Technologies, UK), respectively, following the manufacturers’ instructions. Quality control was performed with FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc). In the case of Nanopore reads, we used Guppy v.3.2.4 for base calling and Porechop v.0.2.4 (https://github.com/rrwick/Porechop) for adapter trimming. De novo assembly was performed with Canu v.2.0 (4) using only the Nanopore reads. The Canu assembly was initially polished with Medaka v.0.11.4 (https://nanoporetech.github.io/medaka) using probabilistic model r941_min_fast_g303. Pilon v.1.23 (5) was used for hybrid polishing with the Illumina reads. To estimate the number of sequencing errors after each polishing step, Illumina reads were aligned to the Nanopore assembly with BWA v.0.7.12 (6). BCFtools v.1.9 (7) was then used to call for single-nucleotide variants and insertion/deletions (indels), which were considered to be errors if their mapping quality score was above 20. RATT was used to transfer the genes annotated in the Fiocruz L1-130 genome to the SK1 assembly. Prokka v.1.14.3 (8) was used to complement the RATT results. Default parameters were used for all software except where otherwise noted. Illumina and Nanopore sequencing generated 1.1 million 300-bp paired-end reads and 142,300 reads with a median length of 8 kb, respectively. De novo assembly of Nanopore reads resulted in 2 contigs, consistent with the 2 Leptospira chromosomes. The number of suspected errors decreased from 14,672 (0.3%) in the original Canu assembly to 92 (0.002%) after polishing with Pilon. Of these, only 6 errors seemed to affect putative coding sequences and were corrected manually. The SK1 genome is very similar to that of strain Fiocruz L1-130 in terms of synteny (Fig. 1) and global metrics, including a total size of 4,630,180 bp, a G+C content of 35%, and 3,731 protein-coding genes. However, at least 11 pseudogenes from Fiocruz L1-130 appear to have functional orthologs in the genomes of strains SK1 and 56601, despite the latter belonging to L. interrogans serovar Lai. Conversely, genes folA and folC, encoding dihydrofolate reductase and dihydrofolate synthase, respectively, appear to have become pseudogenes in SK1, which suggests impaired folate metabolism in this strain.
Fig 1

Comparison of the genomes of L. interrogans serovar Copenhageni strains SK1 and Fiocruz L1-130. Pink bands represent regions of shared sequence similarity. Coverage of Nanopore and Illumina reads is plotted along the SK1 genome, averaged over a 500-bp sliding window. Chr, chromosome.

Comparison of the genomes of L. interrogans serovar Copenhageni strains SK1 and Fiocruz L1-130. Pink bands represent regions of shared sequence similarity. Coverage of Nanopore and Illumina reads is plotted along the SK1 genome, averaged over a 500-bp sliding window. Chr, chromosome.

Data availability.

The genome sequence of the SK1 strain and associated data were deposited under BioProject number PRJNA600490, GenBank accession numbers CP048830 (chromosome I) and CP048831 (chromosome II), and SRA accession numbers SRX7657977 (Illumina reads) and SRX7730232 (MinION reads).
  8 in total

1.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

2.  Prokka: rapid prokaryotic genome annotation.

Authors:  Torsten Seemann
Journal:  Bioinformatics       Date:  2014-03-18       Impact factor: 6.937

3.  Unique physiological and pathogenic features of Leptospira interrogans revealed by whole-genome sequencing.

Authors:  Shuang-Xi Ren; Gang Fu; Xiu-Gao Jiang; Rong Zeng; You-Gang Miao; Hai Xu; Yi-Xuan Zhang; Hui Xiong; Gang Lu; Ling-Feng Lu; Hong-Quan Jiang; Jia Jia; Yue-Feng Tu; Ju-Xing Jiang; Wen-Yi Gu; Yue-Qing Zhang; Zhen Cai; Hai-Hui Sheng; Hai-Feng Yin; Yi Zhang; Gen-Feng Zhu; Ma Wan; Hong-Lei Huang; Zhen Qian; Sheng-Yue Wang; Wei Ma; Zhi-Jian Yao; Yan Shen; Bo-Qin Qiang; Qi-Chang Xia; Xiao-Kui Guo; Antoine Danchin; Isabelle Saint Girons; Ronald L Somerville; Yu-Mei Wen; Man-Hua Shi; Zhu Chen; Jian-Guo Xu; Guo-Ping Zhao
Journal:  Nature       Date:  2003-04-24       Impact factor: 49.962

4.  Comparative genomics of two Leptospira interrogans serovars reveals novel insights into physiology and pathogenesis.

Authors:  A L T O Nascimento; A I Ko; E A L Martins; C B Monteiro-Vitorello; P L Ho; D A Haake; S Verjovski-Almeida; R A Hartskeerl; M V Marques; M C Oliveira; C F M Menck; L C C Leite; H Carrer; L L Coutinho; W M Degrave; O A Dellagostin; H El-Dorry; E S Ferro; M I T Ferro; L R Furlan; M Gamberini; E A Giglioti; A Góes-Neto; G H Goldman; M H S Goldman; R Harakava; S M B Jerônimo; I L M Junqueira-de-Azevedo; E T Kimura; E E Kuramae; E G M Lemos; M V F Lemos; C L Marino; L R Nunes; R C de Oliveira; G G Pereira; M S Reis; A Schriefer; W J Siqueira; P Sommer; S M Tsai; A J G Simpson; J A Ferro; L E A Camargo; J P Kitajima; J C Setubal; M A Van Sluys
Journal:  J Bacteriol       Date:  2004-04       Impact factor: 3.490

5.  Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Authors:  Bruce J Walker; Thomas Abeel; Terrance Shea; Margaret Priest; Amr Abouelliel; Sharadha Sakthikumar; Christina A Cuomo; Qiandong Zeng; Jennifer Wortman; Sarah K Young; Ashlee M Earl
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

6.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors:  Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

7.  Isolation and characterization of Leptospira interrogans serovar Copenhageni from a dog from Saint Kitts.

Authors:  Christopher R Larson; Michelle Dennis; Rajeev V Nair; Alejandro Llanes; Andrea Peda; Shamara Welcome; Sreekumari Rajeev
Journal:  JMM Case Rep       Date:  2017-10-23

8.  Fast and accurate long-read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2010-01-15       Impact factor: 6.937

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.