Literature DB >> 26798084

Complete Genome Sequence of Lysinibacillus sphaericus B1-CDA, a Bacterium That Accumulates Arsenic.

Aminur Rahman¹, Noor Nahar², Jana Jass³, Björn Olsson², Abul Mandal⁴.

Abstract

Here, we report the genomic sequence and genetic composition of an arsenic-resistant bacterium, Lysinibacillus sphaericus B1-CDA. Assembly of the sequencing reads revealed that the genome size is ~4.5 Mb, encompassing ~80% of the chromosomal DNA.

Entities: Chemical Disease Species

Year: 2016 PMID： 26798084 PMCID： PMC4722251 DOI： 10.1128/genomeA.00999-15

Source DB: PubMed Journal: Genome Announc

GENOME ANNOUNCEMENT

The arsenic-resistant strain B1-CDA was isolated from arsenic-contaminated land in Bangladesh (1). Sequencing of the genomic DNA of B1-CDA was performed by an Illumina HiSeq 2500 PE100 sequencer with a single sequencing index. The genome assembly started with Illumina 100-bp paired-end reads of genomic DNA with an insert length of 300 bp. The read quality was checked using FastQC (2). The raw reads were quality trimmed and corrected using Quake (3). Properly paired reads ≥30 bp in length were selected from the pool of corrected reads, and the remaining singleton reads were considered single-end reads. Both types of reads were then used in k-mer-based de novo assembly by employing SOAPdenovo (4). The set of scaffolds with the largest N50 was identified by evaluating k-mers ranging from 29 to 99. The optimal scaffold sequences were further subjected to gap closing by utilizing the corrected paired-end reads. The resulting scaffolds of length ≥300 bp were chosen as the final assembly (5). A total of 11,105,899 pairs of reads were generated by Illumina deep sequencing. Analysis of the raw reads with FastQC showed that the average per base Phred score was ≥32 for all positions, and the mean per sequence Phred score was 38. The overall G+C content was 38%. After quality trimming, error correction, and removal of the TruSeq adapter sequence, 10,940,654 read pairs (98.5%) and 145,888 single-end sequences remained for further analysis. The set of scaffold sequences with maximal N50 (507,225 bp) was produced at a k-mer of 91. The corresponding scaffold sequences were subjected to gap closure using the corrected paired-end reads, and the resulting scaffolds (≥300 bp) were defined as the final assembly. The final assembly was 4,509,276 bp, and it consisted of 31 scaffolds ranging from 314 bp to 1,145,744 bp. The assembled genome sequence was annotated with RAST (6). The RAST analysis pipeline uses tRNAscan-SE to predict tRNA genes (7) and the Glimmer algorithm to predict protein-coding genes (8). Predictions of tRNA-, rRNA-, and protein-coding genes were performed based on 77 RAST-predicted tRNA genes. RAST resulted in 11 rRNA genes, including seven 5S, one 16S, and three 23S genes. A total of 4,513 protein-coding genes were predicted using the Glimmer algorithm, of which 2,671 protein-coding genes were annotated by RAST’s automated homology analysis and assigned to functional categories. GeneMark (9) and FgenesB (10) algorithms were also applied, yielding 4,562 and 4,323 genes, respectively. The functional annotation by RAST and Blast2GO (11) indicated that B1-CDA contains many genes, which are responsive to metal ions, like arsenic, cobalt, copper, iron, nickel, potassium, manganese, and zinc. All protein-coding sequences resulting from GeneMark were used by Blast2GO for functional annotation. Based on the phylogenetic trees inferred by using the neighbor-joining method (12) presented in the MEGA6 software (13), B1-CDA resembles Lysinibacillus sphaericus G10, R-27024, and CICR-X12. In summary, strain B1-CDA demonstrates the presence of several metal-responsive genes that might be utilized in bioremediation of toxic metals in polluted environments.

Nucleotide sequence accession numbers.

The genome sequence of B1-CDA strain has been deposited in GenBank under the accession number LJYY00000000. The version described in this paper is the first version, LJYY00000000.1.

11 in total

1. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors: Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal: Mol Biol Evol Date: 2013-10-16 Impact factor: 16.240

2. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors: T M Lowe; S R Eddy
Journal: Nucleic Acids Res Date: 1997-03-01 Impact factor: 16.971

3. Ab initio gene finding in Drosophila genomic DNA.

Authors: A A Salamov; V V Solovyev
Journal: Genome Res Date: 2000-04 Impact factor: 9.043

4. The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors: N Saitou; M Nei
Journal: Mol Biol Evol Date: 1987-07 Impact factor: 16.240

5. Microbial gene identification using interpolated Markov models.

Authors: S L Salzberg; A L Delcher; S Kasif; O White
Journal: Nucleic Acids Res Date: 1998-01-15 Impact factor: 16.971

6. Comparative genome analysis of Lysinibacillus B1-CDA, a bacterium that accumulates arsenics.

Authors: Aminur Rahman; Noor Nahar; Neelu N Nawani; Jana Jass; Sibdas Ghosh; Björn Olsson; Abul Mandal
Journal: Genomics Date: 2015-09-24 Impact factor: 5.736

7. Isolation and characterization of a Lysinibacillus strain B1-CDA showing potential for bioremediation of arsenics from contaminated water.

Authors: Aminur Rahman; Noor Nahar; Neelu N Nawani; Jana Jass; Prithviraj Desale; Balu P Kapadnis; Khaled Hossain; Ananda K Saha; Sibdas Ghosh; Björn Olsson; Abul Mandal
Journal: J Environ Sci Health A Tox Hazard Subst Environ Eng Date: 2014 Impact factor: 2.269

8. Quake: quality-aware detection and correction of sequencing errors.

Authors: David R Kelley; Michael C Schatz; Steven L Salzberg
Journal: Genome Biol Date: 2010-11-29 Impact factor: 13.583

9. The RAST Server: rapid annotations using subsystems technology.

Authors: Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal: BMC Genomics Date: 2008-02-08 Impact factor: 3.969

10. High-throughput functional annotation and data mining with the Blast2GO suite.

Authors: Stefan Götz; Juan Miguel García-Gómez; Javier Terol; Tim D Williams; Shivashankar H Nagaraj; María José Nueda; Montserrat Robles; Manuel Talón; Joaquín Dopazo; Ana Conesa
Journal: Nucleic Acids Res Date: 2008-04-29 Impact factor: 16.971

2 in total

Review 1. Understanding and Designing the Strategies for the Microbe-Mediated Remediation of Environmental Contaminants Using Omics Approaches.

Authors: Muneer A Malla; Anamika Dubey; Shweta Yadav; Ashwani Kumar; Abeer Hashem; Elsayed Fathi Abd Allah
Journal: Front Microbiol Date: 2018-06-04 Impact factor: 5.640

2. Complete Genome Sequence of Enterobacter cloacae B2-DHA, a Chromium-Resistant Bacterium.

Authors: Aminur Rahman; Noor Nahar; Björn Olsson; Abul Mandal
Journal: Genome Announc Date: 2016-06-02

2 in total