Literature DB >> 32439666

Genome Sequences of Burkholderia thailandensis Strains E421, E426, and DW503.

Catherine M Mageeney1, Anupama Sinha1, Kelly P Williams1, Steven S Branda2.   

Abstract

We present the draft genome sequences of three Burkholderia thailandensis strains, E421, E426, and DW503. E421 consists of 90 contigs of 6,639,935 bp and 67.73% GC content. E426 consists of 106 contigs of 6,587,853 bp and 67.73% GC content. DW503 consists of 102 contigs of 6,458,767 bp and 67.64% GC content.
Copyright © 2020 Mageeney et al.

Entities:  

Year:  2020        PMID: 32439666      PMCID: PMC7242668          DOI: 10.1128/MRA.00312-20

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Burkholderia thailandensis, occurring naturally in soil and water, serves as a surrogate for the study of its more pathogenic congeners, select agent bacteria Burkholderia pseudomallei and Burkholderia mallei, with which it shares many genome features and pathogenesis mechanisms, and it provokes similar immune responses in mammalian systems (1). We report the genome sequences of three B. thailandensis strains obtained from BEI Resources (Manassas, VA, USA). B. thailandensis E421 and E426 were isolated independently from a rice field in Northeast Thailand. B. thailandensis DW503 is an allelic exchange mutant of B. thailandensis E264 in which the gene cluster comprising amrR, amrA, amrB, and oprA is deleted (2). E421, E426, and DW503 were grown in Luria broth in Erlenmeyer flasks at 37°C for ∼16 h. From each culture, 2 ml was centrifuged at 4,000 × g for 4 min, and DNA was extracted from the pelleted bacteria using the DNeasy blood and tissue kit (Qiagen, Hilden, Germany). DNA sequencing libraries were generated using the Nextera DNA library prep and DNA sample preparation index kits (Illumina, San Diego, CA, USA). Library concentrations were measured using the Qubit fluorometer with its high-sensitivity DNA assay kit (Thermo Fisher Scientific, Waltham, MA, USA). Equal amounts of the three libraries were mixed to generate a multiplexed library; its concentration was measured using the Qubit device, and fragment sizes were measured using the Bioanalyzer electrophoresis system with its high-sensitivity DNA analysis kit (Agilent, Santa Clara, CA, USA). The multiplexed library was sequenced in paired-end mode using a NextSeq 500 instrument with its high-output (300 cycles) kit (Illumina). Reads were quality filtered using BBDuk (27 June 2016 release with the following parameters: ktrim = r, k = 21, mink = 11, hdist = 1) (http://jgi.doe.gov/data-and-tools/bb-tools/). Assembly of the sequencing reads was completed using the default settings for SPAdes v. 3.9.0 (3) (Table 1). FastANI alignment (4), run using default parameters, revealed that E421 and E426 share 99.8% average nucleotide identity (ANI) with E264. DW503 shares 99.9978% ANI with E264; its deletion at nucleotide position 129447 (GenBank accession number JAAAQG010000016) spans 148 kbp (CP000086.1, coordinates 2668007 to 2816380) and 99 genes (chromosomal loci BTH_I2364 to BTH_I2461), including the gene cluster comprising amrR, amrA, amrB, and oprA, as confirmed by analysis of the raw reads with readStepper (5). Sequence annotation was performed using our tater.pl pipeline (8), which calls Prokka v. 1.11 (7), tFind, and rFind software (8); counts for coding regions, miscellaneous RNA (miscRNA), rRNA sequences, and tRNA sequences (6) are listed in Table 1. All the strains had hits to the following Rfam (9) miscRNA profiles: 6S, Bacteria_large_SRP, Bacteria_small_SRP, Betaproteobacteria_toxic_sRNA, cspA, FMN, Intron_gpI, mini-ykkC, P9, pfl, RNaseP_bact_a, SAH_riboswitch, SECIS_3, sucA, and TPP. In addition, E421 and E426 had hits to C4, cobalamin, and isrK. Additionally, sequences were analyzed to identify CRISPR-Cas arrays (CRISPRCasFinder [10]), prophages, and other genomic islands (6, 8). The TIGER package (6), including tater.pl, genomic island prediction software, and prophage calling software, is available at github.com/sandialabs/TIGER.
TABLE 1

Assembly statistics and genome sequence features

Featurea Data for strain:
E421E426DW503
No. of reads133,200,415136,919,90661,127,893
Coverage (fold)2,8242,9971,440
No. of contigs90106102
Total size (bp)6,639,9356,587,8536,458,767
N50 (bp)159,164121,401156,901
GC content (%)67.7367.7367.64
No. of ambiguous bases109240
No. of CDS5,6085,5245,427
No. of rRNA operons322
No. of tRNA/tmRNA genes524750
No. of miscRNA genes272521
No. of CRISPR-Cas arrays134
No. of GIs758
Prophage coordinatesb 45oatA|selB 02:159872–20474541Hyp 04:49998–7581816mutS 12:52204–68122
17mutS 10:52214–6893144pcaK 33:9582–54013
40F 16:5517–4527711R 49:24269–32704, 84:10537–2253
Other GI coordinatesb 26dapA|sbp 04:49955–757608G 01:381563–389876 49CbiA|HBT 08:149573–198632
33T 19:84650–1175203Phage_GPD 14:8710–4950219Hyp 20:23740–42637
22S 21:23808–4551919A 16:4992–802722A 55:4198–4233, 68:1–10395, 82:2548–1, 89:1–1497, 55:1–4200
24P 32:39753–6419526dapA|sbp 23:49376–6823548R 50:2569–32098, 89:1–1497, 67:1–10868, 83:2372–1, 50:2517–2568
3Acyl_transf_3 42:53140–55361, 05:2308–2523, 17:137203–136908
GenBank accession no.JAAAQH000000000JAAAWT000000000JAAAQG000000000
NCBI SRA project no.PRJNA600278PRJNA600279PRJNA600280
NCBI run no.SRR11144399SRR11142218SRR11073226

CDS, coding DNA sequences; tmRNA, transfer-messenger RNA.

Names of prophages and other genomic islands (GIs) are in bold, followed by their corresponding contig numbers and nucleotide positions (note that DW503 GIs 11R, 22A, 48R, and 3Acyl_transf_3 are split between two or more contigs).

Assembly statistics and genome sequence features CDS, coding DNA sequences; tmRNA, transfer-messenger RNA. Names of prophages and other genomic islands (GIs) are in bold, followed by their corresponding contig numbers and nucleotide positions (note that DW503 GIs 11R, 22A, 48R, and 3Acyl_transf_3 are split between two or more contigs). Names of prophages and other genomic islands (GIs), contig numbers, and coordinates are listed in Table 1. DW503 is a deletion mutant of E264 (GenBank assembly number GCA_000012365) (2); we previously reported the coordinates for E264 prophages and GIs (6). Some of the GIs in E264 were found to be split between two or more DW503 contigs (separated by commas in Table 1).

Data availability.

The assemblies have been deposited at GenBank under the following accession numbers: JAAAQH000000000 (E421), JAAAWT000000000 (E426), and JAAAQG000000000 (DW503). The raw data have been deposited at the NCBI Sequence Read Archive (SRA) under the BioProject and run accession numbers PRJNA600278 and SRR11144399 (E421), PRJNA600279 and SRR11142218 (E426), and PRJNA600280 and SRR11073226 (DW503), respectively.
  10 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Prokka: rapid prokaryotic genome annotation.

Authors:  Torsten Seemann
Journal:  Bioinformatics       Date:  2014-03-18       Impact factor: 6.937

3.  Identification of the acid phosphatase (acpA) gene homologues in pathogenic and non-pathogenic Burkholderia spp. facilitates TnphoA mutagenesis.

Authors:  M Burtnick; A Bolton; P Brett; D Watanabe; D Woods
Journal:  Microbiology       Date:  2001-01       Impact factor: 2.777

4.  Non-Coding RNA Analysis Using the Rfam Database.

Authors:  Ioanna Kalvari; Eric P Nawrocki; Joanna Argasinska; Natalia Quinones-Olvera; Robert D Finn; Alex Bateman; Anton I Petrov
Journal:  Curr Protoc Bioinformatics       Date:  2018-06-05

5.  Inflammation patterns induced by different Burkholderia species in mice.

Authors:  W Joost Wiersinga; Alex F de Vos; Regina de Beer; Catharina W Wieland; Joris J T H Roelofs; Donald E Woods; Tom van der Poll
Journal:  Cell Microbiol       Date:  2007-07-20       Impact factor: 3.715

6.  Islander: a database of precisely mapped genomic islands in tRNA and tmRNA genes.

Authors:  Corey M Hudson; Britney Y Lau; Kelly P Williams
Journal:  Nucleic Acids Res       Date:  2014-11-05       Impact factor: 16.971

7.  Resistance determinants and mobile genetic elements of an NDM-1-encoding Klebsiella pneumoniae strain.

Authors:  Corey M Hudson; Zachary W Bent; Robert J Meagher; Kelly P Williams
Journal:  PLoS One       Date:  2014-06-06       Impact factor: 3.240

8.  High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.

Authors:  Chirag Jain; Luis M Rodriguez-R; Adam M Phillippy; Konstantinos T Konstantinidis; Srinivas Aluru
Journal:  Nat Commun       Date:  2018-11-30       Impact factor: 14.919

9.  CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins.

Authors:  David Couvin; Aude Bernheim; Claire Toffano-Nioche; Marie Touchon; Juraj Michalik; Bertrand Néron; Eduardo P C Rocha; Gilles Vergnaud; Daniel Gautheret; Christine Pourcel
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

10.  New candidates for regulated gene integrity revealed through precise mapping of integrative genetic elements.

Authors:  Catherine M Mageeney; Britney Y Lau; Julian M Wagner; Corey M Hudson; Joseph S Schoeniger; Raga Krishnakumar; Kelly P Williams
Journal:  Nucleic Acids Res       Date:  2020-05-07       Impact factor: 16.971

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.