Literature DB >> 33986081

Four Complete Genome Sequences for Bradyrhizobium sp. Strains Isolated from an Endemic Australian Acacia Legume Reveal Structural Variation.

Ming-Dao Chia1, Anna K Simonsen2,3,4.   

Abstract

Bradyrhizobium sp. strains were isolated from root nodules of the Australian legume, Acacia acuminata (Fabaceae). Here, we report the complete genome sequences of four strains using a hybrid long- and short-read assembly approach. The genome sizes range between ∼7.1 Mbp and ∼8.1 Mbp, each with one single circular chromosome. Whole-genome alignments show extensive structural rearrangement.
Copyright © 2021 Chia and Simonsen.

Entities:  

Year:  2021        PMID: 33986081      PMCID: PMC8142567          DOI: 10.1128/MRA.00229-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Diazotrophs in the genus Bradyrhizobium (Bradyrhizobiaceae) are common and widespread root symbionts of many legume species worldwide. Within Australia, Acacia (Fabaceae) is a highly diverse and functionally important legume genus, and its symbiosis with Bradyrhizobium provides critical ecosystem services to native Australian vegetation (1–4). Here, we report 4 complete Bradyrhizobium sp. genome sequences originally isolated from Acacia acuminata (a host endemic to southwest Australia, a global biodiversity hot spot). These genome sequences were completed to provide preliminary insight into the chromosome structural variation; the strains sequenced were selected to maximize genetic variability from a larger population genomic study comprising 375 closely related yet genetically diverse Bradyrhizobium strains sampled along a large climate gradient in the same region (5). All 4 Bradyrhizobium strains cluster within a single species, having >99.5% 16S rRNA sequence identity, and phylogenetically cluster with Bradyrhizobium diazoefficiens (5). All strains were grown on yeast extract mannitol plates from frozen stock cultures (70% glycerol, −80°C), previously isolated from root nodules of Acacia acuminata (3, 5). DNA was extracted from a single colony using a modified MoBio Ultraclean microbial isolation protocol, where cells were heat treated (60°C, 5 min) in lysis buffer prior to mechanical lysing. For genome assembly, we used a hybrid approach using short Illumina reads and long reads from either the PacBio or Nanopore sequencing platform. Short-read whole-genome paired-end 150-bp Illumina data were generated on two HiSeq 2000 lanes using Illumina Nextera XT library kits, following standard Illumina protocols (5), and trimmed using Trimmomatic v0.36 (6) (ILLUMINACLIP:adapters.fasta:2:30:10 LEADING:5 TRAILING:5 SLIDINGWINDOW:4:20 MINLEN:100). In 3 of 4 strains, long-read data were generated on a PacBio RS II system at the Macrogen sequencing facilities in South Korea; SMRTbell libraries were created using the protocol “Procedure and Checklist—10 kb Template Preparation and Sequencing (with Low-Input DNA)” (7), and each strain was sequenced on two single-molecule real-time (SMRT) cells. Long reads for the fourth strain were generated in-house at Research School of Biology labs, Australian National University. In brief, DNA was isolated using the high-molecular-weight method of Schalamun et al. (8) (excluding the chloroform cleanup). Unsheared DNA extract was then prepared using the Oxford Nanopore library kit (SQK-LSK108) and sequenced on one R9.4 FLO-MIN106 flow cell. The reads were base called using Guppy v3.0.3. All genomes were assembled using Unicycler v0.4.8 with default settings (9). With the long and short reads combined, the total sequence data generated for each strain exceeded 113× coverage across each genome. The genome size, GC content, and gene number varied across strains (Table 1), but all strains contained one single chromosome and were designated with a complete circular status according to Unicycler. Starting genes were found for strains 65_7, 38_8, and 36_1, and the circular contigs were rotated accordingly, with the starting gene at the beginning of the forward strand. Unicycler did not find starting genes on strain 41_2. All genomes were annotated using NCBI’s PGAP v5.0 with default settings (10). Whole-genome alignments, using progressiveMauve v2.4.0 with default settings (11), confirm large structural rearrangements (Fig. 1) among the strains.
TABLE 1

Summary of the sequencing and genome assembly details for each strain

StrainGenBank accession no.Data for Illumina reads:
Data for long reads:
Genome size (bp)Estimated coverage (×)GC content (%)Total no. of genes
No. of readsSRA accession no.PlatformLibrary DNA input (μg)No. of readsSRA accession no.N50 (bp)
65_7CP0670418,512,491SRR12822213, SRR12821956PacBio5332,793SRR12919153, SRR1291915291,3217,100,87818063.16,613
38_8CP0671006,105,011SRR12822231, SRR12821976PacBio5273,397SRR12919157, SRR1291915674,9347,668,73412063.87,194
41_2CP0671015,913,600SRR12822277, SRR12822021PacBio5218,088SRR12919155, SRR1291915452,1017,144,34612563.56,707
36_1CP0671026,115,736SRR12822150, SRR12821895Nanopore1242,922SRR1291916028,5178,085,09511363.47,539
FIG 1

Pairwise whole-genome Mauve alignment output confirms the presence of large structural variation among circular chromosomes. Comparisons between strains where starting genes could be identified (36_1, 38_8, and 65_7; genome lengths in base pairs) are shown. For each comparison (a, b, and c), matching colored blocks and connecting lines indicate homologous genome sections between each pair. Inversions are indicated in the bottom genome of each pair (inversions are represented as matching color blocks below the black line). For example, one inverted genomic segment is visible between the matching purple blocks in panel a and the matching brown blocks in panel b. To facilitate visualization of the larger chromosomal rearrangements, the Mauve LCB weights (which adjust the single nucleotide polymorphism [SNP] similarity threshold) are adjusted to 13,166 (a), 12,932 (b), and 6,985 (c) for each pairwise comparison. The fourth strain (41_2) is not included in the comparison since a starting gene could not be identified, and it would visually indicate some false genomic rearrangements in Mauve’s linear chromosome alignment tool if included.

Pairwise whole-genome Mauve alignment output confirms the presence of large structural variation among circular chromosomes. Comparisons between strains where starting genes could be identified (36_1, 38_8, and 65_7; genome lengths in base pairs) are shown. For each comparison (a, b, and c), matching colored blocks and connecting lines indicate homologous genome sections between each pair. Inversions are indicated in the bottom genome of each pair (inversions are represented as matching color blocks below the black line). For example, one inverted genomic segment is visible between the matching purple blocks in panel a and the matching brown blocks in panel b. To facilitate visualization of the larger chromosomal rearrangements, the Mauve LCB weights (which adjust the single nucleotide polymorphism [SNP] similarity threshold) are adjusted to 13,166 (a), 12,932 (b), and 6,985 (c) for each pairwise comparison. The fourth strain (41_2) is not included in the comparison since a starting gene could not be identified, and it would visually indicate some false genomic rearrangements in Mauve’s linear chromosome alignment tool if included. Summary of the sequencing and genome assembly details for each strain

Data availability.

The genome data are available in GenBank under BioProject accession number PRJNA669073 (SRA accession numbers are provided in Table 1). The Unicycler log files and Mauve alignment files are available on Figshare (https://doi.org/10.6084/m9.figshare.14134169).
  9 in total

1.  Small-subunit rRNA genotyping of rhizobia nodulating Australian Acacia spp.

Authors:  B Lafay; J J Burdon
Journal:  Appl Environ Microbiol       Date:  2001-01       Impact factor: 4.792

2.  Genetic diversity of rhizobia associated with Acacia longifolia in two stages of invasion of coastal sand dunes.

Authors:  Susana Rodríguez-Echeverría; João A Crisóstomo; Helena Freitas
Journal:  Appl Environ Microbiol       Date:  2007-06-01       Impact factor: 4.792

3.  Bradyrhizobia nodulating the Acacia mangium x A. auriculiformis interspecific hybrid are specific and differ from those associated with both parental species.

Authors:  Christine Le Roux; Diana Tentchev; Yves Prin; Doreen Goh; Yani Japarudin; Marie-Mathilde Perrineau; Robin Duponnois; Odile Domergue; Philippe de Lajudie; Antoine Galiana
Journal:  Appl Environ Microbiol       Date:  2009-10-23       Impact factor: 4.792

4.  Novel model-based clustering reveals ecologically differentiated bacterial genomes across a large climate gradient.

Authors:  Anna K Simonsen; Luke G Barrett; Peter H Thrall; Suzanne M Prober
Journal:  Ecol Lett       Date:  2019-10-14       Impact factor: 9.492

5.  progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.

Authors:  Aaron E Darling; Bob Mau; Nicole T Perna
Journal:  PLoS One       Date:  2010-06-25       Impact factor: 3.240

6.  Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.

Authors:  Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal:  PLoS Comput Biol       Date:  2017-06-08       Impact factor: 4.475

7.  Harnessing the MinION: An example of how to establish long-read sequencing in a laboratory using challenging plant tissue from Eucalyptus pauciflora.

Authors:  Miriam Schalamun; Ramawatar Nagar; David Kainer; Eleanor Beavan; David Eccles; John P Rathjen; Robert Lanfear; Benjamin Schwessinger
Journal:  Mol Ecol Resour       Date:  2018-10-05       Impact factor: 7.090

8.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

9.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.