Literature DB >> 33986081

Four Complete Genome Sequences for Bradyrhizobium sp. Strains Isolated from an Endemic Australian Acacia Legume Reveal Structural Variation.

Abstract

Bradyrhizobium sp. strains were isolated from root nodules of the Australian legume, Acacia acuminata (Fabaceae). Here, we report the complete genome sequences of four strains using a hybrid long- and short-read assembly approach. The genome sizes range between ∼7.1 Mbp and ∼8.1 Mbp, each with one single circular chromosome. Whole-genome alignments show extensive structural rearrangement.

Entities: Chemical Disease Species

Year: 2021 PMID： 33986081 PMCID： PMC8142567 DOI： 10.1128/MRA.00229-21

Source DB: PubMed Journal: Microbiol Resour Announc ISSN： 2576-098X

ANNOUNCEMENT

Diazotrophs in the genus Bradyrhizobium (Bradyrhizobiaceae) are common and widespread root symbionts of many legume species worldwide. Within Australia, Acacia (Fabaceae) is a highly diverse and functionally important legume genus, and its symbiosis with Bradyrhizobium provides critical ecosystem services to native Australian vegetation (1–4). Here, we report 4 complete Bradyrhizobium sp. genome sequences originally isolated from Acacia acuminata (a host endemic to southwest Australia, a global biodiversity hot spot). These genome sequences were completed to provide preliminary insight into the chromosome structural variation; the strains sequenced were selected to maximize genetic variability from a larger population genomic study comprising 375 closely related yet genetically diverse Bradyrhizobium strains sampled along a large climate gradient in the same region (5). All 4 Bradyrhizobium strains cluster within a single species, having >99.5% 16S rRNA sequence identity, and phylogenetically cluster with Bradyrhizobium diazoefficiens (5). All strains were grown on yeast extract mannitol plates from frozen stock cultures (70% glycerol, −80°C), previously isolated from root nodules of Acacia acuminata (3, 5). DNA was extracted from a single colony using a modified MoBio Ultraclean microbial isolation protocol, where cells were heat treated (60°C, 5 min) in lysis buffer prior to mechanical lysing. For genome assembly, we used a hybrid approach using short Illumina reads and long reads from either the PacBio or Nanopore sequencing platform. Short-read whole-genome paired-end 150-bp Illumina data were generated on two HiSeq 2000 lanes using Illumina Nextera XT library kits, following standard Illumina protocols (5), and trimmed using Trimmomatic v0.36 (6) (ILLUMINACLIP:adapters.fasta:2:30:10 LEADING:5 TRAILING:5 SLIDINGWINDOW:4:20 MINLEN:100). In 3 of 4 strains, long-read data were generated on a PacBio RS II system at the Macrogen sequencing facilities in South Korea; SMRTbell libraries were created using the protocol “Procedure and Checklist—10 kb Template Preparation and Sequencing (with Low-Input DNA)” (7), and each strain was sequenced on two single-molecule real-time (SMRT) cells. Long reads for the fourth strain were generated in-house at Research School of Biology labs, Australian National University. In brief, DNA was isolated using the high-molecular-weight method of Schalamun et al. (8) (excluding the chloroform cleanup). Unsheared DNA extract was then prepared using the Oxford Nanopore library kit (SQK-LSK108) and sequenced on one R9.4 FLO-MIN106 flow cell. The reads were base called using Guppy v3.0.3. All genomes were assembled using Unicycler v0.4.8 with default settings (9). With the long and short reads combined, the total sequence data generated for each strain exceeded 113× coverage across each genome. The genome size, GC content, and gene number varied across strains (Table 1), but all strains contained one single chromosome and were designated with a complete circular status according to Unicycler. Starting genes were found for strains 65_7, 38_8, and 36_1, and the circular contigs were rotated accordingly, with the starting gene at the beginning of the forward strand. Unicycler did not find starting genes on strain 41_2. All genomes were annotated using NCBI’s PGAP v5.0 with default settings (10). Whole-genome alignments, using progressiveMauve v2.4.0 with default settings (11), confirm large structural rearrangements (Fig. 1) among the strains.

TABLE 1

Summary of the sequencing and genome assembly details for each strain

Strain	GenBank accession no.	Data for Illumina reads:		Data for long reads:					Genome size (bp)	Estimated coverage (×)	GC content (%)	Total no. of genes
Strain	GenBank accession no.	No. of reads	SRA accession no.	Platform	Library DNA input (μg)	No. of reads	SRA accession no.	N₅₀ (bp)	Genome size (bp)	Estimated coverage (×)	GC content (%)	Total no. of genes
65_7	CP067041	8,512,491	SRR12822213, SRR12821956	PacBio	5	332,793	SRR12919153, SRR12919152	91,321	7,100,878	180	63.1	6,613
38_8	CP067100	6,105,011	SRR12822231, SRR12821976	PacBio	5	273,397	SRR12919157, SRR12919156	74,934	7,668,734	120	63.8	7,194
41_2	CP067101	5,913,600	SRR12822277, SRR12822021	PacBio	5	218,088	SRR12919155, SRR12919154	52,101	7,144,346	125	63.5	6,707
36_1	CP067102	6,115,736	SRR12822150, SRR12821895	Nanopore	1	242,922	SRR12919160	28,517	8,085,095	113	63.4	7,539

FIG 1

Pairwise whole-genome Mauve alignment output confirms the presence of large structural variation among circular chromosomes. Comparisons between strains where starting genes could be identified (36_1, 38_8, and 65_7; genome lengths in base pairs) are shown. For each comparison (a, b, and c), matching colored blocks and connecting lines indicate homologous genome sections between each pair. Inversions are indicated in the bottom genome of each pair (inversions are represented as matching color blocks below the black line). For example, one inverted genomic segment is visible between the matching purple blocks in panel a and the matching brown blocks in panel b. To facilitate visualization of the larger chromosomal rearrangements, the Mauve LCB weights (which adjust the single nucleotide polymorphism [SNP] similarity threshold) are adjusted to 13,166 (a), 12,932 (b), and 6,985 (c) for each pairwise comparison. The fourth strain (41_2) is not included in the comparison since a starting gene could not be identified, and it would visually indicate some false genomic rearrangements in Mauve’s linear chromosome alignment tool if included.

Data availability.

The genome data are available in GenBank under BioProject accession number PRJNA669073 (SRA accession numbers are provided in Table 1). The Unicycler log files and Mauve alignment files are available on Figshare (https://doi.org/10.6084/m9.figshare.14134169).

9 in total

1. Small-subunit rRNA genotyping of rhizobia nodulating Australian Acacia spp.

Authors: B Lafay; J J Burdon
Journal: Appl Environ Microbiol Date: 2001-01 Impact factor: 4.792

2. Genetic diversity of rhizobia associated with Acacia longifolia in two stages of invasion of coastal sand dunes.

Authors: Susana Rodríguez-Echeverría; João A Crisóstomo; Helena Freitas
Journal: Appl Environ Microbiol Date: 2007-06-01 Impact factor: 4.792

3. Bradyrhizobia nodulating the Acacia mangium x A. auriculiformis interspecific hybrid are specific and differ from those associated with both parental species.

Authors: Christine Le Roux; Diana Tentchev; Yves Prin; Doreen Goh; Yani Japarudin; Marie-Mathilde Perrineau; Robin Duponnois; Odile Domergue; Philippe de Lajudie; Antoine Galiana
Journal: Appl Environ Microbiol Date: 2009-10-23 Impact factor: 4.792

4. Novel model-based clustering reveals ecologically differentiated bacterial genomes across a large climate gradient.

Authors: Anna K Simonsen; Luke G Barrett; Peter H Thrall; Suzanne M Prober
Journal: Ecol Lett Date: 2019-10-14 Impact factor: 9.492

5. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.

Authors: Aaron E Darling; Bob Mau; Nicole T Perna
Journal: PLoS One Date: 2010-06-25 Impact factor: 3.240

6. Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.

Authors: Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal: PLoS Comput Biol Date: 2017-06-08 Impact factor: 4.475

7. Harnessing the MinION: An example of how to establish long-read sequencing in a laboratory using challenging plant tissue from Eucalyptus pauciflora.

Authors: Miriam Schalamun; Ramawatar Nagar; David Kainer; Eleanor Beavan; David Eccles; John P Rathjen; Robert Lanfear; Benjamin Schwessinger
Journal: Mol Ecol Resour Date: 2018-10-05 Impact factor: 7.090

8. Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors: Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal: Bioinformatics Date: 2014-04-01 Impact factor: 6.937

9. NCBI prokaryotic genome annotation pipeline.

Authors: Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal: Nucleic Acids Res Date: 2016-06-24 Impact factor: 16.971

9 in total