| Literature DB >> 28878862 |
Amir Ktari1, Imen Nouioui1, Teal Furnholm2, Erik Swanson2, Faten Ghodhbane-Gtari1, Louis S Tisa2, Maher Gtari1.
Abstract
Frankia sp. NRRL B-16219 was directly isolated from a soil sample obtained from the rhizosphere of Ceanothus jepsonii growing in the USA. Its host plant range includes members of Elaeagnaceae species. Phylogenetically, strain NRRL B-16219 is closely related to "Frankia discariae" with a 16S rRNA gene similarity of 99.78%. Because of the lack of genetic tools for Frankia, our understanding of the bacterial signals involved during the plant infection process and the development of actinorhizal root nodules is very limited. Since the first three Frankia genomes were sequenced, additional genome sequences covering more diverse strains have helped provide insight into the depth of the pangenome and attempts to identify bacterial signaling molecules like the rhizobial canonical nod genes. The genome sequence of Frankia sp. strain NRRL B-16219 was generated and assembled into 289 contigs containing 8,032,739 bp with 71.7% GC content. Annotation of the genome identified 6211 protein-coding genes, 561 pseudogenes, 1758 hypothetical proteins and 53 RNA genes including 4 rRNA genes. The NRRL B-16219 draft genome contained genes homologous to the rhizobial common nodulation genes clustered in two areas. The first cluster contains nodACIJH genes whereas the second has nodAB and nodH genes in the upstream region. Phylogenetic analysis shows that Frankia nod genes are more deeply rooted than their sister groups from rhizobia. PCR-sequencing suggested the widespread occurrence of highly homologous nodA and nodB genes in microsymbionts of field collected Ceanothus americanus.Entities:
Keywords: Actinorhizal symbiosis; Canonical nod genes; Ceanothus; Frankia; Genome; Plant-microbe interactions
Year: 2017 PMID: 28878862 PMCID: PMC5584510 DOI: 10.1186/s40793-017-0261-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Frankia sp. strain NRRL B-16219 according to MIGS [45]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | IDR | ||
| Strain NRRL B-16219IDA | |||
| Gram stain | Positive | IDA | |
| Cell shape | Filament-shaped | IDA | |
| Motility | Non-motile | NAS | |
| Sporulation | Sporulating | NAS | |
| Temperature range | 25–35 °C | TAS [ | |
| Optimum temperature | 28 °C | TAS [ | |
| pH range; Optimum | pH 6.3 – pH 6.8 | NAS | |
| Carbon source | short fatty acids, TCA-cycle intermediates and carbohydrates | IDA | |
| MIGS-6 | Habitat | Soil and Host-associated | IDA |
| MIGS-6.3 | Salinity | Not reported | |
| MIGS-22 | Oxygen requirement | Aerobic | NAS |
| MIGS-15 | Biotic relationship | Free-living and Host plant-associated | NAS |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | Soil beneath | IDA |
| MIGS-5 | Sample collection | 1982 | IDA |
| MIGS-4.1 | Latitude | Not reported | - |
| MIGS-4.2 | Longitude | Not reported | - |
| MIGS-4.4 | Altitude | Not reported | - |
a Evidence codes – IDA INFERRED FROM DIRECT ASSAY, TAS traceable author statement (i.e., a direct report exists in the literature) NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence)
Fig. 1Maximum likelihood (ML) phylogenetic tree based on the 16S rRNA gene sequences (1400 nt), showing the relationships between Frankia NRRL B-16219 and Frankia species. The ML tree was inferred using the GTR + GAMMA model and rooted by midpoint-rooting; the branches are scaled in terms of the expected number of substitutions per site. The numbers above the branches are support values when larger than 60% from ML (left) and MP (right) bootstrapping
Fig. 2Scanning electron micrograph of strain NRRL B-16219 after growth for 4 weeks in liquid DPM medium at 28 °C showing hyphae (h), vesicles (v) and sporangia (s)
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Draft genome |
| MIGS-28 | Libraries used | Illumina Standard library |
| MIGS 29 | Sequencing platforms | Illumina HiSeq2500 platform |
| MIGS 31.2 | Fold coverage | 120.5× |
| MIGS 30 | Assemblers | Spades version 3.5, ALLPaths-LG version r52488 |
| MIGS 32 | Gene calling method | GeneMarkS+ v3.3 |
| Locus Tag | BBK14_RS02460 | |
| Genbank ID | MAXA00000000.1 | |
| Genbank Date of Release | October 30, 2016 | |
| GOLD ID | Gp0153653 | |
| BIOPROJECT | PRJNA224116 | |
| MIGS 13 | Source Material Identifier | NRRL B-16219 |
| Project relevance | Agricultural |
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 8,032,739 | 100.0 |
| DNA coding (bp) | 6,603,166 | 82.20 |
| DNA G + C (bp) | 5,760,840 | 71.72 |
| DNA Contigs289100 | .0 | |
| Total genes | 6859 | 100.0 |
| Protein coding genes | 6, 211 | 91.01 |
| RNA genes | 53 | 0.77 |
| Pseudo genesb | 561 | 8.18 |
| Genes in internal clusters | - | - |
| Genes with function prediction | 5046 | 73.60 |
| Genes assigned to COGs | 3609 | 52.64 |
| Genes with Pfam domains | 4735 | 69.06 |
| Genes with signal peptides | 176 | 2.57 |
| Genes with transmembrane helices | 296 | 4.32 |
| CRISPR repeats | 2 | - |
aThe total is based on either the size of the genome in base pairs or the total genes in the annotated genome
bPseudo genes may also be counted as protein coding or RNA genes, so is not additive under total gene count
Number of genes associated with the general COG functional categories
| Code | Value | % agea | Description |
|---|---|---|---|
| J | 178 | 4.27 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.02 | RNA processing and modification |
| K | 408 | 9.79 | Transcription |
| L | 109 | 2.62 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 32 | 0.77 | Cell cycle control, cell division, chromosome partitioning |
| V | 135 | 3.24 | Defense mechanisms |
| T | 249 | 5.98 | Signal transduction mechanisms |
| M | 173 | 4.15 | Cell wall/membrane biogenesis |
| N | 21 | 0.5 | Cell motility |
| U | 30 | 0.72 | Intracellular trafficking, secretion, and vesicular transport |
| O | 140 | 3.36 | Posttranslational modification, protein turnover, chaperones |
| C | 250 | 6 | Energy production and conversion |
| G | 207 | 4.97 | Carbohydrate transport and metabolism |
| E | 297 | 7.13 | Amino acid transport and metabolism |
| F | 94 | 2.26 | Nucleotide transport and metabolism |
| H | 262 | 6.29 | Coenzyme transport and metabolism |
| I | 351 | 8.42 | Lipid transport and metabolism |
| P | 210 | 5.04 | Inorganic ion transport and metabolism |
| Q | 256 | 6.14 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 508 | 12.19 | General function prediction only |
| S | 178 | 4.27 | Function unknown |
| - | 3247 | 47.36 | Not in COGs |
aThe total is based on the total number of protein-coding genes in the genome
Fig. 3Organization of nod genes in Frankia NRRL B-16219 and Candidatus Frankia datiscae Dg1 genomes. Sizes, localization and orientation of the genes are displayed proportionally. These genes are not detectable in any other Frankia genome except Candidatus Frankia Dg2
Fig. 4Maximum likelihood phylogeny based on amino acids of nodA (a), nodB (b), nodC (c) and nodH (d). GC-content is provided for nod genes and for genomes (in parenthesis). Bootstrap and probability values larger than 50% are only shown
Fig. 5Neighbor-Joining phylogenetic tree based on glnA gene sequences. Bootstrap and probability values larger than 50% are only shown. Marked in bold are Frankia strains or microsymbionts with nod genes as present in their genomes or detected by PCR-sequencing analysis