| Literature DB >> 31588247 |
Lorine Bethencourt1, Florian Vautrin1, Najwa Taib2, Audrey Dubost1, Lucia Castro-Garcia2, Olivier Imbaud2, Danis Abrouk1, Pascale Fournier1, Jérôme Briolay3, Agnès Nguyen4, Philippe Normand1, Maria P Fernandez1, Céline Brochier-Armanet2, Aude Herrera-Belaroussi1.
Abstract
Actinobacteria from genus Frankia are able to form symbiotic associations with actinorhizal plants including alders. Among them, Sp+ strains are characterized by their ability to differentiate numerous sporangia inside host plant cells (unlike "Sp-" strains unable of in-planta sporulation). Here, we report the first genome sequences of three unisolated Sp+ strains: AgTrS, AiOr and AvVan obtained from Alnus glutinosa, A. incana and A. alnobetula (previously known as viridis), respectively (with genome completeness estimated at more than 98%). They represent new Frankia species based on Average Nucleotide Identity (ANI) calculations, and the smallest Alnus-infective Frankia genomes so far sequenced (~5 Mbp), with 5,178, 6,192 and 5,751 candidate protein-encoding genes for AgTrS, AiOr and AvVan, respectively. © The author(s).Entities:
Keywords: AgTrS; AiOr; AvVan; Frankia
Year: 2019 PMID: 31588247 PMCID: PMC6775861 DOI: 10.7150/jgen.35875
Source DB: PubMed Journal: J Genomics
Assembly data of the three Sp+ Frankia genomes sequenced, before and after read sorting based on their G+C content (to remove the reads with G+C content ≤ 54 %).
| Data before read sorting | Data after read sorting | |||||
|---|---|---|---|---|---|---|
| Sp+ strain | Read number | Total contig number | Mean contig size (pb) | Read number | Total contig number | Mean contig size (pb) |
| AgTrS | 3,480,805 | 3,559 (366) | 2,120 | 2,401,363 (69%) | 612 (281) | 7,978 |
| AiOr | 4,413 305 | 1,860 (376) | 4,542 | 3,977,168 (90%) | 669 (302) | 8,228 |
| AvVan | 1,805,928 | 7,798 (457) | 2,876 | 549,771 (30%) | 1,228 (322) | 3,962 |
Genome features of the three Sp+ Frankia genomes sequenced (in grey), compared to available Frankia genomes (only genomes associated with described or being described Frankia species were included in this table and they are all Sp- strains).
| Species | Cluster | Strain | Genome accession # | No. of contigs | Genome length (nt) | Genomic G+C content (mol%) | N50 genomic values | Genome coverage | Genome completeness (%) (CheckM) | Original host genus | Associated hosts | Reference |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| - | 1 | AgTrS | PRJEB30934 | 281 (≥ 5 kb) | 4,882,652 | 71.6 | 15.3 | 295.1 | 98.1 | This study, | ||
| - | 1 | AiOr | PRJEB30935 | 302 (≥ 5 kb) | 5,504,816 | 71.6 | 17.4 | 433.5 | 99.4 | This study, | ||
| - | 1 | AvVan | SSXH00000000 | 322 (≥ 5 kb) | 4,877,887 | 71.4 | 6.6 | 67.6 | 98.1 | This study, | ||
| 1 | DSM 45986T (ACN14aT) | NC_008278.1 | 1 | 7,497,934 | 72.8 | - | - | - | ||||
| 1 | DSM 44263T (CpI1-ST) | JYFN00000000 | 153 | 7,624,758 | 72.4 | - | - | - | ||||
| 1 | DSM 45898T (ARgP5T) | OESX01000001 | 568 | 7,730,285 | 72.4 | - | - | - | ||||
| 1 | DSM 45818T (CcI3T) | CP000249.1 | 1 | 5,433,628 | 70.1 | - | - | - | Casuarinaceae | |||
| 2 | DSM 100624T (BMG5.1T) | JWIO00000000 | 116 | 5,795,263 | 71.0 | - | - | - | ||||
| 2 | Dg2 | FLUV00000000 | 1066 | 5,929,312 | 67.9 | - | - | - | Rosaceae, | |||
| 2 | Dg1 | CP002801 | 1 | 5,323,186 | 70.0 | - | - | - | ||||
| 3 | DSM 46785T (BCU110501T) | ARDT00000000 | 207 | 7,891,711 | 72.3 | - | - | - | Rhamnaceae, Elaeagnaceae, | |||
| 3 | DSM 46783T (BMG5.12T) | ARFH00000000 | 139 | 7,589,313 | 71.7 | - | - | - | Rhamnaceae, Elaeagnaceae, | |||
| 3 | DSM 45899T (G2T) | FAOZ00000000 | 83 | 9,537,992 | 70.9 | - | - | - | Rhamnaceae, Elaeagnaceae, | |||
| 4 | DSM 45817T (EuI1cT) | CP002299.1 | 1 | 8,815,781 | 72.3 | - | - | - | Elaeagnaceae, | |||
| 4 | DSM 105290T (Cn3T) | AGJN00000000 | 2 | 9,978,592 | 71.8 | - | - | - | - | |||
| 4 | DSM 100626T (M16386T) | MOMC00000000.1 | 174 | 9,435,764 | 72.0 | - | - | - | - |
Figure 1COG functional classification of proteins encoded on the three sequenced Sp+ Proportions (%) of proteins in each of the COG super-functional categories “Cellular processes and signalling”, “Information processing and storage”, “Metabolism” and “Poorly characterized”, predicted for AgTrS (white bars), AiOr (black bars) and AvVan (grey bars) genomes.
Figure 2Position of the three Sp+ sequenced In addition to Sp+ genomes, a total of 28 sequenced Frankia strains were used. For all the 31 genomes, 51 ribosomal protein sequences (total size = 18,582 nt) were included in a supermatrix and the phylogenetic tree was constructed based on the model GTR+I+R4.