| Literature DB >> 34150961 |
Maria K Syrokou1, Spiros Paramithiotis1, Panagiotis N Skandamis1, Eleftherios H Drosinos1, Loulouda Bosnea2, Marios Mataragas2.
Abstract
Lactiplantibacillus plantarum is a species found in a wide range of foods and other commodities. It can be used as starter or adjunct culture in fermented foods. Herein the annotated high-quality draft genome (scaffolds) of six L. plantarum subsp. argentoratensis strains (LQC 2320, LQC 2422, LQC 2441, LQC 2485, LQC 2516 and LQC 2520) isolated from various Greek wheat sourdoughs is presented. Raw sequence reads were quality checked, assembled into larger contiguous sequences and scaffolds were annotated. The total size of the genomes ranged from 3.13 Mb to 3.49 Mb and the GC content from 45.02% to 45.13%. The total number of coding and non-coding genes were between 3268 and 3723 (3091 to 3492 protein-coding genes, 62 to 107 repeat-region, 54 to 59 tRNAs and 2 to 5 rRNAs, 20 to 30 crispr-repeats, 17 to 26 crispr-spacers and 2 to 4 crispr-arrays). The Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession numbers JAEQMR000000000, JAEQMQ000000000, JAEQMP000000000, JAEQMO000000000, JAEQMN000000000 and JAEQMM000000000. The version described in this paper is version JAEQMR010000000, JAEQMQ010000000, JAEQMP010000000, JAEQMO010000000, JAEQMN010000000 and JAEQMM010000000. Raw sequence reads have been submitted in the Sequence Read Archive (SRA) under the BioProject accession number PRJNA689714 (BioSample accession numbers SAMN17215143, SAMN17215144, SAMN17215145, SAMN17215146, SAMN17215147 and SAMN17215148 and SRA accession numbers SRR13357463, SRR13357464, SRR13357465, SRR13357466, SRR13357467, SRR13357468).Entities:
Keywords: Bioinformatics; Fermentation; Genomics; Lactic acid bacteria; Lactobacillus; Sourdough; Whole-Genome Sequencing
Year: 2021 PMID: 34150961 PMCID: PMC8193102 DOI: 10.1016/j.dib.2021.107172
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1QUAST report comparing different assemblers for the L. plantarum subsp. argentoratensis LQC 2441 strain (assigned as G1_strain).
Characteristics and quality metrics of the six L. plantarum subsp. argentoratensis genomes after genome assembly into scaffoldsa.
| CheckM | BUSCO | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Strain | No of scaffolds | Genome length (bp) | N50 (bp) | GC content (%) | SkewI metric | Completeness (%) | Contamination (%) | Complete and single copy (%) | Complete and duplicate copy (%) | Fragmented (%) | Missing (%) |
| LQC 2441 | 19 | 3,147,789 | 2,993,011 | 45.04 | 0.982 | 100 | 0.2 | 99.8 | 0.0 | 0.0 | 0.2 |
| LQC 2485 | 46 | 3,494,755 | 3,148,808 | 45.02 | 0.934 | 100 | 4.8 | 99.3 | 0.5 | 0.0 | 0.2 |
| LQC 2422 | 20 | 3,128,861 | 2,990,528 | 45.09 | 0.983 | 100 | 0.2 | 99.8 | 0.0 | 0.0 | 0.2 |
| LQC 2320 | 8 | 3,181,752 | 3,129,011 | 45.13 | 0.991 | 100 | 0.0 | 100 | 0.0 | 0.0 | 0.0 |
| LQC 2516 | 19 | 3,148,153 | 3,000,101 | 45.04 | 0.993 | 100 | 0.2 | 99.8 | 0.0 | 0.0 | 0.2 |
| LQC 2520 | 10 | 3,175,498 | 3,140,405 | 45.12 | 0.987 | 100 | 0.0 | 100 | 0.0 | 0.0 | 0.0 |
The percentage of Ns for each genome was 0.07% (LQC 2441), 0.12% (LQC 2485), 0.07% (LQC 2422), 0.05% (LQC 2320), 0.07% (LQC 2516) and 0.04% (LQC 2520).
Quality and functional properties of the six L. plantarum subsp. argentoratensis genomes after genome annotationa.
| Non-coding | Consistency | BUSCO | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Strain | Protein-coding genes (CDS) | repeat-region | tRNA | rRNA | crispr-repeat | crispr-spacer | crispr-array | Coarse (%) | Fine (%) | Complete and single copy (%) | Complete and duplicate copy (%) | Fragmented (%) | Missing (%) |
| LQC 2441 | 3098 | 78 | 54 | 2 | 23 | 20 | 3 | 98.3 | 96.5 | 98.4 | 1.4 | 0.0 | 0.2 |
| LQC 2485 | 3492 | 107 | 59 | 5 | 30 | 26 | 4 | 98.3 | 94.6 | 93.7 | 6.1 | 0.0 | 0.2 |
| LQC 2422 | 3091 | 71 | 54 | 2 | 25 | 22 | 3 | 98.3 | 96.5 | 98.4 | 1.4 | 0.0 | 0.2 |
| LQC 2320 | 3131 | 62 | 54 | 2 | 21 | 19 | 2 | 98.2 | 96.8 | 98.6 | 1.4 | 0.0 | 0.0 |
| LQC 2516 | 3109 | 79 | 54 | 2 | 20 | 17 | 3 | 98.4 | 96.8 | 98.4 | 1.4 | 0.0 | 0.2 |
| LQC 2520 | 3132 | 63 | 54 | 2 | 21 | 19 | 2 | 98.2 | 96.8 | 98.6 | 1.4 | 0.0 | 0.0 |
Total number of genes for each genome was 3278 (LQC 2441), 3723 (LQC 2485), 3268 (LQC 2422), 3291 (LQC 2320), 3284 (LQC 2516) and 3293 (LQC 2520) of which the number of specialty genes was 38 (11 transporters and 27 antibiotic resistance for LQC 2441), 41 (11 transporters and 30 antibiotic resistance for LQC 2485), 38 (11 transporters and 27 antibiotic resistance for LQC 2422), 39 (12 transporters, 1 drug target and 26 antibiotic resistance for LQC 2320), 38 (11 transporters and 27 antibiotic resistance for LQC 2516) and 39 (12 transporters, 1 drug target and 26 antibiotic resistance for LQC 2520).
Fig. 2Circular view of the genome of L. plantarum subsp. argentoratensis LQC 2320 strain.
Fig. 3GC skew analysis of the genome of L. plantarum subsp. argentoratensis LQC 2516 strain (above) and skewI threshold value for the genus of Lactobacillus (below).
Fig. 4Subsystem analysis of the genome of L. plantarum subsp. argentoratensis LQC 2520 strain.
| Subject | Food Science: Food Microbiology |
| Specific subject area | Genomics |
| Type of data | Table |
| How data were acquired | Whole-Genome Sequencing: Illumina Novaseq 6000 (Illumina, CA) |
| Data format | Raw sequence reads, genome assembly and annotation |
| Parameters for data collection | Genomic DNA from pure microbial cultures |
| Description of data collection | Purification of genomic DNA, Whole-Genome Sequencing, genome assembly and genome annotation |
| Data source location | Institution: Laboratory of Food Quality Control and Hygiene of Agricultural University of Athens |
| Data accessibility | |
| Repository name: Sequence Read Archive (SRA) |