Literature DB >> 35515993

Genome sequence data of Lactiplantibacillus plantarum IMI 507028.

Ivana Nikodinoska1, Jenny Makkonen2, Daniel Blande2, Colm Moran3.   

Abstract

Here, we report the genome sequencing data for the fermented milk isolate, Lactiplantibacillus plantarum (L. plantarum) IMI 507028. The Bioproject, SRA, and GenBank data were deposited at NCBI under accession numbers PRJNA801616, SRR18323693, and JAKMAX000000000, respectively. The size of the genome was 3,231,321 bp, with a GC% of 44.52. Before sequence trimming, the genome contained 40 contigs, in which 35 contigs were annotated, revealing 2937 coding sequences out of 3052 total genes. The strain was identified as L. plantarum with an average nucleotide identity (ANI) value of 99.9922% between IMI 507028 and L. plantarum JDM1. Genes related to antimicrobial resistance or pathogenic factors were not found during screening.
© 2022 The Author(s). Published by Elsevier Inc.

Entities:  

Keywords:  Antibiotic resistance; Lactiplantibacillus plantarum; Microbial identity; Microbial safety; Virulent factors; Whole genome sequencing

Year:  2022        PMID: 35515993      PMCID: PMC9062256          DOI: 10.1016/j.dib.2022.108190

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

Lactiplantibacillus plantarum is widely isolated from fermented foods and plant sources, and is versatile for its use in products aimed at food and feed preservation. Lactiplantibacillus plantarum IMI 507028 is a strain that can find applications in the agri-food sector; therefore, its characterization is important in terms of identity and safety. The genome data presented herein could enrich safety-related knowledge of the lactic acid bacteria, that could be used for comparative purposes.

Data Description

Lactiplantibacillus plantarum (L. plantarum) IMI 507028 genome data, taxonomic identification, genome searches for antimicrobial resistance, and virulence (pathogenic) factors are described. The analyzed genome sequencing data produced an annotated assembly of 35 contigs with a sequence length of 3,230,670 bp, GC% of 44.52, N50 contig length of 366,540 bp, and genome coverage of 529x. The total number of genes after annotation was 3052 genes, of which 2937 were coding sequences, 2 were ribosomal RNAs, and 59 were transfer RNA. Genome comparison performed with alignment-free genome distance estimation with Mash using MinHash [1] showed the best hit (low distance and high matching) for L. plantarum JDM1 (Table 1).
Table 1

Taxonomic identification of IMI 507028 used as query-ID with Mash, a fast genome distance estimation using MinHash. The search was made for the closest matching NCBI RefSeq genomes. Mash distance approximates the mutation rate. The sketches are set to include 400 hashes (derived from k-mers of size 16) selected randomly from each genome for pairwise comparison. For example, IMI 507028 and the top hit L. plantarum JDM1 share 398 out of the 400 hashes.

Reference IDMash distancep-valueMatching hashesAssembly accession
L. plantarum JDM10.000156800 398/400GCF_000023085.1
L. plantarum DmCS_0010.01011520296/400GCF_000743895.1
L. plantarum subsp. plantarum NC80.01011520296/400GCF_000247735.1
L. plantarum 21650.01072930291/400GCF_000466845.1
L. plantarum LP910.0108540290/400GCF_000473935.1
Taxonomic identification of IMI 507028 used as query-ID with Mash, a fast genome distance estimation using MinHash. The search was made for the closest matching NCBI RefSeq genomes. Mash distance approximates the mutation rate. The sketches are set to include 400 hashes (derived from k-mers of size 16) selected randomly from each genome for pairwise comparison. For example, IMI 507028 and the top hit L. plantarum JDM1 share 398 out of the 400 hashes. The alignment-based calculation of ANI is the most widely used comparative measurement for overall genome relatedness indices (OGRI) between two genomes to define boundaries between species with a proposed species boundary cut-off of 95–96% [2]. The genomes sequences of the indicated species included in the orthoANI genome analyses are shown in Table 2.
Table 2

Sequences included in the OrthoANI analysis. Type strains are marked with (T).

StrainAssembly AccessionContigsSize (bp)GC%
L. plantarum JDM1GCF_000023085.137323369744.51
L. plantarum subsp. plantarum ATCC 14917 (T)GCF_000143745.11319775944.66
L. plantarum subsp. plantarum ST-IIIGCF_000148815.29321226144.48
L. plantarum subsp. plantarum NC8GCF_000247735.12330793644.48
L. plantarum LP91GCF_000473935.110320722444.56
L. plantarum Lp90GCF_000731855.1145292558445.06
L. plantarum DmCS_001GCF_000743895.133332407644.32
L. plantarum 90skGCF_000830535.183319468744.54
L. plantarum DSM 13273GCF_001436855.147337145844.26
L. pentosus RI-031GCF_002751855.190343980044.24
Sequences included in the OrthoANI analysis. Type strains are marked with (T). Data from the OrthoANI comparison [3] of IMI 507028 with closely related Lactiplantibacillus strains are shown in Table 3. Pairwise comparisons indicated that the IMI 507028 genome shared 99.9922% identity with the strain L. plantarum JDM1. The cut-off value for species identification was set at 95% [4].
Table 3

OrthoANI (%) calculations between IMI 507028 and selected Lactiplantibacillus strains.

IMIJDM1ATCC 14917ST-IIINC8LP91LP90DmCS_00190skDSM 13273L. pentosus RI-031
IMI 50702810099.992299.132799.052899.069499.192999.023999.073199.118299.046579.9773
JDM110099.12399.072499.094599.172199.05299.076399.157299.093779.8184
ATCC 1491710099.231499.546999.282999.292899.515399.814699.23779.7437
ST-III10099.377899.257799.063999.381199.21199.668279.9368
NC810099.29799.315199.993599.533499.39579.9225
LP9110099.290399.242499.22599.242479.8461
LP9010099.343699.295299.014379.7679
DmCS_00110099.53399.377379.7947
90sk10099.208679.9542
DSM 1327310079.8667
L. pentosus RI-031100
OrthoANI (%) calculations between IMI 507028 and selected Lactiplantibacillus strains. To assess the antibiotic resistance genes present in the sequenced genome IMI 507028, three different databases were used, and no hits with ≥80% identity and 70% coverage were identified, with criteria and thresholds proposed by the European Food Safety Authority (EFSA) for whole genome sequencing data analysis [4]. In addition, no hits were identified for toxin and virulence factor genes potentially carried by IMI 507028 when screened against an updated virulence factor database (VFDB). The search for potential plasmids was performed using PlasmidFinder, and contig 23 showed a partial match with L. plantarum subsp. plantarum P-8 plasmid LBPp6.

Experimental Design, Materials and Methods

Detailed genome sequencing data analysis was performed as previously described [5] except for the annotation performed using NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v6.0 [6]. Sequencing produced 6,141,736 raw reads and 1,854,804,272 base pairs. After trimming 5,850,376 paired reads, 1,708,275,654 base pairs were obtained.

CRediT authorship contribution statement

Ivana Nikodinoska: Writing – original draft, Data curation. Jenny Makkonen: Methodology, Software. Daniel Blande: Software, Formal analysis, Writing – review & editing. Colm Moran: Writing – review & editing, Project administration.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
SubjectMicrobiology
Specific subject areaMicrobial genomics
Type of dataRaw reads, assembled and annotated data from the L. plantarum IMI 507028 genome sequencing
How the data were acquiredIllumina NovaSeq 6000, Unicycler v 0.4.8, PGAP v6.0, NCBI Bacterial Antimicrobial Resistance Reference Gene Database v. 2021-06-01.1, ResFinder, Virulence Factor Database (VFDB), PlasmidFinder.
Data formatRawAnalyzed
Description of data collectionLactiplantibacillus plantarum IMI 507028 was isolated from fermented milk and the extracted DNA was sequenced using NovaSeq 6000 Platform (Illumina). The obtained sequencing data were used for genome analysis, strain identification, and search of genes of concern.
Data source locationInstitution: Alltech Inc.City/Town/Region: Nicholasville, KentuckyCountry: USA
Data accessibilityBioproject Accession Number: PRJNA801616NCBI GenBank Accession Number: JAKMAX000000000NCBI SRA Accession Number: SRR18323693Direct URL to data: https://www.ncbi.nlm.nih.gov/bioproject/PRJNA801616
  6 in total

1.  OrthoANI: An improved algorithm and software for calculating average nucleotide identity.

Authors:  Imchang Lee; Yeong Ouk Kim; Sang-Cheol Park; Jongsik Chun
Journal:  Int J Syst Evol Microbiol       Date:  2015-11-09       Impact factor: 2.747

2.  Proposed minimal standards for the use of genome data for the taxonomy of prokaryotes.

Authors:  Jongsik Chun; Aharon Oren; Antonio Ventosa; Henrik Christensen; David Ruiz Arahal; Milton S da Costa; Alejandro P Rooney; Hana Yi; Xue-Wei Xu; Sofie De Meyer; Martha E Trujillo
Journal:  Int J Syst Evol Microbiol       Date:  2018-01       Impact factor: 2.747

3.  Mash: fast genome and metagenome distance estimation using MinHash.

Authors:  Brian D Ondov; Todd J Treangen; Páll Melsted; Adam B Mallonee; Nicholas H Bergman; Sergey Koren; Adam M Phillippy
Journal:  Genome Biol       Date:  2016-06-20       Impact factor: 13.583

4.  Whole genome sequence data of Lactiplantibacillus plantarum IMI 507027.

Authors:  Ivana Nikodinoska; Jenny Makkonen; Daniel Blande; Colm Moran
Journal:  Data Brief       Date:  2022-03-06

5.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

6.  EFSA statement on the requirements for whole genome sequence analysis of microorganisms intentionally used in the food chain.

Authors: 
Journal:  EFSA J       Date:  2021-07-28
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.