| Literature DB >> 34335919 |
.
Abstract
Microorganisms, genetically modified or not, may be used in the food chain as such or as production organisms of substances of interest. The placement of such microorganisms or derived substances/products in the European market may be subject to a pre-market authorisation process. The authorisation process defines the need to perform a risk assessment to establish the safety and/or the efficacy of the microorganisms when used in the food chain as such or as production strains of substances of interest. In order to perform a risk assessment, the microorganism/s subject to the application for authorisation need/s to be characterised. In this regard, data obtained from whole genome sequence analysis can provide information on the unequivocal taxonomic identification of the strains and on the characterisation of their potential functional traits of concern which may include virulence factors, resistance to antimicrobials of clinical relevance for humans and animals, production of known toxic metabolites. In fact, in some areas of the regulated products, the use of whole genome sequence-based data has been established as a requirement for the risk assessment. This document provides recommendations to applicants on how to describe the process and results which should be provided to the risk assessor in the context of an application for market authorisation of a regulated product. Indications are given on how to perform WGS and the quality criteria/thresholds that should be reached as well as the data and relevant information that need to be sent along whenever such kind of data is required.Entities:
Keywords: food chain; intentional use; microorganisms; whole genome sequence‐based data
Year: 2021 PMID: 34335919 PMCID: PMC8317053 DOI: 10.2903/j.efsa.2021.6506
Source DB: PubMed Journal: EFSA J ISSN: 1831-4732
| Section | Item | Provided | Comments | |
|---|---|---|---|---|
| Yes | NA | |||
|
| ||||
|
|
| |||
| Identifier for the microorganism/s subject of the application for authorisation (same used in other sections of the dossier) |
|
| ||
| Confirmation of the correspondence of the samples used for DNA extraction, sequencing, WGS‐based data analysis and results reported with the microorganism/s subject of the application |
|
| ||
|
|
| |||
|
|
| |||
| Library construction method (including the DNA fragmentation method and any selection of fragments) |
|
| ||
|
|
| |||
| Sequencing strategy and instrumentation used (base‐calling method, where applicable) |
|
| ||
| Trimming and adaptor removal criteria applied, software version and parameters used, PHRED score, number of reads and total base pairs of sequence data before and after trimming |
|
| ||
| Average read depth |
|
| ||
| Contamination in the sequencing data – Percent of reads assigned to unexpected organism/s Tool used, the software version and parameters used, and results; the database used, its version and/or date of accession |
|
| ||
|
|
| |||
| Assembler software, version and parameters (including those applied in post‐assembly processing) |
|
| ||
| Data related to the contigs: total number of contigs produced by the assembler, total length of contigs, N50 |
|
| ||
| Number of highly conserved genes present for yeast and filamentous fungi |
|
| ||
| Annotation software name, version and parameters used, databases used, version and/or date of accession |
|
| ||
|
|
| |||
|
|
| |||
| For |
|
| ||
| For read‐mapping approach, the reference genome used, the software including version number and parameters used, and the results |
|
| ||
|
|
| |||
| Sequences and methodology used for analyses and comparison |
|
| ||
| Alignments between the GMM and the parental/recipient strain |
|
| ||
| Map or graphic presentation, including all insertions, deletions and substitutions found in the genetically modified strain, coding and non‐coding sequences (e.g. promoters, terminators) |
|
| ||
|
|
| |||
| Strategy, software and parameters used to identify genes of interest and database/s used (including version and/or accession date) |
|
| ||
|
|
|
| ||
| For relevant hits:
Subject sequence (including name, accession number and function of the encoded protein) Sequence identity Percentage length of the subject sequence covered |
|
| ||
| Reference‐based read mapping approach |
|
| ||
| For relevant hits:
Subject sequence (including name, accession number and function of the encoded protein) Sequence identity Median depth of mapping Percentage length of the subject sequence covered |
|
| ||
|
|
| |||
| The sequencing reads, and after trimming where relevant, should be submitted in FASTQ or similar formats, compressed or not, pair or single end |
|
| ||
| Assembled sequences can be submitted in FASTA format (e.g.,*.fa) |
|
| ||
| Annotation |
|
| ||
| Alignments to characterise the genetic modification should be provided in Sequence Alignment/Map format (SAM), or Binary Alignment/Map format (BAM) or similar file formats |
|
| ||
| Name: Date: Signature: | ||||
Certain applications may require data supporting the stability of the genetic modification.
In case the annotation format includes the nucleotide sequence, data in FASTA format is not required.