| Literature DB >> 30103816 |
Martin Norling1,2, Niclas Jareborg1,3, Jacques Dainat4,5.
Abstract
OBJECTIVE: The state-of-the-art genome annotation tools output GFF3 format files, while this format is not accepted as submission format by the International Nucleotide Sequence Database Collaboration (INSDC) databases. Converting the GFF3 format to a format accepted by one of the three INSDC databases is a key step in the achievement of genome annotation projects. However, the flexibility existing in the GFF3 format makes this conversion task difficult to perform. Until now, no converter is able to handle any GFF3 flavour regardless of source.Entities:
Keywords: Annotation; Converter; EMBL; GFF3; Submission
Mesh:
Substances:
Year: 2018 PMID: 30103816 PMCID: PMC6090716 DOI: 10.1186/s13104-018-3686-x
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
EMBLmyGFF3 performance assessment on different genome annotations
| Species | File size of the genome (Mbases) | File size of the gff annotation file (Mbytes) | Memory usage (Gbytes) | Compute time (Min) |
|---|---|---|---|---|
|
| 4.6 | 3.5 | 0.17 | 2.2 |
|
| 13 | 3 | 0.14 | 1.8 |
|
| 36 | 17 | 0.40 | 10 |
|
| 227 | 63 | 1.60 | 49 |
|
| 927 | 122 | 3.11 | 164 |
The tests have been run on a 2.8 GHz Intel Core i7 computer with 8 Gb memory except for E. subulatus run on AMD Opteron(tm) Processor 6174 with 256 Gb memory