| Literature DB >> 30239681 |
Nik Shazana Nik Mohd Sanusi1, Rozana Rosli1,2, Mohd Amin Ab Halim1, Kuang-Lim Chan1, Jayanthi Nagappan1, Norazah Azizi1, Nadzirah Amiruddin1, Tatiana V Tatarinova3, Eng-Ti Leslie Low1.
Abstract
A set of Elaeis guineensis genes had been generated by combining two gene prediction pipelines: Fgenesh++ developed by Softberry and Seqping by the Malaysian Palm Oil Board. PalmXplore was developed to provide a scalable data repository and a user-friendly search engine system to efficiently store, manage and retrieve the oil palm gene sequences and annotations. Information deposited in PalmXplore includes predicted genes, their genomic coordinates, as well as the annotations derived from external databases, such as Pfam, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes. Information about genes related to important traits, such as those involved in fatty acid biosynthesis (FAB) and disease resistance, is also provided. The system offers Basic Local Alignment Search Tool homology search, where the results can be downloaded or visualized in the oil palm genome browser (MYPalmViewer). PalmXplore is regularly updated offering new features, improvements to genome annotation and new genomic sequences. The system is freely accessible at http://palmxplore.mpob.gov.my.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30239681 PMCID: PMC6146135 DOI: 10.1093/database/bay095
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1Overview of the PalmXplore system architecture. The system architecture is based on the client/server architecture. The PalmXplore-DB contains a list of predicted oil palm genes, functional annotations of the genes with integrative access to external databases and oil palm genome scaffolds and sequences.
Figure 2ERD of PalmXplore-DB. The ERD shows the conceptual data structure used in PalmXplore-DB. Entities and relationships are represented as boxes and dotted lines between the boxes, respectively. The database structure consists of 15 tables presented in three modules (represented by different colours).
Figure 3Oil palm genes classification based on GO annotation.
Summary of the data deposited in PalmXplore system
|
| |||
|---|---|---|---|
| Total number of scaffolds | 40 360 | ||
| Average / N50 / largest scaffold sizes (bp) | 38 036 / 1 045 414 / 22 100 610 | ||
| Number of bases (bp) | 1 535 150 282 | ||
|
| |||
| Representative | Fgenesh++ | Seqping | |
| Number of genes | 26 059 | 27 915 | 17 680 |
| Average length (bp) | 1 239 | 1 120 | 1 193 |
| Gene density (gene/Mb) | 16.98 | 18.19 | 11.52 |
| Average exon per gene | 5.4 | 5.1 | 6.0 |
| Average exon length (bp) | 252 | 237 | 197 |
| Number of genes annotated to GO term(s) | 21 572 | - | - |
| Number of genes with Enzyme Code (KEGG) | 6 195 | - | - |
|
| |||
| Intronless genes | 3 658 | ||
| Resistance (R) genes | 210 | ||
| FAB genes | 42 | ||
Figure 4Search and browse options in the PalmXplore system. (A) Basic search: search oil palm genes by Gene ID, Scaffold ID, chromosome number, keywords, specific gene or location on the genome. (B) Advanced search: refine search results by entering multiple options. (C) MYPalmViewer: visualize and navigate searched gene in the oil palm genome sequence. Annotation data is also available. (D) CDS browser: browse the list of predicted genes.
Figure 5Interoperability with web portal and Bioinformatics analysis tools. (A) Genomsawit portal: a web portal for the oil palm genome information. Genome assemblies and gene models are available for download here. (B) PalmXplore system manages oil palm gene data deposition and queries. (C) BLAST as an alignment tool. (D) MYPalmViewer to visualise oil palm genomes, genes, genetic markers and others; (E) sequence retrieval of oil palm genomic data; and (F) data download facility.