| Literature DB >> 31733062 |
Olivier Arnaiz1, Eric Meyer2, Linda Sperling1.
Abstract
ParameciumDB (https://paramecium.i2bc.paris-saclay.fr) is a community model organism database for the genome and genetics of the ciliate Paramecium. ParameciumDB development relies on the GMOD (www.gmod.org) toolkit. The ParameciumDB web site has been publicly available since 2006 when the P. tetraurelia somatic genome sequence was released, revealing that a series of whole genome duplications punctuated the evolutionary history of the species. The genome is linked to available genetic data and stocks. ParameciumDB has undergone major changes in its content and website since the last update published in 2011. Genomes from multiple Paramecium species, especially from the P. aurelia complex, are now included in ParameciumDB. A new modern web interface accompanies this transition to a database for the whole Paramecium genus. Gene pages have been enriched with orthology relationships, among the Paramecium species and with a panel of model organisms across the eukaryotic tree. This update also presents expert curation of Paramecium mitochondrial genomes.Entities:
Mesh:
Year: 2020 PMID: 31733062 PMCID: PMC7145670 DOI: 10.1093/nar/gkz948
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Available genomes. For each genome, the species, strain, cellular compartment (macronucleus, MAC, micronucleus, MIC or mitochondrion, MITO), assembly complexity, number of annotated coding genes and bibliographic reference are provided
| Species | Strain | Compartment | Complexity | N50 | # coding genes | Reference |
|---|---|---|---|---|---|---|
|
| V1-4 | MAC | 76976592 | 145535 | 40261 | ( |
| V1-4 | MITO | 39731 | 39731 | 46 | ( | |
|
| 110224 | MAC | 29155737 | 96293 | 17226 | ( |
|
| 43c3d | MAC | 30525943 | 306679 | 18673 | ( |
| 43c3d | MITO | 43620 | 43620 | 46 | ( | |
| C026 | MITO | 44414 | 44413 | 46 | ( | |
| C083 | MITO | 44180 | 44175 | 46 | ( | |
| C104 | MITO | 44421 | 44420 | 46 | ( | |
| GBE | MITO | 43663 | 43657 | 46 | ( | |
|
| 223 | MAC | 71912400 | 189418 | 40810 | ( |
| 223 | MITO | 40127 | 40127 | 46 | ( | |
|
| 274 | MAC | 71627707 | 176048 | 41085 | ( |
| 274 | MITO | 40070 | 40070 | 40070 | ( | |
|
| M | MAC | 65347890 | 212635 | 37098 | ( |
| M | MITO | 40138 | 40138 | 46 | ( | |
|
| MO3c4 | MAC | 35730210 | 436665 | 17834 | ( |
| MO4 | MITO | 38064 | 38062 | 42 | ( | |
| Peniche 3I | MITO | 39192 | 39192 | 42 | ( | |
|
| TE | MAC | 64789109 | 78722 | 35534 | ( |
| TE | MITO | 39671 | 39671 | 46 | ( | |
|
| K8 | MAC | 72980862 | 439553 | 38668 | ( |
| 138 | MITO | 39802 | 39802 | 46 | ||
|
| 87 | MITO | 39865 | 39865 | 46 | |
|
| Ir4-2 | MAC | 71017591 | 460739 | 34474 | ( |
| AZ9-3 | MITO | 39763 | 39763 | 46 | ||
|
| N1A | MAC | 59122419 | 223945 | 33793 | ( |
| N1A | MITO | 39781 | 39781 | 46 | ( | |
|
| AZ8-4 | MAC | 68020722 | 420472 | 36094 | ( |
| AZ8-4 | MITO | 39745 | 39745 | 46 | ( | |
| 128 | MITO | 39939 | 39939 | 46 | ( | |
| 130 | MITO | 39946 | 39946 | 46 | ( | |
| 133 | MITO | 39984 | 39983 | 46 | ( | |
|
| 30995 | MITO | 40274 | 40274 | 46 | |
|
| d4-2 | MAC | 72094543 | 410619 | 39642 | ( |
| 51 | MAC | 72102941 | 412881 | 40460 | ( | |
| 51 | MITO | 40040 | 40040 | 46 | ( | |
| 51 | MIC | 98489268 | 37181 | NA | ( | |
| 32 | MITO | 39835 | 39835 | 46 | ||
|
| 209 | MAC | 65931501 | 490675 | 36179 | ( |
| 209 | MITO | 39834 | 39834 | 46 | ( |
Figure 1.Phylogenetic tree of Paramecium species based on the alignment of all 46 mitochondrial proteins. The tree was built by MAFFT alignment of concatenated protein sequences, where all N- and C-termini are correctly aligned; strict Gblocks curation selects 9135 residues aligned in all species. PhyML aLRT analysis was carried out using the service at https://www.phylogeny.fr with default parameters. This analysis places the caudatum species as closest relatives of the aurelia species. Note the good support for a sexaurelia-sonneborni-jenningsi clade.
Tools and software components
| Tool | Software | Description | Reference | |
|---|---|---|---|---|
| Database | chado | V1.31; PostgreSQL | Data storage | ( |
| lucene | 2.3.3.4 | Quick search indexation |
| |
| bio-db-seqfeature-store | MySQL (mariadb v5.5.60), BioPerl (v1) | Data storage | ( | |
| Interface | Template toolkit | v2.24 | Web content system |
|
| Web design | Jquery (v2.1.4), Bootstrap (v3.3.5), font-awesome (v4.5.0) | Javascript and CSS libraries |
| |
|
| Blast | ncbi-blast (v2.3.0+) | Search for a sequence in a nucleotide or protein database, using NCBI Basic Local Alignment Search Tool | ( |
| BioMart | Biomart (v0.9) | Advanced query and data retrieval interface, powered by BioMart software | ( | |
| Get Sequence | Samtools (v0.1.19) | Retrieve a single nucleotide or protein sequence from a ParameciumDB database | ( | |
| ID converter | In-house | Find corresponding gene IDs between different gene annotation versions | ||
| Search motif | EMBOSS (v6.6.0.0) | Searches sequences with a sequence motif using patmatdb program from the EMBOSS package | ( | |
|
| JBrowse | Jbrowse (v1.12.1) | A fast, Interactive genome browser | ( |
| GBrowse | Gbrowse2 (v2.54) | The Generic Genome Browser | ( | |
| Chromosome Synteny | Circos (v0.69-3) | Visualize gene paralogy or orthology relationships between scaffolds, based on protein similarity | ( | |
| Stock Tubes | In-house | Visualize available strains, genotypes and phenotypes | ||
| Publications | In-house | Paramecium publications, linked to PubMed and to ParameciumDB gene pages | ||
| Codon Usage | In-house | Codon usage tables and tool to find alternate codons for design of synthetic genes resistant to RNAi | ||
| ParaWiki | Mediawiki (v1.27.1) | Community pages with information about Paramecium biology, protocols, meetings and more |
| |
|
| RNAi Off-target | BWA (v0.7.12) | Determine whether a sequence has off-target siRNA matches. | ( |
| Multiple alignment | MUSCLE (v3.8.425) | Construct a multiple alignment of nucleotide or protein sequences with MUSCLE. | ( |
The tools and software components used by ParameciumDB are organized by category (Database, Interface, Search, Browse, Analysis). Each tool has a brief description. For each software component, the version is provided as well as bibliographic reference or URL.
Figure 2.Gene page example. Gene page of ND7, showing the different collapsable sections available for this gene, indicated in the Table of Contents: Overview with general description and cross references, Sequence, Transcription unit structure, Bibliographic references, Protein domain predictions, Homology within Paramecium species and with other organisms, Expression data (RNA-seq, microarray, proteomic), intragenic IES information and information about ND7 alleles, stocks and mutant phenotypes. Selecting proteins adds them to the shopping cart as shown for the Homology section. The shopping cart can be visualized from the User dropdown menu (top right of page) as shown by the inset screenshot at the bottom of the figure. Actions are available for the shopping cart items: multiple alignment (with MUSCLE (34)), expression profiles, submit to a blast at ParameciumDB or NCBI. The items can also be exported, in fasta format by choosing the ‘Sequence(s)’ tab.