| Literature DB >> 30799499 |
Tian Li1, Xiao-Fei Xu2, Hui-Hui Du1, Li Li2, Neng-Zhang Li3, Ze-Yang Zhou1,4, Yuan-Yi Peng3.
Abstract
Pasteurella multocida can infect a wide range of host, including humans and animals of economic importance. Genomics studies on the pathogen have produced a large amount of omics data, which are deposited in GenBank but lacks a dedicated and comprehensive resource for further analysis and integration so that need to be brought together centrally in a coherent and systematic manner. Here we have collected the genomic data for 176 P. multocida strains that are categorized into 11 host groups and 9 serotype groups, and developed the open-access P. multocida Database (PamulDB) to make this resource readily available. The PamulDB implements and integrates Chado for genome data management, Drupal for web content management, and bioinformatics tools like NCBI BLAST, HMMER, PSORTb and OrthoMCL for data analysis. All the P. multocida genomes have been further annotated for search and analysis of homologous sequence, phylogeny, gene ontology, transposon, protein subcellular localization and secreted protein. Transcriptomic data of P. multocida are also selectively adopted for gene expression analysis. The PamulDB has been developing and improving to better aid researchers with identifying and classifying of pathogens, dissecting mechanisms of the pathogen infection and host response.Entities:
Mesh:
Year: 2019 PMID: 30799499 PMCID: PMC6387869 DOI: 10.1093/database/baz025
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Summary of P. multocida strains and major data types available in PamulDB
| No. of records | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Subspecies | Serotype | Total | |||||||||||||
|
|
|
| unclassified | A | A1 | A3 | A5 | B | B2 | D | F | NA | |||
| Strains | 3 | 15 | 46 | 112 | 13 | 5 | 5 | 1 | 2 | 17 | 1 | 3 | 129 | 176 | |
| Assembly | 502 | 301 | 7166 | 4460 | 244 | 84 | 64 | 6 | 39 | 732 | 1 | 7 | 11 252 | 12 429 | |
| Gene | 4274 | 32 292 | 91 544 | 235 985 | 27 604 | 10 532 | 10 722 | 2 176 | 4 023 | 36 903 | 2 256 | 6 259 | 263 620 | 364 095 | |
| Protein | 4165 | 31 495 | 89 007 | 231 999 | 26 821 | 10 223 | 10 409 | 2102 | 3962 | 35 942 | 2183 | 6111 | 258 913 | 356 666 | |
| ncRNA | 109 | 797 | 2 537 | 3 987 | 784 | 309 | 313 | 74 | 61 | 961 | 73 | 148 | 4707 | 7430 | |
| Transposon | 101 | 570 | 1740 | 4129 | 516 | 192 | 181 | 35 | 72 | 646 | 38 | 123 | 4737 | 6540 | |
| Gene ontology | |||||||||||||||
| Molecular function | 708 | 740 | 760 | 760 | 734 | 716 | 722 | 712 | 679 | 719 | 43 | 709 | 771 | 771 | |
| Biological process | 441 | 471 | 504 | 500 | 472 | 446 | 448 | 443 | 427 | 447 | 38 | 440 | 513 | 513 | |
| Cellular component | 44 | 45 | 51 | 53 | 45 | 44 | 47 | 45 | 43 | 45 | 9 | 44 | 55 | 55 | |
aNot available; btRNA and rRNA.
Figure 1Highlights for the homepage, tools, overview pages and entry details of PamulDB. (A) The homepage with quick accesses to search, analysis tools and data for each strain. (B–E) An example of searching results and browsing genes, protein domains and subcellular localizations. (F) PamulDB gene ontology browser shows GO terms at multiple levels and can export sequences for each level. (G) Details for a genome feature, including data summary, genome location, sequences, protein domains, protein locations, gene ontology, homologs, phylogeny and publications. (H) PamulDB genome browser is built with JBrowse and used to graphically view genome features. (I and J) Analysis tools of BLAST and HMMER used for searching homologous sequences. (K) Analysis tool of SubLocPred used to predict protein subcellular localizations.