| Literature DB >> 22140100 |
Yukio Maruyama1, Yoshifumi Kawamura, Tetsuo Nishikawa, Takao Isogai, Nobuo Nomura, Naoki Goshima.
Abstract
The Human Gene and Protein Database (HGPD; http://www.HGPD.jp/) is a unique database that stores information on a set of human Gateway entry clones in addition to protein expression and protein synthesis data. The HGPD was launched in November 2008, and 33,275 human Gateway entry clones have been constructed from the open reading frames (ORFs) of full-length cDNA, thus representing the largest collection in the world. Recently, research objectives have focused on the development of new medicines and the establishment of novel diagnostic methods and medical treatments. And, studies using proteins and protein information, which are closely related to gene function, have been undertaken. For this update, we constructed an additional 9974 human Gateway entry clones, giving a total of 43,249. This set of human Gateway entry clones was named the Human Proteome Expression Resource, known as the 'HuPEX'. In addition, we also classified the clones into 10 groups according to protein function. Moreover, in vivo cellular localization data of proteins for 32,651 human Gateway entry clones were included for retrieval from the HGPD. In 'Information Overview', which presents the search results, the ORF region of each cDNA is now displayed allowing the Gateway entry clones to be searched more easily.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22140100 PMCID: PMC3245012 DOI: 10.1093/nar/gkr1188
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Improvement of entries of HGPD
| Data set | V. 1.0 | V. 2.0 (in this update) |
|---|---|---|
| Gateway entry clones | 33 275 (14 590 loci | 43 249 (16 484 loci |
| (N-type: 12 754) | (N-type: 17 802) | |
| (F-type: 20 521) | (F-type: 25 447) | |
| 13 364 | 17 821 | |
| – | 32 651 |
aNumbers of locus in HGPD are represented.
Figure 1.Improved display of the HGPD. In ‘Information Overview’, (a) open reading frames (ORFs) of each cDNA clone can be visualized. In ‘Protein Info’, the left endpoint represents the N-terminus of each ‘Locus’ (b) and the right endpoint represents the C-terminus (c). ‘SL’, the subcellular localization of human proteins, is linked by the ‘SL’ button in ‘Protein Info’. (d) Protein subcellular localization by visual observation of the images. (e) The subcellular localization image. ‘PE’ or ‘ESPRESSO’ information is linked by the ‘PE’ button in ‘Protein Info’.
Classification under protein function
| Category | Numbers of gene symbol | Numbers of Gateway entry clone |
|---|---|---|
| Protein kinase | 385 | 1322 |
| Protein phosphatase | 84 | 291 |
| G-protein coupled receptor | 208 | 405 |
| Small GTPase | 134 | 255 |
| Transcription factor | 1704 | 4319 |
| Protease | 399 | 1104 |
| Gricans | 159 | 405 |
| Ion channel | 177 | 395 |
| Epigenetics | 171 | 519 |
| Mitochondrial protein | 637 | 1669 |
Figure 2.Category Search. This interface is accessed through ‘Advanced Search’ under ‘Top View’ (a). (b) A category search is initiated by clicking the ‘Category Search’ tab. (c) In each category, only one item (term) can be selected. (d) The number of hits from a selected item (term) is then counted, and (f) displayed by clicking the ‘Submit’ button (e). (h) The results are then linked to ‘Information Overview’ by clicking ‘IO’. (g) The ‘Gene Symbol’ interface is linked to NCBI Entrez Gene. (i) The list of search results is obtained by clicking the ‘Download’ button.