| Literature DB >> 29088455 |
Guantao Zheng1,2, Yijie Ma1, Yang Zou1, An Yin2, Wushuang Li1, Dong Dong1.
Abstract
Metastasis is the main event leading to death in cancer patients. Over the past decade, high-throughput technologies have provided genome-wide view of transcriptomic changes associated with cancer metastases. Many microarray and RNA sequencing studies have addressed metastases-related expression patterns in various types of cancer, and the number of relevant works continues to increase rapidly. These works have characterized genes that orchestrate the metastatic phenotype of cancer cells. However, these expression data have been deposited in various repositories, and efficiently analyzing these data is still difficult because of the lack of an integrated data mining platform. To facilitate the in-depth analyses of transcriptome data on metastasis, it is quite important to make a comprehensive integration of these metastases-related expression data. Here, we presented a database, HCMDB (the human cancer metastasis database, http://hcmdb.i-sanger.com/index), which is freely accessible to the research community query cross-platform transcriptome data on metastases. HCMDB is developed and maintained as a useful resource for building the systems-biology understanding of metastasis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29088455 PMCID: PMC5753185 DOI: 10.1093/nar/gkx1008
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The framework for constructing HCMDB. The transcriptome data were derived from GEO, SRA, TCGA database. A series of filters were used to ensure the data quality of HCMDB. Metastasis-related literatures were manually to annotate the expression dysregulated genes.
Experiment and sample size in the current version of HCMDB
| Primary cancers | No. of experiments | No. of samples | Data sources |
|---|---|---|---|
| Bladder cancer | 17 | 423 | TCGA |
| Brain cancer | 1 | 22 | GEO |
| Breast cancer | 93 | 3054 | GEO, TCGA |
| Cervical cancer | 11 | 968 | GEO, TCGA |
| Colorectal cancer | 75 | 2440 | GEO, TCGA, SRA |
| Esophagus cancer | 12 | 171 | TCGA |
| Ewing's sarcoma | 1 | 37 | GEO |
| Eye cancer | 3 | 121 | GEO, TCGA |
| Gastric cancer | 3 | 404 | TCGA |
| Head and neck cancer | 2 | 30 | GEO |
| Kindey cancer | 25 | 353 | GEO, TCGA |
| Laryngeal cancer | 2 | 15 | GEO |
| Liver cancer | 30 | 273 | GEO, SRA |
| Lung cancer | 4 | 46 | GEO |
| Midgut carcinoid tumor | 4 | 39 | GEO |
| Nasopharynx cancer | 2 | 22 | GEO |
| Oral cancer | 2 | 27 | GEO |
| Osteosarcoma | 3 | 30 | GEO |
| Ovarian cancer | 1 | 18 | GEO |
| Pancreatic cancer | 27 | 293 | GEO, TCGA |
| Pancreatic neuroendocrine tumor | 18 | 94 | GEO |
| Penis cancer | 3 | 33 | GEO |
| Prostate cancer | 49 | 863 | GEO, TCGA, SRA |
| Skin cancer | 38 | 598 | GEO |
| Small intestine cancer | 11 | 87 | GEO |
| Synovial sarcoma | 1 | 34 | GEO |
| Testicular cancer | 2 | 142 | GEO, TCGA |
| Thymoma | 2 | 121 | TCGA |
| Thyroid cancer | 15 | 667 | GEO, TCGA |
| Total | 455 | 11 425 |
Figure 2.The percentage of categories of differential expression analyses.
Figure 3.The schematic workflow of HCMDB.