| Literature DB >> 35473935 |
Qili Shi1, Teng Liu2,3, Wei Hu2, Zhiao Chen1, Xianghuo He4, Shengli Li5.
Abstract
The production of functional mature RNA transcripts from genes undergoes various pre-transcriptional regulation and post-transcriptional modifications. Accumulating studies demonstrated that gene transcription carries out in tissue and cancer type-dependent ways. However, RNA transcript-level specificity analysis in large-scale transcriptomics data across different normal tissue and cancer types is lacking. We applied reference-based de novo transcript assembly and quantification of 27,741 samples across 33 cancer types, 29 tissue types, and 25 cancer cell line types. We totally identified 231,836 specific RNA transcripts (SRTs) across various tissue and cancer types, most of which are found independent of specific genes. Almost half of tumor SRTs are also tissue-specific but in different tissues. Furthermore, we found that 10 ~ 20% of tumor SRTs in most tumor types were testis-specific. The SRT database (SRTdb) was constructed based on these resources. Taking liver cancer as an example, we showed how SRTdb resource is utilized to optimize the identification of RNA transcripts for more precision diagnosis of particular cancers. Our results provide a useful resource for exploring transcript specificity across various cancer and tissue types, and boost the precision medicine for tumor patients.Entities:
Keywords: Pan-cancer analysis; Precision tumor diagnosis; RNA transcript; Tissue specificity analysis; Transcriptional diversity
Year: 2022 PMID: 35473935 PMCID: PMC9044872 DOI: 10.1186/s40364-022-00377-1
Source DB: PubMed Journal: Biomark Res ISSN: 2050-7771
The numbers of samples in each cancer, tissue, and cell line type
| Type | Normal tissue | Tumor tissuea | Cancer cell line |
|---|---|---|---|
| Adipose tissue | 1204 | 0 | 0 |
| Adrenal gland | 258 | 79 (ACC) | 0 |
| Bladder | 21 | 415 (BLCA) | 0 |
| Blood | 929 | 151 (LAML) | 0 |
| Blood vessel | 1335 | 0 | 0 |
| Brain | 1668 | 530 (LGG);169 (GBM) | 65 |
| Breast | 457 | 1109 (BRCA) | 57 |
| Cervix | 19 | 302 (CESC) | 3 |
| Colon | 779 | 480 (COAD) | 59 |
| Esophagus | 1434 | 160 (ESCA) | 26 |
| Heart | 857 | 0 | 0 |
| Kidney | 89 | 289 (KIRP); 539 (KIRC); 65 (KICH) | 32 |
| Liver | 226 | 372 (LIHC) | 25 |
| Lung | 573 | 536 (LUAD); 502 (LUSC) | 191 |
| Muscle | 799 | 0 | 0 |
| Nerve | 619 | 183 (PCPG) | 16 |
| Ovary | 180 | 379 (OV) | 47 |
| Pancreas | 328 | 178 (PAAD) | 41 |
| Pituitary | 283 | 0 | 0 |
| Prostate | 242 | 500 (PRAD) | 8 |
| Salivary gland | 162 | 0 | 2 |
| Skin | 1806 | 471 (SKCM) | 56 |
| Small intestine | 187 | 0 | 1 |
| Spleen | 241 | 0 | 0 |
| Stomach | 359 | 375 (STAD) | 37 |
| Testis | 361 | 156 (TGCT) | 0 |
| Thyroid | 653 | 509 (THCA) | 11 |
| Uterus | 142 | 56 (UCS); 552 (UCEC) | 27 |
| Vagina | 156 | 0 | 0 |
| Biliary tract | 0 | 36 (CHOL) | 8 |
| Soft tissue | 0 | 263 (SARC) | 31 |
| Lymphoma | 0 | 48 (DLBC) | 176 |
| Head and Neck | 0 | 502 (HNSC) | 32 |
| Mesothelium | 0 | 86 (MESO) | 0 |
| Rectum | 0 | 167 (READ) | 0 |
| Thymus | 0 | 119 (THYM) | 0 |
| Eye | 0 | 80 (UVM) | 0 |
| Bone | 0 | 0 | 28 |
| Pleura | 0 | 0 | 11 |
| Urinary tract | 0 | 0 | 26 |
aACC adrenocortical carcinoma, BLCA Bladder Urothelial Carcinoma, LAML Acute Myeloid Leukemia, LGG Brain Lower Grade Glioma, GBM Glioblastoma multiforme, BRCA Breast invasive carcinoma, CESC Cervical squamous cell carcinoma and endocervical adenocarcinoma, COAD Colon adenocarcinoma, ESCA Esophageal carcinoma, KIRP Kidney renal papillary cell carcinoma, KIRC Kidney renal clear cell carcinoma, KICH Kidney Chromophobe, LIHC Liver hepatocellular carcinoma, LUAD Lung adenocarcinoma, LUSC Lung squamous cell carcinoma, PCPG Pheochromocytoma and Paraganglioma, OV Ovarian serous cystadenocarcinoma, PAAD Pancreatic adenocarcinoma, PRAD Prostate adenocarcinoma, SKCM Skin Cutaneous Melanoma, STAD Stomach adenocarcinoma, TGCT Testicular Germ Cell Tumors, THCA Thyroid carcinoma, UCS Uterine Carcinosarcoma, UCEC Uterine Corpus Endometrial Carcinoma, CHOL Cholangiocarcinoma, SARC Sarcoma, DLBC Lymphoid Neoplasm Diffuse Large B-cell Lymphoma, HNSC Head and Neck squamous cell carcinoma, MESO Mesothelioma, READ Rectum adenocarcinoma, THYM Thymoma, UVM Uveal Melanoma
Fig. 1The infrastructure of online SRTdb database. A Transcripts were generated from de novo transcript assembly of 10,358 tumor samples across 33 different tumor types and 1,160,216 transcripts were totally identified. Transcript quantification was also performed in 16,367 normal tissue samples across 29 different tissue types and 1016 cancer cell lines across 25 different primary sites. Tumor, tissue, and cancer cell type-specific scores were calculated. SRTdb offers features of browse, search, visualization, and download for all users. B Piechart shows the percentages of annotated and novel transcripts. C The distribution of transcript length. D The distribution of exon numbers in transcripts
Fig. 2Exploration of specific RNA transcripts with the SRTdb resource. A UMAP visualization of tumor samples by using top 2000 variable transcripts. B UMAP visualization of normal samples by using top 2000 variable transcripts. C The number distributions of SRTs across different tumor types, cancer cell line types, and normal tissue types. D The percentages of SRTs in host SRGs across tumor (top panel) and normal tissue (bottom panel) samples
Fig. 3The tissue origins of tumor SRTs. A The percentages of tumor SRTs specific in original tissue types, other tissue types or not tissue-specific. B The expression distribution of liver cancer-specific transcript, ENST00000522365.1, across multiple cancer and normal tissue types. C The number distribution of tissue SRTs specific in matched tumor types, other cancer types or not cancer-specific. D The numbers of testis SRTs across different tumor types
Fig. 4The diagnostic value of liver cancer-specific transcripts. A The specific diagnostic scores of transcripts for liver cancer. The expression distribution of transcript ENST00000481511.4 (B), ENST00000465758.1 (C), and ENST00000379236.3 (D) across different tumor and normal tissue types