| Literature DB >> 33385022 |
Juliana Ferreira de Sousa1, Patrick da Silva2, Rodolfo Bortolozo Serafim2,3, Ricardo Perecin Nociti3,4, Cristiano Gallina Moreira2, Wilson Araujo Silva3,5,6, Valeria Valente2,3.
Abstract
Astrocytomas are the most common and aggressive type of primary brain tumors in adults. The World Health Organization (WHO) assorts them into grades, from I to IV, based on histopathological features that reflect their malignancy [1]. Alongside with tumor progression, comes an increased proliferation, genomic instability, infiltration in normal brain tissue and resistance to treatments. The high genomic instability forges tumor cells enhancing key proteins that avoid cells from collapsing and favor therapy resistance [2]. To explore genes and pathways associated with tumor progression phenotypes we analyzed gene expression in a panel of non-tumor and glioma cell lines, namely: ACBRI371, non-tumor human astrocytes; HDPC, fibroblasts derived from dental pulp; Res186, Res259, Res286 and UW467 that include grade I, II and III astrocytoma cell lines derived from pediatric tumors; and T98G, U343MG, U87MG, U138MG and U251MG, all derived from GBM (grade IV). We also profiled gene expression changes caused by exogenously induced replicative stress, performing RNA sequencing with camptothecin (CPT)-treated cells. Here we describe the RNA-sequencing data set acquired, including quality of reads and sequencing consistency, as well as the bioinformatics strategy used to analyze it. We also compared gene expression patterns and pathway enrichment between non-tumor versus lower-grade (LGG), non-tumor versus GBM, LGG versus GBM, and CPT-treated versus non-treated cells. In brief, a total of 6467 genes showed differential expression and 5 pathways were enriched in tumor progression, while 2279 genes and 7 pathways were altered under the replication stress condition. The raw data was deposited in the NCBI BioProject database under the accession number PRJNA631805. Our dataset is valuable for researchers interested in differential gene expression among different astrocytoma grades and in expression changes caused by replicative stress, facilitating studies that seek novel biomarkers of glioma progression and treatment resistance.Entities:
Keywords: Astrocytoma; Camptothecin (CPT); Gene expression profiling; Glioblastoma; RNAseq; Replicative stress; Tumor progression
Year: 2020 PMID: 33385022 PMCID: PMC7772531 DOI: 10.1016/j.dib.2020.106643
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
RNA Sequencing metrics. The total amounts of reads produced for each cell line or condition analyzed were grouped into two datasets (A and B) for the first run (#1) or four datasets (A, B, C and D) for the second run (#2). Counting of the number of raw reads, timmmed reads and aligned reads are shown.
| First run #1 | |||||
|---|---|---|---|---|---|
| Sample | Datasets* | raw reads | trimmed reads | aligned reads | total aligned reads per condition |
| ACBRI371.A | A | 18,777,557 | 16,073,974 | 15,938,790 | 32,064,952 |
| ACBRI371.B | B | 18,812,311 | 16,251,936 | 16,126,162 | |
| HDPC.A | A | 15,943,724 | 13,606,444 | 13,504,133 | 27,185,197 |
| HDPC.B | B | 15,979,931 | 13,774,367 | 13,681,064 | |
| T98.A | A | 17,728,147 | 15,076,520 | 14,976,750 | 30,171,998 |
| T98.B | B | 17,785,963 | 15,286,117 | 15,195,248 | |
| U138.A | A | 16,704,458 | 14,166,959 | 14,071,552 | 28,340,144 |
| U138.B | B | 16,744,624 | 14,355,064 | 14,268,592 | |
| U251.A | A | 18,962,975 | 16,244,799 | 16,142,384 | 32,480,625 |
| U251.B | B | 19,015,489 | 16,431,033 | 16,338,241 | |
| U343.A | A | 19,758,340 | 16,810,865 | 16,692,122 | 33,601,621 |
| U343.B | B | 19,791,840 | 17,016,541 | 16,909,499 | |
| U87.A | A | 19,764,536 | 16,898,823 | 16,788,200 | 32,998,714 |
| U87.B | B | 18,881,408 | 16,305,409 | 16,210,514 | |
| Second run #2 | |||||
| ACBRI371 + cpt18hs.A | A | 21,518,098 | 21,095,090 | 20,862,592 | 83,405,190 |
| ACBRI371 + cpt18hs.B | B | 21,195,306 | 20,755,940 | 20,502,694 | |
| ACBRI371 + cpt18hs.C | C | 21,913,608 | 21,476,156 | 21,245,612 | |
| ACBRI371 + cpt18hs.D | D | 21,516,074 | 21,073,316 | 20,794,292 | |
| R186.A | A | 19,595,942 | 19,245,970 | 18,417,842 | 73,482,118 |
| R186.B | B | 19,230,796 | 18,859,758 | 18,014,710 | |
| R186.C | C | 19,960,034 | 19,593,568 | 18,755,774 | |
| R186.D | D | 19,544,228 | 19,171,266 | 18,293,792 | |
| R259.A | A | 18,566,902 | 18,175,510 | 16,400,986 | 65,617,994 |
| R259.B | B | 18,311,806 | 17,885,020 | 16,128,140 | |
| R259.C | C | 18,900,300 | 18,492,500 | 16,693,170 | |
| R259.D | D | 18,628,414 | 18,200,300 | 16,395,698 | |
| R286.A | A | 19,534,344 | 19,152,002 | 17,501,632 | 69,847,434 |
| R286.B | B | 19,192,134 | 18,795,248 | 17,144,668 | |
| R286.C | C | 19,876,010 | 19,479,954 | 17,808,962 | |
| R286.D | D | 19,487,048 | 19,087,884 | 17,392,172 | |
| U138 + cpt18hs.A | A | 22,201,702 | 21,832,016 | 14,562,900 | 58,244,234 |
| U138 + cpt18hs.B | B | 21,898,446 | 21,490,134 | 14,317,296 | |
| U138 + cpt18hs.C | C | 22,596,730 | 22,211,560 | 14,821,870 | |
| U138 + cpt18hs.D | D | 22,234,620 | 21,823,154 | 14,542,168 | |
| U251 + cpt18hs.A | A | 22,296,224 | 21,818,672 | 21,557,698 | 86,098,692 |
| U251 + cpt18hs.B | B | 21,929,756 | 21,428,122 | 21,135,756 | |
| U251 + cpt18hs.C | C | 22,702,128 | 22,210,046 | 21,951,788 | |
| U251 + cpt18hs.D | D | 22,280,758 | 21,776,944 | 21,453,450 | |
| UW467.A | A | 20,931,108 | 20,520,954 | 18,115,334 | 72,368,544 |
| UW467.B | B | 20,578,786 | 20,146,762 | 17,756,036 | |
| UW467.C | C | 21,313,478 | 20,886,716 | 18,446,260 | |
| UW467.D | D | 20,930,310 | 20,494,222 | 18,050,914 | |
*Refers to groups of reads obtained from different lanes of sequencing runs.
Fig. 1Scatter plots showing the correlations among datasets of reads for each sample analyzed. Scatter plots were generated with the number of reads per gene for all genes whose expression was identified in each sequenced sample. Plots were clustered into four different groups, according to the cell line origin and/or treatment condition: non-tumor cells, LGG cells, GBM cells and CPT-treated GBM cells. #1 refers to samples sequenced in the Genome Analyzer IIx, while #2 refers to samples sequenced in the NextSeq 500, Illumina Inc. The letters A, B, C and D, accompanying the name of each cell line, refer to reads obtained from different lanes of sequencing for the same sample.
Fig. 2Volcano plots displaying the degree of altered gene expression among groups of cell lines and treated versus non-treated cells. Volcano plots were produced using the fold change values and p-values generated through the DESeq2 R package analysis to compare the mRNA expression changes between LGG vs GBM (A), ACBRI371 vs LGG (B), ACBRI371 vs GBM (C); and Control cells vs CPT treated cells for: ACBRI371 (D), U138MG (E) and U251MG (F). Blue dots show genes with significant p-value. Green dots show genes with significant fold change. Red dots represent genes with significance in both p-value and fold change.
KEGG pathways enrichment analysis. Differentially expressed genes (q-values ≤ 0.0001 and log fold change > 2 or < −2) were submitted to KEGG evaluation. Enriched pathways found in each comparison and details of the analysis results are shown.
| DOWN LGG - UP GBM | ||||||||
|---|---|---|---|---|---|---|---|---|
| Gene Set | Description | Size | Expect | Ratio | p value | FDR | LOGFOLD > 2 | Enrichment% |
| hsa05200 | Pathways in cancer | 526 | 28.24 | 1.8059 | 0.00002066 | 0.001347 | 51 | 9.69581749 |
| hsa04151 | PI3K-Akt signaling pathway | 354 | 19.006 | 2.1572 | 2.0102E-06 | 0.00021844 | 41 | 11.5819209 |
| hsa05165 | Human papillomavirus infection | 339 | 18.2 | 1.978 | 0.000058394 | 0.0027195 | 36 | 10.61946903 |
| hsa04510 | Focal adhesion | 199 | 10.684 | 2.8079 | 2.13E-07 | 0.000069599 | 30 | 15.07537688 |
| hsa04010 | MAPK signaling pathway | 295 | 15.838 | 1.8942 | 0.00051407 | 0.016759 | 30 | 10.16949153 |
| hsa04514 | Cell adhesion molecules (CAMs) | 144 | 7.7312 | 2.5869 | 0.000078204 | 0.0031868 | 20 | 13.88888889 |
| hsa04512 | ECM-receptor interaction | 82 | 4.4025 | 3.8615 | 1.1754E-06 | 0.00019159 | 17 | 20.73170732 |
| hsa04933 | AGE-RAGE signaling pathway in diabetic complications | 99 | 5.3152 | 3.1984 | 0.000017218 | 0.001347 | 17 | 17.17171717 |
| hsa04668 | TNF signaling pathway | 110 | 5.9057 | 2.7092 | 0.00023505 | 0.0085141 | 16 | 14.54545455 |
| hsa05144 | Malaria | 49 | 2.6307 | 4.1813 | 0.000042617 | 0.0023155 | 11 | 22.44897959 |
| No enriched pathway | ||||||||
| No enriched pathway | ||||||||
| hsa04080 | Neuroactive ligand-receptor interaction | 277 | 10.866 | 2.1166 | 0.00051907 | 0.022 | 23 | 8.303249097 |
| hsa04020 | Calcium signaling pathway | 183 | 7.1789 | 2.9253 | 8.7111E-06 | 0.0014199 | 21 | 11.47540984 |
| hsa04723 | Retrograde endocannabinoid signaling | 148 | 5.8059 | 2.7558 | 0.00021105 | 0.01376 | 16 | 10.81081081 |
| hsa04713 | Circadian entrainment | 96 | 3.766 | 3.7175 | 0.000019704 | 0.0021412 | 14 | 14.58333333 |
| hsa05032 | Morphine addiction | 91 | 3.5698 | 3.6416 | 0.000048805 | 0.0039776 | 13 | 14.28571429 |
| hsa04724 | Glutamatergic synapse | 114 | 4.4721 | 2.9069 | 0.00049148 | 0.022 | 13 | 11.40350877 |
| hsa04727 | GABAergic synapse | 88 | 3.4521 | 3.1864 | 0.00060736 | 0.022 | 11 | 12.5 |
| hsa05033 | Nicotine addiction | 40 | 1.5692 | 6.3729 | 2.2051E-06 | 0.00071887 | 10 | 25 |
| hsa05133 | Pertussis | 76 | 2.9814 | 3.3541 | 0.00071129 | 0.023188 | 10 | 13.15789474 |
| hsa05144 | Malaria | 49 | 1.9222 | 4.1619 | 0.00056511 | 0.022 | 8 | 16.32653061 |
| No enriched pathway | ||||||||
| hsa04080 | Neuroactive ligand-receptor interaction | 277 | 20.88 | 1.8678 | 0.000094512 | 0.0034882 | 39 | 14.07942238 |
| hsa04514 | Cell adhesion molecules (CAMs) | 144 | 10.854 | 3.3166 | 6.42E-11 | 2.09E-08 | 36 | 25 |
| hsa04510 | Focal adhesion | 199 | 15 | 2.0666 | 0.000076063 | 0.0034882 | 31 | 15.57788945 |
| hsa04512 | ECM-receptor interaction | 82 | 6.181 | 3.3975 | 4.26E-07 | 0.000065465 | 21 | 25.6097561 |
| hsa05032 | Morphine addiction | 91 | 6.8594 | 3.0615 | 2.6922E-06 | 0.00021942 | 21 | 23.07692308 |
| hsa04012 | ErbB signaling pathway | 85 | 6.4071 | 2.9654 | 0.000013146 | 0.00085709 | 19 | 22.35294118 |
| hsa05133 | Pertussis | 76 | 5.7287 | 2.9675 | 0.000036973 | 0.0020089 | 17 | 22.36842105 |
| hsa05140 | Leishmaniasis | 74 | 5.578 | 2.8684 | 0.000096299 | 0.0034882 | 16 | 21.62162162 |
| hsa05033 | Nicotine addiction | 40 | 3.0151 | 4.6433 | 6.02E-07 | 0.000065465 | 14 | 35 |
| hsa05150 | Staphylococcus aureus infection | 56 | 4.2212 | 3.0797 | 0.00020337 | 0.0061666 | 13 | 23.21428571 |
| hsa04115 | p53 signaling pathway | 72 | 10,218 | 88,078 | 6.71E-03 | 0.00021880 | 9 | 12.5 |
| hsa05210 | Apoptosis | 136 | 19,301 | 41,448 | 0.00066233 | 0.042506 | 8 | 5.882352941 |
| hsa05222 | Colorectal cancer | 86 | 12,205 | 57,353 | 0.00020349 | 0.030728 | 7 | 8.139534884 |
| hsa04064 | Small cell lung cancer | 93 | 13,199 | 53,036 | 0.00033082 | 0.030728 | 7 | 7.52688172 |
| hsa04210 | NF-kappa B signaling pathway | 95 | 13,482 | 51,920 | 0.00037703 | 0.030728 | 7 | 7.368421053 |
| hsa03018 | TNF signaling pathway | 110 | 15,611 | 44,840 | 0.00091270 | 0.042506 | 7 | 6.363636364 |
| hsa04668 | RNA degradation | 79 | 11,212 | 53,516 | 0.00085155 | 0.042506 | 6 | 7.594936709 |
| hsa05033 | Neuroactive ligand-receptor interaction | 277 | 11,126 | 26,065 | 0.0000017977 | 0.000058604 | 29 | 10.46931408 |
| hsa04723 | Retrograde endocannabinoid signaling | 148 | 59,446 | 45,420 | 2.30E-07 | 3.75E-05 | 27 | 18.24324324 |
| hsa04724 | Glutamatergic synapse | 114 | 45,789 | 48,046 | 5.75E-06 | 6.24E-04 | 22 | 19.29824561 |
| hsa04727 | Dopaminergic synapse | 131 | 52,617 | 38,010 | 2.20E-03 | 0.000010239 | 20 | 15.26717557 |
| hsa05032 | Cell adhesion molecules (CAMs) | 144 | 57,839 | 34,579 | 0.0000010483 | 0.000042717 | 20 | 13.88888889 |
| hsa04713 | Nicotine addiction | 40 | 16,066 | 11,826 | 1.11E-12 | 3.62E-10 | 19 | 47.5 |
| hsa04728 | GABAergic synapse | 88 | 35,346 | 50,925 | 8.42E-05 | 6.86E-03 | 18 | 20.45454545 |
| hsa04514 | Morphine addiction | 91 | 36,551 | 49,246 | 1.47E-04 | 9.61E-03 | 18 | 19.78021978 |
| hsa05150 | Circadian entrainment | 96 | 38,559 | 46,681 | 3.56E-04 | 0.0000019319 | 18 | 18.75 |
| hsa04080 | Staphylococcus aureus infection | 56 | 22,493 | 5335 | 0.0000016155 | 0.000058518 | 12 | 21.42857143 |
| No enriched pathway | ||||||||
| hsa04151 | PI3K-Akt signaling pathway | 354 | 73,938 | 24,345 | 0.00039531 | 0.021461 | 18 | 5.084745763 |
| hsa04360 | Axon guidance | 175 | 36,551 | 49,246 | 1.93E-04 | 0.0000062837 | 18 | 10.28571429 |
| hsa04015 | Rap1 signaling pathway | 206 | 43,026 | 37,187 | 0.0000055099 | 0.00089812 | 16 | 7.766990291 |
| hsa04510 | Focal adhesion | 199 | 41,564 | 36,089 | 0.000015772 | 0.0017139 | 15 | 7.537688442 |
| hsa04020 | Calcium signaling pathway | 183 | 38,222 | 36,628 | 0.000025978 | 0.0021172 | 14 | 7.650273224 |
| hsa04540 | Gap junction | 88 | 18,380 | 48,966 | 0.000083868 | 0.0054682 | 9 | 10.22727273 |
| hsa05146 | Amoebiasis | 96 | 20,051 | 39,899 | 0.00084677 | 0.029830 | 8 | 8.333333333 |
| hsa04720 | Long-term potentiation | 67 | 13,994 | 50,022 | 0.00046081 | 0.021461 | 7 | 10.44776119 |
| hsa04971 | Gastric acid secretion | 75 | 15,665 | 44,686 | 0.00091504 | 0.029830 | 7 | 9.333333333 |
| hsa05143 | African trypanosomiasis | 35 | 0.73102 | 68,397 | 0.00072934 | 0.029720 | 5 | 14.28571429 |
| No enriched pathway | ||||||||
| hsa05330 | Allograft rejection | 38 | 0.020351 | 98,276 | 0.00015027 | 0.020956 | 2 | 5.263157895 |
| hsa05332 | Graft-versus-host disease | 41 | 0.021957 | 91,085 | 0.00017519 | 0.020956 | 2 | 4.87804878 |
| hsa04940 | Type I diabetes mellitus | 43 | 0.023029 | 86,849 | 0.00019285 | 0.020956 | 2 | 4.651162791 |
| hsa05320 | Autoimmune thyroid disease | 53 | 0.028384 | 70,462 | 0.00029377 | 0.023756 | 2 | 3.773584906 |
| hsa05416 | Viral myocarditis | 59 | 0.031597 | 63,297 | 0.00036436 | 0.023756 | 2 | 3.389830508 |
| hsa04612 | Antigen processing and presentation | 77 | 0.041237 | 48.5 | 0.00062109 | 0.033746 | 2 | 2.597402597 |
Fig. 3Enriched KEGG pathways in each collection of genes presenting altered expression in the indicated comparisons. All genes with q-values ≤ 0.0001 (adjusted p-value set to avoid identification of false positive enrichments) and log fold change > 2 or < −2 of each comparison were subjected to pathway analysis by KEGG. Comparisons that revealed enriched pathways are shown. A False Discovery Rate (FDR) ≤ 0.05 were used as threshold to select significant pathways. Graphs were plotted with GraphPad Prism 4.0 software.
| Subject | Cancer Research |
| Specific subject area | Transcriptomic changes in different grade astrocytoma cells, comparing gene expression changes that occur in tumor progression or under replicative stress induced by the topoisomerase I inhibitor, Camptothecin (CPT). |
| Type of data | Transcriptomic data |
| How data were acquired | RNA sequencing: Bioanalyzer Instrument (Agilent), lllumina Sequencers Genome Analyzer IIx and NextSeq 500 (Illumina Inc.). |
| Data format | Raw: sra file format (repository link below) |
| Parameters for data collection | Cells were grown under standard conditions or treated with CPT for 18 h, then RNA isolation and sequencing were performed. |
| Description of data collection | Total RNA was isolated using RNeasy mini kit (Qiagen), RNA quality was evaluated by Bioanalyzer (Agilent), rRNA was removed from samples and then samples were clustered and sequenced. |
| Data source location | Institution: Ribeirão Preto Blood Bank |
| Data accessibility | Raw data is available at NCBI BioProject repository under the identification number PRJNA631805. Direct URL to data: |