| Literature DB >> 35814037 |
Abstract
High RNA integrity is essential for good quality of transcriptomics profiling. Nevertheless, in some cases samples with low RNA integrity is the only available material to study. This work was set to investigate the impact of thermal-induced RNA degradation on the transcriptomic profiles of human leukemic cells. DNA microarray-based transcriptomics was conducted on two groups of samples; high RNA integrity samples (n = 4) and low RNA integrity samples (n = 5). RNA degradation caused limited but noticeable changes in the transcriptomes. Only 1945 (6.7%) of 29,230 genes showed altered quantitation (fold change ≥ two-fold, p value ≤ 0.03, corrected p value ≤ 0.05). RNA degradation had the most impact on short transcripts and those with short distance between their 5'end and the probe binding position. Overall, the present work identified the genes whose relative quantification is sensitive to RNA degradation. Therefore, altered expression of these genes should be interpreted with caution when studied in low integrity RNA samples. Supplementary Information: The online version contains supplementary material available at 10.1007/s13205-022-03223-1. © King Abdulaziz City for Science and Technology 2022.Entities:
Keywords: Leukemia; Misquantitation; RNA Degradation; Transcriptomics
Year: 2022 PMID: 35814037 PMCID: PMC9259771 DOI: 10.1007/s13205-022-03223-1
Source DB: PubMed Journal: 3 Biotech ISSN: 2190-5738 Impact factor: 2.893
Fig. 1Workflow of the major steps of the study’s methodology. Four leukemia cell lines were cultured. Next, RNA was extracted from the leukemia cell lines and 300 ng RNA from each leukemia cell line was transferred into one tube to make a pooled RNA sample (1200 ng). Then, 100 ng from the pooled RNA sample was transferred into nine tubes (100 ng each). Four tubes were kept in ice for 30 min and the other five tubes were incubated at 60 °C for 30 min. Post incubation, RIN was measured in the nine tubes. Next, transcriptomics analyses were conducted on the high RIN samples (n = 4) and the low RIN samples (n = 5). Finally, bioinformatics and data analysis were conducted on the transcriptomics profiles. RIN RNA integrity number
Fig. 2Heat-induced degradation of RNA. The RNA integrity number (RIN) was determined using Nano 6000 chips and the Agilent 2100 Bioanalyzer in two group of samples: RNA samples incubated in ice for 30 min (n = 4) and RNA samples incubated at 60 °C for 30 min (n = 5). Representative examples of electrophoretic trace of RNA samples are shown (A, B). Heat induced a significant decrease in the RIN values (C). The p values were calculated on this basis Student’s t-test using Excel software. Asterisk symbol: **** is p value ≤ 0.0001
Fig. 3Heat-induced degradation of RNA alters the transcriptome of human cancer cells. Principle component analysis (A) shows that low RIN samples (RIN ≤ 3.8) were grouped together and distinctly separated from the high RIN samples (RIN ≥ 7.9). Volcano plot (B) compares the gene expression in the two groups of samples. Cluster analysis with heatmap presentation (C) also grouped the low RIN samples together and the high RIN samples together; and identifies genes with altered quantification (FC ≥ two-fold, p value ≤ 0.03, corrected p value ≤ 0.05)
Fig. 4Comparison of the types of RNA in the three groups of genes. Ensemble database was interrogated to identify the type of RNA transcribed from the stable genes, the over-represented genes and the down-represented genes. lncRNA long noncoding RNA
Fig. 5Sequence characteristics of the three groups of genes. Sequence features were extracted from Ensemble database and NCBI reference RNA sequence database. 5' UTR length of transcripts (A); 3' UTR length of transcripts (B); transcript length (C); gene length (D); percentage of GC content in transcripts (E); distance between probe positions and the 5’ end of transcripts (F). UTR untranslated region. The p values were calculated on this basis Student’s t-test using Excel software. Asterisk symbols: * is p value ≤ 0.05; ** is p value ≤ 0.01; *** is p value ≤ 0.001; **** is p value ≤ 0.0001
Gene ontology enrichment analysis of the down-represented genes
| Term name | Corrected | Term size | Intersection size | |
|---|---|---|---|---|
| Molecular function | Protein binding | 2.66 × 10–8 | 14,752 | 749 |
| Binding | 0.0007 | 17,052 | 816 | |
| Catalytic activity | 0.0007 | 5879 | 334 | |
| Enzyme binding | 0.0007 | 1866 | 128 | |
| Biological process | Cellular metabolic process | 0.00007 | 10,855 | 574 |
| Metabolic process | 0.01 | 11,998 | 606 | |
| Response to nutrient | 0.01 | 183 | 23 | |
| Cellular component organization or biogenesis | 0.01 | 6767 | 367 | |
| Protein metabolic process | 0.01 | 5806 | 321 | |
| Phosphorus metabolic process | 0.01 | 3125 | 188 | |
| Tricarboxylic acid cycle | 0.01 | 36 | 9 | |
| Vacuole organization | 0.01 | 182 | 22 | |
| Protein modification process | 0.02 | 4027 | 232 | |
| Protein catabolic process | 0.02 | 983 | 72 | |
| Autophagy | 0.03 | 545 | 45 | |
| Macromolecule modification | 0.03 | 4245 | 240 | |
| Macroautophagy | 0.03 | 312 | 30 | |
| Organic substance catabolic process | 0.03 | 2271 | 140 | |
| Organic substance metabolic process | 0.04 | 11,403 | 571 | |
| Endoplasmic reticulum to Golgi vesicle-mediated transport | 0.04 | 207 | 22 | |
| Autophagosome organization | 0.04 | 102 | 14 | |
| Cell component | Cytoplasm | 2.69 × 10–19 | 11,888 | 659 |
| Membrane-bounded organelle | 1.47 × 10–18 | 12,836 | 693 | |
| Vesicle | 5.61 × 10–10 | 4071 | 265 | |
| Nuclear lumen | 6.05 × 10–8 | 4343 | 269 | |
| Endomembrane system | 1.76 × 10–7 | 4633 | 281 | |
| Endomembrane system | 1.76 × 10–7 | 4633 | 281 | |
| Nucleus | 0.0003 | 7565 | 399 | |
| Nucleolus | 0.0008 | 938 | 69 | |
| Golgi-associated vesicle | 0.001 | 178 | 21 | |
| Endoplasmic reticulum | 0.002 | 1975 | 123 |
FDR False discovery rate, term size the number of genes that were annotated to a term, intersection size the number of genes found in the present study that were annotated to a term
Gene ontology enrichment analysis of the over-represented genes
| Term name | Corrected | Term size | Intersection size | |
|---|---|---|---|---|
| Molecular Function | Protein binding | 4.92 × 10–5 | 14,752 | 330 |
| Histone binding | 0.03 | 241 | 15 | |
| Nucleic acid binding | 0.04 | 4353 | 115 | |
| DNA binding | 0.04 | 2519 | 74 | |
| Chromatin binding | 0.05 | 604 | 25 | |
| Biological Process | Regulation of RNA metabolic process | 3.97 × 10–5 | 3850 | 118 |
| Regulation of nucleobase-containing compound metabolic process | 3.97 × 10–5 | 4123 | 125 | |
| Regulation of gene expression | 3.97 × 10–5 | 5162 | 147 | |
| Regulation of macromolecule metabolic process | 6.34 × 10–5 | 6626 | 176 | |
| Nucleic acid metabolic process | 8.93 × 10–5 | 5247 | 146 | |
| Positive regulation of gene expression | 0.0002 | 2396 | 79 | |
| Positive regulation of nucleic acid-templated transcription | 0.0002 | 1679 | 61 | |
| Positive regulation of RNA biosynthetic process | 0.0002 | 1680 | 61 | |
| Nucleobase-containing compound metabolic process | 0.0002 | 5766 | 154 | |
| Positive regulation of transcription by RNA polymerase II | 0.0002 | 1276 | 50 | |
| mRNA transport | 0.0003 | 154 | 14 | |
| Positive regulation of transcription, DNA-templated | 0.0003 | 1592 | 58 | |
| RNA metabolic process | 0.0003 | 4732 | 131 | |
| Gene expression | 0.0006 | 6270 | 161 | |
| Nucleobase-containing compound biosynthetic process | 0.0007 | 4211 | 117 | |
| Cellular macromolecule metabolic process | 0.0007 | 8357 | 202 | |
| Regulation of transcription, DNA-templated | 0.0008 | 3494 | 101 | |
| Nitrogen compound metabolic process | 0.001 | 10,057 | 233 | |
| Nucleic acid-templated transcription | 0.001 | 3734 | 105 | |
| RNA biosynthetic process | 0.002 | 3749 | 105 | |
| Transcription by RNA polymerase II | 0.002 | 2735 | 82 | |
| Organic substance metabolic process | 0.007 | 11,403 | 253 | |
| Protein metabolic process | 0.02 | 5806 | 142 | |
| Cell cycle | 0.03 | 1914 | 56 | |
| Cell Component | Nucleoplasm | 5.21 × 10–6 | 3991 | 122 |
| Nuclear lumen | 4.1 × 10–6 | 4343 | 124 | |
| Nucleus | 0.001 | 7565 | 182 | |
| Plasma membrane region | 0.04 | 1255 | 38 |
FDR False discovery rate, term size the number of genes that were annotated to a term, intersection size the number of genes found in the present study that were annotated to a term