| Literature DB >> 34095379 |
George Krasnov1,2,3, Timur Shkrigunov1, Sergey Radko1, Konstantin Ptitsyn1, Valeriya Shapovalova4, Olga Timoshenko1, Svetlana Khmeleva1, Leonid Kurbatov1, Yana Kiseleva5, Ekaterina Ilgisonis1, Olga Kiseleva1, Igor Vakhrushev1, Anastasia Tsvetkova1, Ivan Buromski6, Sergey Markin1, Alexander Archakov1, Andrey Lisitsa1, Elena Ponomarenko1.
Abstract
The chromosome-centric dataset was created by applying several technologies of transcriptome profiling. The described dataset is available at NCBI repository (BioProject ID PRJNA635536). The dataset referred to the same type of tissue, cell lines, transcriptome sequencing technologies, and was accomplished in a period of 8 years (the first data were obtained in 2013 while the last ones - in 2020). The high-throughput sequencing technologies were employed along with the quantitative PCR (qPCR) approach, for data generation using the gene expression level assessment. qPCR was performed for a limited group of genes, encoded on human chromosome 18, for the Russian part of the Chromosome-Centric Human Proteome Project. The data of high-throughput sequencing are provided as Excel spreadsheets, where the data on FPKM and TMP values were evaluated for the whole transcriptome with both Illumina HiSeq and Oxford Nanopore Technologies MinION sequencing.Entities:
Keywords: HepG2; Human proteome project; Illumina HiSeq; Liver tissue; Oxford nanopore Technologies; Transcriptome
Year: 2021 PMID: 34095379 PMCID: PMC8166769 DOI: 10.1016/j.dib.2021.107130
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Dataset origin and structure. Quantitative data for samples of liver tissue and HepG2 cells from several datasets, obtained in 2013 and 2020 with SOLiD, Illumina GII/HiSeq, qPCR and Oxford Nanopore (MinION), and mapped to the Chr18-encoded proteins.
Fig. 2Relationships between clusters within dataset. Each leaf in the dendrogram corresponds to the certain source of the data about the expression level of Chromosome 18 genes, measured with different methods and processed with different bioinformatics pipelines (bowtie2-RSEM..RefSeq, Salmon, bowtie2-RSEM, STAR-RSEM, Salmon..Gencode, Tophat-Cufflinks, Salmon..Ensembl).
Properties of the dataset. Headers of rows and columns denote clusters formed as a result of the correlation analysis between Chr18 transcriptome profiles derived by various methods (qPCR or RNA-seq) for different types of specimen (human liver or cultured HepG2 cells).
| Main clusters | LIVER polyA amplification (n = 17) | HepG2 (n = 22) | LIVER polyA extraction (n = 16) | qPCR (n = 6) |
|---|---|---|---|---|
rp— Pearson correlation coefficient; rs— Spearman rank correlation coefficient
± — standard deviation
| Subject | Biological sciences — Biochemistry |
| Specific subject area | Chromosome-Centric Human Proteome Project |
| Type of data | Table |
| How data were acquired | Quantitative PCR (RT-PCR or ddPCR); high throughput RNA sequencing by Illumina HiSeq 2500 system and ONT MinION nanopore sequencer |
| Data format | Raw (FastQ) |
| Parameters for data collection | RNA preparations from three human hepatocellular carcinoma samples and one sample of cultured HepG2 cells were used to analyze: total RNA for profiling the Chr18 transcriptome by quantitative PCR; amplified polyA+ RNA for the whole transcriptome sequencing with Illumina HiSeq; extracted polyA+ RNA for the direct sequencing of whole transcriptome with ONT MinION. |
| Description of data collection | The samples collected were immediately placed in RNAlater RNA stabilization solution (Thermo Fisher Scientific) and stored at −20°C. Total RNA extraction was performed using RNeasy Mini Kit (Qiagen). The rRNA-depleted RNA was obtained with Ribo-Zero rRNA depletion kit (Illumina). The polyA+ RNA was isolated with Dynabeads mRNA Purification Kit (Thermo Fisher Scientific). The sequencing libraries construction and sequencing were carried out according to manufacturers’ protocols. The PCR analysis was conducted in duplicates, Illumina HiSeq sequencing – with 2 replicates, ONT MinION sequencing – with a single run. |
| Data source location | Institute of Biomedical Chemistry, Moscow, Russia |
| Data accessibility | |
| Related research article | K.A. Deinichenko, G.S. Krasnov, S.P. Radko, K.G. Ptitsyn, V. V Shapovalova, O.S. Timoshenko, S.A. Khmeleva, L.K. Kurbatov, Y.Y. Kiseleva, E. V Ilgisonis, M.A. Pyatnitskiy, E. V Poverennaya, O.I. Kiseleva, I. V Vakhrushev, A. V Tsvetkova, I. V Buromski, S.S. Markin, V.G. Zgoda, A.I. Archakov, A. V Lisitsa, E.A. Ponomarenko, Human Chr18: "Stakhanovite" Genes, Missing and uPE1 Proteins in Liver Tissue and HepG2 Cells, Biomedical Chemistry: Research and Methods (2021) 4, e00144. |