| Literature DB >> 26594381 |
Jocelyn Y H Choy1, Priscilla L S Boon1, Nicolas Bertin1, Melissa J Fullwood2.
Abstract
Gene expression is the most fundamental level at which the genotype leads to the phenotype of the organism. Enabled by ultra-high-throughput next-generation DNA sequencing, RNA-Seq involves shotgun sequencing of fragmented RNA transcripts by next-generation sequencing followed by in silico assembly, and is rapidly becoming the most popular method for gene expression analysis. Poly[A]+ RNA-Seq analyses of normal human adult tissue samples such as Illumina's Human BodyMap 2.0 Project and the RNA-Seq atlas have provided a useful global resource and framework for comparisons with diseased tissues such as cancer. However, these analyses have failed to provide information on poly[A]-RNA, which is abundant in our cells. The most recent advances in RNA-Seq analyses use ribosomal RNA-depletion to provide information on both poly[A]+ and poly[A]-RNA. In this paper, we describe the use of Illumina's HiSeq 2000 to generate high quality rRNA-depleted RNA-Seq datasets from human fetal and adult tissues. The datasets reported here will be useful in understanding the different expression profiles in different tissues.Entities:
Keywords: Development; RNA sequencing; Transcriptomics
Mesh:
Substances:
Year: 2015 PMID: 26594381 PMCID: PMC4640133 DOI: 10.1038/sdata.2015.63
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Flow Chart of the RNA-seq experiment and data analysis.
Details of total RNA used for RNA-seq library construction, corresponding names used in the manuscript and key QC metrics
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
| Total number of reads, percentage of reads mapped and percentage of reads mapping uniquely were obtained from the output of the alignment using STAR. The Picard MarkDuplicate tool was used to calculate the percentage of duplicated reads for each library. The percentages of mapped reads that map to genes and rRNA was obtained using RseQC. The percentage of mapped reads that map to mitochondrial chromosome was obtained from the output of featureCounts. | |||||||||||
| Agilent_Fetal_Colon | Agilent Technologies | F | Single donor: 20 weeks gestation (Lot no.: 0006071667) | Agilent Fetal Colon | 174262897 | 69.49 | 65.28 | 57.5 | 23.5 | 28.7 | 16.2 |
| Agilent_Fetal_Stomach | Agilent Technologies | M | Pool of 10: 18 (2 donors), 19 (3 donors), 20 (4 donors), 21 weeks gestation (Lot no.: 0006049059) | Agilent Fetal Stomach | 174661455 | 55.49 | 49.89 | 93.1 | 16.0 | 40.0 | 9.5 |
| Biochain_Fetal_Colon | Biochain | F | Pool of 1 donor, 37 weeks old (Lot no.: B207218) | Biochain Fetal Stomach | 78419198 | 95.35 | 82.83 | 27.8 | 68.4 | 1.2 | 1.1 |
| Biochain_Fetal_Stomach | Biochain | M | Pool of 1 donor, 24 weeks old (Lot no.: B303126) | Biochain Fetal Stomach | 81875607 | 96.28 | 88.62 | 16.0 | 48.9 | 1.0 | 0.7 |
| Agilent_Adult_Colon | Agilent Technologies | M | Single donor; 82 years old (Lot no.: 0006055759) | Agilent Adult Colon | 71105369 | 68.03 | 63.37 | 61.4 | 31.9 | 25.7 | 27.3 |
| Agilent_Adult_Heart | Agilent Technologies | F | Single donor: 73 years old (Lot no.: 0006097996) | Agilent Adult Heart | 67643565 | 65.39 | 60.58 | 78.5 | 31.5 | 32.4 | 34.3 |
| Agilent_Adult_Kidney | Agilent Technologies | F | Single donor: 76 years old (Lot no.: 0006068269) | Agilent Adult Kidney | 92807528 | 63.79 | 58.67 | 79.4 | 30.0 | 30.9 | 33.8 |
| Agilent_Adult_Liver | Agilent Technologies | M+F | Pool of 3: 30, 44 and 55 years (Lot no.: 0006063161) | Agilent Adult Liver | 78137222 | 68.63 | 62.94 | 62.6 | 36.9 | 23.7 | 20.6 |
| Agilent_Adult_Lung | Agilent Technologies | F | Single donor: 40 years old (Lot no.: 0006106003) | Agilent Adult Lung | 77032217 | 58.3 | 52.88 | 83.0 | 17.7 | 41.8 | 11.7 |
| Agilent_Adult_Stomach | Agilent Technologies | F | Pool of 3 Donors, 39, 70, & 52 years old (Lot no.: 0006056559) | Agilent Adult Stomach | 149287775 | 57.84 | 49.92 | 87.4 | 23.9 | 35.6 | 19.3 |
| Biochain_Adult_Colon | Biochain | M | Single donor; 29 years old (Lot no.: 302060) | Biochain Adult Colon | 68628998 | 65.95 | 62.3 | 69.7 | 24.7 | 32.4 | 32.3 |
| Biochain_Adult_Heart | Biochain | M | Single donor; 24 years old (Lot no.: B604038) | Biochain Adult Heart | 75787942 | 73.27 | 69.42 | 66.0 | 41.0 | 21.4 | 37.1 |
| Biochain_Adult_Kidney | Biochain | M | Single donor; 26 years old (Lot no.: B106007) | Biochain Adult Kidney | 80053022 | 69.81 | 65.83 | 68.5 | 30.2 | 27.4 | 36.3 |
| Biochain_Adult_Liver | Biochain | M | Single donor; 64 years old (Lot no.: B510092) | Biochain Adult Liver | 85220810 | 61.91 | 56.68 | 78.0 | 28.8 | 30.6 | 19.8 |
| Biochain_Adult_Lung | Biochain | M | Single donor; 20 years old (Lot no.:B307203) | Biochain Adult Lung | 82080766 | 62.28 | 57.92 | 56.4 | 21.0 | 25.7 | 17.3 |
| Biochain_Adult_Stomach | Biochain | M | Single donor; 24 years old (Lot no.: A506301) | Biochain Adult Stomach | 83186579 | 57.41 | 51.05 | 90.2 | 17.7 | 39.7 | 22 |
| Origene_Adult_Stomach_0288 | Origene | F | Single donor; 44 years old (Cat no.: CR560288) | Origene Adult Stomach 1 | 88595062 | 58.29 | 51.56 | 74.8 | 21.9 | 31.3 | 18.4 |
| Origene_Adult_Stomach_0393 | Origene | F | Single donor; 32 years old (Cat. no.: CR560393) | Origene Adult Stomach 2 | 89447266 | 61.15 | 54.74 | 66.0 | 24.7 | 27.8 | 18.4 |
| Origene_Adult_Stomach_1840 | Origene | M | Single donor; 59 years old (Cat. no.: CR561840) | Origene Adult Stomach 3 | 81103684 | 56.28 | 51.5 | 83.6 | 14.4 | 38.5 | 27.3 |
Figure 2Summary of key quality control metrics.
(a) Boxplot of average sequence quality per base per sample. (b) The distribution of duplicated reads relative to the total number of sequences for all libraries. (c) The distribution of unique reads for all libraries. (d) XIST versus ChrY: normalized expression of female specific transcript XIST (x-axis) versus sum of normalized expression of all Y chromosome transcripts excluding those in the pseudo-autosomal regions (y-axis).
Figure 3A representation of tissue specific genes with transcripts from corresponding RNA-seq libraries contrasted with transcripts from other tissue samples.
(a) PHGR1, (b) TNNI3, (c) UMOD, (d) APOC1, (e) SFTPA1 and (f) PGA(3–5) are highly expressed in the colon, heart, kidney, liver, lung and stomach respectively. RNA-seq libraries from the same tissue type show higher expression of transcripts when compared to RNA-seq libraries of different tissue types.
Figure 4A multidimensional scaling plot for all libraries.
The different tissue types are colored differently. Similar tissue types cluster together.