| Literature DB >> 27044256 |
Mathias Uhlén1, Björn M Hallström2, Cecilia Lindskog3, Adil Mardinoglu4, Fredrik Pontén3, Jens Nielsen5.
Abstract
Quantifying the differential expression of genes in various human organs, tissues, and cell types is vital to understand human physiology and disease. Recently, several large-scale transcriptomics studies have analyzed the expression of protein-coding genes across tissues. These datasets provide a framework for defining the molecular constituents of the human body as well as for generating comprehensive lists of proteins expressed across tissues or in a tissue-restricted manner. Here, we review publicly available human transcriptome resources and discuss body-wide data from independent genome-wide transcriptome analyses of different tissues. Gene expression measurements from these independent datasets, generated using samples from fresh frozen surgical specimens and postmortem tissues, are consistent. Overall, the different genome-wide analyses support a distribution in which many proteins are found in all tissues and relatively few in a tissue-restricted manner. Moreover, we discuss the applications of publicly available omics data for building genome-scale metabolic models, used for analyzing cell and tissue functions both in physiological and in disease contexts.Entities:
Keywords: genome‐scale metabolic models; proteomics; transcriptomics
Mesh:
Year: 2016 PMID: 27044256 PMCID: PMC4848759 DOI: 10.15252/msb.20155865
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
Figure 1Global transcriptomics analysis of human tissues and organs
Overview of the tissues and organs analyzed using RNA‐seq by the Human Protein Atlas consortium (HPA, green), tissues studied with cap analysis gene expression (CAGE) within the FANTOM consortium (purple), and tissues analyzed using RNA‐seq by the genome‐based tissue expression consortium (GTEx, orange). Altogether, 22 tissues and organs were studied with both the HPA and FANTOM datasets, while 21 tissues overlapped between the HPA and GTEx datasets.
Data resources for RNA expression data with relevance for human protein‐encoding genes
| Resource | Affiliation | Description | Link (URL) | References |
|---|---|---|---|---|
| Human Protein Atlas | Science for Life Lab (Sweden) | Tissue‐based RNA data based on surgically removed tissues (RNA‐Seq) |
| (Uhlen |
| GTEx | Broad Institute (USA) | Tissue‐based RNA data based on postmortem samples (RNA‐Seq) |
| (Keen & Moore, |
| FANTOM | Riken Institute (Japan) | Tissue‐based RNA data based on CAGE |
| (Yu |
| RNA‐Seq Atlas | J. Gutenberg University (Germany) | A reference database for gene expression profiling in normal tissue by next‐generation sequencing |
| (Krupp |
| Allen Brain Atlas | Allen Institute (USA) | An anatomically comprehensive atlas of the adult human brain transcriptome |
| (Hawrylycz |
| Evolution of gene expression | University of Lausanne (Switzerland) | The evolution of gene expression levels in mammalian organs |
| (Brawand |
| AltIso | MIT (USA) | Alternative isoform regulation in human tissue transcriptomes. |
| (Wang |
| Expression Atlas | EBI (UK) | Repository for RNA expression data (both microarray and RNA‐Seq) |
| (Petryszak |
| ArrayExpress | EBI (UK) | International functional genomics public data repositories |
| (Rustici |
| Illumina Body Map | Illumina (USA) | RNA‐Seq of 16 human individual tissues |
| (Rustici |
| Gene Expression Omnibus | NCBI (USA) | Repository for RNA expression data (both microarray and RNA‐Seq) |
| (Barrett |
Classification of all human protein‐coding genes based on transcript expression levels in tissues and organs. The columns HPA and GTEx indicate the number of genes identified in the different categories using the datasets (Keen & Moore, 2015; Uhlen et al, 2015) from these two consortia
| Category | Definition | HPA | GTEx |
|---|---|---|---|
| Tissue enriched | At least fivefold higher mRNA levels (FPKM) in a particular tissue as compared to all other tissues | 2,359 | 2,289 |
| Group enriched | At least fivefold higher mRNA levels in a group of tissues (2–7) | 1,208 | 1,307 |
| Enhanced | At least fivefold higher mRNA levels in a particular tissue as compared to the average levels in all tissues | 3,227 | 3,077 |
| Expressed in all | Detected in all tissues | 8,385 | 8,459 |
| Mixed | Detected in at least two tissues, but not in all, and not part of any of the categories above | 2,484 | 2,537 |
| Not detected | Not present in any of the analyzed tissues (under cutoff) | 1,021 | 1,015 |
| Total | Total number of genes analyzed | 18,684 | 18,684 |
| Total elevated | Total number of tissue‐enriched, group‐enriched, and tissue‐enhanced genes | 6,794 | 6,673 |
Figure 2Classification of all protein‐coding genes using transcriptomics data
(A) Pie chart showing the number of genes that fall into each expression specificity category, based on the classifications of HPA (32 tissues, 137 samples) (with a cutoff of 0.5 FPKM). (B) The number of protein‐coding genes classified as tissue enriched (dark blue), group enriched (medium blue), and tissue enhanced (light blue) based on the HPA dataset. (C) Pie chart showing the number of genes that fall into each expression specificity category, based on the classifications of GTEx (30 tissues, 2,510 samples) (Keen & Moore, 2015) (with a cutoff of 0.5 FPKM). (D) The number of protein‐coding genes classified as above based on GTEx dataset. (E). Barplot showing the overlap of tissue‐enriched genes between the two datasets. All genes that are tissue enriched in either dataset are depicted. Genes classified as tissue enriched/group enriched/tissue enhanced in the same tissue in both datasets are shown in blue; genes only enriched in one of the datasets are shown in yellow (only HPA) or orange (only GTEx).
Figure 3Protein classification and interindividual variations
(A) Venn diagrams showing the overlap between tissue‐elevated genes between the two datasets, HPA in light green and GTEx in light blue. (B) Venn diagram showing the overlap between genes classified as “expressed in all tissues” between the two datasets. The pie charts show the classification of the non‐overlapping genes in the dataset where the gene was not detected in all tissues. (C) Comparison of interindividual variation between genes that are annotated as “expressed in all tissues” and all other genes, in lung, brain, and skin (these tissues were selected because they have a large number of biological replicates). The plots illustrate the distribution of the coefficient of variation (CV) within the tissue for all genes in each of the two classes (red: expressed in all, black: other). The CV is shifted toward the lower side in the “expressed in all” category (P ≪ 0.001), suggesting that genes that are expressed in all tissues have lower variation between individuals.
Figure 4Genome‐scale metabolic models for human cells/tissues
(A) GEMs incorporate the known biochemical reactions and their catalyzing enzymes in a particular cell/tissue type. The information related to the reaction–gene association is used for the reconstruction of context‐specific GEMs. (B) The continuously increasing number of reactions, metabolites, and genes included in generic human GEMs and manually curated cell‐/tissue‐specific GEMs generated in the recent years is shown. (C) High‐throughput omics data including proteomics, transcriptomics, and metabolomics have been used for reconstructing cell‐/tissue‐specific GEMs based on generic human GEMs. (D) The metabolic tasks that are known to occur in a given human cell/tissue need to be defined to generate functional cell‐/tissue‐specific GEMs. The definition of the metabolic task related to bile acid synthesis in the liver is presented. Glucose, cysteine, phenylalanine, oxygen (O2), and phosphate (Pi) must be taken up, whereas urea, water (H2O), and carbon dioxide (CO2) must be secreted in order to successfully simulate bile acid synthesis in liver GEM.
List of generic and cell‐/tissue‐specific human GEMs
| Model name | Application | References |
|---|---|---|
| Generic human GEMs | ||
| Recon1 | Integration of genomic and bibliomic data | (Duarte |
| EHMN | Integration of genomic and bibliomic data | (Ma |
| HMR | Integration of previous generic human GEMs and publicly available databases | (Agren |
| Recon2 | Community‐based reconstruction of human metabolism | (Thiele |
| HMR2 | Incorporation of extensive lipid metabolism into the generic human GEM | (Mardinoglu |
| Cell‐/tissue‐specific GEMs | ||
| Red blood cell | Analysis of the metabolic loads in red blood cells | (Wiback & Palsson, |
| Mitochondria | Study of the human mitochondrial metabolism | (Vo |
| Fibroblasts | Metabolic alterations in Leigh syndrome | (Vo |
| HepatoNet1 | Investigation of hepatic enzyme deficiencies | (Gille |
| Computational liver model | Discovery of biomarkers of liver disorders including hyperammonemia and hyperglutaminemia | (Jerby |
| Kidney | Prediction of causal drug off‐targets that impact kidney function | (Chang |
| Brain (three neuron types and astrocytes) | Revealing the metabolic alterations in Alzheimer's disease | (Lewis |
| IAB‐AMQ‐1410 | Analysis of the host–pathogen interactions with | (Bordbar |
| Multitissue (hepatocytes, myocytes, and adipocytes) | Revealing the metabolic alterations in T2D | (Bordbar |
| Erythrocyte (iAB‐RBC‐283) | Revealing the complexity in the functional capabilities of human erythrocyte metabolism | (Bordbar |
| 69 cell‐specific GEMs | Studying the metabolic differences between healthy cells and cancers | (Agren |
| 126 tissue‐specific GEMs | Comparative analysis between healthy tissues and tumor | (Wang |
| CardioNet | The effect of oxygen and substrate supply on the efficiency of selected metabolic functions of cardiomyocytes | (Karlstaedt |
|
| Revealing the metabolic differences in obese subjects | (Mardinoglu |
| Tissue‐specific GEMs | Studying the metabolic differences between healthy tissues and cancers | (Nam |
| Liver GEM | Studying urea metabolism in liver tissue | (Vlassis |
| 83 cell‐specific GEMs | Defining the major metabolic functions in human cell types | (Agren |
|
| Revealing the metabolic alterations in response to NAFLD | (Mardinoglu |
|
| Revealing the metabolic alterations in response to T2D | (Varemo |
| 32 tissue‐specific GEMs | Global analysis of the metabolic functions in major human tissues | (Uhlen |