| Literature DB >> 35073880 |
Max Karlsson1, Evelina Sjöstedt2,3, Yonglun Luo4,5,6,7, Mathias Uhlen8,9, Per Oksvold1, Åsa Sivertsson1, Jinrong Huang4,5,6, María Bueno Álvez1, Muhammad Arif1, Xiangyu Li1, Lin Lin6,7, Jiaying Yu4,5, Tao Ma10, Fengping Xu4,5, Peng Han5, Hui Jiang10, Adil Mardinoglu1, Cheng Zhang1, Kalle von Feilitzen1, Xun Xu4, Jian Wang4, Huanming Yang4, Lars Bolund4,5,6, Wen Zhong1, Linn Fagerberg1, Cecilia Lindskog3, Fredrik Pontén3, Jan Mulder2.
Abstract
BACKGROUND: There is a need for functional genome-wide annotation of the protein-coding genes to get a deeper understanding of mammalian biology. Here, a new annotation strategy is introduced based on dimensionality reduction and density-based clustering of whole-body co-expression patterns. This strategy has been used to explore the gene expression landscape in pig, and we present a whole-body map of all protein-coding genes in all major pig tissues and organs.Entities:
Keywords: Annotation; Gene expression; Genome-wide; Protein-coding genes; Tissue expression profile; Transcriptome
Mesh:
Year: 2022 PMID: 35073880 PMCID: PMC8788080 DOI: 10.1186/s12915-022-01229-y
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Whole-body expression analysis of the pig. A Organ schematic drawing of the pig body, following the established color code. B The 98 tissue types analyzed from the Bama minipig are grouped into 44 grouped tissues, each belonging to one of 14 organ systems. C Circular dendrogram based on Ward’s criterion on pairwise Spearman correlation between tissue types. Branch lengths have been scaled to reduce visual complexity. A selection of branches is annotated based on common biological features
Fig. 2Principle component analysis (PCA) plot showing the relation and clustering of all tissue samples. Brain samples are shown in more detail in the zoom-in box (right)
Fig. 3Gene classification based on tissue expression. A Network plot indicating the number of genes with tissue or group enriched expression for combinations of tissue types (tissues: gray nodes; tissue enriched: red nodes; group enriched: orange nodes). Nodes were filtered based on rules listed in “Methods” section to reduce visual complexity. B Immunohistochemical (IHC) staining (left) and RNA expression (right) of tissue enriched genes: MOG (Brain), ASGR1 (liver), CRISP2 (testis), TNNT1 (skeletal muscle), and DSC1 (skin)
Fig. 4UMAP gene cluster annotation and visualization based on gene expression clustering. UMAP plot showing clustering of 22,342 genes based on their expression in 350 pig tissue samples. The resulting 84 gene clusters are outlined and are color coded by mixing the colors associated with each organ system in proportion to the mean squared fraction of total expression among tissues for genes in the cluster. Top: Color legend and cluster map showing cluster ID numbers. Bottom: Annotated cluster names. See Additional file 1: Fig. S4, for basic gene UMAP visualizations
Fig. 5Comparison between UMAP clusters and tissue specificity classification. Bubble heatmap showing the –log10(FDR) of the hypergeometric test comparing the overlap of cluster genes with genes classified as elevated in different tissues. FDR values are capped at 10−100 to allow for higher contrast in the figure. Only statistically significant overlaps (FDR < 0.001) are shown
Fig. 6Comparison of gene expression between human and pig. A UMAP of human and pig tissues based on expression normalized for ubiquitous expression level differences between species. Lines connect the same tissue between the two species. B Alluvial diagram showing the overlap between human and pig, in terms of tissue specificity categories, based on 32 tissue types available in human and pig datasets. C Statistical assessment of overlap using a hypergeometric test. The heatmap shows the adjusted p-values for statistically significant overlap (FDR < 0.001) between genes classified as tissue elevated in either pig or human tissues. FDR values are capped at 10−100 to increase the contrast in the figure. D IHC (left) and RNA expression (right) examples: PLN (overlapping expression) and CYP19A1 (not overlapping). The scale bar represents 50 μm