| Literature DB >> 35265836 |
Kim M Summers1, Stephen J Bush2, Chunlei Wu3, David A Hume1.
Abstract
The laboratory rat is an important model for biomedical research. To generate a comprehensive rat transcriptomic atlas, we curated and downloaded 7700 rat RNA-seq datasets from public repositories, downsampled them to a common depth and quantified expression. Data from 585 rat tissues and cells, averaged from each BioProject, can be visualized and queried at http://biogps.org/ratatlas. Gene co-expression network (GCN) analysis revealed clusters of transcripts that were tissue or cell type restricted and contained transcription factors implicated in lineage determination. Other clusters were enriched for transcripts associated with biological processes. Many of these clusters overlap with previous data from analysis of other species, while some (e.g. expressed specifically in immune cells, retina/pineal gland, pituitary and germ cells) are unique to these data. GCN analysis on large subsets of the data related specifically to liver, nervous system, kidney, musculoskeletal system and cardiovascular system enabled deconvolution of cell type-specific signatures. The approach is extensible and the dataset can be used as a point of reference from which to analyse the transcriptomes of cell types and tissues that have not yet been sampled. Sets of strictly co-expressed transcripts provide a resource for critical interpretation of single-cell RNA-seq data.Entities:
Year: 2022 PMID: 35265836 PMCID: PMC8900154 DOI: 10.1093/nargab/lqac017
Source DB: PubMed Journal: NAR Genom Bioinform ISSN: 2631-9268
Figure 1.Sample-to-sample network graph for samples averaged by BioProject, age and tissue type. The 585 averaged RNA-seq profiles generated as described in the ‘Materials and Methods’ section. A pairwise sample-to-sample Pearson correlation coefficient (r) was calculated. The resulting matrix is displayed as a network graph using BioLayout. The individual samples (nodes, coloured balls) are connected by edges (lines) that reflect the chosen r value threshold. For panels (A) and (B), a correlation coefficient threshold of r > 0.21 was used to include all samples. For panels (C) and (D), the threshold was increased to a more stringent r > 0.7, which removed 15 nodes that make no connection at this r threshold and increased the separation of the remaining nodes. In panels (A) and (C), the nodes are coloured by organ system: dark red, auditory system; light red, cardiovascular system; salmon, digestive system; orange, endocrine system; olive, liver; bright green, female reproductive system; teal, immune system; dark teal, integumentary system; dark green, male reproductive system; black, mixed tissues; light blue, nervous system; dark blue, primordia/early development; purple, renal system; pink, respiratory system; mauve, skeletomuscular system; and grey, whole body (embryo). In panels (B) and (D), nodes are coloured by BioProject, data being generated by different laboratories. Note that in panels (A) and (C), where nodes are coloured by organ system, nodes of the same colour cluster together, whereas there is no pattern of association when the same nodes are coloured based on BioProject in panels (B) and (D).
Gene expression clusters from rat tissues and cells
| Cluster number | Number of transcripts | Specificity | Index genes and TFs | Functional annotation |
|---|---|---|---|---|
| 1 and 70 | 1514 and 27 | Testis | Acr, Amhr2, Ccna1, Fshr, Meioc, Spata16, Tnp1/2, Rec8, Stag3, | Spermatogenesis, motility, meiosis |
| 2 | 1303 | CNS neurons | Amigo1, Camk2a, Cx3cl1, Gabbr1/2, Grik1–5, Nfasc, Snca, | Neurotransmission, neural development |
| 3 | 583 | Non-specific variable | Atm, Birc6, Ccnt1/2, Cdk12/13, Ddx5/6, Fancb, Herc1/2, Hipk1, | Misfolded protein/stress response, tumour suppressors |
| 5 | 342 | Liver | Afm, Alb, Apoc1–4, C3, Cfb, Cth, Cyp2a1, F2, Fetub, Gcgr, Ghr, Hpx, Igf1, Plg, Serpina1, | Hepatocyte secretory products, xenobiotic metabolism |
| 6 | 310 | Oocyte | Axin2, Bmp15, Bub1b, Ccnb3, Dlgap5, Esrp1, Eya1/3, Gdf9, Gpr1, Zp1–4, | Oocyte-specific transcription, zona pellucida structure, meiosis |
| 7 | 213 | Skeletal muscle | Acta1, Casq1, Ckm, Des, Mb, Myh2, Myl1, Pfkm, Ryr1, | Muscle contraction, calcium signalling |
| 8 | 211 | Kidney | Aco1, Adm2, Cyp4a2/a8, Klk1, Nox4, Pth1r, Slc5a2 | Tubule function, resorption, metabolism |
| 9 | 194 | Oocyte | Aurkc, Ccnb1, Magoh, Mnd1, Mos, Nanos2, Ooep, | Stem cell renewal, meiosis |
| 11 | 188 | Variable, not tissue specific | Bub1, Ccna2, Cdk1/2, Cenpk, Lig1, Mki67, Orc1, Pcna, Pola1, | Cell division cycle, DNA synthesis/repair, mitosis |
| 12 | 165 | ES cells | Dppa3/a4, Dusp10, Fgf17, Fzd6, Slc2a3, | Stem cell maintenance |
| 14 | 124 | Intestine | Ace2, Cdh17, Cldn7, Defa family, Dgat1, Heph, Il20ra, Krt20, Lgals4, Muc13, Vil1, | Intestinal barrier function |
| 15 | 111 | Stimulated T cells | Cd2, Cd3e, Cd69, Dock2, Il2rg, Ltb, Ptprc, Sla, Was, | T-cell function |
| 17 | 96 | Pineal gland/retina | Aanat, Arr3, Asmt, Gch1, Opn1sw, | Pineal function, melatonin synthesis |
| 18 | 95 | Retina/pineal gland | Cnga1, Gabbr1/2, Opn1mw, Pde6a/b/g/h, Rd3, Rdh8, Rp1, Rtbdn, | Retinal function |
| 19 | 94 | Thymus | Ccl25, Cd3d, Cd8a/b, Fas, Rag1, Tap2, Tbata, | Thymic differentiation, selection |
| 20 | 94 | Liver, kidney | Cyp2c23, Dcxr, Fbp1, G6pc, Gk, H6pd, Pck1, Slc22a1, Slc37a4, | Gluconeogenesis |
| 21 | 94 | Macrophage, microglia | C1qa/b/c, Csf1r, Ctss, Gpr84, Hexb, Mpeg1, P2ry12/13, Siglec5, Tgfbr1, Trem2, Tyrobp, | Innate immune function, microglial differentiation |
| 22 | 90 | Skin | Cdsn, Csta, Klk9/10/12, Krt4/13/23, Lce3d/e, Lipk, Ppl, Trex2, Vsig8, | Skin barrier function |
| 23 | 87 | T cells, NK cells | Ccl1, Ccr4/5/8, Cd40lg, Gpr183, Ifng, Il17a, Il2, Il2ra/b, Lta, Zap70, | Activation, cytokine secretion |
| 24 | 85 | Dorsal root ganglia | Acp3, Calca/b, Grik1, Htr1d, Nfeh/l/m, Nmb, Piezo2, Prokr1, Ret, | Ganglion cell differentiation |
| 27, 28 and 33 | 75, 74 and 65 | Skin | Adgrf4, Ces4a, Col17a1, keratins, Krtaps, Lce family, Lgals7, Lipm, Perp, | Skin barrier function |
| 29 | 69 | Testis | Aard, Clec12b, Gk5, Hormad1, Inca1, Shbg, Sycp1/2, | Sertoli cell differentiation, synaptonemal complex |
| 30 | 68 | B cell | Btla, Cd19, Cd79a/b, Cxcr5, Fcna, Gpr174, Ighm, Jchain, | B-cell differentiation, immunoglobulin production |
| 34 | 65 | Prostate | Andpro, Cyss, Dach2, Eaf2, Fut4, Lao1, Lyc2, Mc5r, Pbsn, Sbp, Semg1, | Prostate differentiation, secretion |
| 35 | 64 | Adrenal | Cbr1, Cyp11a1/b2/b3, Cyp1b1, Fdx1, Kcnk3/9, Mc2r, Pcsk5, Pnmt, Soat1, Star, | Steroid hormone production, adrenalin |
| 36 and 40 | 64 and 59 | Placenta | Ceacam3/9/11/12, Cts7/8, Faslg, Fcrla/b, Ifnk, Il17f, Il23a, Lcn9, Mmp1, Peg10, Prl family, Wnt8a, | Trophoblast differentiation, secretion |
| 38 | 60 | Brain | Crmp1, Ephb2, Gpc2, Gpr85, Marcksl1, Mdga1, Mex3b, | Neurogenic progenitor cell differentiation |
| 42 | 56 | Variable | Bub3, Ddx39a, Dkc1, Srsf2/3, Trip13, | Genotoxic damage response, tumour suppressors |
| 43 | 52 | Cochlea, middle ear | Cd164l2, Chrna9/10, Cldn9, Fbxo2, Grxcr1/2, Kncn, Loxhd1, Otoa/r/s | Hearing, cochlear function |
| 44 | 51 | Blood | Cxcr2, Gp9, Gypa, Kel, Pf4, S100a9, Tpt1, Tspo2 | Platelets, granulocytes |
| 46 | 49 | Lung | Ager, Aqp5, Clec14a, Cyp2a3, Dram1, Fmo2, Lamp3, Lyz2, Scgb1a1/3a1/3a2, Sftpa1/b/c/d, Wnt3a, | Alveolar type 1 and type II cell function and secretion |
| 47 and 83 | 48 and 24 | Heart | Actc1, Cav3, Fgf16, Myh7, Myl2, Palld, Ryr2, Tnnc1, | Cardiac-specific muscle contraction. |
| 48 | 48 | Monocyte, macrophage | C5ar1, Ccr1, Cd14, Csf2ra, Cyba, Fcgr1a, Itgam, Msr1, Ncf1/2/4, Nlrp3, Slc11a1 | Innate immune function, free radical production |
| 49 | 46 | Kidney | Acre2, Aqp2/3, Cldn8, Insrr, Kcne1, Oxgr1, | Distal tubule, collecting duct, water resorption |
| 51 | 45 | ES cells | Fgf4, Fgf19, Gdf3, | Regulation of pluripotency |
| 55 | 38 | Granulocytes | Camp, Ctsg, Elane, Fncb, Mpo, Prg2/3, S100a8 | Neutrophil granule proteins |
| 63 | 33 | Brain | Aqp4, Edil3, Gpr37/62 Mag, Mbp, Mobp, Opalin, Plp1, Sema4d, | Myelination, oligodendrocytes |
| 64 | 33 | Pancreas | Amy2a3, Cel, Cela1/2a/3b, Cpa1/2, Ctrc/l Pnlip, Pnliprp1/2 | Pancreatic enzymes, secretion |
| 66 | 29 | Stomach | Atp4a/b, Chia, Ctse, Cym, Ghrl, Gkn1/2, Pgc | Acidification, digestive enzymes |
| 68 | 27 | Brain, PC12 cells | P2rx2, Prph, Th, Vgf, | Sympathetic neurons? |
| 77 | 26 | Mast cell?, lymphatic | Adgrg5, Cma1, Cpa3, Lilrb3a, Lyve1, Selp, Sirpd, Slpi, Timd4, | |
| 82 | 24 | Adipose | Adipoq, Fabp4, Lep, Lipe, Lpl, Oxtr, Plin1, Pnpla2, Retn, Sucnr1, Tshr, | Fat storage, lipolysis, adipokines |
| 87 | 21 | Lens | Cryb family, Cryg family, Lim2, Opn4 | Lens structural proteins |
| 88 | 20 | Macrophage | Adam8, Cd68, Ctsb, Ctsd, Gpnmb, P2rx4 | Endosome/lysosome |
| 90 | 20 | Colon | Krt19, Lypd8, Phgr1, Pla2g10, Tspan1, | Colon epithelium differentiation, secretion |
| 92 | 19 | Cerebellum | Ca8, Cbln1/3, Chn2, Fat2, Gabra6, Grm4, | Purkinje cell differentiation, granule proteins |
| 95 | 19 | Variable in many tissues | Adgrl4, Cd93, Cdh5, Dll4, Egfl7, Kdr, Pcdh12, Pecam1, Tie1, | Endothelial cell differentiation |
| 97 | 19 | Cartilage growth plate | Acan, Clec11a, Col9a1/2/3, Loxl3, Rflna, | Cartilage structural proteins |
| 98 | 18 | Activated T cells, thymus | Ccr7, Cd7, Cd96, Heca, | Immune cell activation |
| 101 | 18 | Macrophage | Acod1, Cxcl10, Il1a/b, Nos2 | Response to LPS |
| 106 | 16 | Cartilage, tendon | Col2a1, Col10a1, Col11a1/2, Myh3, Ptx4, | Cartilage structural proteins |
Clusters were generated at r ≥ 0.75 and MCL inflation value 2.2. Selected transcripts encoding transcription factors are highlighted in bold. The full lists of transcripts in these clusters and the average expression profiles are provided in Supplementary Table S2. Index genes were chosen for illustrative purpose based upon known function in the indicated tissue confirmed by a PubMed search on gene name AND tissue. Where two cluster numbers are shown, the two clusters are in the same region of the network graph and show closely related expression profiles.
Figure 2.Gene expression profiles for genes that did not fall within a cluster. Y-axis shows the expression level in TPM. X-axis shows the organ system, coloured as in Supplementary Table S2. Reading from left to right: light red, nervous system; blue, auditory system; light green, respiratory system; yellow, cardiovascular system; pink, digestive system; turquoise, endocrine system; salmon, liver; grey, renal system; dark red, skeletomuscular system; dark blue, integumentary system; dark green, immune system; olive, male reproductive system; dark pink, female reproductive system; dark turquoise, primordia/early development; black, whole body (embryo); and red, mixed tissues.
Gene expression clusters from rat liver
| Liver cluster number | Number of nodes | Description and index genes |
|---|---|---|
| 1 | 6292 | Widely expressed, high in bile duct ligation model; growth, protein synthesis, inflammation, fibrosis, connective tissue |
| 2 | 752 | High in foetal liver; cell cycle, haematopoiesis, embryonic liver; |
| 3 | 414 | General expression, metabolic regulation; |
| 4 | 278 | General expression, control of lipid metabolism; |
| 5 | 206 | Isolated samples, gall bladder, neuroendocrine; |
| 6 | 166 | E14 liver, fibrosis model; |
| 7 | 148 | Foetal liver, fibrosis, Zucker rats: myeloid infiltration; |
| 10 | 98 | Variable expression: proteasome complex, proteolysis; |
| 11 | 76 | Variable, low in foetal liver, periportal hepatocytes, urea synthesis; |
| 13 | 67 | Variable, low in foetal liver, fibrosis model, fatty acid β-oxidation; |
| 16 and 70 | 105 and 10 | Variable, cholesterol and fatty acid syntheses; |
| 18 | 54 | Fibrosis; |
| 24 | 41 | Variable, mast cells; |
| 25 | 41 | Variable, interferon response; |
| 26 | 41 | Variable, mitochondrial; |
| 31 | 34 | One BioProject, pancreas contamination; |
| 33 | 32 | One BioProject, NK cells; |
| 34 | 31 | Highly variable, hepatic stellate cell activation? |
| 43 | 21 | Variable, interferon response; |
| 56 | 12 | Kupffer cell; |
| 63 | 10 | Endothelial cell; |
| 65 | 10 | Class II MHC; |
| 66 | 10 | Male-specific; |
| 69 | 10 | Xenobiotic-induced; |
| 84 | 9 | Female-specific; |
Clusters were generated at r ≥ 0.75 and MCL inflation value 1.7. The full gene lists for each of the clusters are provided in Supplementary Table S3A. Transcription factors are highlighted in bold. Index genes were chosen for illustrative purpose based upon known function.
Figure 3.Gene expression profiles for macrophage-related genes. Y-axis shows the expression level in TPM. X-axis shows the organ system, coloured as in Supplementary Table S2. Reading from left to right: light red, nervous system; blue, auditory system; light green, respiratory system; yellow, cardiovascular system; pink, digestive system; turquoise, endocrine system; salmon, liver; grey, renal system; dark red, skeletomuscular system; dark blue, integumentary system; dark green, immune system; olive, male reproductive system; dark pink, female reproductive system; dark turquoise, primordia/early development; black, whole body (embryo); and red, mixed tissues.