| Literature DB >> 32117413 |
Kim M Summers1, Stephen J Bush2, Chunlei Wu3, Andrew I Su3, Charity Muriuki4, Emily L Clark4, Heather A Finlayson4, Lel Eory4, Lindsey A Waddell4, Richard Talbot4, Alan L Archibald4, David A Hume1.
Abstract
The domestic pig (Sus scrofa) is both an economically important livestock species and a model for biomedical research. Two highly contiguous pig reference genomes have recently been released. To support functional annotation of the pig genomes and comparative analysis with large human transcriptomic data sets, we aimed to create a pig gene expression atlas. To achieve this objective, we extended a previous approach developed for the chicken. We downloaded RNAseq data sets from public repositories, down-sampled to a common depth, and quantified expression against a reference transcriptome using the mRNA quantitation tool, Kallisto. We then used the network analysis tool Graphia to identify clusters of transcripts that were coexpressed across the merged data set. Consistent with the principle of guilt-by-association, we identified coexpression clusters that were highly tissue or cell-type restricted and contained transcription factors that have previously been implicated in lineage determination. Other clusters were enriched for transcripts associated with biological processes, such as the cell cycle and oxidative phosphorylation. The same approach was used to identify coexpression clusters within RNAseq data from multiple individual liver and brain samples, highlighting cell type, process, and region-specific gene expression. Evidence of conserved expression can add confidence to assignment of orthology between pig and human genes. Many transcripts currently identified as novel genes with ENSSSCG or LOC IDs were found to be coexpressed with annotated neighbouring transcripts in the same orientation, indicating they may be products of the same transcriptional unit. The meta-analytic approach to utilising public RNAseq data is extendable to include new data sets and new species and provides a framework to support the Functional Annotation of Animals Genomes (FAANG) initiative.Entities:
Keywords: Functional Annotation of Animals Genomes; expression atlas; network analysis; pig; transcriptomics
Year: 2020 PMID: 32117413 PMCID: PMC7034361 DOI: 10.3389/fgene.2019.01355
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Gene expression clusters from pig tissues and cells. Clusters were generated at r ≥ 0.7 and MCL inflation value 2.2. Clusters of ≥40 nodes are shown.
| Cluster number | Number of transcripts | Expression pattern | Class | Functional annotation |
|---|---|---|---|---|
|
| 1,902 | Digestive tract, CNS > immune > others | Housekeeping | Regulation of transcription, cell cycle |
|
| 1,555 | Ovary (some) > immune, kidney, adipose > liver, muscle, adipose | Protein synthesis | Protein biosynthesis, primary metabolism, ribosome, retrograde transport, endosome to Golgi |
|
| 1,505 | Testis only; many unannotated genes | Male reproductive | Sexual reproduction, male gamete generation, spermatogenesis, spermatid development, sperm motility |
|
| 1,504 | CNS | Central nervous system | Transmission of nerve impulse, synaptic transmission, cell communication, axon guidance, synapse organisation |
|
| 787 | Ovary (some) only; many unannotated genes | Female reproductive | Transcription, defence response |
|
| 651 | Spleen > lung, peripheral blood > lymph nodes, thymus, tonsil, macrophages | Immune system | Immune response, defence response, response to pest, pathogen or parasite, inflammatory response, innate immune response |
|
| 560 | Oocyte only | Female reproductive | Ubiquitin cycle, female gamete generation, nucleoplasm |
|
| 502 | Kidney >> liver, ovary (two), mixture of tissues | Kidney | Excretion, organic acid metabolism, sodium ion transport, cell communication |
|
| 445 | Liver > mixture of tissues | Liver | Organic acid metabolism |
|
| 321 | Lung >>> adipose, mixture of tissues | Respiratory system | Morphogenesis, organ development, regulation of liquid surface tension, angiogenesis, patterning of blood vessels, heart development |
|
| 319 | Muscle >> heart, mixture of tissues | Muscular system | Muscle contraction, muscle development, myofibril assembly, glucose/hexose/carbohydrate metabolism, glycolysis, gluconeogenesis |
|
| 279 | Fetal thymus, cell lines > immune, ovary > digestive tract, adipose, kidney | Cell division | Cell cycle, cell proliferation, mitosis, DNA replication, microtubule based movement, G1/S transition of mitotic cell cycle |
|
| 264 | Mesenchymal cells > adipose, muscle (some), ovary, lung | Connective tissue | Development, phosphate transport, cell communication, skeletal development, extracellular matrix, collagen triple helix repeat |
|
| 262 | Stem cells > brain > immune, ovary, adipose, digestive tract | Development | RNA processing, organelle organisation and biogenesis, mRNA splicing |
|
| 274 | Ear >>> adipose (some) | Integumentary system | Epidermis development, ectoderm development, catabolism, keratinocyte differentiation |
|
| 187 | BMDM >> alveolar macrophages, immune | Immune system | Immune response, defence response, response to pest, pathogen or parasite, response to wounding, inflammatory response, lysosome, monocyte chemotaxis |
|
| 181 | Fetal thymus >>> immune system, blood | Immune system | Defence response, immune response, T-cell activation, lymphocyte differentiation, hemopoiesis, |
|
| 170 | Adipose only | Integumentary system | Lipid metabolism, energy derivation by oxidation of organic compounds, cell-matrix adhesion, regulation of lipolysis in adipocytes |
|
| 169 | Large intestine > small intestine | Digestive system | Mineral absorption, ion transport |
|
| 156 | Heart > muscle > kidney, mixture of tissues > digestive tract > liver, lung, immune | Mitochondria | Oxidative phosphorylation, electron transport, ATP synthesis coupled electron transport, tricarboxylic acid cycle, respiratory chain, aerobic respiration |
|
| 154 | Ovary (three only); many unannotated genes | Female reproductive | Steroid metabolism, lipid metabolism, cholesterol metabolic process, ovarian steroidogenesis, reproductive physiological process |
|
| 147 | Inner cell mass from day 7-8 embryos only | Development | Cell communication, membrane lipid catabolism, |
|
| 151 | Inner ear stria vascularis only | Integumentary system | Sensory perception of mechanical stimulus, perception of sound, ion transport, morphogenesis, skeletal development, signal, collagen triple helix repeat |
|
| 131 | Epididymis | Male reproductive | Defensin, defence response to bacterium, innate immune response, |
|
| 127 | Uterine endometrium > chorion | Female reproductive | Protein catabolic process, glycosphingolipid metabolism |
|
| 125 | Peripheral blood only | Immune system | Response to biotic stimulus, defence response, response to pest, pathogen or parasite, immune response, response to wounding, osteoclast differentiation |
|
| 116 | Ileum >>> other digestive tract | Digestive tract | Nucleotide transport, digestion, bile secretion |
|
| 157 | Stem cells >> iPSC; mainly unannotated genes | Development | Pattern specification, axis specification |
|
| 112 | Hippocampus (fetal) > oocyte > random samples; mainly unannotated genes | Olfactory receptors | Perception of smell, sensory perception, neurophysiological process |
|
| 107 | Fibroblasts > somatic cells | Integumentary system | Neuromuscular junction, heparin binding, extracellular exosome, morphogenesis, development, growth |
|
| 104 | Heart >> muscle > mixture of tissues | Cardiovascular system | Cardiac muscle contraction, ventricular cardiac muscle tissue morphogenesis, circulation, muscle development, regulation of heart contraction rate |
|
| 85 | Duodenum >> ileum, caecum, colon | Digestive tract | Digestion, cobalt/metal ion transport, extracellular region |
|
| 86 | Longissimus dorsi muscle (two only) >> other muscle, heart, chorion | Muscular system | Muscle development, muscle contraction, myogenesis |
|
| 74 | Pituitary gland only; many unannotated genes | Endocrine system | Sex differentiation, pregnancy, neuroactive ligand-receptor interaction, hormone, extracellular region, hormone mediated signalling pathway |
|
| 73 | Embryo, inner cell mass > ovary (most), fetal thymus, random others | Development | RNA metabolism, RNA processing, protein folding, chaperonin-containing T complex, positive regulation of protein localisation of Cajal body, binding of sperm to zona pellucida, metabolism, toxin transport |
|
| 72 | Hippocampal formation, amygdala (one) >> other CNS | Nervous system | Locomotory behaviour, neuropeptide signalling pathway, adenylate cyclase-activating dopamine receptor signalling pathway, signalling pathway, cell communication, synaptic transmission |
|
| 72 | Digestive tract, spleen, ovary > adipose, lung, blood > muscle, heart, liver, kidney | Pathway | Protein biosynthesis, translation, ribosomal assembly, metabolism |
|
| 70 | Stomach >>> fibroblasts, somatic cells | Digestive tract | Smooth muscle contraction, muscle development, cGMP-PKG signalling pathway |
|
| 69 | Alveolar macrophages > BMDMs > lung | Immune system | Defence response, immune response, response to (external) biotic stimulus, phagosome |
|
| 66 | Penis >> tonsil >> alveolar macrophages | Immune system | Inflammatory response, response to wounding, immune response, calcium ion binding |
|
| 64 | Primordia of developing teeth only | Development | Neurogenesis, organogenesis, morphogenesis, homeodomain |
|
| 64 | Kidney cell line | Kidney | Integrin mediated signalling pathway, cell-matrix adhesion |
|
| 58 | iPSC (one) > other stem cells | Development | Regulation of metabolism, regulation of transcription, development, signalling pathways regulating pluripotency of stem cells, neurogenesis |
|
| 55 | Immune, digestive tract | Immune system | Defence response, response to biotic stimulus, immune response, external side of plasma membrane |
|
| 53 | Mixture of multiple tissues (one) >> random other samples; many unannotated genes | Mixture | Cortical cytoskeleton |
|
| 53 | iPSC >> CNS, digestive tract | Development | Cellular metabolism |
|
| 51 | BMDM after LPS, some spleen, some lung > blood, some adipose, some ovary > digestive tract | Immune system | Response to biotic stimulus, immune response, defence response, defence response to virus, negative regulation of viral genome replication, ubiquitin cycle, ISG15 protein conjugation |
|
| 50 | Oocyte > digestive tract, CNS, kidney, lung, spleen, immune, ovary > liver, heart, muscle, adipose | Development | Ubiquitin cycle |
|
| 50 | Digestive tract, forebrain, tonsil, lymph nodes, spleen (one) > ovary (two), male reproductive | Digestive tract | Nucleosome assembly, chromatin assembly or disassembly, DNA packaging |
|
| 49 | Duodenum > other intestine, spleen, alveolar macrophages | Digestive tract | Humoral immune response, response to pest, pathogen or parasite, antimicrobial humoral response, inerleukin-10 biosynthesis, interleukin-4 biosynthesis, interleukin-13 biosynthesis |
|
| 48 | Ovary (most), CNS | Female reproductive | Transport, localisation |
|
| 47 | Kidney only, one sample very high | Kidney | Perception of small, sensory perception |
|
| 47 | Endometrium of pregnancy >> uterine endometrium | Female reproductive | Di-, tri-valent inorganic cation transporter |
|
| 44 | Adipose, lung, ovary, spleen > digestive tract, macrophages > heart | Connective tissue |
|
|
| 43 | Ovary (one) >> ovary (two); many unannotated genes | Female reproductive | Hormone biosynthesis |
|
| 42 | Testis > CNS, digestive tract > other male reproductive | Male reproductive |
|
|
| 41 | Pituitary gland only; many unannotated genes | Endocrine system | Secretory granule |
|
| 41 | Inner cell mass >> adipose, kidney, lung, spleen, BMDM, ovary | Development |
|
|
| 41 | Liver (one); mainly unannotated genes | Liver |
|
Figure 1Expression of Y chromosome genes across tissues in the atlas. Top panel shows average expression of cluster 143, which contains genes mapped to the Y chromosome. Remaining panels show expression of Y chromosome genes from cluster 143. Y axis shows normalized expression levels (TPM) derived from Kallisto (see Methods). Each bar on the X axis represents a sample in the analysis, averaged for breed, sex and BioProject. The order of samples is as for . Red indicates sex unknown, green indicates female, blue indicates male and yellow indicates pooled samples of mixed/unknown sex.
Gene expression clusters from pig liver. Clusters were generated at r ≥ 0.75 and Markov clustering algorithm (MCL) inflation value 1.7. Clusters of ≥40 nodes are shown.
| Cluster number | Number of nodes | GO term enrichment |
|---|---|---|
|
| 3648 | Protein modification, primary metabolism, protein transport, protein localization, G-protein coupled receptor protein signalling |
|
| 1951 | Biosynthesis, macromolecule metabolism, neurophysiological process, response to external stimulus, extracellular exosome, translation, mitochondrion |
|
| 753 | Cytoplasm organization and biogenesis, ribosome biogenesis and assembly, RNA processing and metabolism, RNA splicing, cellular physiological process, poly(A) RNA binding, nucleolus |
|
| 599 |
|
|
| 325 | Phosphate transport, development, regulation of cellular process, anion transport, morphogenesis, cell adhesion, extracellular exosome, extracellular matrix, angiogenesis, |
|
| 248 | Organic acid metabolism, lipid metabolism, amine metabolism, steroid metabolism, mitochondrion, fatty acid beta oxidation using acyl-CoA dehydrogenase, lipid homeostasis |
|
| 169 | Immune response, defence response, response to biotic stimulus, response to wounding, inflammatory response, positive regulation of T cell proliferation |
|
| 159 | Signal transduction, cell communication, immune cell migration, integrin mediated signalling pathway, regulation of cell shape, leukocyte cell-cell adhesion |
|
| 157 | Protein transport, establishment of protein localisation, secretory pathway, Golgi vesicle transport, ER-associated protein catabolism, protein folding, endoplasmic reticulum chaperone complex, response to endoplasmic reticulum stress |
|
| 120 |
|
|
| 105 | Electron transport, generation of precursor metabolites and energy, amine metabolism, regulation of blood coagulation, extracellular exosome |
|
| 100 | Response to unfolded protein, protein folding, response to stress |
|
| 93 |
|
|
| 89 | Response to biotic stimulus, immune response, defence response, ubiquitin cycle, defence response to virus, negative regulation of viral genome replication, ISG15-protein conjugation |
|
| 89 |
|
|
| 65 | Generation of precursor metabolites and energy, electron transport, oxidative phosphorylation, ATP coupled electron transport, metabolism, cytochrome C oxidase activity, NADH dehydrogenase (ubiquinone) activity |
|
| 60 | Carboxylic acid metabolism, pigment metabolism, anti-inflammatory response, extracellular exosome, blood microparticle, extracellular matrix |
|
| 60 | Transition metal ion transport, oxidation-reduction process |
|
| 47 |
|
|
| 47 | Fatty acid metabolism pathway, mitochondrial inner membrane |
|
| 43 | Structural constituent of ribosome, translation, cytosolic large/small ribosomal subunit, focal adhesion, nucleolus |
|
| 40 |
|
Gene expression clusters from pig central nervous system. Clusters were generated at r ≥ 0.85 and Markov clustering algorithm (MCL) inflation value 1.7. Clusters of ≥40 nodes are shown.
| Cluster number | Number of nodes | Expression pattern | GO term enrichment |
|---|---|---|---|
|
| 2,175 | PRNJA40675 > developing cortex (decreases with age) | Nucleobase, nucleoside, nucleotide and nucleic acid metabolism, organismal physiological process, transcription, primary metabolism, response to external stimulus, defence response, immune response, perception of smell, biopolymer metabolism, poly(A) RNA binding, metal ion binding |
|
| 1,118 | Female pituitary > male pituitary > all others | Extracellular exosome, oligosaccharyl transferase complex |
|
| 1,113 | Pituitary > brain regions > developing cortex, hypothalamus | Immune response, defence response, response to biotic stimulus, T-cell activation, organismal physiological process, lymphocyte activation, response to wounding |
|
| 1,054 | Forebrain > cortex (decreasing with age) > others | Cell cycle, cell proliferation, DNA metabolism, M phase of mitotic cell cycle, nuclear division, mitosis, signal transduction, primary metabolism, kinetochore, nuclear chromatin, translation, DNA replication-dependent nucleosome assembly |
|
| 657 | Hypothalamus > brain regions > frontal and occipital cortex > developing cortex (decreases with age) | Nucleobase, nucleoside, nucleotide and nucleic acid metabolism, glycolysis, intracellular signalling cascade, catabolism, |
|
| 504 | Female pituitary, hypothalamus > male pituitary, brain regions | G-protein coupled receptor protein signalling metabolism, cell surface receptor linked signal transduction |
|
| 483 | Brain regions including hypothalamus > hippocampal formation, cortex > pituitary, “brain”, developing cortex, hippocampus | Cell communication, nucleobase, nucleoside, nucleotide and nucleic acid metabolism, signal transduction, response to wounding, myelin sheath |
|
| 345 | Hypothalamus | Regulation of cellular process, immunity |
|
| 298 | Brain regions > pituitary, “brain”, hippocampus | Transport, localisation, monovalent cation transport, synaptic transmission, transmission of nerve impulse, cell-cell signalling, vesicle mediated transport, myelin sheath, synapse, SNARE complex |
|
| 231 | Hippocampus, mainly ENSSSCG and LOC | Defence response, immune response, response to biotic stimulus |
|
| 174 | General, variable | Electron transport, generation of precursor metabolites and energy, mitochondrial electron transport, NADH to ubiquinone, ATP synthesis coupled electron transport, mitochondrial respiratory chain complex I |
|
| 144 | Hypothalamus > “brain” > others |
|
|
| 102 | One sample of cortex (SRS1027244) > one sample of brain (SRS2520758) > all others | Sexual reproduction, male gamete generation, spermatogenesis, anion transport |
|
| 88 | Male pituitary > female pituitary, all others very low |
|
|
| 82 | Hypothalamus (SRS497445) > > Hypothalamus (SRS497448) > all others | Cell-cell signalling, synaptic transmission, neuropeptide signalling pathway |
|
| 74 | Developing cortex, forebrain, brainstem >> all others | Coenzyme/cofactor biosynthesis, energy coupled proton transport, down electrochemical gradient, ATP biosynthesis, mitochondrion |
|
| 73 | Developing cortex decreasing with age, “brain” and brain regions moderate, hypothalamus low | Protein modification |
|
| 71 | Female pituitary, forebrain > other brain regions > hypothalamus | Histidine family amino acid catabolism, histidine metabolism |
|
| 71 | Brain, brain regions > hypothalamus, forebrain, hippocampal formation, hippocampus |
|
|
| 68 | Cortex (SRS1027246) >> other cortex, “brain” > other brain regions | Development, regulation of biological process |
|
| 63 | Brain, some cortex, corpus callosum, brain stem (SRS1027238) > others | Cell-cell signalling, ion transport, synaptic transmission, transmission of nerve impulse, postsynaptic membrane |
|
| 62 | Amygdala, hippocampal formation > cortex (SRS1027243) > others | G-protein coupled receptor protein signalling, cell communication, signal transduction, synaptic transmission, transmission of nerve impulse, dopaminergic synaptic transmission |
|
| 56 | Low in pituitary, hippocampus, forebrain | Transport, localisation, chloride transport, secretory pathway, small GTPase mediated signal transduction |
|
| 52 | Hypothalamus, “brain” > pituitary, amygdala > other brain regions |
|
|
| 47 | Forebrain, pituitary |
|
|
| 47 | Hypothalamus, “brain”, pituitary > other brain regions |
|
|
| 44 | Brainstem, cortex > “brain” > other regions |
|
|
| 44 | Forebrain, hypothalamus, pituitary > other regions |
|
|
| 43 | Cortex (one sample) > pituitary > other regions |
|
|
| 40 | Pituitary (one sample) > all other samples |
|