| Literature DB >> 32531054 |
Mark J Nolte1, Peicheng Jing1, Colin N Dewey2, Bret A Payseur1.
Abstract
Island populations repeatedly evolve extreme body sizes, but the genomic basis of this pattern remains largely unknown. To understand how organisms on islands evolve gigantism, we compared genome-wide patterns of gene expression in Gough Island mice, the largest wild house mice in the world, and mainland mice from the WSB/EiJ wild-derived inbred strain. We used RNA-seq to quantify differential gene expression in three key metabolic organs: gonadal adipose depot, hypothalamus, and liver. Between 4,000 and 8,800 genes were significantly differentially expressed across the evaluated organs, representing between 20% and 50% of detected transcripts, with 20% or more of differentially expressed transcripts in each organ exhibiting expression fold changes of at least 2×. A minimum of 73 candidate genes for extreme size evolution, including Irs1 and Lrp1, were identified by considering differential expression jointly with other data sets: 1) genomic positions of published quantitative trait loci for body weight and growth rate, 2) whole-genome sequencing of 16 wild-caught Gough Island mice that revealed fixed single-nucleotide differences between the strains, and 3) publicly available tissue-specific regulatory elements. Additionally, patterns of differential expression across three time points in the liver revealed that Arid5b potentially regulates hundreds of genes. Functional enrichment analyses pointed to cell cycling, mitochondrial function, signaling pathways, inflammatory response, and nutrient metabolism as potential causes of weight accumulation in Gough Island mice. Collectively, our results indicate that extensive gene regulatory evolution in metabolic organs accompanied the rapid evolution of gigantism during the short time house mice have inhabited Gough Island.Entities:
Keywords: body size; gene regulatory evolution; house mouse; island evolution; island rule
Mesh:
Year: 2020 PMID: 32531054 PMCID: PMC7487164 DOI: 10.1093/gbe/evaa118
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.Thousands of genes expressed in metabolic organs are transcribed at different levels in island mice relative to mainland mice. (A) Reverse cumulative distribution plots showing the distribution of magnitudes of differential expression for five different conditions. For each condition, the count of DE genes and the percentage DE among all detected transcripts (in parentheses) are provided. (B) Genes expressed throughout liver maturation that are DE (either higher or lower in island mice) at some or all evaluated time points are sorted into four log2 fold change classes (absolute value of log2 fold change): no threshold (or cutoff), 0.6, 1, and 2. For the liver, the number of genes manifesting large magnitudes of differential expression increases over time. Counts of DE genes are scaled down and given a common origin so that all magnitudes of differential expression can be viewed together. See raw gene counts in supplementary figure 9, Supplementary Material online.
. 2.A subset of DE genes residing within body size QTL (QTL-DE genes) exhibit fixed SNPs coincident with putative gene regulatory elements. (A) Established gene regulatory elements (REs)—drawn from 29 public data sets (supplementary tables 8 and 9, Supplementary Material online)—overlap with fixed SNPs in 73 QTL-DE genes. Counts of SNP-RE intersections in genic regions are not mutually exclusive: for example, if a SNP occurs in different genic regions in different transcript variants then its coincidence with an RE will be counted for each genic region. Counts are for any position between two adjacent genic regions on the x axis; for example, all SNP-RE co-occurrences between upstream position 501 and 1,500 from the transcriptional start site would be included in the up500 column. Spanning over 1 Mb, gene Dab1 bears many fixed SNPs and was removed from panel (A) for clarity of the heat map; likewise, genes with long alphanumerical names were removed for clarity. Red arrows in (A) indicate QTL-DE genes highlighted in (C), where genes Arid5b, Insl5, Lrp1, and Zbtb16 are shown with a subset of fixed SNPs that overlap with putative organ-specific REs from a wide array of public data sets (B), details of which can be found in supplementary tables 8 and 9, Supplementary Material online. Panel (C) does not provide the total number of SNP-RE coincidences, but rather indicates the types of putative REs that overlap with fixed SNPs. In all panels, “5′ introns” refers to either the first or second intron of a transcript; 5′ intronic counts are also included in the total count for the “intron” genic region. “dn” and “up” refer to down- and upstream, respectively, and the numbers refer to base pairs (k = kilobases).
DE Genes (Island vs. Mainland) in Body Size QTL: Sorted According to Magnitude of Differential Expression, SNP Status, and SNP-Regulatory Element Overlap
| 363 Total DE Genes (±2 Mb around QTL Peaks) | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Larger DE Magnitude: |log2 fold change| ≥ 0.6 ( | Smaller DE Magnitude: |log2 fold change| ≤ 0.6 ( | ||||||||
| Genes with Fixed Flanking SNPs | Genes with Fixed Flanking SNPs | ||||||||
|
| SNPs |
| Remaining Genes |
| SNPs |
| Remaining Genes | ||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
| ||||
|
|
|
|
|
| |||||
|
|
|
|
|
| |||||
|
|
|
|
|
| |||||
|
|
|
|
| ||||||
Putative published and publicly available regulatory elements listed in supplementary tables 8 and 9, Supplementary Material online.
High-frequency SNP = frequency across 16 sequenced wild-caught Gough Island mouse genomes ≥ 0.9.
In conditions where |log2 fold change| ≤ 0.6 fixed SNPs flanking these genes overlap condition-specific regulatory elements.
Longer gene names listed here: (a) BC055111, (b) F420014N23Rik, (c) Gm10384, (d) 1110002J07Rik, (e) 1700040L02Rik, (f) 3010026O09Rik, (g) A230028O05Rik, (h) A930033H14Rik, (i) BC035947c, (j) Mettl21bc, (k) Rasgef1c, (l) 1600010M07Rik, (m) 1700024P16Rik, (n) 5730419F03Rik, (o) 9830004L10Rik, (p) A630001G21Rik, (q) BC089597, (r) D830026I12Rik, (s) G530012D18Rik, (t) 9030616G12Rik, (u) D930048N14Rik, (v) 1700029J07Rik, (w) 2900052N01Rik, (x) 4933407L21Rik, (y) Cdkn2aipnl, (z) Tmem184c, (aa) RP23-32A8.1, (bb) 9530068E07Rik, (cc) Pafah1b2, and (dd) A430046D13Rik.
DE Genes (Island vs. Mainland) in Body Size QTL with Large Fold Changes, Fixed SNPs in Putative RegulatoryElements, and Annotated Mammalian Phenotypes
| Gene Name | Ensemble Gene ID |
| ±Distance Flanking QTL Peak (Mb) | QTL-Associated Trait |
| Differential Expression (log2 fold change) | Mammalian Phenotype Ontology Phenotypesd |
|---|---|---|---|---|---|---|---|
|
| ENSMUSG00000009376 | chr6 | 1 | Body weight, growth rate | wk4a, wk4h | −0.8, −0.44 | Abnormal fetal growth/weight/body size and liver development, decreased circulating insulin level, embryo size, fetal size, insulin secretion and liver weight, enlarged liver sinusoidal spaces, hepatic steatosis, impaired glucose tolerance, increased circulating glucose level, postnatal growth retardation |
|
| ENSMUSG00000015533 | chr13 | 1 | Growth rate | el, wk4l | −1.73, 2.72 | Homeostasis/metabolism phenotype |
|
| ENSMUSG00000019942 | chr10 | 1 | Body weight | wk4a, wk4l | 0.45, 1.34 | Abnormal hepatocyte morphology, decreased hepatocyte number and hepatocyte proliferation |
|
| ENSMUSG00000022512 | chr16 | 1 | Growth rate | el, wk2l, wk4a, wk4h, wk4l | −0.64, −0.32, −2.54, −0.79, −0.52 | Weight loss |
|
| ENSMUSG00000028518 | chr4 | 1 | Growth rate | el | 1.31 | Abnormal gluconeogenesis, decreased body weight, circulating insulin level, glycogen level and insulin secretion, homeostasis/metabolism phenotype, impaired glucose tolerance, increased body weight, circulating free fatty acid level, circulating glucose level, circulating insulin level, circulating leptin level, fatty acid level, susceptibility to diet-induced obesity and triglyceride level, insulin resistance |
|
| ENSMUSG00000028519 | chr4 | 1 | Growth rate | el, wk2l, wk4a, wk4l | 1.06, 2.17, 2.12, 2.17 | Decreased body size |
|
| ENSMUSG00000032026 | chr9 | 1 | Body weight | el, wk4a | 0.92, 0.41 | Increased circulating HDL cholesterol level |
|
| ENSMUSG00000032076 | chr9 | 1 | Body weight | wk2l, wk4h, wk4l | 0.83, −0.16, 0.53 | Decreased body size, body weight and fetal size |
|
| ENSMUSG00000041301 | chr6 | 1 | Body weight, growth rate | wk2l | −1.63 | Abnormal hepatocyte morphology, decreased body length, body size and body weight, homeostasis/metabolism phenotype, liver/biliary system phenotype, postnatal growth retardation, weight loss |
|
| ENSMUSG00000051359 | chr15 | 1 | Growth rate | wk4a, wk4h | 0.97, 0.14 | Decreased body length, circulating HDL cholesterol level, circulating glucose level, lean body mass and total body fat amount |
|
| ENSMUSG00000079470 | chr1 | 1 | Body weight | wk4a | 1.16 | Growth/size/body region phenotype |
|
| ENSMUSG00000019947 | chr10 | 1–2 | Body weight, growth rate | el, wk2l, wk4a | −0.6, −0.74, −1.1 | Abnormal fat cell morphology, decreased body length, body size, body weight, brown fat lipid droplet number, susceptibility to diet-induced obesity and white adipose tissue amount, increased food intake, postnatal growth retardation |
|
| ENSMUSG00000025402 | chr10 | 1–2 | Body weight, growth rate | wk2l, wk4l | 0.52, −1.09 | Decreased body length, circulating HDL cholesterol level and circulating cholesterol level |
|
| ENSMUSG00000007279 | chr7 | 2 | Body weight, growth rate | wk2l, wk4h | −2.55, 0.54 | Decreased body length, body size, and body weight |
|
| ENSMUSG00000008590 | chr9 | 2 | Body weight | el, wk2l, wk4l | −5.88, −8.6, −8.04 | Increased body length |
|
| ENSMUSG00000021765 | chr13 | 2 | Growth rate | wk2l, wk4a, wk4l | −1.63, −1.21, −1.79 | Decreased fetal size and weight, fetal growth retardation, homeostasis/metabolism phenotype |
|
| ENSMUSG00000025403 | chr10 | 2 | Body weight, growth rate | wk2l, wk4l | −0.56, −0.77 | Decreased embryo size |
|
| ENSMUSG00000034853 | chr4 | 2 | Growth rate | el, wk2l, wk4h | −0.72, −0.75, 0.62 | Abnormal brown adipose tissue thermogenesis and brown fat cell morphology, decreased circulating HDL cholesterol level, circulating cholesterol level, circulating glucose level, lean body mass, susceptibility to diet-induced obesity and susceptibility to hepatic steatosis, homeostasis/metabolism phenotype, improved glucose tolerance, increased brown adipose tissue amount, circulating triglyceride level, fatty acid level, food intake and oxygen consumption |
|
| ENSMUSG00000034926 | chr4 | 2 | Growth rate | wk2l, wk4a, wk4l | −0.44, 1.1, −0.36 | Decreased body size, body weight, circulating cholesterol level and circulating triglyceride level, postnatal growth retardation |
|
| ENSMUSG00000040249 | chr10 | 2 | Body weight, growth rate | el, wk4a, wk4l | −0.55, −0.66, −0.53 | Abnormal adipose tissue physiology, brown fat cell morphology, embryo size, fat pad morphology, lipid homeostasis, lipid level and liver physiology, decreased body weight, brown fat lipid droplet number, circulating free fatty acid level, circulating glucose level, circulating insulin level, hepatocyte number, liver triglyceride level, susceptibility to diet-induced obesity and total body fat amount, embryonic growth retardation, homeostasis/metabolism phenotype, improved glucose tolerance, increased food intake and oxygen consumption, postnatal growth retardation |
|
| ENSMUSG00000040287 | chr10 | 2 | Body weight, growth rate | el, wk2l, wk4a, wk4h, wk4l | 0.55, 1.18, 1.06, 0.7, 1.05 | Decreased fetal weight |
|
| ENSMUSG00000040502 | chr10 | 2 | Body weight, growth rate | el, wk2l, wk4l | 0.97, 1.08, 0.69 | Decreased body weight and circulating HDL cholesterol level |
|
| ENSMUSG00000044254 | chr4 | 2 | Growth rate | wk4a | 1.32 | Abnormal cholesterol homeostasis, decreased circulating HDL cholesterol level, circulating LDL cholesterol level and circulating cholesterol level, homeostasis/metabolism phenotype, increased circulating HDL cholesterol level |
|
| ENSMUSG00000055980 | chr1 | 2 | Growth rate | wk4a, wk4l | −1.57, 0.63 | Abnormal lipid level, decreased body length, body size, body weight, circulating HDL cholesterol level, circulating glucose level, fetal weight, insulin secretion and total body fat amount, fetal growth retardation, homeostasis/metabolism phenotype, impaired glucose tolerance and lipolysis, increased circulating free fatty acid level, circulating insulin level and circulating triglyceride level, insulin resistance, postnatal growth retardation |
|
| ENSMUSG00000066090 | chr4 | 2 | Growth rate | el, wk4h | 6.47, −0.57 | Decreased circulating insulin level, impaired glucose tolerance |
|
| ENSMUSG00000079465 | chr1 | 2 | Growth rate | el, wk2l, wk4l | −0.62, −2.08, −2.21 | Decreased body weight |
|
| ENSMUSG00000067158 | chr1 | 2 | Growth rate | wk2l, wk4l | −0.68, −0.85 | Decreased body size and body weight |
|
| ENSMUSG00000074657 | chr10 | 2 | Body weight, growth rate | wk4a, wk4h | 1.06, 0.27 | Decreased body size and body weight, postnatal growth retardation |
Putative published and publicly available regulatory elements listed in supplementary tables 8 and 9, Supplementary Material online.
Gray et al. (2015).
wk4a = 4-week gonadal adipose depot; wk4h = 4-week hypothalamus; el = E16.5 liver; wk2l = 2-week liver; wk4l = 4-week liver.
dReported phenotypes manually curated to remove verbal redundancy.
Significantly Enriched Mammalian Phenotype Ontology Phenotypes among QTL-DE Genes
|
|
| Count of Body Size QTL (out of 14) Harboring DE Genes |
|---|---|---|
| Abnormal circulating cholesterol level | 22 | 8 |
| Abnormal circulating HDL cholesterol level | 16–17 | 8 |
| Decreased circulating HDL cholesterol level | 13–14 | 7 |
| Abnormal lipid level | 40–41 | 11 |
| Abnormal lipid homeostasis | 40–41 | 11 |
| Abnormal cholesterol level | 24 | 8 |
| Abnormal circulating lipid level | 34–35 | 10 |
| Decreased cholesterol level | 16–17 | 7 |
| Decreased circulating cholesterol level | 15–16 | 7 |
| Abnormal cholesterol homeostasis | 24 | 8 |
| Homeostasis/metabolism phenotype | 103 | 14 |
| Abnormal lipoprotein level | 18–19 | 8 |
| Abnormal circulating lipoprotein level | 18–19 | 8 |
| Abnormal triglyceride level | 19 | 9 |
| Increased circulating triglyceride level | 11 | 7 |
| Abnormal circulating triglyceride level | 17 | 9 |
| Abnormal fatty acid level | 15 | 8 |
| Abnormal body weight | 52 | 14 |
| Abnormal brown adipose tissue thermogenesis | 3 | 2 |
| Abnormal circulating free fatty acids level | 10 | 7 |
| Abnormal free fatty acids level | 11 | 8 |
| Abnormal body length | 16 | 11 |
| Decreased body length | 13 | 9 |
| Decreased body weight | 45 | 14 |
| Decreased total tissue mass | 45 | 14 |
| Abnormal prenatal growth/weight/body size | 27 | 13 |
| Abnormal prenatal body size | 24 | 13 |
| Abnormal cell proliferation | 36 | 13 |
| Decreased cell proliferation | 25 | 12 |
Enrichment carried out in MouseMine: Motenko et al. (2015). All adjusted P values < 0.05.
The number of QTL-DE genes annotated to the phenotype in a Hypergeometric Test with background equal to all genes with phenotypic annotation for Mus mus domesticus.
Significant after multiple test correction using both Bonferroni and Benjamini–Hochberg methods, otherwise significant after Benjamini–Hochberg multiple test correction.
. 3.Transcriptional regulation of coregulated gene groups throughout liver maturation. TFs Arid5b and Tfec reside within body size-related QTL (light blue box) mapped from an intercross between island and mainland mice (Gray et al. 2015). Colored dashed lines emanating from either Arid5b or Tfec connect to coregulated gene groups whose transcription is putatively regulated by these TFs. Circles at the terminus of colored, dashed lines indicate transcriptional regulation as evidenced by enrichment of TF binding motifs in upstream sequences of coregulated group members. Except for one instance (indicated by asterisks), one representative gene is shown for each coregulated gene group displayed; the number of genes within a group is indicated by a vertical ellipsis and associated count within parentheses. Red text indicates a DE gene that also resides within a body size QTL.
. 4.Summary of GO functional enrichment for DE genes in the embryonic (E16.5) and 4-week liver. Higher and lower refer to gene expression levels that were higher or lower in island mice relative to mainland mice. Vertical GO terms in all panels are a descriptively specific subset of all significantly enriched GO terms for each time point and fold-change direction (see Materials and Methods). Colors designate GOSlim categories, broad functional categories to which individual GO terms are sorted.The GOSlim color key applies to rectangles at the top of each plot as well; rectangles provide the number of all DE genes pertaining to a GOSlim category regardless if a gene’s annotated GO term is represented in the plot. Plots show the average log2 fold change for DE genes annotated to an enriched GO term. Black diamonds designate average log2 fold change values that exceed the maximum value of the plots.
. 5.GO and KEGG functional enrichment of gonadal adipose depot and hypothalamic DE genes hint at differences in organ function between island and mainland mice. (A) Subset of enriched GO and KEGG terms among gonadal adipose depot DE genes. (B) Subset of enriched GO and KEGG terms among hypothalamic DE genes. In (A) and (B), the height of the triangles (shorter vertical axes) are arbitrary and indicate the qualitative contribution of the functions encapsulated by the displayed enriched GO and KEGG terms to the biological processes denoted along the long axes with blue arrows. Also, in both (A) and (B), green text designates significantly enriched terms among genes with higher transcription in island mice relative to mainland mice, whereas red text indicates enrichment among lower transcribed genes. In (B), the term Macrophage is used in place of Osteoclast as osteoclasts are bone-specific macrophages. Macrophages specific to the central nervous system (i.e., hypothalamus) are called microglia and although tissue-resident macrophages carry out tissue-specific functions and manifest distinctive transcriptional profiles they also have a subset of functions and gene expression profiles in common (Nataf et al. 2005; Gautier et al. 2012; Casano and Peri 2015).