| Literature DB >> 33686424 |
Jocelyn P Colella1,2,3, Anna Tigano1,2, Olga Dudchenko4,5,6, Arina D Omer4, Ruqayya Khan4,6, Ivan D Bochkov4,6, Erez L Aiden4,5,6,7,8, Matthew D MacManes1,2.
Abstract
Warming climate and increasing desertification urge the identification of genes involved in heat and dehydration tolerance to better inform and target biodiversity conservation efforts. Comparisons among extant desert-adapted species can highlight parallel or convergent patterns of genome evolution through the identification of shared signatures of selection. We generate a chromosome-level genome assembly for the canyon mouse (Peromyscus crinitus) and test for a signature of parallel evolution by comparing signatures of selective sweeps across population-level genomic resequencing data from another congeneric desert specialist (Peromyscus eremicus) and a widely distributed habitat generalist (Peromyscus maniculatus), that may be locally adapted to arid conditions. We identify few shared candidate loci involved in desert adaptation and do not find support for a shared pattern of parallel evolution. Instead, we hypothesize divergent molecular mechanisms of desert adaptation among deer mice, potentially tied to species-specific historical demography, which may limit or enhance adaptation. We identify a number of candidate loci experiencing selective sweeps in the P. crinitus genome that are implicated in osmoregulation (Trypsin, Prostasin) and metabolic tuning (Kallikrein, eIF2-alpha kinase GCN2, APPL1/2), which may be important for accommodating hot and dry environmental conditions. © The American Genetic Association. 2021.Entities:
Keywords: zzm321990 Peromyscuszzm321990 ; dehydration; desert; parallel evolution; thermoregulation
Mesh:
Year: 2021 PMID: 33686424 PMCID: PMC8141686 DOI: 10.1093/jhered/esab009
Source DB: PubMed Journal: J Hered ISSN: 0022-1503 Impact factor: 2.645
Figure 1.Geographic ranges of the 3 Peromyscus species examined in this study. Major North American deserts are denoted by diagonal hashing. Peromyscus crinitus range is in red/medium grey, Peromyscus eremicus in blue/dark grey, and Peromyscus maniculatus in yellow/light grey. Areas of sympatry are denoted by additive color overlap. Collection localities are labeled with white dots and include the Motte Rimrock Reserve (MOT), Elliot Chaparral Reserve (ELL), and Philip L. Boyd Deep Canyon Desert Research Center (DCDRC). See online version for full colors.
Assembly stats, genome size, and global Tajima’s D and pi (1 kbp windows) for each Peromyscus species
| Species |
| Scaffold N50 | Contig N50 | Size (Gb)a | Size (Gb)b | Taj. | π |
|---|---|---|---|---|---|---|---|
|
| 9 | 94 816 992 | 204 461 | 2.27 | 2.28 | −0.69 | 0.005 |
|
| 26 | 119 957 392 | 76 024 | 2.45 | 2.54 | −1.27 | 0.007 |
|
| 5 | 115 033 041 | 42 400 | 2.33 | 2.44 | −1.62 | 0.012 |
aabyss-fac estimate.
bassemblathon estimate.
Figure 2.Distributions of effective population size (Ne) through time for Peromyscus crinitus (red/top), Peromyscus eremicus (blue/bottom), and Peromyscus maniculatus (yellow/middle) based on a generation time of 6 months (0.5 years) and a general mammalian mutation rate of 2.2 × 10−9 substitutions/site/year. Note that the P. maniculatus genome was sequenced from a captive individual and therefore does not reflect natural populations’ trends of this species. CLR scores for P. crinitus based on Sweepfinder2 results. Values above the horizontal red line surpass the 99.9th percentile. The top 5 or fewer unique genes are labeled for each chromosome. See online version for full colors.
Figure 3.REVIGO plots of enriched functional groups for Peromyscus crinitus (top row) and Peromyscus eremicus (bottom row) based on functional annotation of the 2 nearest protein-coding genes to each site (dataset II) identified as the subject of a selective sweep. Darker colors indicate greater significance. MP, metabolic process, MB, membrane-bound. See online version for full colors.
Significant sweep sites localized within protein-coding genes for each Peromyscus species
| Spp. | Chr. | Pos. | Gene | Protein | General function | Dir. |
|---|---|---|---|---|---|---|
|
| 4 | 145409180 |
| Zinc finger protein 217 | DNA-binding transcription factor, transcription regulation, zinc binding | − |
| 20 | 36260251 |
| Dehydrogenase/reductase SDR family member 7B | Oxidoreductase activity | − | |
|
| 1 | 42451454 |
| Harmonin | Mechanotransduction in cochlear hair cells | − |
| 8 | 67956 |
| Meiosis-specific with OB domain-containing protein | Meiosis | + | |
|
| 3 | 52113514 |
| Trypsin-2 | Calcium ion binding | + |
|
| Kallikrein-13 | Protein processing, proteolysis, reg. of IGF | + | |||
|
| Prostasin | Sodium balance | + | |||
|
| Kallikrein-4 | Zinc ion binding, proteolysis | + | |||
|
| Myeloblastin | Degrades collagen (I, III, IV), elastin, fibronectin, laminin, vitronectin; blood coagulation, immune response | + | |||
|
| Kallikrein-14 | Varied (epidermis morphogenesis) | + | |||
| TRYP_PIGa | Trypsin | Calcium ion binding, proteolysis | + | |||
|
| Kallikrein-6 | Varied (collagen catabolism, tissue regen) | + | |||
|
| Chymotrypsin-like elastase family member 2A | Cleavage and elastin hydrolase, proteolysis | + | |||
| 4 | 57673659 |
| eIF-2-alpha kinase GCN2 | Metabolic stress sensing protein kinase, role in ISR required for adaptation to amino acid starvation, protein synthesis repression | − | |
| 6 | 66203934 |
| Nestin | Brain, eye development (neg. reg. catalytic activity) | − | |
| 9 | 23303147 |
| DENND64 | Endocytic recycling pathway component | + | |
| 23323150 |
| DENND64 | Endocytic recycling pathway component | + | ||
| 43305800 |
| NYNRIN | Nucleic acid binding | + | ||
| 22243007 |
| Poly(ADP-ribose) glycohydrolase | Prevent detrimental accumulation of poly(ADP-ribose) upon prolonged replicative stress | − | ||
| 22283012 |
| Nuclear receptor coactivator 4 | Androgen receptor (iron ion homeostasis) | − | ||
| 22303015 |
| Oxidoreductase NAD-binding domain-containing protein 1 | Oxidoreductase activity | − | ||
| 18 | 450151 |
| DCC-interacting protein 13-beta | Varied (cold acclimation, diet induced thermogen., glucose homeostasis, neg. reg. of insulin response/fatty acid oxidation/glucose import, pos. reg. of cold-induced thermogen.) | − | |
|
| DCC-interacting protein 13-alpha | Varied. (insulin receptor signaling pathway, pos. reg. of glucose import) | − | |||
| 460153 |
| DCC-interacting protein 13-beta | Varied (cold acclimation, diet induced thermogen., glucose homeostasis, neg. reg. of insulin response/fatty acid oxidation/ glucose import, pos. reg. of cold-induced thermogen.) | − | ||
|
| DCC-interacting protein 13-alpha | Varied (insulin receptor signaling pathway, pos. reg. of glucose import) | − | |||
| 23 | 28065942 |
| Tectonic-1 | Neural development | + |
Species (Spp.), chromosome (Chr.) sweep position (Pos.), gene name, protein, general function (based on UniProt database: uniprot.org), and direction (Dir.) of gene transcription. thermogen., thermogenesis; neg., negative; pos., positive; reg., regulation; IGF, insulin-like growth factor; ISR, integrated stress response.
aNo gene name alternative available.
Figure 4.CLR scores for Peromyscus crinitus based on Sweepfinder2 results. Values above the horizontal dashed line surpass the 99.9th percentile. The top 5 or fewer unique genes are labeled for each chromosome.
Functional annotation of proximal gene names and enriched GO terms associated with significant selective sweeps and shared between desert-adapted Peromyscus crinitus and Peromyscus eremicus
| Data set | Gene name / GO term | Function | Protein/class | |
|---|---|---|---|---|
| Gene names | I | none | — | — |
| II |
| Hydrolase, biotin transport/metabolism | Biotinidase | |
|
| Ribosomal protein, translation | 60S ribosomal protein L36 | ||
| III |
| Hydrolase, biotin transport/metabolism | Biotinidase | |
|
| Zn binding, virion attachment | Envelope glycoprotein | ||
|
| DNA binidng, protein heterodimerization | Putative histone H3.X | ||
|
| RNA binding, ribosome constituent, translation | 60S ribosomal protein L15 | ||
|
| Ribosomal protein, translation | 60S ribosomal protein L36 | ||
|
| Zn/Mg binding, immune response | DNA-directed RNA polymerase III subunit RPC1 | ||
|
| Ribosomal protein, enzyme binding | 40S ribosomal protein S2 | ||
|
| mRNA binding, ribosome, translation | 40S ribosomal protein S26 | ||
|
| Rapid Ca2+ release, K+ channel, ossification | Trimeric intracellular cation channel type B | ||
|
| Ca2+ binding, collagen catabolism, proteolysis, cell growth | Trypsin-2 | ||
| Enriched GO terms | I | none | — | — |
| II | GO:0005622 | Intracellular | Cellular component | |
| GO:0005840a | Ribosome | Cellular component | ||
| GO:0022626a | Cystolic ribosome | Cellular component | ||
| GO:0043226 | Organelle | Cellular component | ||
| GO:0043229 | Intracellular organelle | Cellular component | ||
| GO:0044391a | Ribosomal subunit | Cellular component | ||
| GO:1990904a | Ribonucleoprotein complex | Cellular component | ||
| III | GO:0003735a | Structural constituent of ribosome | Molecular function | |
| GO:0022626a | Cystolic ribosome | Cellular component | ||
| GO:0022625a | Cystolic large ribosomal subunit | Cellular component | ||
| GO:0044391a | Ribosomal subunit | Cellular component | ||
| GO:0005840a | Ribosome | Cellular component | ||
| GO:0015934 | Large ribosomal subunit | Cellular component | ||
| GO:1990904a | Ribonucleoprotein complex | Cellular component |
aGene names or GO terms also experiencing selective sweeps in Peromyscus maniculatus.
Figure 5.Overlap in proximal gene names (top row) and enriched GO terms (bottom row) for datasets I (left column), II (center), and III (right). * indicates significant overlap between species. See online version for full colors.