| Literature DB >> 32912160 |
Zhenhua Shang1, David J Horovitz1, Ronald H McKenzie1, Jessica L Keisler1, Michael R Felder1, Shannon W Davis2.
Abstract
BACKGROUND: Peromyscus are the most common mammalian species in North America and are widely used in both laboratory and field studies. The deer mouse, P. maniculatus and the old-field mouse, P. polionotus, are closely related and can generate viable and fertile hybrid offspring. The ability to generate hybrid offspring, coupled with developing genomic resources, enables researchers to conduct linkage analysis studies to identify genomic loci associated with specific traits.Entities:
Keywords: DNA polymorphism; Genomic; Neural crest; Peromyscus; Sox10
Mesh:
Substances:
Year: 2020 PMID: 32912160 PMCID: PMC7488232 DOI: 10.1186/s12864-020-06969-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Characterization of variants
| Variant Type | Count |
|---|---|
| SNP | 34,084,607 |
| Insertion | 2,038,754 |
| Deletion | 2,042,973 |
| Transcript | 45,591,302 |
| EXON | 769,092 |
| 5′ UTR | 96,746 |
| 3′ UTR | 490,685 |
| INTRON | 44,630,345 |
| Splice Acceptor | 1215 |
| Splice Donor | 1327 |
| Splicing Region | 87,301 |
| Intergenic | 23,148,376 |
| 5 kb Upstream | 7,302,504 |
| 5 kb Downstream | 7,337,171 |
Top 20 over and underrepresented GO terms in the list of Peromyscus genes containing a nonsynonymous SNP between P. maniculatus and P. polionotus
| cellular process | GO:0009987 | 2.03E-298 | 15,748 | 6516 | 7781 |
| cellular metabolic process | GO:0044237 | 8.06E-257 | 9331 | 3861 | 5119 |
| metabolic process | GO:0008152 | 2.96E-243 | 10,435 | 4318 | 5557 |
| primary metabolic process | GO:0044238 | 5.83E-235 | 9405 | 3891 | 5096 |
| organic substance metabolic process | GO:0071704 | 8.98E-225 | 9966 | 4124 | 5310 |
| nitrogen compound metabolic process | GO:0006807 | 2.89E-214 | 8892 | 3679 | 4819 |
| cellular component organization or biogenesis | GO:0071840 | 5.56E-214 | 6225 | 2576 | 3618 |
| cellular component organization | GO:0016043 | 9.53E-201 | 6053 | 2505 | 3505 |
| localization | GO:0051179 | 3.18E-193 | 5918 | 2449 | 3423 |
| cellular macromolecule metabolic process | GO:0044260 | 2.50E-174 | 7080 | 2929 | 3905 |
| macromolecule metabolic process | GO:0043170 | 3.99E-170 | 8461 | 3501 | 4506 |
| organonitrogen compound metabolic process | GO:1901564 | 9.62E-137 | 6215 | 2572 | 3402 |
| developmental process | GO:0032502 | 4.23E-136 | 6193 | 2562 | 3390 |
| anatomical structure development | GO:0048856 | 1.40E-130 | 5810 | 2404 | 3198 |
| establishment of localization | GO:0051234 | 1.11E-127 | 4518 | 1869 | 2588 |
| biological regulation | GO:0065007 | 1.61E-120 | 12,092 | 5003 | 5874 |
| multicellular organism development | GO:0007275 | 1.36E-119 | 5304 | 2195 | 2931 |
| transport | GO:0006810 | 4.07E-118 | 4378 | 1811 | 2494 |
| organelle organization | GO:0006996 | 2.00E-117 | 3549 | 1468 | 2095 |
| macromolecule modification | GO:0043412 | 7.62E-114 | 3623 | 1499 | 2121 |
| sensory perception of smell | GO:0007608 | 3.37E-238 | 1138 | 471 | 17 |
| sensory perception of chemical stimulus | GO:0007606 | 2.58E-228 | 1236 | 511 | 39 |
| sensory perception | GO:0007600 | 1.17E-116 | 1784 | 738 | 303 |
| G protein-coupled receptor signaling pathway | GO:0007186 | 2.04E-83 | 1921 | 795 | 410 |
| nervous system process | GO:0050877 | 8.09E-55 | 2286 | 946 | 606 |
| system process | GO:0003008 | 2.91E-25 | 2923 | 1209 | 955 |
| phagocytosis, recognition | GO:0006910 | 3.11E-23 | 151 | 62 | 9 |
| response to pheromone | GO:0019236 | 1.07E-21 | 105 | 43 | 2 |
| complement activation, classical pathway | GO:0006958 | 1.28E-19 | 165 | 68 | 16 |
| humoral immune response mediated by circulating immunoglobulin | GO:0002455 | 3.89E-17 | 180 | 74 | 23 |
| complement activation | GO:0006956 | 1.02E-15 | 187 | 77 | 27 |
| humoral immune response | GO:0006959 | 6.06E-15 | 355 | 147 | 78 |
| phagocytosis, engulfment | GO:0006911 | 4.56E-12 | 189 | 78 | 34 |
| protein activation cascade | GO:0072376 | 1.11E-11 | 200 | 83 | 38 |
| B cell receptor signaling pathway | GO:0050853 | 2.16E-11 | 181 | 75 | 33 |
| xenobiotic metabolic process | GO:0006805 | 2.19E-11 | 112 | 46 | 14 |
| plasma membrane invagination | GO:0099024 | 6.36E-11 | 198 | 82 | 39 |
| membrane invagination | GO:0010324 | 1.60E-10 | 205 | 85 | 42 |
| response to leukemia inhibitory factor | GO:1990823 | 1.06E-09 | 311 | 129 | 78 |
| cellular response to leukemia inhibitory factor | GO:1990830 | 1.06E-09 | 311 | 129 | 78 |
Polymorphism characterization in Peromyscus genes associated with autism in humans
| Gene | SFARI Score | Upstream | 5′ UTR | Silent | Missense | In-frame deletion | Splice region | Intronic | 3′ UTR | Downstream |
|---|---|---|---|---|---|---|---|---|---|---|
| 1S | 205 | 30 | 6 | 4 | 5881 | 13 | ||||
| 1S | 75 | 4 | 3 | 175 | 24 | |||||
| 1S | 205 | 16 | 12 | 3 | 1089 | 9 | 53 | |||
| 1S | 155 | 2 | 26 | 11 | 663 | 11 | 42 | |||
| 1S | 84 | 4 | 1 | 888 | 19 | 82 | ||||
| 1S | 100 | 2 | 39 | 15 | 1 | 5 | 896 | 19 | ||
| 1 | 89 | 2 | 1 | 83 | 1 |
Fig. 1ASH1L protein conservation. a Schematic representation of a protein alignment between Mus, rat, BW, PO, cow, human, and dog ASH1L proteins. Numbers across the top represent approximate amino acid locations. Block arrows represent conserved protein domains. Conservation graph at the bottom indicates amino acid percent conservation between the seven species. Asterisks indicate approximate locations for amino acid differences between BW and PO. b Regional protein alignment for ASH1L in areas surrounding an amino acid difference between BW and PO, with amino acid locations indicated across the top
Fig. 2ASH1L nucleotide conservation. a VISTA plot for Ash1l genomic loci for cow, dog, human, Mus, PO, and rat compared to BW Ash1l. Blue regions indicate exons and red regions indicate conserved non-coding regions. CNS indicates a conserved non-coding sequence in intron three across all mammals analyzed. b DNA sequence alignment for the CNS, indicated in a for the seven mammalian species. Numbers across the top represent nucleotide locations. Conservation graph at the bottom indicates percent nucleotide conservation between the seven mammalian species. Asterisks indicate approximate locations for nucleotide differences between BW and PO. c Regional nucleotide alignment within the CNS. Asterisk indicates a single nucleotide polymorphism between BW and PO. Boxed regions indicate transcription factor binding sites in BW for NKX2.1 and in PO for NBF1
Number of restriction enzyme recognition sites and average fragment size for P. maniculatus
| Restriction Enzyme | CpG | Number of Recognition sites in | Average fragment size in |
|---|---|---|---|
| ApaI | Yes | 384,617 | 6431 |
| AscI | Yes | 2460 | 1,005,504 |
| AvrII | No | 482,638 | 5125 |
| BamHI | No | 410,386 | 6027 |
| BspQI | No | 457,484 | 5406 |
| BssHII | Yes | 115,555 | 21,405 |
| DraI | No | 2,508,985 | 986 |
| EagI | Yes | 47,415 | 52,167 |
| EcoRI | No | 701,369 | 3527 |
| FseI | Yes | 13,191 | 187,517 |
| HindIII | No | 809,283 | 3056 |
| NaeI | Yes | 105,993 | 23,336 |
| NarI | Yes | 89,075 | 27,769 |
| NheI | No | 377,464 | 6553 |
| NotI | Yes | 5842 | 423,406 |
| PacI | No | 146,207 | 16,918 |
| PmeI | No | 34,572 | 71,547 |
| RsrII | Yes | 10,111 | 244,638 |
| SacI | No | 608,101 | 4067 |
| SacII | Yes | 47,658 | 51,901 |
| SaII | Yes | 33,458 | 73,930 |
| Sbfl | No | 78,368 | 31,563 |
| SgrAI | Yes | 13,574 | 182,226 |
| SmaI | Yes | 200,696 | 12,324 |
| SpeI | No | 370,097 | 6683 |
| SphI | No | 654,548 | 3779 |
| SspI | No | 1,559,294 | 1586 |
| SwaI | No | 166,068 | 14,894 |
| XbaI | No | 748,330 | 3305 |
| XhoI | Yes | 109,564 | 22,576 |
Fig. 3Embryonic lethality of BW S/S embryos. a Wild type e14.5 embryo. b resorbing e14.5 embryo. c Wild type e13.5 embryo. d e13.5 embryo with malformations, including pericardial swelling and caudal regression. Scale bar equals 1 mm for all images. Images were acquired by the authors
Fig. 4Variation in spot size for the Dominant spot trait. a Large spot size on an S/+ P. maniculatus. b Small spot size on an S/+ F1 hybrid. c Intermediate spot size on an S/+ PO N2. Images were acquired by the authors
Polymorphisms that generate RFLPs between BW and PO in candidate genes that cause spotting in Mus
| Candidate Gene | Contig | PCR Amplicon Location | Polymorphism Location | BW sequence | PO Sequence | Forward Primer | Reverse Primer |
|---|---|---|---|---|---|---|---|
| NW_006501107 | 1,645,861. .. 1,646,386 | 1,646,160 | AGGCCT (StuI) | AGTCCT | CTCGAGAACCTTGGGATTCA | AACAGGGTCTCCTGCAGTGT | |
| NW_006501134 | 1,664,283. .. 1,664,495 | 1,664,389 | CCGG (MspI) | CCAG | ATGACGCCACCCACTAAGAC | GATGATGCCTAGCACGAACA | |
| NW_006501162 | 6,362,706. .. 6,363,108 | 6,362,907. .. 6,362,919 | CCGT | CCGT//GGGA | CCCGTCCTAGCTTTGGAAC | AGCATCAGGGCAACCTTAAA | |
| NW_006501158 | 227,546. .. 228,022 | 227,668 | GGATCC (BamHI) | GAATCC | CCCAATTAGCTGCTCTTCAAAC | GGAGCCTTTGTGTCTTATCAGTA | |
| NW_006501059 | 77,663,262. .. 7,663,281 | 7,663,508 | GTATGC | GCATGC (SphI) | GGATGAGACTCAGGGTGAGG | GCTCCATCACTCGGCATTAT | |
| NW_006501055 | 3,517,274.. . 3,517,631 | 3,517,515 | TTTAAA (DraI) | TTTCAA | CCTTGCCTACTACGCTCTGA | TAATTCTGCATCCTTCCGGC | |
| NW_006501668 | 491,420. .. 491,911 | 491,673 | AGGCCT (StuI) | AGACCT | GTTTCACCCTAGGAAGTTGTGG | GCCTCAGAAGCAGCCCTC | |
| NW_006502260 | 133,647. .. 133,992 | 133,912 | TTTAAA (DraI) | TTTAAG | CCAAAGTTGAAGGCTGTTGC | AGTCCATTGCTTTCACACCT | |
| NW_006501150 | 898,573. .. 898,893 | 898,801 | CCAC | CCGC (AciI) | GGCAGACTGAGGGAGGTGTA | GGAGATCAGCCACGAGGTAA |
* Polymorphism and primers identified by and verified in our analysis
Fig. 5Linkage analysis for spotting mutation candidate genes. Genomic DNA from BW and PO control animals, along with a selection of S/+ F1 animals, was PCR amplified for a set of candidate genes (Edn3, Ednrb, Kit, Kitl, Mitf, Pax3, Ret, Snai2, and Sox10), known to cause pigmentation defects in Mus and humans. Each amplicon contained a restriction enzyme recognition site polymorphism between BW and PO (see Table 5). PCR amplicons were digested with the appropriate restriction enzyme and analyzed by agarose gel electrophoresis. Sox10 is the only candidate gene were all S/+ N2 animals have both a PO and BW allele, demonstrating linkage with the Dominant spot phenotype. Each image was cropped to remove excessive space
Fig. 6Histogram of spot size variation. This histogram displays the frequency of spot size area (mm2) for 25 BW S/+ (gray) and 46 S/+ on PO N2 (black)