| Literature DB >> 29479424 |
Theodore S Kalbfleisch1, Brenda M Murdoch2, Timothy P L Smith3, James D Murdoch4, Michael P Heaton3, Stephanie D McKay4.
Abstract
Background: Moose ( Alces alces) colonized the North American continent from Asia less than 15,000 years ago, and spread across the boreal forest regions of Canada and the northern United States (US). Contemporary populations have low genetic diversity, due either to low number of individuals in the original migration (founder effect), and/or subsequent population bottlenecks in North America. Genetic tests based on informative single nucleotide polymorphism (SNP) markers are helpful in forensic and wildlife conservation activities, but have been difficult to develop for moose, due to the lack of a reference genome assembly and whole genome sequence (WGS) data.Entities:
Keywords: Alces; DNA testing; SNP; animal identification; cattle genomes; genetic diversity; moose; parentage; sheep genomes; single nucleotide polymorphism; whole genome sequence; wildlife comparative genomics
Year: 2018 PMID: 29479424 PMCID: PMC5801567 DOI: 10.12688/f1000research.13501.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Locations in North America where the moose used in this study were collected.
Whole genome sequence information and alignment statistics for four North American moose.
| Reads mapped (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Species | Animal
| BioSample
| Location or
| Sex | Estimated
| Gb
| Total
| Bt_UMD3.1 | Oar_v3.1 |
|
| HM2013
|
| Paxson,
| M | 19.2 | 64.1 | 461 | 13.0 | 10.4 |
|
| JC2001
|
| Green River
| F | 13.7 | 45.7 | 327 | 13.5 | 10.8 |
|
| R199 |
| Lowell,
| M | 23.2 | 77.3 | 653 | 8.8 | 6.8 |
|
| Clearwater06 |
| Elk River,
| M | 20.0 | 66.8 | 549 | 8.9 | 6.9 |
|
| 19969811 |
| USA
| M | 14.2 | 47.5 | 314 | 88.4 | nd
[ |
|
| 199735001 |
| USA
| M | 13.7 | 42.7 | 319 | 22.2 | 83.8 |
Key:
aBased on the observed linear relationship between aligned read depth (y) and total gigabases (Gb) of genomic sequence collected (x) with a quality score ≥ to 20, where y = 0.3x in cattle and sheep genomes [38, 39].
bNCBI BioProject number 325061
cNCBI BioProject number 324822
dNot determined
eNCBI BioProject number 324837
Figure 2. Computer screen images of species-associated nucleotide differences in moose.
Overlapping computer screen images of moose WGS data aligned to bovine and ovine reference genomes, respectively, showing two moose-associated nucleotides in the GDF9 gene. The bovine UMD3.1 and ovine Oar_v3.1 map positions for the variant sites are chr7:46,065,076 - 46,065,079 and chr5: 41,841,536 - 41,841,536,539, respectively.
North American moose SNPs identified by aligning WGS to bovine and ovine reference genomes.
| Reference assembly | |||
|---|---|---|---|
| Step | Progressively filtered
| Bt_UMD3.1 | Oar_v3.1 |
| 1 | Variants passing depth
| 1,095,371 | 813,006 |
| 2 | Moose-associated
| 1,049,080 | 775,700 |
| 2 | SNPs | 46,005 | 36,934 |
| 3 | SNPs with 0.125 MAF | 16,815 | 13,698 |
| 3 | SNPs with 0.250 MAF | 12,502 | 10,106 |
| 3 | SNPs with 0.375 MAF | 13,369 | 10,710 |
| 3 | SNPs with 0.500 MAF | 3,319 | 2,420 |
| 4 | SNPs with 0.500 MAF and
| 1,341 | 1,014 |
| 5 | SNPs with conserved
| 773 | 552 |
| 6 | SNPS with independent
| 317 | 317 |
Key:
aAutosomal chromosome alignment with minimum read depth of ten and minimum genotyping quality score of 30. There were approximately 60,600 and 58,200 additional moose SNPs in the respective UMD3.1 and Oar_v3.1 alignments that were heterozygous in all four moose. However, these were excluded from the SNP counts because this artifact is caused by sequence read misalignment.
bThese sites are difference from the reference and homozygous in all four moose.
c101 bp flanking regions of SNPs with 0.5 MAF that could be unambiguously identified by BLAT alignment to the other reference assembly ( Table S1 and Table S2). These regions also contained no SNPs among the four moose.
dSNP that were independently identified by alignment in each reference genome and manually grouped into 216 chromosomal bins for assay design ( Table S3).
Figure 3. Computer screen images of a highly informative moose SNP.
Overlapping computer screen images of moose WGS data aligned to bovine and ovine reference genomes, respectively, showing a highly informative SNP. Screenshots from IGV software showing one of 317 moose SNPs with a 0.5 MAF, both homozygous genotypes present, and aligned to genomic regions conserved in all three species. The bovine UMD3.1 and ovine Oar_v3.1 map positions are chr1:7,634,617 and chr1: 126,412,162, respectively.
Figure 4. Genome-wide distribution of highly informative moose SNPs.
The distribution of moose SNPs with 0.5 MAF relative to the cattle and sheep chromosomal locations (see Table S1 and Table S2 for marker details). The inset shows the chromosomal map positions with the cattle UMD3.1 reference assembly.
Figure 5. Heterozygosity analysis of moose genomes.
The ratio of heterozygous to homozygous genotypes from four moose was evaluated with sets of 773 and 552 markers SNPs with 0.5 MAF (see Table S1 and Table S2 for marker details). ( A) Genotype counts for each of the four animals with the 773 moose SNPs identified in the alignment to cattle ( Table S1) and the 552 moose SNPs identified in the alignment to sheep ( Table S2). ( B) The heterozygosity ratios calculated for each of the four animals from the 773 SNP set (grey circles); and the 552 SNP set (tan circles). The ratio consisted of the number of heterozygous sites divided by the combined number of homozygous sites.
Proportion of heterozygous sites shared between pairs of moose from Alaska (AK), Idaho (ID), Wyoming (WY), and Vermont (VT), USA.
| Proportion | ||||||||
|---|---|---|---|---|---|---|---|---|
| SNPs used from cattle
| SNPs used from sheep
| |||||||
| Source | AK | ID | WY | VT | AK | ID | WY | VT |
| AK | 1.000 | - | - | - | 1.000 | - | - | - |
| ID | 0.430 | 1.000 | - | - | 0.397 | 1.000 | - | - |
| WY | 0.378 | 0.323 | 1.000 | - | 0.391 | 0.317 | 1.000 | - |
| VT | 0.255 | 0.268 | 0.325 | 1.000 | 0.279 | 0.281 | 0.320 | 1.000 |
Moose gene variants identified by viewing selected genes in IGV.
| Bovine UMD3.1 | Moose genotype
[ | Allele frequency | |||||||
|---|---|---|---|---|---|---|---|---|---|
| BTA | Position | Gene | Feature | AK | WY | VT | ID | A1 | A2 |
| 5 | 45,830,842
[ |
| Intron 1 | G
[ | S | C | S | 0.50 | 0.50 |
| 13 | 47,415,079
[ |
| CDS, M217I
[ | K | K | G | T | 0.50 | 0.50 |
| 2 | 6,587,679 |
| Intron 2 | W | A | W | W | 0.63 | 0.38 |
| 2 | 6,219,619 |
| Exon 5 | G | C | S | C | 0.63 | 0.38 |
| 3 | 22,970,184 |
| Exon 16 | T | T | C | Y | 0.63 | 0.38 |
| 5 | 66,599,950 |
| CDS, I27V
[ | C | T | Y | T | 0.63 | 0.38 |
| 7 | 22,883,135 |
| Intron 1 | Y | Y | C | Y | 0.63 | 0.38 |
| 8 | 78,244,133 |
| Exon 3 | Y | Y | T | T | 0.75 | 0.25 |
| 16 | 27,304,417 |
| Exon 2 | C | G | G | S | 0.63 | 0.38 |
| 26 | 20,694,701 |
| Exon 17 | W | W | A | W | 0.63 | 0.38 |
Key:
aBased on samples from four individuals sourced from Alaska (AK), Wyoming (WY), Vermont (VT), and Idaho (ID), USA.
bHighly-informative moose parentage SNPs with 0.5 MAF and both homozygous genotypes present among the four moose.
cHomozygotes are denoted with the one-letter nucleotide code. Heterozygotes are denoted with IUPAC/IUBMB ambiguity codes: R = a/g, Y = c/t, M = a/c, K = g/t, S = c/g, W = a/t. [40].
dThe PRNP codon number 217 refers to the number system in cattle. In moose, this codon is at position 209.
eThe IGF1 codon is in exon 2 and the numbering for codon 27 is the same in cattle as in moose.