| Literature DB >> 27079378 |
Rahel Schrimpf1, Maren Gottschalk1, Julia Metzger1, Gunilla Martinsson2, Harald Sieme3, Ottmar Distl4.
Abstract
BACKGROUND: Stallion fertility is an economically important trait due to the increase of artificial insemination in horses. The availability of whole genome sequence data facilitates identification of rare high-impact variants contributing to stallion fertility. The aim of our study was to genotype rare high-impact variants retrieved from next-generation sequencing (NGS)-data of 11 horses in order to unravel harmful genetic variants in large samples of stallions.Entities:
Keywords: High-impact variants; NOTCH1; Stallion fertility; Whole genome screening
Mesh:
Year: 2016 PMID: 27079378 PMCID: PMC4832559 DOI: 10.1186/s12864-016-2608-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Comparison of SNPs with high effect on protein structure in genes for stallion fertility among breeds using whole-genome sequences of fertile stallions
| SNP ID | Chrom | Gene | Arabian stallion* | Sorraia stallion* | Hanoverian stallion* | Hanoverian stallion* | Dülmen Horse mare* | Icelandic stallion** | Przewalski stallion** | Standard-bred stallion** | Norwegian fjord mare** | Arabian mare* | ‘Twilight‘ mare** |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| g.26775767G>C | ECA1 | NEURL1 | 0/0 | 0/0 |
| 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.77472655G>C | ECA3 | KDR | 0/0 |
| 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.74610774C>T | ECA4 | CFTR | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 |
| g.56937215C>T | ECA5 | OVGP1 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.45985131A>G | ECA9 | FBXO43 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.82699661C>T | ECA9 | TSSK6 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.7083659A>T | ECA11 | SLC9A3R1 | 0/0 | 0/0 |
|
| 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 |
| g.40694339G>A | ECA13 | PKD1 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.6704968C>T | ECA16 | GHRL | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 |
| g.19034281C>T | ECA16 | FOXP1 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.21894180G>A | ECA17 | FNDC3A |
| 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 |
| g.32635273T>C | ECA20 | BTNL2 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 |
| g.35255390T>C | ECA20 | TCP11 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.4323852G>A | ECA25 | SPATA31E1 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 |
| g.37453246G>C | ECA25 | NOTCH1 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 |
| g.37455302G>A | ECA25 | NOTCH1 | 0/0 |
|
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.79813487A>T | ECA3 | ENSECAG00000020135 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.25184403G>C | ECA7 | ENSECAG00000021286 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| g.30073089G>A | ECA13 | ENSECAG00000018118 | 0/0 | 0/0 | 0/0 |
| 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
The variant ID (SNP ID), chromosomal position (ECA), gene symbol (Gene) and the horse genotypes are given. Genotypes either represent wild type variant (0/0), or mutant in heterozygote state (0/1), highlighted in bold. The animals marked with an asterisk were sequenced by Metzger et al. (2014) [52] the animals marked with two asterisks were sequenced by Orlando et al. (2013) [51]. The reference horse genome (Twilight) is derived from the Thoroughbred mare Twilight (EquCab2.70)
Identification of variants with high effect on protein structure in genes for stallion fertility
| SNP ID | Accession no. | Gene | Location in gene | High SNP effect | Substitution type |
|---|---|---|---|---|---|
| g.26775767G>C | ss1457622628 | NEURL1 | Exon 5 | Stop gained | p.S155* |
| g.77472655G>C | ss1457622630 | KDR | Intron 30 | Splice site donor | |
| g.74610774C>T | ss1457622632 | CFTR | Exon 17 | Stop gained | p.Q894* |
| g.56937215C>T | ss1457622634 | OVGP1 | Exon 6 | Stop gained | p.R145* |
| g.45985131A>G | ss1457622636 | FBXO43 | Exon 1 | Start lost | p.M1T |
| g.82699661C>T | ss1457622637 | TSSK6 | Exon 3 | Stop gained | p.W155* |
| g.7083659A>T | ss1457622638 | SLC9A3R1 | Splice site intron 9 | Splice site donor | |
| g.40694339G>A | ss1457622640 | PKD1 | Exon 17 | Stop gained | p.W339* |
| g.6704968C>T | ss1457622641 | GHRL | Splice site intron 1 | Splice site acceptor | |
| g.19034281C>T | ss1457622642 | FOXP1 | Splice site intron 4 | Splice site donor | |
| g.21894180G>A | ss1457622643 | FNDC3A | Splice site intron 27 | Splice site acceptor | |
| g.32635273T>C | ss1457622644 | BTNL2 | Exon 1 | Start lost | p.M1T |
| g.35255390T>C | ss1457622645 | TCP11 | Exon 1 | Start lost | p.M1T |
| g.4323852G>A | ss1457622646 | SPATA31E1 | Exon 3 | Stop gained | p.R423* |
| g.37453246G>C | ss1457622647 | NOTCH1 | Splice site intron 4 | Splice site acceptor | |
| g.37455302G>A | ss1457622648 | NOTCH1 | Splice site intron 14 | Splice site donor | |
| g.79813487A>T | ss1457622631 | ENSECAG00000020135 | n.a. | Splice site acceptor, | |
| Splice site donor | |||||
| g.25184403G>C | ss1457622635 | ENSECAG00000021286 | n.a. | Splice site donor | |
| g.30073089G>A | ss1457622639 | ENSECAG00000018118 | n.a. | Splice site donor |
The variant ID (SNP ID) and dbSNP accession number, gene symbol and location in the gene, type of high-impact effect and corresponding substitution type on protein level are given. Variants causing a stop codon are marked with an asterix
Validation of 17 high-impact SNPs in 337 fertile stallions
| HWE | |||||
|---|---|---|---|---|---|
| Polymorphism | Candidate gene | HET |
| DF | P-HWE |
| g.26775767G>C | NEURL | 0.116 | 22.094 | 1 | <0.0001 |
| g.77472655G>C | KDR | 0.228 | 0.324 | 1 | 0.569 |
| g.74610774C>T | CFTR | 0.004 | 0.001 | 1 | 0.976 |
| g.56937215C>T | OVGP1 | 0.111 | 0.810 | 1 | 0.368 |
| g.45985131A>G | FBXO43 | 0.032 | 0.082 | 1 | 0.775 |
| g.82699661C>T | TSSK6 | 0.068 | 0.380 | 1 | 0.538 |
| g.7083659A>T | SLC9A3R1 | 0.398 | 0.476 | 1 | 0.490 |
| g.40694339G>A | PKD1 | 0.013 | 0.013 | 1 | 0.911 |
| g.6704968C>T | GHRL | 0.019 | 18.572 | 1 | <0.001 |
| g.19034281C>T | FOXP1 | 0.104 | 0.959 | 1 | 0.327 |
| g.32635273T>C | BTNL2 | 0.168 | 120.250 | 1 | <0.0001 |
| g.35255390T>C | TCP11 | 0.022 | 0.040 | 1 | 0.842 |
| g.4323852G>A | SPATA31E1 | 0.009 | 0.007 | 1 | 0.933 |
| g.37453246G>C | NOTCH1 | 0.023 | 0.040 | 1 | 0.841 |
| g.37455302G>A | NOTCH1 | 0.226 | 0.272 | 1 | 0.602 |
| g.79813487A>T | ENSECAG00000020135 | 0 | 0 | 0 | – |
| g.25184403G>C | ENSECAG00000021286 | 0 | 0 | 0 | – |
For each polymorphism, corresponding candidate gene, heterozygosity (HET), test for Hardy Weinberg equilibrium (HWE) with the respective chi square (X 2) statistic, degrees of freedom (DF) and P-value (P-HWE) are given
Minor allele (MA), minor allele frequencies (MAF), minor genotype frequencies (MAG) for high-impact variants within candidate genes for stallion fertility validated in 337 fertile stallions
| Allele | Genotype | ||||
|---|---|---|---|---|---|
| Polymorphism | MA | MAF | AA | AB | BB |
| g.26775767G>C | C | 0.0865 | 0.855 | 0.116 | 0.028 |
| NEURL1 | 272/318 | 37/318 | 9/318 | ||
| g.77472655G>C | C | 0.1266 | 0.759 | 0.228 | 0.013 |
| KDR | 243/320 | 73/320 | 4/320 | ||
| g.74610774C>T | T | 0.0018 | 0.997 | 0.004 | 0 |
| CFTR | 283/284 | 1/284 | |||
| g.56937215C>T | T | 0.0556 | 0.889 | 0.111 | 0 |
| OVGP1 | 208/234 | 26/234 | |||
| g.45985131A>G | G | 0.0158 | 0.968 | 0.032 | 0 |
| FBXO43 | 306/316 | 10/316 | |||
| g.82699661C>T | T | 0.0338 | 0.933 | 0.068 | 0 |
| TSSK6 | 290/311 | 21/311 | |||
| g.7083659A>T | T | 0.2919 | 0.509 | 0.398 | 0.093 |
| SLC9A3R1 | 164/322 | 128/322 | 30/322 | ||
| g.40694339G>A | A | 0.0062 | 0.988 | 0.013 | 0 |
| PKD1 | 317/321 | 4/321 | |||
| g.6704968C>T | T | 0.0125 | 0.978 | 0.019 | 0.003 |
| GHRL | 314/321 | 6/321 | 1/321 | ||
| g.19034281C>T | T | 0.0522 | 0.896 | 0.104 | 0 |
| FOXP1 | 283/316 | 33/316 | |||
| g.32635273T>C | C | 0.3064 | 0.610 | 0.168 | 0.223 |
| BTNL2 | 200/328 | 55/328 | 73/328 | ||
| g.35255390T>C | C | 0.0110 | 0.978 | 0.022 | 0 |
| TCP11 | 310/317 | 7/317 | |||
| g.4323852G>A | A | 0.0047 | 0.991 | 0.009 | 0 |
| SPATA31E1 | 318/321 | 3/321 | |||
| g.37453246G>C | C | 0.0113 | 0.978 | 0.023 | 0 |
| NOTCH1 | 304/311 | 7/311 | |||
| g.37455302G>A | A | 0.1341 | 0.753 | 0.226 | 0.021 |
| NOTCH1 | 247/328 | 74/328 | 7/328 | ||
| g.79813487A>T | T | 0 | 1 | 0 | 0 |
| ENSECAG00000020135 | 74/74 | ||||
| g.25184403G>C | C | 0 | 1 | 0 | 0 |
| ENSECAG00000021286 | 74/74 | ||||
Evaluation of high-impact variants for stallion fertility
| SNP ID | GENE | EBV-PAT association | GTmissing | 2pqhigh | 2pqlow | E (q2 stallion) | Number of breeds | Private | Rare MAF < 0.01 | MAF 0.01-0.1 | MAF >0.1 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| g.26775767G>C | NEURL | 7 | + | ||||||||
| g.77472655G>C | KDR | 11 | + | ||||||||
| g.74610774C>T | CFTR | + | 0.004 | 0.001 | 2 | + | |||||
| g.56937215C>T | OVGP1 | + | 0.111 | 0.723 | 3 | + | |||||
| g.45985131A>G | FBXO43 | + | 0.032 | 0.079 | 4 | + | |||||
| g.82699661C>T | TSSK6 | + | 0.068 | 0.355 | 8 | + | |||||
| g.7083659A>T | SLC9A3R1 | 12 | + | ||||||||
| g.40694339G>A | PKD1 | + | 0.013 | 0.012 | 2 | + | |||||
| g.6704968C>T | GHRL | 5 | + | ||||||||
| g.19034281C>T | FOXP1 | + | 0.111 | 0.861 | 11 | + | |||||
| g.32635273T>C | BTNL2 | 7 | + | ||||||||
| g.35255390T>C | TCP11 | + | 0.022 | 0.038 | 2 | + | |||||
| g.4323852G>A | SPATA31E1 | + | 0.009 | 0.007 | 3 | + | |||||
| g.37453246G>C | NOTCH1 | + | 0.023 | 0.040 | 4 | + | |||||
| g.37455302G>A | NOTCH1 | 0.008 | 9 | + | |||||||
| g.79813487A>T | ENSECAG00000020135 | 1 | + | ||||||||
| g.25184403G>C | ENSECAG00000021286 | 1 | + |
Significant associations of the high-impact variants with the estimated breeding values of the paternal component for the pregnancy rate per estrus (EBV-PAT) in Hanoverian stallions (n = 216) are indicated by Bonferroni-corrected P-values. For each variant, a plus sign indicates the missing mutant homozygous genotype (GTmissing), high and low heterozygous genotype frequencies (2pqhigh, 2pqlow) and the expected number of mutant homozygous genotypes for stallions E (q2 stallion) are given. The number of breeds where the variants were observed is counted and a grouping (group I-IV) by the minor allele frequency (MAF) is given
Most promising high-impact variants in genes contributing to stallion fertility
| Loci for stallion fertility | Loci for stallion infertility | No obvious relation to stallion reproduction | |||||
|---|---|---|---|---|---|---|---|
| Class I | Class II | Class III | |||||
| SNP ID | Gene | SNP ID | Gene | SNP ID | Gene | SNP ID | Gene |
| g.37455302G>A | NOTCH1 | g.74610774C>T | CFTR | g.79813487A>T | ENSECAG00000020135 | g.32635273T>C | BTNL2 |
| g.56937215C>T | OVGP1 | g.25184403G>C | ENSECAG00000021286 | g.7083659A>T | SLC9A3R1 | ||
| g.45985131A>G | FBXO43 | g.6704968C>T | GHRL | ||||
| g.82699661C>T | TSSK6 | g.26775767G>C | NEURL1 | ||||
| g.40694339G>A | PKD1 | g.77472655G>C | KDR | ||||
| g.19034281C>T | FOXP1 | ||||||
| g.35255390 T>C | TCP11 | ||||||
| g.4323852G>A | SPATA31E1 | ||||||
| g.37453246G>C | NOTCH1 | ||||||
Three classes of high-impact variants with corresponding single nucleotide polymorphism (SNP) identities (ID) and gene are given