| Literature DB >> 28928950 |
Michael P Heaton1, Timothy P L Smith1, Bradley A Freking1, Aspen M Workman1, Gary L Bennett1, Jacky K Carnahan1, Theodore S Kalbfleisch2.
Abstract
Background: Access to sheep genome sequences significantly improves the chances of identifying genes that may influence the health, welfare, and productivity of these animals.Entities:
Keywords: BMP15; BMP1RB; Fecundity; GDF9; Litter size; Sheep; Whole genome sequence
Year: 2017 PMID: 28928950 PMCID: PMC5590088 DOI: 10.12688/f1000research.12216.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. USMARC Sheep Diversity Panel version 2.4.
This group of 96 rams was sampled from USMARC and private U.S. flocks to represent genetic diversity for traits such as fertility, prolificacy, maternal ability, growth rate, carcass leanness, wool quality, mature weight, and longevity.
Figure 2. Comparison of 163 reference SNP genotypes with those derived from WGS data.
( A) Computer screen image of one animal’s WGS data aligned to ovine reference assembly Oar_v3.1 at a reference SNP site. The heterozygous C/T genotype is shown as viewed with the IGV software [7, 8]. ( B) Linear relationship between mapped read depth and the amount (Gb) of Q20 WGS data collected. At each SNP position, the read depth and genotypes were visualized and manually recorded for 163 parentage SNPs. ( C), genotype scoring accuracy for 163 parentage SNPs in 96 sires. Consensus reference genotypes (n = 15,684) for the parentage SNPs were previously determined by multiple methods [22].
Figure 3. Comparison of WGS genotypes from 96 rams with those from bead arrays.
( A) The distribution of average WGS read depth across 45,946 SNP sites for 96 sires combined. ( B) A comparison of the average WGS read depth per animal to the average genotype concordance between 45,946 WGS and bead array genotypes.
Figure 4. Physical maps and rooted maximum parsimony phylogenetic trees of GDF9, BMP15, and BMPR1B protein variants in U.S. sheep.
( A) Physical maps of GDF9, BMP15, and BMPR1B exon and protein domains in relationship to missense variants. ( B) Maximum parsimony phylogenetic trees of haplotype-phased protein variant identified in the sheep diversity panel. For each gene analyzed, the most frequent protein isoform was defined as “variant 1” and used as the reference sequence for each tree. Each node in a tree represents a different protein isoform that varies by one amino acid compared to adjacent nodes. The areas of the circles are proportional to the variant frequency in the panel of 96 rams. The trees were rooted based on evolutionary conservation of residues in closely related species. The predicted root of GDF9 was not observed in the 96 rams.
DNA sequence information for GDF9, BMP15, and BMP15R1B missense variants identified in the sheep diversity panel (MSDPv2.4).
| Gene | Codon
| Position
| Exon | Pro.
| Codon
[ | Codon
| MAF
[ | Flanking genomic sequence |
|---|---|---|---|---|---|---|---|---|
|
| R87H | chr5:41,843,258 | 1 | PRO | c
| c
| 0.037 | ggtgggagacacagacctggtctcctttcccctctcttagaggttctgtatgatgggcacggggaaccccccaggctgcagccagatgacagagctttgc
|
| E241K | chr5:41,841,675 | 2 | PRO |
|
| 0.037 | gaaatacaaatggatggagattgatgtgacggctcctcttgagcctctggtggcctcccacaagaggaatattcacatgtctgtaaattttacatgtgcg
| |
| V332I | chr5:41,841,402 | 2 | MAT |
|
| 0.245 | atctgcctaccccgtgggagaagaagctgctgagggtgtaagatcgtcccgtcaccgcagagaccaggagagtgccagctctgaattgaagaagcctctg
| |
| V371M | chr5:41,841,285 | 2 | MAT |
|
| 0.026 | tctgagtgaatacttcaaacagtttctttttccccagaatgaatgtgagctccatgactttagacttagctttagtcagctgaagtgggacaactggatt
| |
|
| L11ΔL
[ | chrX:50,977,397 | 1 | PRO | indel | cttctt = LL
| 0.344 | gtaaaaggaaaggtttaaagcgttatcctttgggcttttatcagaacatgttgctgaacaccaagcttttcaagatggtcctcctgagcatccttagaatc
|
| R67Q | chrX:50,977,228 | 1 | PRO | c
| c
| 0.010 | cacctgcctgaggcccctaccttgcccctgattcaggagctgctagaagaagcccctggcaagcagcagaggaagccgcgggtcttagggcatcccttac
| |
| L252P | chrX:50,971,365 | 1 | MAT | c
| c
| 0.083 | ttctggtggcatggcacttcatcattggacactgtcttcttgttactgtatttcaatgacactcagagtgttcagaagaccaaacctctccctaaaggcc
| |
|
| M64I | chr6:29,401,381 | 4 | AR | at
| at
| 0.026 | acacacacacacacacacacacacacacatactttgcctgtttgatctttagcacagatggatattgtttcacgatgatagaagaagatgactctgggat
|
| T345N | chr6:29,380,795 | 9 | AS-AL | a
| a
| 0.026 | cctgtggtcacttctggatgtctaggactagaaggctcagattttcagtgtcgggtaaggaagataccttggttccactttgtaaccttttattggcaag
|
aAll variants and sequences are oriented from the sense strand perspective. However, GDF9, BMP15, and BMPR1B are oriented in the opposite direction with regards to the Oar_v3.1 reference assembly. Alphabetical abbreviations for relevant amino acids: E, glutamate; H, histidine; I, isoleucine; K, lysine; L, leucine; M, methionine; N, asparagine; P, proline; Q, glutamine; R, arginine; T, threonine; and V, valine.
bProtein domain abbreviations: PRO, propeptide; MAT, mature peptide; AR, activin receptor domain; and AS-AL, between the active site proton acceptor and the activation loop domains.
cIUPAC/IUBMB ambiguity codes used for nucleotides: R = a/g, Y = c/t, M = a/c, K = g/t, S = c/g, W = a/t [35].
dThe major allele is listed first.
eMinor allele frequency in MSDPv2.4.
fThe L11ΔL variant is an abbreviation for p.(Leu10_11delinsLeu), the recommended nomenclature for this variant by the Human Genome Variation Society [36].
Comparison of missense, nonsense, and frameshift variants in the coding sequences of ovine GDF9, BMP15, BMPR1B and their phenotypic association in sheep.
| Gene
| Coding
| Phenotype | Breed groups | Ref. |
|---|---|---|---|---|
| GDF9
| R87H | None reported | Multiple |
|
| E241K | None reported | Multiple |
| |
| R315C | Fecundity, Vacaria,
| Ile de France |
| |
| V332I | None reported | Multiple |
| |
| F345C | Fecundity, Embrapa,
| Santa Inês |
| |
| V371M | Fecundity | Finnish landrace |
| |
| S395F | Fecundity, High Fertility,
| Belclare, Cambridge |
| |
| S427R | Fecundity, Thoka,
| Icelandic |
| |
| BMP15
| L11ΔL | None reported | Multiple |
|
|
|
|
|
| |
| W154Δ17 | Fecundity, Rasa Aragonesa,
| Rasa Aragonesa |
| |
| Q239stop | Fecundity, Galway,
| Belclare, Cambridge |
| |
|
|
|
|
| |
| Q291stop | Fecundity, Hanna,
| Romney |
| |
| V299D | Fecundity, Inverdale,
| Romney |
| |
| T317I | Fecundity, Grivette,
| Grivette |
| |
| C321Y | Fecundity, Lacaune,
| Lacaune |
| |
| N337H | Fecundity, Olkuska,
| Olkuska |
| |
| S367I | Fecundity, Belclare,
| Belclare |
| |
| BMPR1B
|
|
|
|
|
| Q249R | Fecundity Booroola,
| Booroola,
|
| |
|
|
|
|
|
aBold font indicates previously unreported variants affecting the protein sequence.
Frequencies of haplotype-phased GDF9, BMP15, and BMPR1B protein variants in U.S. sheep.
| Protein | Protein variant
| Variant amino acids
[ | Sheep diversity panel
|
|---|---|---|---|
| GDF9 | 1 | R87, E241, V332, V371 | 0.693 |
| 2 | R87, E241,
| 0.245 | |
| 3 |
| 0.036 | |
| 4 | R87, E241, V332,
| 0.026 | |
| BMP15 | 1 | L11, R67, L252 | 0.656 |
| 2 |
| 0.250 | |
| 3 |
| 0.083 | |
| 4 |
| 0.010 | |
| BMPR1B | 1 | M64, T345 | 0.948 |
| 2 |
| 0.026 | |
| 3 | M64,
| 0.026 |
aThe bolded residues are those differing from “variant 1” in each gene.
bThe protein variant frequency.
Figure 5. Evolutionary comparison GDF9, BMP15, and BMPR1B protein residues at their variant sites in U.S. sheep.
Aligned protein sequences from a representative subset of 29 vertebrate species were compared. Abbreviations and symbols are as follows: TMRCA, estimated time to most recent common ancestor in millions of years [48]; letters, IUPAC/IUBMB codes for amino acids; dot, amino acid residues identical to those in sheep “variant 1”; triangle, net deletion of one leucine residue in BMP15 positions 10 and 11 where two leucine residues are commonly present; ni, a fourth protein variant was not identified for BMPR1B; nr, not in refseq_protein database and thus residues were determined by analyzing WGS data; dash, not enough sequence similarity for comparison or missing polypeptide region; nm, did not match a refseq_protein in the database for that species.
Frequency estimates of GDF9, BMP15, and BMPR1B protein variants by breed group.
| Protein variant frequency
[ | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GDF9 | BMP15 | BMPR1B | ||||||||||||
| 1 | 2 | 3 | 4
[ | 1 | 2 | 3
[ | 4
[ | 1 | 2
[ | 3
[ | ||||
|
| No. | (Ref.) | (I332) | (H87,
| (M371) | (Ref.) | (ΔL) | (P252) | (Q67) | (Ref.) | (I64) | (N345) | ||
| Dorper | 6 | 0.33 | 0.67 | -
[ | - | 0.17 | 0.50 | 0.33 | - | 1.00 | - | - | ||
| Dorper, white | 4 | 0.75 | 0.25 | - | - | - | - | 1.00 | - | 1.00 | - | - | ||
| Dorset | 11 | 0.64 | 0.36 | - | - | 0.82 | 0.09 | - | 0.09 | 1.00 | - | - | ||
| Finn | 10 | 0.55 | 0.20 | - | 0.25 | 0.70 | 0.30 | - | - | 0.95 | - | 0.05 | ||
| Katahdin | 8 | 0.75 | 0.25 | - | - | 1.00 | - | - | - | 0.81 | 0.19 | - | ||
| Navajo-Churro | 1 | 1.00 | - | - | - | 1.00 | - | - | - | 1.00 | - | - | ||
| Rambouillet | 10 | 0.85 | 0.15 | - | - | 0.20 | 0.80 | - | - | 1.00 | - | - | ||
| Romanov | 10 | 0.30 | 0.60 | 0.10 | - | 0.30 | 0.70 | - | - | 0.80 | - | 0.20 | ||
| Suffolk | 9 | 0.94 | 0.06 | - | - | 1.00 | - | - | - | 1.00 | - | - | ||
| Texel | 10 | 0.70 | 0.10 | 0.20 | - | 0.90 | 0.10 | - | - | 1.00 | - | - | ||
| Composite | 17 | 0.88 | 0.09 | 0.03 | - | 0.82 | 0.06 | 0.12 | - | 0.94 | 0.06 | - | ||
aThe variants correspond to those shown in Figure 4. The distinctive missense variant or reference isoform is indicated in parentheses.
bGDF9 protein “variant 4" contains the M371 amino acid previously associated with litter size in Finnish landrace sheep [11, 26, 37].
cBMP15 protein “variants 3 and 4" contain the previously unreported P252, and Q67 residues, respectively.
dBMPR1B protein “variants 2 and 3” contain the previously unreported I64 and N345 missense variants, respectively.
eHyphen indicates the variant was not detected in that group.