| Literature DB >> 22235840 |
Michael P Mullen1, Christopher J Creevey, Donagh P Berry, Matt S McCabe, David A Magee, Dawn J Howard, Aideen P Killeen, Stephen D Park, Paul A McGettigan, Matt C Lucy, David E Machugh, Sinead M Waters.
Abstract
BACKGROUND: The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22235840 PMCID: PMC3315736 DOI: 10.1186/1471-2164-13-16
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Capture efficiency
| Low calving interval pool | High calving interval pool | |
|---|---|---|
| Total sequencing data generated (Gb) | 1.57 | 1.38 |
| Sequence data mapped to bovine genome (Gb) | 1.23 (78%) | 0.9 (67%) |
| Data mapped to targeted regions (Mb) | 143 (12%) | 132 (14%) |
| Data remaining after quality control (Mb) | 74 (52%) | 48 (34%) |
| Number of bases targeted (Mb) | 1.54 | 1.54 |
| Quantity of bases covered (Mb) | 1.18 (76%) | 1.20 (77%) |
| Average fold coverage per base | 56 | 32 |
Percentages in parentheses represent proportions related to the category above each field.
Variants detected across the 1.5Mb of targeted sequence
| Shared between pools | Unique in Low CIV pool2 | Unique to High CIV pool2 | ||||
|---|---|---|---|---|---|---|
| SNP | Indel | SNP | Indel | SNP | Indel | |
| 5' and 3' UTR | 547 | 86 | 93 | 50 | 118 | 58 |
| Exonic1 | 278 (153) | 29 | 57 (38) | 17 | 65 (45) | 19 |
| Intronic | 2226 | 277 | 310 | 189 | 441 | 168 |
1: Numbers in parenthesis represent the number of SNPs resulting in non synonymous substitutions; 2: Variants classified as unique represent variants undetected in the corresponding pool.
Top 20 most significant SNPs, at least 10 Mb apart, in exonic, intronic and UTR regions displaying differential frequencies between groups of cattle divergent for calving interval.
| Entrez gene/Ensembl ID | Location | Allele | Frequency in Low CIV pool1 | Frequency in High CIV pool1 | Type/Location3 | Predicted effect4 | Reference5 | ||
|---|---|---|---|---|---|---|---|---|---|
| 19 | 49749042 | G>T | 0.00 | 0.38 | Exon NSS | N/A | rs41917096 | ||
| 21 | 49258728 | C>T | 0.00 | 0.34 | 4.8 × | Exon NSS | N/A | novel | |
| 21 | 9510000 | A>G | 0.00 | 0.52 | 9.1 × | Exon NSS | N/A | novel | |
| 7 | 37630361 | T>G | 0.00 | 0.30 | 8.2 × | Exon NSS | N/A | novel | |
| 19 | 43679543 | G>T | 0.00 | 0.28 | 1.2 × | Exon NSS | N/A | novel | |
| 2 | 108855684 | C>T | 0.00 | 0.54 | 2.3 × | 5' | 2 × pTFBS | novel | |
| 7 | 871347 | C>T | 0.47 | 0.09 | 5.3 × | 5' | 1 × pTFBS | rs43495395 | |
| 11 | 74275999 | G>A | 0.00 | 0.39 | 2.2 × | 5' | 8 × pTFBS | novel | |
| 20 | 34207771 | C>A | 0.00 | 0.39 | 8.3 × | 5' | None | novel | |
| 28 | 24994609 | C>T | 0.00 | 0.18 | 2.4 × | 3' | 1 × miRNA | novel | |
| 13 | 66803046 | A>G | 0.65 | 0.25 | 3.1 × | 5' | None | novel | |
| X | 35195377 | C>G | 0.00 | 0.83 | 4.9 × | 3' | None | novel | |
| 3 | 110250920 | T>G | 0.00 | 0.38 | 5.0 × | 3' | 4 × miRNA | novel | |
| 17 | 73990477 | C>T | 0.68 | 0.18 | 9.7 × | 3' | None | rs41255339 | |
| 10 | 78593544 | C>T | 0.00 | 0.21 | 9.3 × | 3' | 1 × miRNA | novel | |
| 9 | 100136966 | C>T | 0.79 | 0.00 | 5.5 × | Intron | N/A | novel | |
| 7 | 4996360 | C>A | 0.00 | 0.55 | 2.8 × | Intron | N/A | novel | |
| 7 | 14524938 | G>A | 0.00 | 0.44 | 4.4 × | Intron | N/A | novel | |
| 4 | 78896406 | A>T | 0.00 | 0.42 | 8.8 × | Intron | N/A | novel | |
| 18 | 48205429 | T>A | 0.00 | 0.53 | 1.6 × | Intron | N/A | novel |
1: Frequency of second allele displayed; 2: Benjamini and Hochberg corrected P value; 3: NSS = SNPs resulting in non synonymous substitutions; 4: Predicted effects on transcription factor binding sites in the 5' regulatory regions using MatInspector software package [33] and on microRNA binding sites in the 3' regulatory regions using MicroInspector software [34], detailed information on predicted effects displayed in Tables 6 and 7; 5: SNPs classified according to dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/ accessed 9th September 2011).
Figure 1Histogram of the distribution of the allele frequency differentials observed between high and low calving interval (CIV) pools.
Figure 2Q-Q plot representing the distribution of the allele frequency differentials observed between high and low calving interval (CIV) pools.
Figure 3Box plot representing the quartile distribution of sequencing depth in the detected and undetected SNP groups compared to dbSNP v130.
Comparison between allele frequencies generated using Sequenom® MassARRAY and Illumina GAIIx technologies.
| EntrezGene ID | SNP name1 | Position | Allele sub.2 | Allele frequency in low CIV pool3 | Allele frequency in high CIV pool3 | Reference5 | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 33897071 | T > C | 0.19 | 0.15 | 1.00 | 0.12 | 0.10 | 1.00 | AF140284 | |||
| 33897099 | A > G | 0.34 | 0.28 | 0.96 | 0.15 | 0.18 | 1.00 | AF140284 | |||
| 33897128 | C > T | 0.13 | 0.15 | 1.00 | 0.02 | 0.00 | 1.00 | AF140284 | |||
| 33897151 | T > G | 0.21 | 0.22 | 1.00 | 0.12 | 0.10 | 1.00 | AM161140 | |||
| 33897252 | A > G | 0.13 | 0.14 | 1.00 | 0.02 | 0.00 | 1.00 | n/a | |||
| 33915503 | A > T | 0.00 | 0.00 | 1.00 | 0.09 | 0.09 | 1.00 | AM161140 | |||
| 33994639 | G > T | 0.62 | 0.65 | 1.00 | 0.47 | 0.29 | 0.23 | rs109702942 | |||
| 33995251 | G > C | 0.07 | 0.09 | 1.00 | 0.00 | 0.00 | 1.00 | n/a | |||
| 34086084 | C > T | 0.69 | 0.73 | 1.00 | 0.31 | 0.29 | 1.00 | AF126288 | |||
| 20 | 34101240 | C > T | 0.05 | 0.00 | 0.96 | 0.37 | 0.33 | 1.00 | rs110979028 | ||
| 34166627 | A > G | 0.08 | 0.12 | 1.00 | 0.43 | 0.39 | 1.00 | n/a | |||
| 34166731 | A > G | 0.09 | 0.13 | 1.00 | 0.43 | 0.31 | 0.96 | n/a | |||
| 34166898 | T > C | 0.09 | 0.00 | 0.23 | 0.42 | 0.33 | 0.96 | n/a | |||
| 34166944 | C > T | 0.09 | 0.00 | 0.22 | 0.42 | 0.38 | 1.00 | rs109825954 | |||
| 34166970 | A > G | 0.09 | 0.00 | 0.23 | 0.42 | 0.36 | 1.00 | n/a | |||
| 34166982 | A > C | 0.09 | 0.00 | 0.22 | 0.43 | 0.33 | 0.96 | n/a | |||
| 34167025 | C > T | 0.09 | 0.00 | 0.23 | 0.42 | 0.37 | 1.00 | n/a | |||
| 34167126 | G > A | 0.09 | 0.09 | 1.00 | 0.43 | 0.35 | 1.00 | n/a | |||
| 34167240 | C > T | 0.09 | 0.00 | 0.13 | 0.44 | 0.52 | 1.00 | n/a | |||
| 49657225 | G > A | 0.90 | 0.86 | 1.00 | 0.84 | 0.91 | 1.00 | rs41923481 | |||
| 49657293 | G > A | 0.92 | 0.82 | 1.00 | 0.92 | 0.90 | 1.00 | rs196003433 | |||
| 49657371 | C > G | 0.92 | 1.00 | 1.00 | 0.89 | 1.00 | 1.00 | rs41923483 | |||
| 49660125 | T > G | 0.08 | 0.00 | 0.13 | 0.12 | 0.08 | 1.00 | n/a | |||
| 49660275 | G > C | 0.33 | 0.36 | 1.00 | 0.09 | 0.11 | 1.00 | rs41923484 | |||
| 49660469 | A > C | 0.08 | 0.00 | 0.96 | 0.12 | 0.12 | 1.00 | rs196003424 | |||
| 49693278 | G > A | 0.33 | 0.21 | 1.00 | 0.59 | 0.60 | 1.00 | rs41923525 | |||
| 19 | 49693285 | C > G | 0.29 | 0.15 | 0.96 | 0.54 | 0.58 | 1.00 | rs41923524 | ||
| 49693316 | G > C | 0.29 | 0.40 | 1.00 | 0.52 | 0.38 | 1.00 | rs41923523 | |||
| 49693328 | G > A | 0.30 | 0.35 | 1.00 | 0.55 | 0.29 | 0.96 | rs41923522 | |||
| 49693374 | C > G | 0.34 | 0.33 | 1.00 | 0.61 | 0.53 | 1.00 | rs41923521 | |||
| 49693442 | A > G | 0.67 | 0.58 | 1.00 | 0.39 | 0.37 | 1.00 | rs196301608 | |||
| 49693460 | C > T | 0.67 | 0.62 | 1.00 | 0.39 | 0.28 | 1.00 | rs196003442 | |||
| 49693512 | A > G | 0.66 | 0.65 | 1.00 | 0.38 | 0.38 | 1.00 | rs196003441 | |||
| 49693686 | C > T | 0.50 | 0.26 | 0.36 | 0.50 | 0.64 | 0.96 | rs41923520 | |||
| 71150007 | C > T | 0.99 | 1.00 | 1.00 | 0.90 | 0.93 | 1.00 | rs29012855 | |||
| 71175747 | T > C | 0.17 | 0.24 | 0.95 | 0.01 | 0.00 | 1.00 | rs109557731 | |||
| 5 | 71175753 | A > G | 0.49 | 0.46 | 1.00 | 0.25 | 0.15 | 0.70 | rs109227434 | ||
| 71176219 | A > T | 0.26 | 0.16 | 1.00 | 0.37 | 0.25 | 1.00 | rs110076130 | |||
| 71198324 | G > A | 0.35 | 0.26 | 0.96 | 0.62 | 0.76 | 0.92 | AF017143 | |||
| 29 | 51250879 | A > G | 0.40 | 0.40 | 1.00 | 0.65 | 0.47 | 0.83 | rs42196909 | ||
| 100112956 | A > G | 0.87 | 0.79 | 1.00 | 0.87 | 0.87 | 1.00 | rs41623543 | |||
| 9 | 100113379 | G > A | 0.63 | 0.65 | 1.00 | 0.77 | 0.47 | 0.22 | rs41623544 | ||
| 100134604 | G > A | 0.86 | 0.76 | 0.73 | 0.86 | 1.00 | 0.62 | n/a | |||
1: SNP details according to previous studies [4,5,7,31,32]; 2: Reported from the forward strand; 3: Allele frequencies represents second allele; 4: Benjamini and Hochberg corrected P values; 5:Genbank accession or dbSNP reference (http://www.ncbi.nlm.nih.gov/projects/SNP/ accessed 9th September 2011).
Figure 4Comparison of allele frequencies. 1Actual genotype frequencies calculated from Sequenom® MassARRAY data obtained from previous studies [4,5,7,31,32].
Effects of SNPs located in the 5' UTR of IGFBP5, MAPK9 and GCK on predicted transcription factor binding sites.
| Entrez Gene ID | Position | Allele | Strand | Matrix Family | Core similarity | Matrix similarity | Site sequence | Detailed Family Information | |
|---|---|---|---|---|---|---|---|---|---|
| 2 | 108855684 | T | (-) | V$ZF07 | 1.00 | 0.94 | ggtccCTCCtctcag | C2H2 zinc finger transcription factors 7 | |
| (+) | V$PLAG | 1.00 | 0.89 | gaGAGGagggacccaggggaggg | Pleomorphic adenoma gene | ||||
| 7 | 871347 | C | (+) | V$NFKB | 1.00 | 0.93 | aacgggtgTTCCttc | Nuclear factor kappa B/c-rel | |
| (+) | V$EREF | 1.00 | 0.81 | cagggaggactgtgtGACCtggt | Estrogen response elements | ||||
| (-) | V$PPAR | 0.76 | 0.71 | aacCAGGtcacacagtcctccct | Peroxisome proliferator activated receptor homodimers | ||||
| G | (-) | V$PAX3 | 1.00 | 0.87 | caggTCACacagtcctccc | PAX-3 binding sites | |||
| 11 | 74275999 | (-) | V$EREF | 1.00 | 0.93 | aaaccagGTCAcacagtcctccc | Estrogen response elements | ||
| (-) | V$RORA | 1.00 | 0.96 | agaaaaccaGGTCacacagtcct | v-ERB and RAR-related orphan receptor alpha | ||||
| (+) | V$GREF | 0.75 | 0.86 | ggaggactgtgTGATctgg | Glucocorticoid responsive and related elements | ||||
| A | (-) | V$GATA | 0.85 | 0.90 | accaGATCacaca | GATA binding factors | |||
| (-) | V$RXRF | 1.00 | 0.79 | agcaaaGGTCagaaaaccagatcac | RXR heterodimer binding sites | ||||
The "core sequence" of a matrix is defined as the (usually 4) consecutive highest conserved positions of the matrix (marked in uppercase letters). A perfect match between the consensus bovine sequence and the matrix gets a score of 1.00 (each sequence position corresponds to the highest conserved nucleotide at that position in the matrix); a "good" match to the matrix usually has a similarity of >0.80.
Effects of SNPs located in the 5' UTR of SLC2A1, HK1 and ESR2 on predicted miRNA binding sites.
| Entrez Gene ID/Ensembl ID | Chromosome | Position | Allele | Strand | miRNA name | miRNA sequence | Free energy ΔG1 (kcal/mol) |
|---|---|---|---|---|---|---|---|
| T | (+) | bta-miR-10b | uacccuguagaaccgaauuugug | -20.7 | |||
| 3 | 110250920 | bta-miR-136 | acuccauuuguuuugaugaugga | -22.0 | |||
| G | (+) | bta-miR-1249 | acgcccuucccccccuucuuca | -21.5 | |||
| bta-miR-2284r | uuggcccaaaaguucguucggau | -21.3 | |||||
| 28 | 24994609 | T | (+) | bta-miR-2465 | ugagccacaguagagccuuggau | -21.9 | |
| 10 | 78593544 | A | (-) | bta-miR-2348 | uucgggugguguggagcggcc | -21.9 | |
1Free energy: Gibbs free energy (ΔG) of the duplex structure is indicated.