Literature DB >> 22384319

Fine Mapping of a QTL for Fertility on BTA7 and Its Association With a CNV in the Israeli Holsteins.

Giora Glick, Andrey Shirak, Eyal Seroussi, Yoel Zeron, Efraim Ezra, Joel I Weller, Micha Ron.   

Abstract

A quantitative trait locus (QTL) affecting female fertility, scored as the inverse of the number of inseminations to conception, on Bos taurus chromosome 7 was detected by a daughter design analysis of the Israeli Holstein population (P < 0.0003). Sires of five of the 10 families analyzed were heterozygous for the QTL. The 95% confidence interval of the QTL spans 27 cM from the centromere. Seven hundred and four SNP markers on the Illumina BovineSNP50 BeadChip within the QTL confidence interval were tested for concordance. A single SNP, NGS-58779, was heterozygous for all the five QTL heterozygous patriarchs, and homozygous for the remaining five QTL homozygous sires. A significant effect on fertility was associated with this marker in the sample of 900 sires genotyped (P < 10(-6)). Haplotype phase was the same for four of the five segregating sires. Thus concordance was obtained in nine of the ten families. We identified a common haplotype region associated with the rare and economically favorable allele of the SNP, spanning 270 kbp on BTA7 upstream to 4.72 Mbp. Eleven genes found in the common haplotype region should be considered as positional candidates for the identification of the causative quantitative trait nucleotide. Copy number variation was found in one of these genes, KIAA1683. Four gene variants were identified, but only the number of copies of a specific variant (V(1)) was significantly associated with breeding values of sires for fertility.

Entities:  

Keywords:  KIAA1683; copy number variation (CNV); female-fertility; gene-variants; quantitative trait locus (QTL)

Year:  2011        PMID: 22384319      PMCID: PMC3276122          DOI: 10.1534/g3.111.000299

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Poor reproductive performance is one of the most common reasons for premature culling. Low female fertility causes economic losses by increasing the need for additional inseminations, higher veterinary costs, longer than optimal lactations, less calves, and higher replacement costs. Therefore, improving female fertility is important for dairy cattle breeders. Various statistics are used to measure fertility of dairy cattle in different countries. In Israel, fertility is scored as the inversed number of inseminations to conception. Heritability for nearly all measures of fertility is only 1–5% (Gonzalez-Recio ; Kuhn ; Weller and Ezra 1997), and has negative genetic correlations with milk production traits (Gonzalez-Recio ; Veerkamp ; Weller and Ezra 1997). Weller conducted a daughter design QTL scan for economic traits on BTA7 in Israeli Holstein population. The scan included genotyping 29 microsatellites spanning the chromosome in 11 sires families. Two significant effects for female fertility QTL were identified; near the centromere with five segregating sire families, and near the end of the chromosome with two segregating sire families. The 95% confidence intervals (CI) of the QTL near the centromere spans 27 cM, which includes hundreds of genes. The most convincing methodology for identifying the causative polymorphism for segregating QTL, the quantitative trait nucleotide (QTN), is concordance (Ron and Weller 2007). Complete concordance is obtained only if:Testing for concordance requires determination of the QTL genotype of several individuals, which can be determined for family patriarchs by either a daughter or granddaughter design (Ron and Weller 2007). The first two requirements for concordance can be analyzed directly from daughter or granddaughter designs results. The third requirement for concordance requires the identification of the QTL phase of the segregating sires. Concordance can be considered as a proof for QTN detection if the probability of obtaining concordance by chance is sufficiently low for rejection of this hypothesis (Ron and Weller 2007). For a given number of patriarchs with known QTL genotype, probability of obtaining concordance by chance decreases as the fraction of heterozygous sires increases (Weller ). Therefore, we focused the analysis on the QTL near the centromeric region of BTA7, because five sires were heterozygous for this QTL. All individuals known to be homozygous for the QTL are also homozygous for the polymorphism. All individuals heterozygous for the QTL are also heterozygous for the polymorphism. The same QTL allele is associated with the same allele of the putative QTN for all the heterozygous animals. Recently, 900 Israeli Holstein sires with genetic evaluations for milk production traits and female fertility, including the 10 patriarchs of the daughter design, were genotyped for the Illumina BovineSNP50 BeadChip, which includes 54,001 SNPs. The aim of this study was to identify SNPs with significant concordance within the CI of the female fertility QTL in the centromeric region of BTA7, and to use this data to identify likely candidate genes for the QTL.

Materials and Methods

Illumina BovineSNP50 BeadChip genotyping

DNA samples of 900 Israeli Holstein bulls were genotyped for 54,001 SNPs using the Illumina BovineSNP50 BeadChip. Quality control and genotyping procedure are as shown by Weller .

Concordance testing

A total of 704 SNP markers included on the Illumina BovineSNP50 BeadChip are positioned within the first 30 Mbp of BTA7, which cover the CI of the QTL for female fertility. The genotypes of the 704 SNPs for the 10 sires with inferred genotypes for the QTL from the daughter design were tested for concordance, of which five were heterozygous and five were homozygous for the QTL. The SNP marker that showed the highest fit to the concordance model is NGS-58779. All five sires, homozygous for the QTL, were homozygous for the marker and all five patriarchs, heterozygous for the QTL were heterozygous for the marker. Nevertheless, the haplotype phase was concordant for only four of the five segregating sires. Thus concordance was obtained in 9 out of the 10 families. The probability to obtain a genetic marker with similar concordance by chance was computed assuming that the QTL allele frequency is 0.5, as half of the families segregated for the QTL. Assuming independence among the sire genotypes, the probability by chance that the five sires homozygous for the QTL will also be homozygous for the marker is 0.55, the probability that four sires will be heterozygous for the marker and the QTL with correct allele phase between the QTL and the marker is: 0.254. The probability that one out of five heterozygous sires for the QTL will be heterozygous for the marker with incorrect allele phase between the QTL and the marker is: 0.25*5. Thus, the probability to obtain this level of concordance by chance for a single SNP is: 0.55* 0.254* 0.25*5 = 0.000152. As 704 SNPs were analyzed 0.1 SNP is expected to show this level of concordance by chance.

Haplotype phase analysis

Genotypes of 20 SNP markers from the Illumina BovineSNP50 BeadChip, positioned within the flanking 0.5 Mbp upstream and downstream to NGS-58779, were analyzed for 900 Israeli Holstein sires. Haplotype phase analysis was performed using the Plink software with individuals treated as unrelated (Purcell ). The resulting haplotypes of the 10 tested sires by the daughter design were verified by LSPH software that is based on known genetic relationships between individuals (Baruch ).

Comparative mapping

Comparative mapping between the bovine common haplotype region and the human orthologous region was conducted using Ensembl biomart (http://www.ensembl.org/biomart/martview) and NCBI map-viewer (http://www.ncbi.nlm.nih.gov/projects/mapview/).

Positional cloning

A partial screen of the exons of six of the 11 genes within the identified common haplotype region (KIAA1683, PDE4C, CIST1, JUND, LSM4, and GDF15) was performed in order to identify concordant polymorphism for the QTL. Primers were designed by the Primer3plus software (http://www.bioinformatics.nl/cgi-bin/primer3plus/primer3plus.cgi) in accordance with the exon-intron borders obtained from the reference sequence of the NCBI Entrez Gene database (http://www.ncbi.nlm.nih.gov/gene/). DNA of sires was extracted from semen samples as described previously (Ma ). The bovine KIAA1683 genes (LOC618787 and LOC788637) were sequenced simultaneously, due to the very high similarity in their predicted exonic sequences. The primers that were used for the positional cloning are presented in Table 1.
Table 1 

PCR and sequencing primers

GeneForward PrimerReverse PrimerProduct Size
KIAA1683
 Exon 1GTACCTGCCAAGTGGAGGAGTGACGCCTGTAGCTGTGAAC283
 Exon 2.1TTCACAGCTACAGGCGTCACCCTTGTTGGCTCTGGAAGTT782
 Exon 2.2ACCCAATGAAGACCAAGCAAGGCCTTGGTTATTGTGACAG509
 Exon 3GCCTCGATGATCAAGTCTCCAGATGTCCGTCCTCGGATGT700
 Exon 4GAGGACGGACATCTTGAAGGCACACAGGGAGGCGAGTT794
 Intron 4GGTCCGTGATCAGTGGTGACTGCTGCTCATGGGATCTA811
 Exon 5-6.1CCCTGACTCCAGCTGACACCGCCTTGGTCTGATGAT795
 Exon 6.2CACCCCATCATCAGACCAACCCACCAGTAAGCCAAGGT232
 SP1(T)26GTAAAACGACGGCCAGTAACTGCTCCTCCCCTGACa531
 SP2CAGGAAACAGCTATGACAACTGCTCCTCCCCTGACg505
JUND
 Exon 1.1CCGGGCCGAGGCTATAAGCTCTGCCTTAATGCGCTCTT997
 Exon 1.2GTCTACGCGAACCTGAGCACCAACGTTTGTTCCCGAGTA911
 Exon 1.3CTCAGACTCGACAAGCTGGACCCTGCAGGAGAAGAAGGAG577
LSM4
 Exon 1ATTGGTTAGCCTTCCCACGAATGACACACTCGCTCTGCAT236
 Exon 2ATATGCCAGCCCTGCCTAGAAGCTGCTGCTTCGTGGAC250
GDF15
 Exon 1CGGACAAAGTCCAGGGAATATGGGGTCTCCGAAATTTTAC385
 Exon 2AGCTTTCTTTCCTCTTTTCCACGTCCATGCCATAAATGAAG700
CIST1
 Exon 1AACATGCGCCCCTCACATGCCGGATCTTACCTGTTCTT405
 Exon 2GGGTTGCATATCCTTGTTCCCCCCAATGTCCCAATACTCA664
 Exon 3-4GGCCCTGGTGTGTCTGTCGCCTCAGAGGAATGATAGGG554
NGS-58779TGAGCTGAGACTGGAAAGAGGAATCCACACCCAGAAGCAAC231
M-13GTAAAACGACGGCCAGTCAGGAAACAGCTATGAC
PCR fragments were amplified using Super-Therm Taq DNA polymerase (JMR Holding, London) according to the instructions of the manufacturer and the following conditions: 30 cycles for 30 s at 92°C, 40 s at 63°C, and 1 min at 72°C, using a DNA engine thermocycler (MJ Research Inc., Waltham, MA). PCR products were separated on agarose gels, excised from the gel, purified with DNA Montage Gel Extraction Kit (Millipore, Bedford, MA). Sequencing was preformed using the BigDye Terminator v3.1 Cycle Sequencing Kit using the 3130 Genetic Analyzer capillary electrophoresis instrument (Applied Biosystems). Sequences were assembled and compared using the GAP4 program (Staden ).

Isolation of KIAA1683 variants

Specific primers (SP) were designed to PCR amplify separately each of the A/G alleles located in KIAA1683 at position 4856941 /4896528 bp on BTA7, which was heterozygous in all of the sires (Table 1). An “A” to “T” mismatch in the fifth nucleotide from the 5′ end of the SP primers, indicated by lower case letter, was introduced to increase the specificity of the PCR amplification. Standard M-13 phage forward or reverse primer sequences were added to the 3′ core primer sequence to serve as a template for sequencing primers and to decrease primers dimer formation. An additional 26 repeats of T tail was attached to the 3′ end of the SP1 primer to allow detection of fragment size difference from the SP2 PCR product on a 2% agarose gel. The KIAA1683_exon3_R primer was used as a reverse primer (Table 1). PCR products were amplified by high-fidelity BIO-X-ACT Long DNA polymerase (Bioline, London) according to the instructions of the manufacturer and the following conditions: 30 cycles for 30 s at 92°C, 40 s at 64°C, and 1 min at 72°C, using a DNA engine thermocycler (MJ Research Inc., Waltham, MA). SP1 and SP2 PCR products were sequenced using the M-13_F and R primers, respectively. The sequences of the isolated variants of the KIAA1683 gene were Blasted against the bovine WGS traces and the HTGS sequence databases (NCBI) for sequence validation, and to identify alternative variants of the gene which were not found among the 10 sires.

Real-Time qPCR

Determination of the relative copy number of the total KIAA1683 gene (including all gene variants) was conducted using quantitative real-time PCR (qPCR) analysis. A total of 326 Israeli Holstein bulls with known breeding values for fertility including the 10 sires that were tested by the daughter design were analyzed. Primers were designed on a region of the exon 3 which was common and monomorphic to all of the identified gene variants. Relative copy numbers of V1, V2, and V3 were determined using variant specific selective primers. V1 selective primer was designed to amplify the “T” allele of the 4857096/4896683 polymorphism, which is unique for V1. V2 selective primer was designed to amplify the unique “A” allele of the 4857153/4896740 polymorphism, and the V3 selective primer was designed to amplify the unique “T” allele of the 4857126/4896713 polymorphism. M-13 phage forward standard primers sequences were attached to the 5′ of the core primers to decrease primers dimer formation. The same reverse primer was used for both V1 and V2 selective primers. qPCR primers are shown in Table 2. V4 does not possess a unique polymorphism which can be used for selective qPCR.
Table 2 

qPCR primers

Primer IDSequenceProduct Size (bp)
RPP30_FTGCTTCCATTGTTTCCTGATGA96
RPP30_RTGGGACCAGGTTCCATGATC
Total-KIAA1683_FCCACCATCCTCAAAACCCTGT101
Total-KIAA1683_RGGATGAGGTGTCGGAAATTCC
V1_FGTAAAACGACGGCCAGTTTTCCGACACCTCATCCGACAT214/155
V2_FGTAAAACGACGGCCAGTGAGACAGGCGACCTGGGA
V1/V2_RGGAAACAGCTATGACATTTTCTCGCCTTCCAAAGG
V3_FGTAAAACGACGGCCAGTAAATGCTTTCCCCGGTCA121
V3_RCAGGAAACAGCTATGACCGTGGTTTCCGTTTTCCCTT
Gene copy number was normalized to an autosomal reference gene, Ribonuclease P protein subunit p30 (RPP30, GeneID:615098, on BTA26. A fragment of 95 bp at position 12893277), RPP30, was used as a reference gene in human CNV studies (Wang ). No CNV was reported for this gene region in previous studies of CNV in cattle (Bae ; Fadista ; Liu ; Seroussi ). The qPCR analysis was performed in duplicates using the Absolute Blue SYBER Green ROX mix kit (Thermo Fisher scientific, UK) according to the instructions of the manufacturer in a 17-µl reaction volume, which included 2 µl of DNA (30 ng/µl), 1 µl of each primer (10 pmol/µl), 7.5 µl of ultra pure water, and 7.5 µl of Absolute Blue SYBER Green ROX Mix. The qPCR reaction was performed in the following conditions: 15 min at 95°C for enzyme activation followed by 40 cycles of 15 s at 95°C, 1 min at 60°C, using an ABI Prism 7000 sequence detection system. Amplification was followed by a dissociation curve analysis to confirm the presence of a single product and the absence of primer dimer. The threshold cycle number (CT) for each tested probe was used to quantify its relative abundance. The qbasePLUS software (Biogazelle, Ghent, Belgium) was used for the calculation of the relative quantities using RPP30 for normalization (D'haene et al. 2010). The qPCR primers that were designed to selectively amplify V3 failed, thus V3 was excluded from the analysis. Correlations between the CTs of the two duplicates were 0.8, 0.8, 0.7, and 0.9 for V1, V2, total KIAA1683, and RPP30, respectively. The standard curves of all qPCR probes were linear in all the tested DNA dilutions with R2 <0.98 and with slopes of −3.36, −3.30, −2.99, and −3.11, which indicate efficiency of 101, 100, 93, and 92% for V1, V2, total KIAA1683, and RPP30, respectively. The average CT of the lowest detectable DNA concentration point (LOD) in the standard curves were 30.7, 30.6, 26.8, and 25.7, while the average CTs of the nontemplate controls (NTC) were 37.1, 35.1, 32.8, and 31.9 for V1, V2, total KIAA1683, and RPP30, respectively. The differences between the average CT of the LOD and NTC were >4 cycles in all tested probes, which indicates nontemplate-specific products of <6.25%.

Association analysis

Association of the NGS58879 SNP marker genotype with the bulls’ breeding values for fertility was analyzed by the PLINK linear/logistic regression option (Purcell ). The population included 900 sires, including the 10 sires tested by the daughter design. A regression analysis of 326 sires’ breeding values for fertility on the overall CNV or specific variant copy number was performed using the “fit Y by X” option of the JMPIN 5.0.1a statistical software (SAS Institute Inc). In addition, multiple regression was performed for the sires’ breeding values for fertility with NGS58879 and V1 as independent variables.

Results

Concordance and effects associated with NGS-58779

Of the 704 SNPs in the QTL CI on BTA7 that was tested for concordance, only a single intergenic SNP, NGS-58779 positioned at 4922643 bp (Btau 4.0 genome assembly), showed apparent full concordance; all five patriarchs that were heterozygous for the QTL were heterozygous for the marker, and the remaining five sires, homozygous for the QTL, were homozygous for the marker. The “A” allele of the NGS-58779 was associated with the positive haplotype of the fertility QTL identified in the daughter design analysis in four of the five heterozygous sires, and corresponded to the effect associated with the marker in the general sire population. Thus concordance was obtained in 9 out of 10 families. The NGS-58779 marker showed a significant association to breeding values of sires for fertility in a sample of 900 sires (P ≤ 1.5*10−6). The frequency of the “A” allele, associated with increased fertility, was 31.5%. The observed allele substitution effect was 0.57 trait units.

Haplotype phasing analysis

The neighboring genomic region of the concordant SNP was analyzed in order to explore the linkage disequilibrium (LD) boundaries of the QTL region. Although the Plink haplotype phase analysis was performed without including relationships among bulls, no conflicts were found between the Plink and LSPH results. All chromosomes carrying the “A” allele of NGS-58779 shared a common haplotype extending from 4.65 to 4.92 Mbp (Table 3).
Table 3 

Phased haplotypes of the 10 analyzed sires

SireQTLaHaplotypes
(SNPs Position on BTA7 in Mbp)
4.434.504.524.594.634.654.674.724.764.824.924.965.015.035.065.095.115.145.165.20
2278+GAGGAGAAGGAGGAGGGAAG
GGCAAGAAGGGGGAGGGAAG
3070+GGGGGGAAGGAAGAAGGAGG
GGGGGGAAGGGAGAGGGGGG
3089+GGGGGGAAGGAAGGGGCGAG
GGGGGGAAGAGGGAGGGAAG
3208+GGGGGGAAGGAAGGGGCGAG
GGGGGGAAGGGGAGAGGAGA
3258+GAGGGGAAGGAGGAGGGGGG
GGGGGGAAGGGGAGGGGAGG
3099GGGGGGAAGGAAGGGGCGAG
GAGGGGAAGGAGGAGGGGGG
2357GGGGGGAAGGAAGGGGCGAG
GGGGGGAAGGAGGAGGGGGG
2283GGGGGGAAGAGGGAGGGAGG
AGGGGGAAGGGGAGGGCGAG
3241GGGGGGAAGGGGAGAGGAGG
AGGGGGAAGGGGAGGGCGGA
3274GGGGGGAAGGGGAGAGGAGA
GAGGAGGGGGGGGAGAGAGG

The NGS-58779 SNP in bold is located at 4.92 Mbp. The common sub-haplotype associated with the NGS-58779 “A” allele is in bold.

QTL segregation status of sires (+ for heterozygous and − for homozygous).

The NGS-58779 SNP in bold is located at 4.92 Mbp. The common sub-haplotype associated with the NGS-58779 “A” allele is in bold. QTL segregation status of sires (+ for heterozygous and − for homozygous).

Comparative mapping and positional cloning

A comparative map of the bovine common haplotype region and its human orthologous region (HSA19p13) is shown in Figure 1. The region contains 11 genes (CIST1, JUND, LSM4, GDF15, KIAA1683, LOC785387, PGPEP1, LRRC25, SSBP4, ISYNA1, and ELL) in the bovine gene map. Partial sequencing of the exons of four genes (CIST1, JUND, LSM4, and GDF15) did not identify a complete concordance between any of the polymorphisms and the QTL status of the patriarchs (Table 4).
Figure 1 

Bovine/human comparative mapping of the QTL region: lines connect the orthologous genes. The human KIAA1683 gene and the bovine orthologous duplication (LOC788637 and LOC618787) are marked in red. The SNP marker NGS-58879 is in the bottom of the BTA7 map and appears in red.

Table 4 

Partial sequencing of positional genes

GeneNumber of ExonsSequencing
Size (bp)Polymorphism
TypeaLocation in GeneGenomic LocationbConcordancec
TotalSequenced
GDF15221067A to T non-syn T/SExon 24,704,1373/4
LSM4524860
JUND113020
KIAA168366358637 SNPs9/10
CIST1441623T to C non-syn V/AExon 14,911,8904/5
C to T non-syn T/MExon 24,914,5459/10
C to A non-syn L/MExon 24,914,8338/10
A to G 3′ UTRExon 44,916,6720/3
T to GIntron 124,920,6703/5

Non-syn stands for nonsynonymous mutation that is capable of encoding amino acid substitution. UTR stands for untranslated region.

Genomic locations of the identified SNP on BTA7 in bp (Btau4.0).

Concordance of polymorphism with the segregation status of sires for the QTL.

Bovine/human comparative mapping of the QTL region: lines connect the orthologous genes. The human KIAA1683 gene and the bovine orthologous duplication (LOC788637 and LOC618787) are marked in red. The SNP marker NGS-58879 is in the bottom of the BTA7 map and appears in red. Non-syn stands for nonsynonymous mutation that is capable of encoding amino acid substitution. UTR stands for untranslated region. Genomic locations of the identified SNP on BTA7 in bp (Btau4.0). Concordance of polymorphism with the segregation status of sires for the QTL. The comparative map showed duplication in the bovine KIAA1683 orthologous genes (LOC788637 and LOC6181787). Alignment analysis demonstrated very high similarity in the sequence of both genes (∼95% identity). Their intron 1 contains a duplication of a section of the exon 1 sequence. LOC788637 has three copies of the exon 1 sequence, while LOC6181787 has only two. Two repetitive regions are present in intron 4 of both gene duplicates. Sequencing analysis of the exons regions revealed 37 sites of polymorphism between the gene variants, 26 of which were predicted to be due to the differences between the two paralogues in the reference genome (Btau 4.0). Figure 2 summarizes the polymorphism patterns among the 10 sires for exons 1 to 6 (the full genotyping report is summarized in supporting information, Table S1). These patterns may be clustered into three major groups: 1) “heterozygous” in >95% of the sites (sires 3099, 2357, 3089, 3208, 3070, and 2278); 2) “homozygous” in 8 sites (sire 3258) and 3) “homozygous” in the same 16 sites (sires 2283, 3241, and 3274).
Figure 2 

Representation of the polymorphic sites found among the 10 sires along exons 1 to 6 of the KIAA1683 genes: Striped sites indicate the location of differences between the two gene duplicates in the reference genome. Sites in which only a single nucleotide was found for the individual sires are marked in black. The patterns of the 10 sires are clustered into three major groups: 1) “heterozygous” in >95% of the sites (3099, 2357, 3089, 3208, 3070, and 2278); 2) “homozygous” in 8 sites (3258) and 3) “homozygous” in the same 16 sites (2283, 3241, and 3274). The red dashed line indicates the section of exon 3 which was chosen as a target for the gene variants isolation. This region distinguishes between the three cluster groups with at least two unique homozygous sites.

Representation of the polymorphic sites found among the 10 sires along exons 1 to 6 of the KIAA1683 genes: Striped sites indicate the location of differences between the two gene duplicates in the reference genome. Sites in which only a single nucleotide was found for the individual sires are marked in black. The patterns of the 10 sires are clustered into three major groups: 1) “heterozygous” in >95% of the sites (3099, 2357, 3089, 3208, 3070, and 2278); 2) “homozygous” in 8 sites (3258) and 3) “homozygous” in the same 16 sites (2283, 3241, and 3274). The red dashed line indicates the section of exon 3 which was chosen as a target for the gene variants isolation. This region distinguishes between the three cluster groups with at least two unique homozygous sites. The sequencing analysis revealed 11 novel polymorphism sites in the KIAA1683 gene that were not found in the two reported gene variants. This indicated that there are additional variants of the gene. In order to isolate theses putative variants we focused on exon 3, which showed seven polymorphic sites, including four informative sites that differentiate the three patterns found in the sample of sires (Figure 2). We used two allele specific primers that selectively amplified the A/G alleles of the 4856941/4896528 polymorphism site (SP1 and SP2, Table 1). All sires amplified both alleles, and the resulting upstream sequence polymorphism is presented in Table 5. The polymorphism of cluster group 3 (sires 2283, 3241, and 3274) was different from that of the other sires, showing only two distinct variants of the KIAA1683 exon 3 region (V2 by SP2 and V3 by SP1). The polymorphism of the other sires upstream to the A/G site was complex, indicating the existence of additional gene variants. The SP2 primer enabled us to identify two distinct variants of the gene: V2 (isolated in sires 2278, 3070, 3089, 3208, 2357, 3241, 3274, and 2283), V4 (isolated in 3258) and both variants in 3099. The SP1 primer enabled us to isolate V3 in sires 2283, 3241, and 3274. The SP1 sequence of the other sires was identical, indicating a common gene version in addition to V3. This expected gene version (V1) is identical to the LOC788637 reference genome sequence. Thus, four different variants of the bovine KIAA1683 exon 3 were identified (Table 6, Figure 3). BLAST analysis of the variants identified perfect matches for the V1 and V2 variants in the bovine reference genome. V3 was found by BLAST analysis against the bovine HTGS sequence database, while V4 is novel (Table 6, Figure 3).
Table 5 

Allele-specific amplification and polymorphism of the 10 sires

PrimerSireCluster
GroupbPolymorphism on BTA7 (in bp)aGene
Variant
485694148965284857096
48966834857126
48967134857153
48967404857165
48967524857178
48967654857214
4896801
SP12278AC/TC/TCCT/CG/AV3, V1
3070AC/TC/TCCT/CG/AV3, V1
30891AC/TC/TCCT/CG/AV3, V1
3208AC/TC/TCCT/CG/AV3, V1
2357AC/TC/TCCT/CG/AV3, V1
3099AC/TC/TCCT/CG/AV3, V1
32582AC/TC/TCCT/CG/AV3, V1
3241ACTCCCAV3
32743ACTCCCAV3
2283ACTCCCAV3
SP22278GCCATCGV2
3070GCCATCGV2
30891GCCATCGV2
3208GCCATCGV2
2357GCCATCGV2
3099GCCC/AT/CCGV2, V4
32582GCCCCCGV4
3241GCCATCGV2
32743GCCATCGV2
2283GCCATCGV2

Location of the polymorphic sites on the KIAA1683 exon 3 in BTA7 on the Btau4.0 genome assembly; positions for each polymorphism are shown for LOC788637 and LOC618787 loci, on the upper and lower lines, respectively. The SNP location targeted by the allele-specific primers is in bold.

The polymorphism-pattern cluster of the sires according to their polymorphic pattern (Figure 2).

Table 6 

Isolated KIAA1683 exon 3 variants

Gene
VariantReference IDSNP Location on BTA7 (in bp)a
4856941
48965284857096
48966834857126
48967134857153
48967404857165
48967524857178
48967654857214
4896801
V1LOC788637ATbCCCTG
V2LOC618787GCCAcTCG
V3ti109805135ACTdCCCA
V4GCCCCCG

Location of polymorphic sites on the Btau4.0 genome assembly; positions are shown for the LOC788637 and the LOC618787 loci, on the upper and lower lines, respectively.

Polymorphism site used by qPCR to selectively amplify V1.

Polymorphism site used by qPCR to selectively amplify V2.

Polymorphism site used by qPCR to selectively amplify V3.

Figure 3 

Sequencing chromatograms of four different KIAA1683 variants. A 60-bp sequence within the KIAA1683 third exon is displayed. Arrows mark specific nucleotide changes that differentiate the gene variants.

Location of the polymorphic sites on the KIAA1683 exon 3 in BTA7 on the Btau4.0 genome assembly; positions for each polymorphism are shown for LOC788637 and LOC618787 loci, on the upper and lower lines, respectively. The SNP location targeted by the allele-specific primers is in bold. The polymorphism-pattern cluster of the sires according to their polymorphic pattern (Figure 2). Location of polymorphic sites on the Btau4.0 genome assembly; positions are shown for the LOC788637 and the LOC618787 loci, on the upper and lower lines, respectively. Polymorphism site used by qPCR to selectively amplify V1. Polymorphism site used by qPCR to selectively amplify V2. Polymorphism site used by qPCR to selectively amplify V3. Sequencing chromatograms of four different KIAA1683 variants. A 60-bp sequence within the KIAA1683 third exon is displayed. Arrows mark specific nucleotide changes that differentiate the gene variants.

Copy number analysis and association with breeding values for fertility

The relative copy number of the 10 sires for KIAA1683, and its V1 and V2 variants were estimated using qPCR (Figure 4). The results confirm the genotyping results using the variant specific analysis. Sire 3258 which lacks V2, and sires 2283, 3241, and 3274 which lack V1 showed only residual copy number reflecting the absence of these gene variants. The regressions of breeding values of 326 sires for fertility on their copy number for total KIAA1683 and V2 were not significant (Figure 5, A and C). On the other hand, a highly significant regression was found for V1 (P < 0.0001; R2 = 0.047, Figure 5B) which is similar to the association of the NGS-58779 SNP marker using the same population structure (P < 0.0003; R2 = 0.037, Figure 5D). When both NGS-58779 and V1 copy number were included in the model, both factors were significantly associated with fertility (P < 0.01 and P < 0.03, respectively), increasing the R2 to 0.069.
Figure 4 

Normalized relative copy number (arbitrary units) of total KIAA1683, V1 and V2 of the 10 sires with known QTL genotypes: QTL genotypes were determined by the daughter design; dotted bars denote the heterozygous sires. Genotypes of the NGS-58779 SNP “AA”, “AG” and “GG” are indicated by striped, dotted, and blank bars, respectively. CNV was analyzed by qPCR and quantities were normalized to RPP30, an autosomal non-CNV reference gene. Values were normalized to the highest value of sire which was assigned the value of 2 (arbitrary units). Error bars indicates the standard error between the qPCR replicates.

Figure 5 

CNV association test: Linear regressions of sires’ breeding values for fertility on copy number of total KIAA1683 (A), V1 (B),V2 (C), and genotypes of NGS-58779 (D). NGS-58779 genotypes were scored as 0, 1, and 2 for homozygotes (G), heterozygotes (AG), and homozygotes (A), respectively. Probabilities are given for the significant regressions.

Normalized relative copy number (arbitrary units) of total KIAA1683, V1 and V2 of the 10 sires with known QTL genotypes: QTL genotypes were determined by the daughter design; dotted bars denote the heterozygous sires. Genotypes of the NGS-58779 SNP “AA”, “AG” and “GG” are indicated by striped, dotted, and blank bars, respectively. CNV was analyzed by qPCR and quantities were normalized to RPP30, an autosomal non-CNV reference gene. Values were normalized to the highest value of sire which was assigned the value of 2 (arbitrary units). Error bars indicates the standard error between the qPCR replicates. CNV association test: Linear regressions of sires’ breeding values for fertility on copy number of total KIAA1683 (A), V1 (B),V2 (C), and genotypes of NGS-58779 (D). NGS-58779 genotypes were scored as 0, 1, and 2 for homozygotes (G), heterozygotes (AG), and homozygotes (A), respectively. Probabilities are given for the significant regressions.

Discussion

In cattle, numerous genome scans were conducted during the past 15 years to identify QTL. These scans, based either on daughter of granddaughter designs (Weller ), led to the identification of many chromosomal regions with effects on nearly all of the economically important traits (http://www.genome.iastate.edu/cgi-bin/QTLdb/BT/index). However, the CI for QTL location is generally >20 cM, thus containing hundreds of genes. Increasing marker density beyond one marker per 10 cM has very limited effect on the length of the CI of the QTL (Ron and Weller 2007). The identification of a putative QTN requires full concordance with the QTL status of individuals. Such information can only be detected by pedigree analysis, such as daughter or granddaughter designs. The probability of obtaining full concordance by chance depends on the number of individuals with known QTL status analyzed, the number of segregating individuals (heterozygous for the QTL), and the number of markers tested (Ron and Weller 2007). The 27 cM 95% CI interval of the QTL for female fertility in the centromeric region of BTA7 (Weller ) contains 704 SNPs from the Illumina BovineSNP50 BeadChip. Of these, only a single intergenic SNP, NGS-58779, showed nearly complete concordance (9 out of 10 families). The probability to obtain such concordance by chance is 0.000152 (see Materials and Methods). By accounting for multiple testing of 704 SNPs only 0.1 SNP is expected to achieve this concordance level by chance. Furthermore, a highly significant effect on female fertility was associated with this marker in the sample of 900 sires genotyped (P < 1.5*10−6).

Common haplotype identification

Haplotype phase analysis for SNP markers within the flanking 0.5 Mbp upstream and downstream to NGS-58779 interval showed that all chromosomes carrying the “A” allele of NGS-58779 shared a common haplotype extending from 4.65 to 4.92 Mbp (Table 3). However, because the frequencies of the SNPs alleles included in this haplotype were >0.8, this haplotype was also found in some of the chromosomes carrying the “G” allele of NGS-58779. On the other hand, no common haplotype associated with the “A” allele was discernable on the other side of NGS-58779. We hypothesize that this common haplotype region contains the causative mutation for this QTL. Cohen-Zinder identified a common haplotypes among the two segregating sires for the QTL for milk protein on BTA6. The QTN was eventually identified in ABCG2, one of the five genes in this haplotype region.

Polymorphism in the KIAA1683 gene

Comparative mapping of the common haplotype region to the human orthologous region on HAS19 revealed duplication in the bovine KIAA1683 gene (LOC788637 and LOC6181787). The two gene duplicates shared high sequence similarity (∼95%). Among the genes in the region that were sequenced, only KIAA1683 revealed a high number of polymorphic sites that may indicate copy number variation. Twenty six sites were attributed to the differences between the two reference genome gene duplicates. The additional 11 sites indicated the possibility for more unidentified gene variants. The polymorphic patterns identified among the 10 sires, which clustered into three major groups, suggest that there are differences in the gene variants held by the different sires. The gene variants isolation analysis confirmed our assumption. A summary of the gene variants harbored by the 10 sires is presented in Table 7. The three sires homozygous for the NGS-58779 “G” allele—2283, 3241, and 3274—share variants V2 and V3. The two sires homozygous for the NGS-58779 “A” allele—3099 and 3257—also have variant V1. Sire 3099 also has variant V4. All four sires that are heterozygous for both the fertility QTL and the NGS-58779 marker—2278, 3070, 3089 and 3208—have variants V1, V2, and V3. Sire 3258, which is heterozygous for the QTL, but shows opposite allelic association between the NGS-58779 and the QTL alleles, has a unique pattern of variants: V1, V3, and V4.
Table 7 

Summary of the KIAA1683 gene variants harbored by the 10 sires

SireGene VariantNGS-58779aQTLb
V1V2V3V4
3099++++A/A
2357+++A/A
3089+++A/G+
3208+++A/G+
2278+++A/G+
3070+++A/G+
3258+++A/G+
3241++G/G
3274++G/G
2283++G/G

Genotypes of the NGS-58779 SNP.

QTL segregation status of the sires (+ for heterozygous and − for homozygous).

Genotypes of the NGS-58779 SNP. QTL segregation status of the sires (+ for heterozygous and − for homozygous). The gene variants’ patterns of the 10 sires may indicate the presence of a CNV, as the sires varied not only in the type of gene variants, but also in their number (from two to four). The qPCR analysis revealed a CNV not only in the total KIAA1683 but also in V1 and V2 specific gene variants. Previous work identified segmental duplication and CNV in this genomic region. Bae used the BovineSNP50 BeadChip signal intensity to identify CNV region in Korean cattle. They identified a CNV region that spans BTA7: 4.65–5.03 Mbp, which overlaps with our common haplotype region (BTA7: 4.65-4.92 Mbp). Liu performed a systematic computational genome-wide analysis of segmental duplications in cattle based on identifying paralogous sequences ≥1 kb in length with ≥90% sequence identity, and genomic regions that exhibit significant increase in depth of coverage in the whole genome shotgun sequences. The analysis identified significant peaks in both KIAA1683 genes indicating segmental duplication. In contrast to Bae , this analysis indicated that the segmental duplication in this region is limited only to the KIAA1683 genes.

KIAA1683 Copy number analysis

Of the four identified KIAA1683 gene variants identified by the allele specific PCR and sequencing, only V1 indicated an association with the NGS-58779 marker in the 10 sires analyzed. The sires that were homozygous for the NGS-58779 “G” allele (2283, 3241, and 3274) lack V1, while the sires homozygous for the “A” allele (3099 and 3257) and the heterozygous sires (2278, 3070, 3089, 3208, and 3258) all have V1 (Table 7). In recent years, CNV has been increasingly recognized as a major source of heritable variation that impacts complex traits (Redon ). Thus, we quantified the copy number of total KIAA1683 and its variants, V1 and V2. Only V1 copy number showed a significant association with fertility in the general population (P < 0.0001). It appears that copy number variation of different gene variants might be considered as independent genetic markers for association analyses. A similar phenomenon was reported for the human CYP2D locus located on chromosome 22 (HSA22q12). CYP2D is an enzyme expressed in the human liver. The CYP2D locus consists of three tandem repeats of homologous sequences of which CYP2D6 is the functional gene, while CYP2D7 and CYP2D8 are pseudogenes. Apparently only CNV of CYP2D6 was associated with drug metabolism phenotype (Abraham ; Ingelman-Sundberg 2004). Analogously, it is possible that V2 to V4 variants of KIAA1683 are pseudogenes, while V1 variant is the only active gene. This hypothesis may be tested by gene expression analysis which will identify the gene variants that are actually being transcribed to mRNA in different bovine tissues. Expression data for LOC788637 and LOC6181787 genes were searched in the bovine gene atlas but no informative results were found (Harhay , http://bovineatlas.msstate.edu/). Inclusion of both NGS-58779 and V1 copy number in the analysis model yielded statistical significance for both factors and increased the R2. It is expected that when a causal mutation is included in the analysis model with additional linked markers, only the causal mutation will be significant (Ron and Weller 2007). Thus both factors contribute to the explained variation, and apparently neither of them is the causal mutation. It should be noted that CIST1 also showed high concordance between the polymorphism and the segregation status of sires for the QTL.

Conclusions

We demonstrate a new strategy for fine mapping QTL which combines GWAS data together with daughter design results and concordance testing. We identified a single intergenic SNP, NGS-58779, which showed concordance for 9 out of 10 sires, and a highly significant association with fertility in the population. We further identified a common haplotype associated with the rare favorable allele of the marker, from 4.65 to 4.92 Mbp on BTA7. Thus, the 27 cM CI of the fertility QTL was reduced to a 270 kbp region which contains only 11 genes that should be considered as positional candidates for the identification of the causative quantitative trait nucleotide. An analysis of the CNV in the KIAA1683 gene showed that only the number of copies of a specific gene variant (V1) was significantly associated with breeding values of sires for fertility. Thus quantifying the copy number of different gene variants is of paramount importance for association analyses.
  24 in total

1.  The Staden package, 1998.

Authors:  R Staden; K F Beal; J K Bonfield
Journal:  Methods Mol Biol       Date:  2000

2.  Paternity validation and estimation of genotyping error rate for the BovineSNP50 BeadChip.

Authors:  J I Weller; G Glick; E Ezra; Y Zeron; E Seroussi; M Ron
Journal:  Anim Genet       Date:  2010-10       Impact factor: 3.169

Review 3.  Accurate and objective copy number profiling using real-time quantitative PCR.

Authors:  Barbara D'haene; Jo Vandesompele; Jan Hellemans
Journal:  Methods       Date:  2010-01-06       Impact factor: 3.608

4.  Genetic analysis of somatic cell score and female fertility of Israeli Holsteins with an individual animal model.

Authors:  J I Weller; E Ezra
Journal:  J Dairy Sci       Date:  1997-03       Impact factor: 4.034

5.  Quantifying EGFR alterations in the lung cancer genome with nanofluidic digital PCR arrays.

Authors:  Jun Wang; Ramesh Ramakrishnan; Zhe Tang; Weiwen Fan; Amy Kluge; Afshin Dowlati; Robert C Jones; Patrick C Ma
Journal:  Clin Chem       Date:  2010-03-05       Impact factor: 8.327

6.  Power of daughter and granddaughter designs for determining linkage between marker loci and quantitative trait loci in dairy cattle.

Authors:  J I Weller; Y Kashi; M Soller
Journal:  J Dairy Sci       Date:  1990-09       Impact factor: 4.034

7.  CYP2D6 gene variants: association with breast cancer specific survival in a cohort of breast cancer patients from the United Kingdom treated with adjuvant tamoxifen.

Authors:  Jean E Abraham; Mel J Maranian; Kristy E Driver; Radka Platte; Bolot Kalmyrzaev; Caroline Baynes; Craig Luccarini; Mitul Shah; Susan Ingle; David Greenberg; Helena M Earl; Alison M Dunning; Paul D P Pharoah; Carlos Caldas
Journal:  Breast Cancer Res       Date:  2010-08-23       Impact factor: 6.466

8.  Detection and analysis of quantitative trait loci affecting production and secondary traits on chromosome 7 in Israeli Holsteins.

Authors:  J I Weller; M Golik; S Reikhav; R Domochovsky; E Seroussi; M Ron
Journal:  J Dairy Sci       Date:  2008-02       Impact factor: 4.034

9.  An atlas of bovine gene expression reveals novel distinctive tissue characteristics and evidence for improving genome annotation.

Authors:  Gregory P Harhay; Timothy Pl Smith; Leeson J Alexander; Christian D Haudenschild; John W Keele; Lakshmi K Matukumalli; Steven G Schroeder; Curtis P Van Tassell; Cathy R Gresham; Susan M Bridges; Shane C Burgess; Tad S Sonstegard
Journal:  Genome Biol       Date:  2010-10-20       Impact factor: 13.583

10.  Analysis of recent segmental duplications in the bovine genome.

Authors:  George E Liu; Mario Ventura; Angelo Cellamare; Lin Chen; Ze Cheng; Bin Zhu; Congjun Li; Jiuzhou Song; Evan E Eichler
Journal:  BMC Genomics       Date:  2009-12-01       Impact factor: 3.969

View more
  9 in total

Review 1.  Adaptive potential of genomic structural variation in human and mammalian evolution.

Authors:  David W Radke; Charles Lee
Journal:  Brief Funct Genomics       Date:  2015-05-23       Impact factor: 4.241

2.  A genome-wide survey reveals a deletion polymorphism associated with resistance to gastrointestinal nematodes in Angus cattle.

Authors:  Lingyang Xu; Yali Hou; Derek M Bickhart; Jiuzhou Song; Curtis P Van Tassell; Tad S Sonstegard; George E Liu
Journal:  Funct Integr Genomics       Date:  2014-04-10       Impact factor: 3.410

3.  Genome wide CNV analysis reveals additional variants associated with milk production traits in Holsteins.

Authors:  Lingyang Xu; John B Cole; Derek M Bickhart; Yali Hou; Jiuzhou Song; Paul M VanRaden; Tad S Sonstegard; Curtis P Van Tassell; George E Liu
Journal:  BMC Genomics       Date:  2014-08-15       Impact factor: 3.969

4.  Genome-wide CNV analysis reveals variants associated with growth traits in Bos indicus.

Authors:  Yang Zhou; Yuri T Utsunomiya; Lingyang Xu; El Hamidi Abdel Hay; Derek M Bickhart; Pamela Almeida Alexandre; Benjamin D Rosen; Steven G Schroeder; Roberto Carvalheiro; Haroldo Henrique de Rezende Neves; Tad S Sonstegard; Curtis P Van Tassell; José Bento Sterman Ferraz; Heidge Fukumasu; Jose Fernando Garcia; George E Liu
Journal:  BMC Genomics       Date:  2016-06-01       Impact factor: 3.969

5.  Loss of maternal ANNEXIN A10 via a 34-kb deleted-type copy number variation is associated with embryonic mortality in Japanese Black cattle.

Authors:  Shinji Sasaki; Takayuki Ibi; Takayuki Akiyama; Moriyuki Fukushima; Yoshikazu Sugimoto
Journal:  BMC Genomics       Date:  2016-11-24       Impact factor: 3.969

6.  CNV Analysis of Host Responses to Porcine Reproductive and Respiratory Syndrome Virus Infection.

Authors:  El Hamidi A Hay; Igseo Choi; Lingyang Xu; Yang Zhou; Robert R R Rowland; Joan K Lunney; George E Liu
Journal:  J Genomics       Date:  2017-05-24

7.  A global analysis of CNVs in Chinese indigenous fine-wool sheep populations using whole-genome resequencing.

Authors:  Chao Yuan; Zengkui Lu; Tingting Guo; Yaojing Yue; Xijun Wang; Tianxiang Wang; Yajun Zhang; Fujun Hou; Chune Niu; Xiaopin Sun; Hongchang Zhao; Shaohua Zhu; Jianbin Liu; Bohui Yang
Journal:  BMC Genomics       Date:  2021-01-23       Impact factor: 3.969

8.  Combining Landscape Genomics and Ecological Modelling to Investigate Local Adaptation of Indigenous Ugandan Cattle to East Coast Fever.

Authors:  Elia Vajana; Mario Barbato; Licia Colli; Marco Milanesi; Estelle Rochat; Enrico Fabrizi; Christopher Mukasa; Marcello Del Corvo; Charles Masembe; Vincent B Muwanika; Fredrick Kabi; Tad Stewart Sonstegard; Heather Jay Huson; Riccardo Negrini; Stéphane Joost; Paolo Ajmone-Marsan
Journal:  Front Genet       Date:  2018-10-03       Impact factor: 4.599

9.  Fine mapping of genomic regions associated with female fertility in Nellore beef cattle based on sequence variants from segregating sires.

Authors:  Gerson A Oliveira Júnior; Daniel J A Santos; Aline S M Cesar; Solomon A Boison; Ricardo V Ventura; Bruno C Perez; José F Garcia; José Bento S Ferraz; Dorian J Garrick
Journal:  J Anim Sci Biotechnol       Date:  2019-12-16
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.