Literature DB >> 26369327

Genome-wide association study for female fertility in Nordic Red cattle.

Johanna K Höglund1,2, Bart Buitenhuis3, Bernt Guldbrandtsen4, Mogens S Lund5, Goutam Sahana6.   

Abstract

BACKGROUND: The Nordic Red Cattle (NRC) consists of animls belonging to the Danish Red, Finnish Ayrshire, and Swedish Red breeds. Compared to the Holstein breed, NRC animals are smaller, have a shorter calving interval, lower mastitis incidence and lower rates of stillborn calves, however they produce less milk, fat and protein. Female fertility is an important trait for the dairy cattle farmer. Selection decisions in female fertilty in NRC are based on the female fertility index (FTI). FTI is a composite index including a number of sub-indices describing aspects of female fertility in dairy cattle. The sub-traits of FTI are: number of inseminations per conception (AIS) in cows (C) and heifers (H), the length in days of the interval from calving to first insemination (ICF) in cows, days from first to last insemination (IFL) in cows and heifers, and 56-day non-return rate (NRR) in cows and heifers. The aim of this study was first to identify QTL for FTI by conducting a genome scan for variants associated with fertility index using imputed whole genome sequence data based on 4207 Nordic Red sires, and subsequently analyzing which of the sub-traits were affected by each FTI QTL by associating them with the sub-traits.
RESULTS: A total 17,388 significant SNP markers (-log10(P) > 8.25) were detected for FTI distributed over 25 chromosomes. The chromosomes with the most significant markers were tested for associations with the underlying sub-traits: BTA1 (822 SNP), BTA2 (220 SNP), BTA3 (83 SNP), BTA5 (195 SNP), two regions on BTA6 (503 SNP), BTA13 (980 SNP), BTA15 (23 SNP), BTA20 (345 SNP), and BTA24 (104 SNP). The fertility traits underlying the FTI peak area were: BTA1 (IFLC, IFLH), BTA2 (AISH, IFLH, NRRH), BTA3 (AISH, NRRH), BTA5 (AISC, AISH, IFLH), BTA6 (region 1: AISH, NRRH; region 2: AISH, IFLH), BTA13 (IFLH, IFLC), BTA15 (IFLC, NRRH), and BTA24 (AISH, IFLH). For BTA20 all sub-traits had SNP markers with a -log10(P) > 10. Furthermore the genes assigned to the most significant SNP for FTI were located on BTA6 (GPR125), BTA13 (ANKRD60), BTA15 (GRAMD1B), and BTA24 (ZNF521).
CONCLUSION: This study 1) shows that many markers within FTI QTL regions were significantly associated with both AISH and IFLH, and 2) identified candidate genes for FTI located on BTA6 (GPR125), BTA13 (ANKRD60), BTA15 (GRAMD1B), and BTA24 (ZNF521). It is not known how the genes/variants identified in this study regulate female fertility, however the majority of these genes were involved in protein binding, 3) a SNP in a QTL region for FTI on BTA20 was previously validated in three cattle breeds.

Entities:  

Mesh:

Year:  2015        PMID: 26369327      PMCID: PMC4570259          DOI: 10.1186/s12863-015-0269-x

Source DB:  PubMed          Journal:  BMC Genet        ISSN: 1471-2156            Impact factor:   2.797


Background

The Nordic Red Cattle (NRC) consists of animals belonging to the Danish Red, Finnish Ayrshire, and Swedish Red breeds. Compared to the Holstein which is the dominating dairy cattle breed in the Denmark, NRC animals are smaller, have a shorter calving interval, lower mastitis incidence and lower rates of stillborn calves. However, they also produce less milk, fat and protein [1]. Female fertility is an important trait for the dairy cattle farmer. Poor fertility leads to extra inseminations, higher veterinary costs, and higher culling rate causing higher replacement costs. Impaired female fertility is the major reason for culling in the dairy industry [2]. In the Nordic countries genetic improvement of female fertility in the NRC cattle population is achieved by including a fertility index (FTI) in the breeding goal. FTI reflects how easily a cow or a heifer conceives given the index is composed of four different female fertility traits namely: number of inseminations per conception, interval from calving to first insemination, days from first to last insemination, and 56-day non-return rate. Genetic evaluation for three NRC populations from Denmark, Sweden and Finland are carried out jointly by Nordic Cattle Genetic Evaluation (http://www.nordicebv.info). The trait definitions have been standardized across the three countries and the cattle industry has made the large NRC dataset available for cattle genetic research. Previous genetic analyses of female fertility in NRC showed that the heritability is in the range of 0.02 to 0.04 [1]. A trait with low heritability requires large progeny groups in order to attain an acceptable level of accuracy for the breeding value. International cooperation is therefore beneficial as a large number of registrations increase the accuracy of the estimated breeding values. Previous studies on the identification of DNA markers have been focusing on the individual red breeds [3, 4], however a joint association analysis including the three NRC populations is lacking. Furthermore, a limitation of these previous studies [3, 4] is that the marker panels used only represent a small fraction of the variants segregating in the population. With the availability of data from the three countries and the whole genome sequence (WGS), a joint analysis could improve the power to identify the causal mutation [5]. The power to detect QTL increases because there is an improvement of accuracy of the breeding value by pooling the data from the three countries together. Furthermore using WGS will capture the linkage disequilibrium structure of the population better due to the higher marker density and in addition WGS includes some of the causal mutations underlying FTI. The identified DNA markers influencing female fertility traits in NRC [4-6] can potentially be used for routine genomic evaluation where additional weight can be put on certain genomic regions/variants [7]. The joint association analysis of the NRC population with imputed WGS data allows the detection of QTL for FTI and the determination of which sub-traits are affected by these QTL for FTI. Therefore our first objective of this study was to identify QTL for FTI by conducting a genome scan for variants associated with FTI in the joint NRC population using imputed WGS. The second objective was to re-analyze the identified QTL region for FTI to ascertain which sub-trait(s) of FTI are influenced by these QTL.

Methods

The analysis was based on a two-step approach. In the first step the genome was screened for the fertility index only based on a sire model without considering relationship among sires. In the second step targeted regions were selected based on the genome scan and re-analyzed for the individual female fertility traits using an animal model considering relationship among all animals in the study.

Animals and phenotype data

No animal experiments were performed in this study, and, therefore, approval from the ethics committee was not required. A total of 4207 Nordic Red sires from Denmark (RDCDNK), Sweden (RDCSWE) and Finland (RDCFIN) with official breeding values for female fertility traits were used for the association study. Estimated breeding values (EBVs) were predicted by the Nordic Cattle Genetic Evaluation (NAV) for each animal as part of routine genetic evaluation of Nordic dairy cattle breeds. Predictions were based on first to third parity records. BLUP EBVs were predicted using multi-trait sire models (SM). For details regarding the phenotypes recorded and models used in routine breeding value prediction, see http://www.nordicebv.info. The reliability of the breeding value is in the range of 40 to 99 % (Additional file 1: Figure S1). Female fertility index reflects the ease with which a heifer or cow was able to conceive. FTI is a compound index and its sub-traits are: number of inseminations per conception (AIS) in cows and heifers, the length in days of the interval from calving to first insemination (ICF) in cows, and days from first to last insemination (IFL) in cows and heifers. In addition 56-day non-return rate (NRR) in cows and heifers were analyzed even though it had no weight in the FTI. The reason to include NRR in our analyses was that NRR is a trait of biological importance understanding female fertility. For AIS, IFL and NRR EBVs were predicted separately for 1st parity animals (heifers, suffixed H) and 2nd and 3rd parity animals (cows, suffixed C). For more information about the heritability and correlations between the traits please see http://www.nordicebv.info/NR/rdonlyres/5CD2E4DC-F82A-4809-A770-3022E270E205/0/PrinciplesNyeste.pdf. In step I FTI was analyzed for its association with SNPs. In step II selected QTL regions identified for FTI were targeted and analyzed for individual female fertility traits: AIS, IFL, ICF and NRR as well as for FTI with an animal model considering the relationship among all the animals in the study.

Whole genome sequence data

The 253 dairy cattle sequences originating from a combination of sequences processed at Aarhus University [8] and sequences from the 1000 Bull Genomes Project dataset Run2 [9] were available. The sequencing of Nordic bulls (Jersey, Holstein, Nordic Red Cattle) at Aarhus University, Foulum was done using Illumina sequencers at Beijing Genomics Institute, Shenzhen, China and at Aarhus University. Sequencing was shotgun paired-end sequencing. Where necessary, fastq data were converted from Illumina to Sanger quality encoding using a patched version of maq [10]. They were aligned to the UMD3.1 assembly of the cattle genome [11] using bwa [12]. They were converted to raw BAM files using samtools [10]. Quality scores were re-calibrated using the Genome Analysis Toolkit [13] following the Human 1000 Genome guidelines incorporating information from dbSNP [14]. Sequences were realigned around insertion/deletions using the Genome Analysis Toolkit. Variants were called using the Genome Analysis Toolkit’s UnifiedGenotyper. Genomes for the 1000 Bull Genomes Project were sequenced in a number of laboratories and collected at the Department of Primary Industries, Victoria, Australia [9]. The breeds included in the sequencing project can be found at: http://www.1000bullgenomes.com/. Sequences were aligned to the same reference genome as used at Aarhus University. Variant calling was done using samtools’ mpileup function. Variant Call Files from Aarhus University and the 1000 Bull Genomes project were combined using the Genome Analysis Toolkit’s CombineVariants with precedence given to calls from the Nordic dataset for animals appearing in both datasets.

Imputation HD and sequence data

Imputation of 50 K SNP to the full sequence was done in two steps. First, in another study (N.K. Kadri, pers. comm.) the 50 K genotypes for 12,322 Nordic bulls (from Nordic Holstein, RDC and Danish Jersey) were imputed to HD genotypes using the software IMPUTE2 [14]. A reference containing HD genotypes for 2036 bulls (902 Holstein, 735 Nordic Red and 399 Danish Jersey) was available. Second, the 12,322 bulls imputed to HD genotypes were further imputed to the full sequence variants. Variants from the 253 dairy bulls were used as reference to impute the imputed HD data to the whole genome level using the software Beagle [15]. For imputation from HD to sequence data, chromosomes were divided into chunks of about 20,000 consecutive markers with an overlap of 250 markers at each end to minimize imputation error at ends of the chunks. Only polymorphisms identified both at Aarhus University dataset and the 1000 Bull Genomes dataset were imputed. For positions containing both a SNP and an insertion-deletion polymorphism (INDEL), the INDEL was deleted. SNPs at positions with disagreements between alleles from sequence and HD data were deleted. The reference data was pre-phased with Beagle v3.3.2 [15] and all markers with an r2 value below 0.9 were removed. This left a total of 8,938,927 markers for chromosomes 1–29.

Statistical method for association analysis

Step I: Genome scan for FTI

A sire model was used for the initial genome screening for markers associated with FTI. A SNP-by-SNP analysis was carried out in which each SNP in turn was tested for association with the phenotype. The following model was used to estimate SNP effects: Where y was the EBV of individual k belonging to the half-sib (sire) family j from population i, μ was the general mean, p is fixed effect of i‘th population (RDCDNK, RDCSWE and RDCFIN), b was the allelic substitution effect, x was the number of copies of an allele (with an arbitrary labeling) of the SNP count in the individual k (corresponding to 0, 1, or 2 copies), and s was the random effect of the j‘th half-sib family assumed to follow a normal distribution s ~ N(0, σ2) where σ2 was the sire variance, and e was a random residual of individual k assumed to follow a normal distribution e ~ N(0, σ2). All statistical analyses were conducted using DMU [16]. The null hypothesis H0: b = 0 was tested with a t-test. A SNP was considered significant if the –log10(P) value for the SNP exceeded a genome wide Bonferroni corrected threshold of 8.25 (= −log10(0.05/8,938,927)) corresponding to a nominal type I error rate of 5 % corrected for 8,938,927 simultaneous tests.

Step II: Association analyses for the targeted regions

The results from the step I analysis were plotted in a Manhattan plot (Fig. 1). QTL peaks included in step II analyses were identified subjectively based on the Manhattan plot. The most significant SNP was the marker with the largest –log10(P) value in that peak. QTL regions were defined as a section of chromosome 0.5 Mb on both sides of the most significant SNPs for FTI in the genome. The selected regions were screened for association with the sub-indices. An animal model considering all relationships among individuals was used.
Fig. 1

Manhattan plot of the association results of Fertility index in Nordic Red Cattle. On the x-axis the chromosomes are represented. On the y-axis the –log10(P) is presented. The horizontal blue line indicates Bonferroni corrected P-value at the 5 % level

Manhattan plot of the association results of Fertility index in Nordic Red Cattle. On the x-axis the chromosomes are represented. On the y-axis the –log10(P) is presented. The horizontal blue line indicates Bonferroni corrected P-value at the 5 % level The association between the SNP and the phenotype was assessed by a single-locus regression analysis for each SNP separately, using a linear mixed model [17]. The model was as follows:where y was the EBV for j‘th bull from i‘th population, μ was the overall mean, p was a fixed effect of i‘th population (RDCDNK, RDCSWE and RDCFIN), b was the allelic substitution effect, x was the number of copies of an allele (with an arbitrary labeling) of the SNP count in the individual j (corresponding to 0, 1, or 2 copies), u was the random polygenic effects. The vector , whose elements were the u was assumed to follow a multivariate normal distribution  ~ N (0,  σ) where was the additive relationship matrix between sires and σ was the polygenic variance, and e was the random residual of individual j with a normal distribution N(0,σ). The model was fitted by REML using the software DMU [16]. The estimates and standard errors of the fixed effects estimates were obtained from DMU. Testing for the presence of an effect of a marker was done using a t-test against a null hypothesis of H0: b = 0. A SNP was considered significant if the –log(p) value for the SNP exceeded a genome wide Bonferroni corrected threshold of 8.25 (= −log10(0.05/8,938,927)) corresponding to a nominal type I error rate of 5 % corrected for 8,938,927 simultaneous tests.

Annotation of the top SNP

All positions provided in this text refer to the UMD3.1 assembly. The annotations of the most significant SNP markers in the selected regions were extracted from the ENSEMBL data base (http://www.ensembl.org, [18]): ENSEMBL Bos taurus 75.31, UMD3.1.

Results

Step I

The results of the step I analysis for FTI are presented Fig. 1 and in Additional file 2: Table S1. In total 17,388 significant SNP markers (−log10(P) > 8.25) were detected for FTI distributed over 25 different chromosomes. The most significant QTL was identified on BTA12 with 12,860 significantly associated SNP. For this QTL the causal variant, a 660-Kb deletion around 20.5 Mb at BTA12, has previously been described [5]. The other QTL were located on BTA1 (822 SNP), BTA2 (219 SNP), BTA3 (83 SNP), BTA5 (195 SNP), BTA6 (503 SNP), BTA7 (146 SNP), BTA8 (117 SNP), BTA10 (294 SNP), BTA11 (215 SNP), BTA13 (980 SNP), BTA14 (43 SNP), BTA15 (23 SNP), BTA17 (1 SNP), BTA19 (47 SNP), BTA20 (345 SNP), BTA22 (2 SNP), BTA23 (38 SNP), BTA24 (104 SNP), BTA25 (53 SNP), BTA26 (224 SNP), BTA27 (15 SNP), BTA28 (9 SNP) and BTA29 (46 SNP). The most significant QTL peaks are presented in Table 1.
Table 1

Overview of the genome-wide significant QTL for fertility index in Nordic Red cattle analyzed with a sire model

ChromosomeStart position (bp)End position (bp)Position of the most significantly associated SNP (bp)Marker name (rsname)MAF−log10(P)Allele substitution effectStandard errors
1128,128,973129,128,826128,628,880rs1334291710.3112.351.700.23
2132,612,052133,611,970133,111,977rs1094084370.4411.10−1.460.21
332,891,18433,888,41733,388,881rs433365590.4811.31−1.490.22
561,407,64462,407,44361,907,474rs1103697590.2811.501.700.24
643,012,00343,998,19043,511,992rs419832840.4012.751.630.22
697,617,38198,613,54798,115,824rs2088940940.3611.53−1.590.23
1356,548,49557,548,17457,048,185rs1335754830.2615.82−2.020.24
1534,177,50435,177,12434,677,367rs417632610.2112.37−1.910.26
2066,626,28667,624,45167,124,570rs424671550.3910.811.550.23
2431,321,19432,320,53531,820,659rs1371348410.4314.18−1.730.22
Overview of the genome-wide significant QTL for fertility index in Nordic Red cattle analyzed with a sire model

Step II

The regions selected based on association results for FTI (Table 1) were reanalyzed for the individual female fertility traits using sequence data in an animal model (Table 2). The QTL shown in Table 2 showed significant association with fertility sub-traits, however the magnitude of test statistics varied for individual sub-traits.
Table 2

Most significant SNP marker for FTI and the underlying female fertility traits

Trait namea Marker namersname−log10(P)Annotationb EnsembleGene nameGene descriptionProportion of additive genetic variance explained by the SNP
BTA1
FTIChr1:128614529rs13774844411.49intergenic_variant1.17
AISCChr1:127295226rs432716319.89intron_variantENSBTAG00000002232TRPC1 Bos taurus transient receptor potential cation channel, subfamily C, member 10.89
AISHChr1:128189301rs13623177011.75upstream_gene_variantENSBTAG00000021886GRK7 Bos taurus G protein-coupled receptor kinase 71.21
ICFChr1:128663781Chr1:128663781c 7.32intergenic_variant0.67
IFLCChr1:127295226rs4327163112.24intron_variantENSBTAG00000002232TRPC1 Bos taurus transient receptor potential cation channel, subfamily C, member 11.26
IFLHChr1:128189301rs13623177012.49upstream_gene_variantENSBTAG00000021886GRK7 Bos taurus G protein-coupled receptor kinase 71.32
NRRCChr1:127295226rs432716317.37intron_variantENSBTAG00000002232TRPC1 Bos taurus transient receptor potential cation channel, subfamily C, member 10.72
NRRHChr1:128106801rs432621998.36intergenic_variant0.93
BTA2
FTIChr2:133075293rs427611908.51intergenic_variant0.90
AISCChr2:133079214rs427611836.17intergenic_variant0.57
AISHChr2:132321329rs20924216813.30intron_variantENSBTAG00000040215EIF4G3Uncharacterized protein1.44
ICFChr2:133126588rs1092457949.98upstream_gene_variantENSBTAG00000012217PLA2G2F Bos taurus phospholipase A2, group IIF0.95
IFLCChr2:133077462rs427611866.00intergenic_variant0.57
IFLHChr2:132321329rs20924216810.50intron_variantENSBTAG00000040215EIF4G3Uncharacterized protein1.13
NRRCChr2:131498393rs1100829976.92intergenic_variant0.74
NRRHChr2:132321329rs20924216813.01intron_variantENSBTAG00000040215EIF4G3Uncharacterized protein1.51
BTA3
FTIChr3:33388881rs4333655914.22intergenic_variant1.56
AISCChr3:33388881rs4333655915.78intergenic_variant1.47
AISHChr3:30770425rs13466437823.07intron_variantENSBTAG00000014299RHOC Bos taurus ras homolog gene family, member C2.48
ICFChr3:31574900rs433349718.05intron_variant0.80
IFLCChr3:33388881rs4333655911.99intergenic_variant1.24
IFLHChr3:35629755rs10915885015.86intron_variantENSBTAG00000031575VAV3vav 3 guanine nucleotide exchange factor1.69
NRRCChr3:33388483rs38120770318.34intergenic_variant1.87
NRRHChr3:32024752rs4333293421.55upstream_gene_variantENSBTAG00000014102WDR77 Bos taurus WD repeat domain 772.41
BTA5
FTIChr5:62781359rs13509968210.79intergenic_variant1.15
AISCChr5:62781359rs1350996829.99intergenic_variant0.89
AISHChr5:62781359rs13509968213.87intergenic_variant1.37
ICFChr5:61879166rs2077549468.01intergenic_variant0.76
IFLCChr5:61625072rs43450537810.16intergenic_variant0.97
IFLHChr5:62781359rs13509968213.45intergenic_variant1.34
NRRCChr5:64863598rs3831022676.74intergenic_variant0.59
NRRHChr5:61836879rs2116815356.95intergenic_variant0.69
BTA6
FTIChr6:43511992rs4198328413.59intron_variantENSBTAG00000004653GPR125G protein-coupled receptor 1251.54
AISCChr6:43223916rs13325817512.17intergenic_variant1.15
AISHChr6:44642153rs20991867420.98intergenic_variant1.99
ICFChr6:42542483rs1103636067.77intron_variantENSBTAG00000047743KCNIP4Bos taurus Kv channel interacting protein 40.71
IFLCChr6:45794025rs10980819812.17intergenic_variant1.26
IFLHChr6:41455222rs4361045316.21intron_variantENSBTAG00000005108SLIT2Slit homolog 2 (Drosophila)1.59
NRRCChr6:42245672rs10951820012.48intron_variantENSBTAG00000047743KCNIP4 Bos taurus Kv channel interacting protein 41.27
NRRHChr6:44642153rs20991867417.63intergenic_variant1.80
BTA6
FTIChr6:98115824rs20889409411.29intergenic_variant1.26
AISCChr6:98115824rs2088940946.38intergenic_variant0.58
AISHChr6:99400480rs37990898714.72intron_variantENSBTAG00000022449SCD5 Bos taurus stearoyl-CoA desaturase 51.51
ICFChr6:98048673rs13335708610.91intergenic_variant1.06
IFLCChr6:98115824rs2088940948.88intergenic_variant0.92
IFLHChr6:99400480rs37990898713.58intron_variantENSBTAG00000022449SCD5 Bos taurus stearoyl-CoA desaturase 51.42
NRRCChr6:98385129rs2107636294.43intergenic_variant0.45
NRRHChr6:96953202rs11037902312.73intron_variantENSBTAG00000035776C4orf22Chromosome 4 open reading frame 221.19
BTA13
FTIChr13:58664049rs37899862514.85intron_variantENSBTAG00000013574ANKRD60Ankyrin repeat domain 601.75
AISCChr13:56125951rs10918570612.69intergenic_variant1.18
AISHChr13:59923876rs4170095616.72intergenic_variant2.28
ICFChr13:59112800rs4255567212.19downstream_gene_variantENSBTAG00000013574ANKRD60Ankyrin repeat domain 601.26
IFLCChr13:57020976rs10948418013.17intergenic_variant1.37
IFLHChr13:59923876rs4170095616.77intergenic_variant2.29
NRRCChr13:60559073rs416995466.44intergenic_variant0.60
NRRHChr13:57048934rs11028039812.14intergenic_variant1.23
BTA15
FTIChr15:34677367rs4176326115.90intron_variantENSBTAG00000001410GRAMD1BGRAM domain containing 1B1.68
AISCChr15:34737376rs11067059010.87intron_variantENSBTAG00000001410GRAMD1BGRAM domain containing 1B0.99
AISHChr15:33428419rs3822783629.99intergenic_variant0.97
ICFChr15:34702074rs417633269.86intron_variantENSBTAG00000001410GRAMD1BGRAM domain containing 1B0.89
IFLCChr15:34677367rs4176326115.55intron_variantENSBTAG00000001410GRAMD1BGRAM domain containing 1B1.55
IFLHChr15:33428419rs38227836210.81intergenic_variant1.08
NRRCChr15:34829707rs37892722211.19intergenic_variant1.67
NRRHChr15:34802991rs20857557711.65intergenic_variant1.50
BTA20
FTIChr20:67116858rs13348850011.44intergenic_variant1.39
AISCChr20:67623217rs11004569013.52intergenic_variant1.22
AISHChr20:68086468rs20893647915.87intergenic_variant1.68
ICFChr20:66806800rs38436343010.20intergenic_variant0.93
IFLCChr20:67124570rs4246715513.09intergenic_variant1.50
IFLHChr20:68084517rs38359001315.71intergenic_variant1.71
NRRCChr20:67623217rs11004569010.79intergenic_variant1.05
NRRHChr20:68086468rs20893647912.82intergenic_variant1.46
BTA24
FTIChr24:31820659rs13713484118.50intron_variantENSBTAG00000007383ZNF521Zinc finger protein 5212.17
AISCChr24:31820659rs13713484115.00intron_variantENSBTAG00000007383ZNF521Zinc finger protein 5211.48
AISHChr24:31817915rs38117489725.32intron_variantENSBTAG00000007383ZNF521Zinc finger protein 5212.80
ICFChr24:31442858rs437379727.94intergenic_variant0.70
IFLCChr24:31820659rs13713484114.49intron_variantENSBTAG00000007383ZNF521Zinc finger protein 5211.58
IFLHChr24:31817915rs38117489722.20intron_variantENSBTAG00000007383ZNF521Zinc finger protein 5212.50
NRRCChr24:30565598rs4244696311.49intergenic_variant1.16
NRRHChr24:31817915rs38117489718.03intron_variantENSBTAG00000007383ZNF521Zinc finger protein 5212.13

A marker is significant if the –log10(P) > 8.25

aFTI: fertility index, AIS: number of inseminations per conception, ICF: length in days of the interval from calving to first insemination, IFL: days from first to last insemination, and NRR: 56-day non-return rate (Suffix h: heifers and suffix c: cows)

bIn case the SNP marker is annotated as a downstream_gene_variant or an upstream_gene_variant the gene closest located to this SNP is mentioned

cSNP marker did not have an rs id

Most significant SNP marker for FTI and the underlying female fertility traits A marker is significant if the –log10(P) > 8.25 aFTI: fertility index, AIS: number of inseminations per conception, ICF: length in days of the interval from calving to first insemination, IFL: days from first to last insemination, and NRR: 56-day non-return rate (Suffix h: heifers and suffix c: cows) bIn case the SNP marker is annotated as a downstream_gene_variant or an upstream_gene_variant the gene closest located to this SNP is mentioned cSNP marker did not have an rs id On BTA1 the associations with the highest test statistic were found for IFLC and IFLH. The most significant SNP marker for IFLC was located in the TRPC1 gene, while the most significant SNP marker for IFLH was annotated as an upstream gene variant of the GRK7 gene. On BTA2 the associations with the highest test statistic were found for AISH, IFLH, and NRRH. The SNP BTA2:132321329, (rs209242168) was the SNP with the highest test statistics for all three traits. This SNP is located within the EIF4G3 gene. On BTA3 the associations with the highest test statistic were found for AISH and NRRH. For the QTL for AISH the most significant SNP was located within the RHOC gene, while for the QTL for NRRH the most significant SNP was l annotated as an upstream gene variant of the WDR77 gene. On BTA5 the SNP with the highest test statistic for FTI was BTA5:62781359 (rs135099682) with a –log10(P) of 10.79. The same marker was also the most significant marker for the traits AISC, AISH and IFLH with –log10(P) values of 9.99, 13.87, and 13.45 respectively. On BTA6 the associations with the highest test statistic were found for AISH and NRRH. Both SNPs (Chr6:43223916, Chr6:44642153) were located in an intergenic region (Table 2). However the most significant SNP marker (BTA6:43511992, rs41983284) for FTI was located within the GPR125 gene. A second QTL on BTA6 was detected and showed the strongest association with AISH and IFLH. The SNP BTA6:99400480 (rs379908987) was the most significant SNP for both AISH and IFLH and were located within the SCD5 gene. The SNP with the highest test statistics for NRRH (BTA6:96953202, rs110379023) was located within the C4orf22 gene. Furthermore, FTI was most strongly associated with the SNP BTA6:98115824 (rs208894094). This SNP was also the most significant marker for AISC and IFLC. On BTA13 the associations with the highest test statistic were found for IFLC and IFLH, whereas the marker with the highest test statistic for FTI was located in the ANKRD60 gene. On BTA15 the associations with the highest test statistic were found for IFLC and NRRH. The most significant SNP marker for IFLC is located within the GRAMD1B gene, while the most significant SNP marker for NRRH is located in an intergenic region. On BTA20 all individual female fertility traits had a –log10(P) above 10. All SNP markers detected were located in an intergenic region. On BTA24 the associations with the highest test statistic were found for AISH and IFLH. The significant SNPs were located within the ZNF521 gene.

Discussion

In this study we have applied a two-step approach to screen the cattle genome for QTL for the female fertility index. In the first step the cattle genome was screened for QTL for the FTI based on the whole genome sequence variants using a sire model without considering the relationship among all animals in the study except the sire-son relationship. This was done to pre-select markers to analyze with a linear mixed model. Based on the results in step I, targeted QTL regions for the FTI were reanalyzed for the underlying female fertility traits using sequence data and an animal model taking the relationship among sires included in the study. Below we discuss the findings from individual targeted regions.

BTA1

In this region for the FTI QTL AISC, IFLC and NRRC have their most significant SNP (BTA1:127295226, rs43271631) in common. AISC and IFLC are phenotypes which are highly connected as they measure the same i.e., AIS reflects the number of inseminations, whereas IFL measures the time this has taken. BTA1:127295226 is located in intronic region of TRPC1. TRPC1 has functions related to protein binding (GO:0005515). In mice it has been shown that osteoblast formation and function is regulated by a TRPC1 protein-dependent pathway [19]. Furthermore, AISH and IFLH share their most significant SNP. It is located up-stream of GRK7. Höglund et al. [6] validated two SNP in Nordic Holstein, Jersey and Nordic Red for NRRH at 124.8 Mb and 135.6 Mb, which is 4 to 5 Mb away from the QTL peak for FTI detected in this study. In Danish and Swedish Holstein significant SNPs were detected at 72.0 and 92.4 Mb respectively [20].

BTA2

The most significant marker (BTA2:132321329, rs209242168) was in common for the traits AISH, IFLH and NRRH and located in the intron of EIF4G3. The function of EIF4G3 is related to protein binding (GO:0005515). Previously a SNP on BTA2 has been detected for NRRC at 12.4 Mb in Finnish Ayrshire [4]. Even though the Finnish Ayrshire is part of the Nordic Red cattle population the SNP at 12.4 Mb for NRRC did not show significant effect in our study.

BTA3

The most significant SNP for FTI (BTA3:33388881, rs43336559) was also the most significant SNP for AISC and IFLC. This SNP is intergenic close to SLC6A17 gene (BTA3: 33,337,960-33,373,481). SLC6A17 gene functions as a vesicular transporter selective for proline, glycine, leucine, and alanine [21]. Complex Vertebral Malformation (CVM), which is characterized by misshapen and fused vertebrae around the cervico-thoracic junction is caused by a mutation in the Golgi-resident nucleotide-sugar transporter encoded by SLC35A3 located a 43.4 Mb on BTA3 [22]. This QTL on BTA3 has most significant effect on number of insemination, non-return rate and interval from first to last insemination indicating that it could be related to early embryonic death. Therefore SLC6A17 is a strong candidate gene for this QTL. Sahana et al. [20] detected significant SNP markers for FTI at 81.3 Mb in Danish and Swedish Holstein. Höglund et al. [6] has validated 10 SNPs on BTA3 for FTI, AISH, NRRC, and NRRH. The closest marker is for FTI at 36.9 Mb, the other 9 markers were not in the range of 32.8 to 33.8 Mb.

BTA5

The most significant SNP for FTI (BTA5:62781359, rs135099682) was also the most significant SNP for AISC, AISH, and IFLC. This SNP was not assigned to any gene. Sahana et al. [20] detected a significant SNP at 116.3 Mb in Nordic Holstein for FTI, AISC, and IFLC. Höglund et al. [6] have validated SNP for ICF at 88.1 Mb and at 15.2, 15.5, and 37.2 Mb for NRRH. None of these SNPs were located in the QTL region detected in this study.

BTA6

BTA6 harbors 2 QTL for fertility traits 54 Mb apart. In the first QTL defined in the region 43.0 to 43.9 Mb the most significant marker (BTA6:43511992, rs41983284) was located in an intron in GPR125. GPR125 has functions related to protein binding (GO:0005515). Even though many female fertility traits were highly significant, none of the traits shared their most significant SNP (Table 2). The most significant SNPs for ICF and NRRC were located in intron regions of KCNIP4. The function of KCNIP4 is among others related to protein binding (GO:0005515). In the region from 97.6 to 98.6 Mb the most significant SNP marker (BTA6:98115824, rs208894094) for FTI was also the most significant marker for AISC and IFLC. Furthermore AISH and IFLH had their most significant marker (BTA6:99400480, rs379908987) located in SDS5 gene. The function of SDS5 is related to stearoyl-CoA desaturase activity (GO:0004768). Sahana et al. [20] identified a SNP marker for ICF on BTA6 in Danish and Swedish Holstein around 89.7 Mb. Pimentel et al. [23] identified a candidate gene (IGFBP7) located at BTA6 influencing NRRH and IFLH. However this gene is not located within any of the QTL areas.

BTA13

The most significant SNP maker for FTI on BTA13 (BTA13:58664049, rs378998625) was located in the gene ANKRD60. This gene is related to protein binding (GO:0005515). In a window of 1 Mb around BTA13:58664049 the traits AISH, AISC, ICF, IFLH, IFLC and NRRH showed highly significant markers (Table 2). None of the top associated marker for the individual traits was in common except for AISH and IFLH (BTA13:59923876, rs41700956). The traits AISH and IFLH share the underlying biology as AISH reflect the number of inseminations and IFLH measures the time between first and last insemination. Previously, Schulman et al. [4] detected a significant SNP on BTA13 for ICF in Finnish Ayrshire which was located at 18.1 Mb. In Nordic Holstein (NH) two SNP markers were detected close to the region of 56.5 to 57.5 Mb [20] on BTA13 for ICF, but the major QTL for ICF in NH was in the region 33.9 to 34.0 Mb. Furthermore, a QTL affecting ICF on BTA13 in Nordic Holstein was validated in Jersey and NRC by Höglund et al. [6]. The 4 validated SNP markers were in the range of 28 to 45 Mb. However, none of the validated SNPs across three breeds for ICF were significantly associated with FTI in this study. As the peak detected for FTI as well as AISH, AISC, ICF, IFLH, IFLC and NRRH in NRC does not show overlap with QTL detected for fertility traits in other cattle breeds indicates that this QTL may be private to the NRC population.

BTA15

In the region 34.1 to 35.1 Mb FTI, AISC, ICF, and IFLC did not have their most significant SNP in common, but each of their significant SNP was located in the intron region of GRAMD1B. This gene’s involvement with female fertility is unknown. Höglund et al. [6] confirmed one SNP marker influencing IFLH in three cattle breeds (Nordic Holstein, Jersey and Nordic Red), however this marker was located at 23.6 Mb and did not overlap with the QTL detected for FTI in this study.

BTA20

The most significant SNP for FTI is BTA20:67116858 (rs133488500). All the female fertility traits have high –log10(P) of over 10 in the region of 500 kbp around SNP marker BTA20:67116858 (Table 2). None of these SNP markers were annotated to a gene. One marker (BTB-01617409) located at 66,926,929 bp on BTA20 has been validated in Nordic Holstein, Jersey and Nordic Red for FTI [6]. This marker is only 100 kbp away from the most significant SNP marker for ICF (BTA20:66806800, rs384363430). Previously a significant SNP was detected on BTA20 for ICF in Finnish Ayrshire [4]. However with the position of this marker around 4.9 Mb this is not overlapping with the region for FTI identified in this study.

BTA24

The most significant SNP for FTI (BTA24:31820659, rs137134841) coincided with the most significant SNP for AISC and IFLC. In the same region ranging from 31.1 to 32.1 Mb AISH, IFLH and NRRH shared their most significant SNP (BTA24:31817915, rs381174897). The two SNPs are close to each other in the genome and are located in an intron in the gene ZNF521. ZNF521 is among others related to protein domain specific binding (GO:0019904). This gene has not been reported to be associated with fertility before, but has been reported as a candidate gene for the trait, rear side view, in Chinese Holstein cattle [24]. Previously significant SNP associations on BTA24 have been identified for ICF on BTA24 in Finnish Ayrshire [4]. These SNPs however are not located in the QTL region detected in this study.

Intergenic regions

All the significant SNPs on BTA5 and BTA20 were located in intergenic regions. With the improvement of the Bovine annotation of the genes and localization of the set sites of gene transcription, initiation, termination as well as differential splicing one would expect the identification of the causal mutation to improve. To date information on genomic structure of organisms which are better annotated like mouse and human are used as an information source. However it has been shown that areas which contain mainly regulatory elements and non-protein coding regions the annotation are more challenging [25, 26]. Though we have used the “full” sequence level variants in our analysis half of the total genetic variants identified in the WGS were filtered out for various quality control reasons. All the variants which were not bi-allelic were dropped due to limitations in the imputation software. Therefore the actual causal polymorphism may be missing from the data analyzed here.

Cow versus heifer traits

The genetic correlation between cow and heifer fertility is generally low [27, 28]. This indicates that little overlap in genes for cows and heifers influencing female fertility are expected. The results of this study show that if female fertility traits had their most significant SNP marker in common with the fertility index these traits were either cow traits or heifer traits. This is in agreement with previous QTL mapping studies [6, 29].

Conclusion

The results of this study showed that 1) the traits AISH and IFLH had in many cases the same significant SNP marker in the QTL detected for FTI, 2) the majority of the candidate genes assigned to the most significant SNP regulating female fertility were involved in protein binding, and 3) a SNP marker on BTA20 located in the QTL region for FTI has previously been validated for female fertility in three different breeds.

Availability of data

Genome assembly data were taken from publicly available sources. The assembly is available for download (ftp://ftp.ncbi.nlm.nih.gov/genomes/Bos_taurus/). All whole genome sequencing data for RDC animals are now part of the 1000 Bull Genomes Project. All variations detected by the 1000 Bull Genomes Project have been or will be submitted by the 1000 Bull Genomes Project to dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/). Whole genome sequence data for the 253 individuals included in Run2 of the 1000 Bull Genomes Project are available at NCBI using SRA no. SRP039339 (http://www.ncbi.nlm.nih.gov/bioproject/PRJNA238491). All annotation information was obtained from a publicly available source (http://www.ensembl.org).
  26 in total

1.  Genome-wide association study using high-density single nucleotide polymorphism arrays and whole-genome sequences for clinical mastitis traits in dairy cattle.

Authors:  G Sahana; B Guldbrandtsen; B Thomsen; L-E Holm; F Panitz; R F Brøndum; C Bendixen; M S Lund
Journal:  J Dairy Sci       Date:  2014-08-22       Impact factor: 4.034

2.  A missense mutation in the bovine SLC35A3 gene, encoding a UDP-N-acetylglucosamine transporter, causes complex vertebral malformation.

Authors:  Bo Thomsen; Per Horn; Frank Panitz; Emøke Bendixen; Anette H Petersen; Lars-Erik Holm; Vivi H Nielsen; Jørgen S Agerholm; Jens Arnbjerg; Christian Bendixen
Journal:  Genome Res       Date:  2005-12-12       Impact factor: 9.043

3.  A TRPC1 protein-dependent pathway regulates osteoclast formation and function.

Authors:  E-Ching Ong; Vasyl Nesin; Courtney L Long; Chang-Xi Bai; Jan L Guz; Ivaylo P Ivanov; Joel Abramowitz; Lutz Birnbaumer; Mary Beth Humphrey; Leonidas Tsiokas
Journal:  J Biol Chem       Date:  2013-06-14       Impact factor: 5.157

4.  Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle.

Authors:  Hans D Daetwyler; Aurélien Capitan; Hubert Pausch; Paul Stothard; Rianne van Binsbergen; Rasmus F Brøndum; Xiaoping Liao; Anis Djari; Sabrina C Rodriguez; Cécile Grohs; Diane Esquerré; Olivier Bouchez; Marie-Noëlle Rossignol; Christophe Klopp; Dominique Rocha; Sébastien Fritz; André Eggen; Phil J Bowman; David Coote; Amanda J Chamberlain; Charlotte Anderson; Curt P VanTassell; Ina Hulsegge; Mike E Goddard; Bernt Guldbrandtsen; Mogens S Lund; Roel F Veerkamp; Didier A Boichard; Ruedi Fries; Ben J Hayes
Journal:  Nat Genet       Date:  2014-07-13       Impact factor: 38.330

5.  Association of heifer fertility with cow fertility and yield in dairy cattle.

Authors:  L B Hansen; A E Freeman; P J Berger
Journal:  J Dairy Sci       Date:  1983-02       Impact factor: 4.034

6.  Ensembl 2014.

Authors:  Paul Flicek; M Ridwan Amode; Daniel Barrell; Kathryn Beal; Konstantinos Billis; Simon Brent; Denise Carvalho-Silva; Peter Clapham; Guy Coates; Stephen Fitzgerald; Laurent Gil; Carlos García Girón; Leo Gordon; Thibaut Hourlier; Sarah Hunt; Nathan Johnson; Thomas Juettemann; Andreas K Kähäri; Stephen Keenan; Eugene Kulesha; Fergal J Martin; Thomas Maurel; William M McLaren; Daniel N Murphy; Rishi Nag; Bert Overduin; Miguel Pignatelli; Bethan Pritchard; Emily Pritchard; Harpreet S Riat; Magali Ruffier; Daniel Sheppard; Kieron Taylor; Anja Thormann; Stephen J Trevanion; Alessandro Vullo; Steven P Wilder; Mark Wilson; Amonida Zadissa; Bronwen L Aken; Ewan Birney; Fiona Cunningham; Jennifer Harrow; Javier Herrero; Tim J P Hubbard; Rhoda Kinsella; Matthieu Muffato; Anne Parker; Giulietta Spudich; Andy Yates; Daniel R Zerbino; Stephen M J Searle
Journal:  Nucleic Acids Res       Date:  2013-12-06       Impact factor: 16.971

7.  A 660-Kb deletion with antagonistic effects on fertility and milk production segregates at high frequency in Nordic Red cattle: additional evidence for the common occurrence of balancing selection in livestock.

Authors:  Naveen Kumar Kadri; Goutam Sahana; Carole Charlier; Terhi Iso-Touru; Bernt Guldbrandtsen; Latifa Karim; Ulrik Sander Nielsen; Frank Panitz; Gert Pedersen Aamand; Nina Schulman; Michel Georges; Johanna Vilkki; Mogens Sandø Lund; Tom Druet
Journal:  PLoS Genet       Date:  2014-01-02       Impact factor: 5.917

8.  Genome wide association studies for body conformation traits in the Chinese Holstein cattle population.

Authors:  Xiaoping Wu; Ming Fang; Lin Liu; Sheng Wang; Jianfeng Liu; Xiangdong Ding; Shengli Zhang; Qin Zhang; Yuan Zhang; Lv Qiao; Mogens Sandø Lund; Guosheng Su; Dongxiao Sun
Journal:  BMC Genomics       Date:  2013-12-17       Impact factor: 3.969

9.  Validation of associations for female fertility traits in Nordic Holstein, Nordic Red and Jersey dairy cattle.

Authors:  Johanna K Höglund; Goutam Sahana; Bernt Guldbrandtsen; Mogens S Lund
Journal:  BMC Genet       Date:  2014-01-15       Impact factor: 2.797

10.  Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle.

Authors:  Rasmus Froberg Brøndum; Bernt Guldbrandtsen; Goutam Sahana; Mogens Sandø Lund; Guosheng Su
Journal:  BMC Genomics       Date:  2014-08-27       Impact factor: 3.969

View more
  13 in total

1.  Reduced representation approach for identification of genome-wide SNPs and their annotation for economically important traits in Indian Tharparkar cattle.

Authors:  M Joel Devadasan; D Ravi Kumar; M R Vineeth; Anjali Choudhary; T Surya; S K Niranjan; Archana Verma; Jayakumar Sivalingam
Journal:  3 Biotech       Date:  2020-06-16       Impact factor: 2.406

2.  Association analysis of loci implied in "buffering" epistasis.

Authors:  Antonio Reverter; Zulma G Vitezica; Marina Naval-Sánchez; John Henshall; Fernanda S S Raidan; Yutao Li; Karin Meyer; Nicholas J Hudson; Laercio R Porto-Neto; Andrés Legarra
Journal:  J Anim Sci       Date:  2020-03-01       Impact factor: 3.159

Review 3.  Synergies between assisted reproduction technologies and functional genomics.

Authors:  Pasqualino Loi; Paola Toschi; Federica Zacchini; Grazyna Ptak; Pier A Scapolo; Emanuele Capra; Alessandra Stella; Paolo Ajmone Marsan; John L Williams
Journal:  Genet Sel Evol       Date:  2016-08-01       Impact factor: 4.297

4.  Meta-analysis of sequence-based association studies across three cattle breeds reveals 25 QTL for fat and protein percentages in milk at nucleotide resolution.

Authors:  Hubert Pausch; Reiner Emmerling; Birgit Gredler-Grandl; Ruedi Fries; Hans D Daetwyler; Michael E Goddard
Journal:  BMC Genomics       Date:  2017-11-09       Impact factor: 3.969

5.  New insights into the associations among feed efficiency, metabolizable efficiency traits and related QTL regions in broiler chickens.

Authors:  Wei Li; Ranran Liu; Maiqing Zheng; Furong Feng; Dawei Liu; Yuming Guo; Guiping Zhao; Jie Wen
Journal:  J Anim Sci Biotechnol       Date:  2020-06-26

6.  Human-Mediated Introgression of Haplotypes in a Modern Dairy Cattle Breed.

Authors:  Qianqian Zhang; Mario P L Calus; Mirte Bosse; Goutam Sahana; Mogens Sandø Lund; Bernt Guldbrandtsen
Journal:  Genetics       Date:  2018-05-30       Impact factor: 4.562

7.  Assessing the genetic background and genomic relatedness of red cattle populations originating from Northern Europe.

Authors:  Christin Schmidtmann; Anna Schönherz; Bernt Guldbrandtsen; Jovana Marjanovic; Mario Calus; Dirk Hinrichs; Georg Thaller
Journal:  Genet Sel Evol       Date:  2021-03-06       Impact factor: 4.297

8.  Meta-Analysis of Heifer Traits Identified Reproductive Pathways in Bos indicus Cattle.

Authors:  Muhammad S Tahir; Laercio R Porto-Neto; Cedric Gondro; Olasege B Shittu; Kimberley Wockner; Andre W L Tan; Hugo R Smith; Gabriela C Gouveia; Jagish Kour; Marina R S Fortes
Journal:  Genes (Basel)       Date:  2021-05-18       Impact factor: 4.096

9.  Fertility-Associated Polymorphism within Bovine ITGβ5 and Its Significant Correlations with Ovarian and Luteal Traits.

Authors:  Jianing Zhao; Jie Li; Fugui Jiang; Enliang Song; Xianyong Lan; Haiyu Zhao
Journal:  Animals (Basel)       Date:  2021-05-28       Impact factor: 2.752

10.  Genome-wide association analysis of milk yield traits in Nordic Red Cattle using imputed whole genome sequence variants.

Authors:  T Iso-Touru; G Sahana; B Guldbrandtsen; M S Lund; J Vilkki
Journal:  BMC Genet       Date:  2016-03-22       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.