| Literature DB >> 26181055 |
ShiGang Yu1, WeiWei Chu1, LiFan Zhang1, HouMing Han2, RongXue Zhao2, Wei Wu2, JiangNing Zhu2, Michael V Dodson3, Wei Wei1, HongLin Liu1, Jie Chen1.
Abstract
Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS) strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD) sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP) and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV) or low estimated breeding value (LEBV). A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the candidate laying-related SNP, including membrane associated guanylate kinase (MAGI-1), KIAA1462, Rho GTPase activating protein 21 (ARHGAP21), acyl-CoA synthetase family member 2 (ACSF2), astrotactin 2 (ASTN2). Collectively, our data suggests that 8 SNP and 5 genes might be promising candidate markers or targets for marker-assisted selection of egg numbers in geese.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26181055 PMCID: PMC4504669 DOI: 10.1371/journal.pone.0131572
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Egg numbers of the experimental population in 34-weekegg-laying period.
| Population | Number of geese | EBVs | Average egg number |
|---|---|---|---|
| All geese | 492 | 74.08±8.01 | 73.95±19.73 |
| LEBV | 10 | 58.87±3.97 | 34.9±12.73 |
| HEBV | 10 | 87.42±4.18 | 105.6±3.98 |
Restriction-site associated DNA (RAD) sequencing statistics of EcoRI library from two DNA pools.
| Sample | Clean Data | Clean Reads Number | RAD-tag number | Average Depth |
|---|---|---|---|---|
| LEBV | 1,727.02Mb | 18,771,943 | 884,827 | 17.33 |
| HEBV | 2,163.79Mb | 23,519,413 | 942,117 | 20.47 |
Allelic frequencies distribution between LEBV and HEBV cohorts.
| SNP | primer | genotype | HEBV (Allele frequencies) | LEBV (Allele frequencies) | P value |
|---|---|---|---|---|---|
| Record-106975 | AATTCTTGCCTAAATAAC(A/G) | G/A | 0.10/0.90 | 1.0/0 | 4.19×10−8 |
| ACACATTGATGCTGCAAATT | |||||
| Record-134172 | GCTGACAGCTCATTTGAT(A/T) | A/T | 1.0/0 | 0.30/0.70 | 3.34×10−6 |
| CAGGATCACGTCCTCAAC | |||||
| Record-112359 | GAAGCGCCTGGTGGCCTCACC(T/G) | T/G | 0/1.0 | 0.80/0.20 | 7.71×10−7 |
| TACCTGCCTGAGAGATGGAGTGTTGG | |||||
| Record-106582 | TTCAGGGAACTCAAACTATA(C/A) | C/A | 0.20/0.80 | 0.90 /0.10 | 1.66×10−5 |
| CTCACGTGGCCTCTACAA | |||||
| Record-111407 | TTTTGGCCGTGGTTTCC(T/A) | T/A | 0.40/0.60 | 1.0/0 | 4.51×10−5 |
| ATCAGACCCGTGGTGGAAC | |||||
| Record-135849 | CTGTGTTGATATTTTCTGATAGTA | T/C | 0.70/0.30 | 0.10/0.90 | 2.44×10−4 |
| TGTTTAGCAGGAGCACAA(T/C) | |||||
| Record-88247 | AGAGAATTTAGTCATATTTGGGT | G/A | 0.21/0.79 | 0.85/0.15 | 2.20×10−5 |
| TAATTTGGACTCAATCAAAA(C/T) | |||||
| Record-135057 | CAGGGGGATTCTAGTAGTTTCCT | G/A | 0.14/0.86 | 0.80/0.20 | 6.34×10−6 |
| TCACTGCTGCAGACACGCA(T/C) | |||||
| Record-130652 | AGTTACATTTAAAAGTGTCTGGTC | G/A | 0.75/0.25 | 0.05/0.95 | 1.02×10−6 |
| CGATCAAAACAAATGGTAGAC(C/T) | |||||
| Record-130775 | CAGGGGGATTCTAGTAGTTTCCT | A/G | 0.96/0.04 | 0.10/0.90 | 4.63×10−10 |
| TCACTGCTGCAGACACGCA(T/C) |
Fig 1Genotyping of the 10 SNP by allele specific PCR in geese.
Electropherosis of AS-PCR revealed different genotypes of 10 SNPs. Eight individuals were randomly selected to show three genotypes of each SNP.
SNP Identification and Genotyping by allele specific PCR: genotype and allele frequencies, polymorphism information content, and chi-square tests of goodness-of-fit for Hardy-Weinberg equilibrium in the experimental population.
| SNP | Sample size | Genotype frequencies (%) | Allele frequencies(%) | He | H0 | P valve (χ2,HWE) | PIC | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Record-106975 | 449 | GG25.4 | GA30.1 | AA44.5 | G40.4 | A59.6 | 0.4817 | 0.3007 | 0.0000 | 0.3657 |
| Record-134172 | 332 | AA46.7 | AT33.7 | TT19.6 | A63.6 | T36.4 | 0.4633 | 0.3373 | 0.0000 | 0.3560 |
| Record-112359 | 330 | TT35.4 | TG39.7 | GG24.9 | T55.3 | G44.7 | 0.4944 | 0.3970 | 0.0001 | 0.3722 |
| Record-106582 | 472 | CC32.6 | CA42.6 | AA24.8 | C53.9 | A46.1 | 0.4969 | 0.4258 | 0.0018 | 0.3735 |
| Record-111407 | 457 | TT43.5 | TA38.7 | AA17.8 | T62.9 | A37.1 | 0.4669 | 0.3868 | 0.0002 | 0.3579 |
| Record-135849 | 193 | TT21.8 | TC52.3 | CC25.9 | T47.9 | C52.1 | 0.4991 | 0.5233 | 0.5544 | 0.3746 |
| Record-88247 | 260 | GG53.9 | GA38.4 | AA7.70 | G73.1 | A26.9 | 0.3935 | 0.3846 | 0.6344 | 0.3161 |
| Record-135057 | 341 | GG47.5 | GA41.6 | AA10.9 | G68.3 | A31.7 | 0.4328 | 0.4164 | 0.5060 | 0.3391 |
| Record-130652 | 342 | GG12.6 | GA32.2 | AA55.2 | G28.7 | A71.4 | 0.4089 | 0.3216 | 0.0002 | 0.3253 |
| Record-130775 | 446 | AA53.4 | AG8.30 | GG38.3 | A57.5 | G42.5 | 0.4887 | 0.0830 | 0.0000 | 0.3693 |
Note: PIC means polymorphism information content.
*means significance at the p< 0.01 level.
Least squares means and standard deviations of the association analysis between different genotypes on egg number.
| SNP | Genotype | Number | Egg production | P value |
|---|---|---|---|---|
| Record-106975 | GG | 115 | 77.83±24.28 | 2.0×10−4 |
| GA | 133 | 77.64±23.20 | ||
| AA | 201 | 67.24±20.76 | ||
| Record-134172 | AA | 155 | 74.48±24.99 | 0.001 |
| AT | 112 | 64.11±21.29 | ||
| TT | 65 | 69.78±18.22 | ||
| Record-112359 | TT | 117 | 76.41±22.31 | 4×10−4 |
| TG | 131 | 69.17±24.46 | ||
| GG | 82 | 62.50±19.07 | ||
| Record-106582 | AA | 154 | 78.66±22.18 | 4×10−4 |
| CA | 201 | 75.91±21.74 | ||
| CC | 117 | 64.41±22.19 | ||
| Record-111407 | AA | 82 | 81.05±23.02 | 4×10−6 |
| TA | 177 | 75.99±21.45 | ||
| TT | 198 | 66.64±222.14 | ||
| Record-135849 | TT | 42 | 69.46±32.02 | 0.054 |
| TC | 101 | 72.20±21.25 | ||
| CC | 50 | 61.46±25.83 | ||
| Record-88247 | AA | 20 | 82.75±26.54 | 5×10−5 |
| GA | 100 | 70.05±22.73 | ||
| GG | 140 | 64.36±23.22 | ||
| Record-135057 | GG | 162 | 65.76±21.67 | 9×10−6 |
| GA | 142 | 73.61±24.33 | ||
| AA | 37 | 78.70±28.13 | ||
| Record-130652 | GG | 43 | 74.19±23.85 | 0.059 |
| GA | 110 | 73.81±26.64 | ||
| AA | 189 | 67.73±22.08 | ||
| Record-130775 | AA | 238 | 77.26±23.48 | 2×10−6 |
| AG | 37 | 74.86±18.83 | ||
| GG | 171 | 65.64±21.30 |
Note: Multiple comparisons were performed using the Duncan multiple-range test
a, b means with different superscripts in the same column are different with p<0.05
A, B means with different superscripts in the same column are different with p<0.01.
Fig 2Genetic networks with multiple makers established by the linear regression analysis for egg number in geese.
The numbers in arrows represent substitution effects of one type of genotypes or allele for another one. Each combined genotype(s) between different makers has two means of performance: predicted (left in the rectangle) and actual (right in the rectangle) values. A) Genetic networks with two markers established by the linear regression analysis for egg number in geese. B) Genetic networks with three markers established by the linear regression analysis for egg number in geese.
Nucleotide BLAST and gene cloning.
| SNP | DUCK | CHICKEN | ||||||
|---|---|---|---|---|---|---|---|---|
| Accession number | Query cover (%) | Identity (%) | Accession number | Query cover (%) | Identity (%) | Chromosome | Gene | |
| Record-106975 | ADON01015296.1 | 99 | 89 | AADN03006395.1 | 87 | 65 | 12 |
|
| Record-134172 | ADON01027705.1 | 99 | 94 | AADN03002000.1 | 99 | 79 | 2 |
|
| Record-112359 | ADON01005944.1 | 93 | 93 | AADN03002597.1 | 100 | 79 | 2 |
|
| Record-106582 | ADON01048494.1 | 100 | 81 | AADN03007512.1 | 40 | 65 | 18 |
|
| Record-111407 | ADON01049731.1 | 51 | 80 | AADN03007334.1 | 65 | 72 | 17 |
|
| Record-88247 | ADON01074474.1 | 97 | 90 | AADN03004436.1 | 83 | 78 | 5 | None |
| Record-135057 | ADON01024818.1 | 99 | 93 | AADN03000109.1 | 99 | 82 | 1 | None |
| Record-130775 | ADON01018463.1 | 100 | 92 | None | None | None | None | None |