| Literature DB >> 22574143 |
Mohamed Salem1, Roger L Vallejo, Timothy D Leeds, Yniv Palti, Sixin Liu, Annas Sabbagh, Caird E Rexroad, Jianbo Yao.
Abstract
Fast growth is an important and highly desired trait, which affects the profitability of food animal production, with feed costs accounting for the largest proportion of production costs. Traditional phenotype-based selection is typically used to select for growth traits; however, genetic improvement is slow over generations. Single nucleotide polymorphisms (SNPs) explain 90% of the genetic differences between individuals; therefore, they are most suitable for genetic evaluation and strategies that employ molecular genetics for selective breeding. SNPs found within or near a coding sequence are of particular interest because they are more likely to alter the biological function of a protein. We aimed to use SNPs to identify markers and genes associated with genetic variation in growth. RNA-Seq whole-transcriptome analysis of pooled cDNA samples from a population of rainbow trout selected for improved growth versus unselected genetic cohorts (10 fish from 1 full-sib family each) identified SNP markers associated with growth-rate. The allelic imbalances (the ratio between the allele frequencies of the fast growing sample and that of the slow growing sample) were considered at scores >5.0 as an amplification and <0.2 as loss of heterozygosity. A subset of SNPs (n = 54) were validated and evaluated for association with growth traits in 778 individuals of a three-generation parent/offspring panel representing 40 families. Twenty-two SNP markers and one mitochondrial haplotype were significantly associated with growth traits. Polymorphism of 48 of the markers was confirmed in other commercially important aquaculture stocks. Many markers were clustered into genes of metabolic energy production pathways and are suitable candidates for genetic selection. The study demonstrates that RNA-Seq at low sequence coverage of divergent populations is a fast and effective means of identifying SNPs, with allelic imbalances between phenotypes. This technique is suitable for marker development in non-model species lacking complete and well-annotated genome reference sequences.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22574143 PMCID: PMC3344853 DOI: 10.1371/journal.pone.0036264
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Workflow used for discovery of SNPs associated with growth traits in the rainbow trout transcriptome.
SNPs identified in RNA-Seq reads were called and filtered using Alpheus pipeline. The SNP detection stringency conditions include at least 4 reads calling the variant, >20% reads calling the variant and >20 Quality score. SNP putatively associated with fast growth were considered at allelic imbalances scores >5.0 as an amplification and <0.2 as loss of heterozygosity. SNPs were validated by individually genotyping the discovery panel. Putative SNPs were genotyped for association analysis on 778 fish (40 families).
Figure 2Variation in average family body weight (BW) measured in grams at approximately 6, 7, 9 and 12 months post-hatching (Weight1, Weight2, Weight3 and Weight4). CV (SD/mean) indicates the phenotypic coefficients of variation.
Color intensities (green, blue and red) reflect changes in mean of BWs of different families at Weight1, Weigh2 and Weight3, respectively. Up/right/down arrows indicate families’ mean BWs lie within top, middle and bottom 33% of the population at each age, respectively.
Association of nuclear SNPs with growth traits1 using family-based association analysis2.
| SNP | Weight2 | Weight3 | Weight4 | |||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |
| nuSNP1 | −10.3 | −3.3 | 0.00096 | 0.096 | −25.6 | −2.8 | 0.00431 | 0.112 | −48.8 | −2.6 | 0.00782 | 0.084 |
| nuSNP7 | 30.6 | 3.5 | 0.00049 | 0.075 | 83.6 | 3.2 | 0.00124 | 0.082 | 144.8 | 2.8 | 0.00468 | 0.07 |
| nuSNP9 | −12.8 | −2.6 | 0.00781 | 0.141 | −24.8 | −1.8 | 0.06913 | 0.272 | −119 | −4.3 | 1.52e−05 | 0.002 |
| nuSNP17 | 10.8 | 3.1 | 0.00179 | 0.136 | 27.5 | 2.8 | 0.00517 | 0.133 | 47.5 | 2.3 | 0.02046 | 0.152 |
| nuSNP22 | −15.4 | −3.7 | 0.00022 | 0.058 | −32.9 | −2.7 | 0.00581 | 0.116 | −54.4 | −2.2 | 0.02665 | 0.144 |
| nuSNP23 | −16.9 | −4.2 | 0.995e−05 | 0.032 | −38.4 | −3.3 | 0.00078 | 0.059 | −58.1 | −2.3 | 0.01821 | 0.126 |
| nuSNP24 | −15.7 | −3.7 | 0.00018 | 0.051 | −33.1 | −2.7 | 0.00576 | 0.112 | −54.4 | −2.2 | 0.02721 | 0.147 |
| nuSNP25 | 9.4 | 2.6 | 0.00775 | 0.221 | 22.2 | 2.1 | 0.03076 | 0.258 | 61.2 | 2.7 | 0.00566 | 0.088 |
Body weight was recorded on each animal at approximately 7 (Weight2), 9 (Weight3) and 12 (Weight4) months post-hatching.
Family-based association analysis was performed with program PLINK version 1.07 [22]. Here, t is the t-statistic for regression of phenotype on allele count (), P is the asymptotic P-value for t-statistic, and the empirical P-value was estimated using 20,000 permutations.
indicates significance at P<0.01.
indicates significance at P<0.05.
Family-based association analysis of nuclear SNPs with growth traits1 using the R package GWAF2.
| SNP | Weight2 | Weight3 | Weight4 | ||||||||||||
| χ | DF |
| Model | h | χ | DF |
| Model | h | χ | DF |
| Model | h | |
| nuSNP7 | 233.08 | 1 | 1.27E−52 | D | 0 | 81.86 | 1 | 1.46E−19 | D | 0 | 12.45 | 1 | 0.00041 | D | 0 |
| nuSNP8 | 2.38 | 1 | 0.12291 | D | 0 | 33.25 | 1 | 8.10E−09 | D | 0 | 0.56 | 1 | 0.4539 | D | 0 |
| nuSNP12 | 8.43 | 2 | 0.01478 | G | 0.01 | 21.21 | 2 | 0.00002 | G | 0.01 | 17.27 | 2 | 0.00017 | G | 0.01 |
| nuSNP20 | 279.6 | 1 | 9.20E−63 | D | 0 | 101.75 | 1 | 6.31E−24 | D | 0 | 15.91 | 1 | 6.65E−05 | D | 0 |
| nuSNP21 | 245.88 | 1 | 2.05E−55 | D | 0 | 101.14 | 1 | 8.59E−24 | D | 0 | 12.22 | 1 | 0.00047 | D | 0 |
Body weight was recorded on each animal at approximately 7 (Weight2), 9 (Weight3) and 12 (Weight4) months post-hatching.
The genome-wide association analysis with family data (GWAF) R package [23] was used in the association analysis. The analyzed sample included 40 full-sib families each with ∼17 progeny. Here, we show the asymptotic P-value for the test statistic distributed as a χ2 with 1 and 2 DF for dominant and general model, respectively. The is the proportion of phenotypic variance explained by the tested SNP.
The general (G) and dominant (D) models had the highest likelihood in the association analysis of Weight2, Weight3 and Weight4.
indicates SNPs strongly associated with BW (P-value = 1.5E−19 to 9.2E−63).
indicates SNPs with weaker association (P-value = 2.0E−5 to 8.1E−09).
Association of nuclear SNPs with weight1 using family-based quantitative trait linkage disequilibrium (QTLD) analysis, the measured genotype test and the quantitative disequilibrium test (QTDT) 2.
| SNP |
|
|
| |||||||||
| Stratifi-cation | Measured genotype | QTDT | QTLD | Stratifi-cation | Measured genotype | QTDT | QTLD | Stratifi-cation | Measured genotype | QTDT | QTLD | |
| nuSNP7 | 0.486 | 0.042 | 0.032 | 0.032 | 0.639 | 0.030 | 0.040 | 0.040 | 0.328 | 0.157 | 0.105 | 0.105 |
| nuSNP12 | 0.124 | 0.149 | 0.065 | 0.065 | 0.368 | 0.009 | 0.023 | 0.023 | 0.105 | 0.003 | 0.016 | 0.016 |
| nuSNP21 | 0.000 | 1 | 0.125 | 0.125 | 0.000 | 0.136 | 0.019 | 0.019 | 0.020 | 0.047 | 0.019 | 0.019 |
| nuSNP25 | 0.111 | 0.029 | 0.084 | 0.084 | 0.145 | 0.024 | 0.064 | 0.064 | 0.559 | 0.198 | 0.265 | 0.265 |
Body weight was recorded on each animal at approximately 7 (Weight2), 9 (Weight3) and 12 (Weight4) months post-hatching.
Family-based QTLD analysis was performed with software SOLAR version 4.0 [24]. The sample included 40 FS families each with ∼17 progeny. Here, we show the asymptotic P-value for the test statistic distributed as a with 1 DF; the effective number of tests and multiple testing adjusted P-value was P = 0.00165 [65].
QTDT stands for quantitative trait disequilibrium test [21].
indicates significance at P<0.01.
indicates significance at P<0.05.
Summary of nuclear markers significantly associated/linked1 to growth traits and their annotations.
| SNP | MAF | HW | QTDT | Statistical Test(s) | Annotation | Location/amino acid change |
| nuSNP7 | 0.106 | NA | NA | 5,6,7,8 | Glucose phosphate isomerase b | 5′UTR |
| nuSNP1 | 0.354 | 1.00E−03 | NA | 5 | Enolase 3-1 | ORF/SYN |
| nuSNP8 | 0,142 | NA | NA | 6 | ATP2A1 calcium ATPase 3 | ORF/SYN |
| nuSNP17 | 0.313 | 1.20E−03 | NA | 5 | Myosin binding protein C | ORF/SYN |
| nuSNP20 | 0.033 | NA | NA | 6 | Myosin binding protein C | ORF/SYN |
| nuSNP21 | 0.036 | NA | <0.01 | 6,7 | Myosin binding protein C | ORF/SYN |
| nuSNP25 | 0.232 | NA | NA | 6,7,8 | Fast myotomal muscle actin 2 | ORF/SYN |
| nuSNP22 | 0.199 | NA | NA | 5 | Troponin C | 3′UTR |
| nuSNP23 | 0.189 | NA | NA | 5 | Troponin C | 3′UTR |
| nuSNP24 | 0.199 | NA | NA | 5 | Troponin C | 3′UTR |
| nuSNP27 | 0.138 | NA | <0.05 | 5 | Fast myotomal muscle troponin-T-2 | ORF/N→K |
| nuSNP29 | 0.235 | NA | NA | 5 | Taxilin beta muscle-derived protein 77 | ORF/G→A |
| nuSNP9 | 0.189 | 7.20E−03 | <0.05 | 5 | 60 S ribosomal protein L4-A | ORF/SYN |
| nuSNP12 | 0.337 | NA | NA | 6,7 | Unknown | Unknown |
Body weight was recorded on each animal at approximately 7 (Weight2), 9 (Weight3) and 12 (Weight4) months post-hatching. A family-based sample that included 40 full-sib families each with 17 progeny were genotyped with 30 SNPs. Summary statistics were obtained with program PLINK version 1.07 [22].
SNPs minor allele frequency (MAF).
SNPs showing deviation from Hardy-Weinberg equilibrium. Exact P-value estimated using 20,000 permutations.
QTDT population stratification test.
t-statistic for regression of phenotype on allele count P is the asymptotic P-value for t-statistic, was estimated using 20,000 permutations.
Genome-wide association analysis with family data (GWAF) R package [23].
Family-based Measured Genotype and QTLD analysis was performed with software SOLAR version 4.0 [24] or QTDT quantitative trait disequilibrium test [21].
Family-based Bayesian quantitative trait nucleotide (BQTN) analysis was performed with software SOLAR version 4.0 [24].
SNP annotation; gene name and SNP location (ORF/5′UTR/3′UTR), SYN = Synonymous, NON-SYN = Non-synonymous.
NA indicates statistically insignificant estimate.
Association of mitochondrial SNPs with weight1 using population-based association analysis2.
| Weight | SNP | Set1 | Set2 | Set3 | ||||||||||||
|
|
|
|
| FDR–BH |
|
|
|
| FDR–BH |
|
|
|
| FDR–BH | ||
| Weight2 | mtSNP6 | 0.05 | −1.3 | 0.177 | 0.181 | 0.21 | 0.12 | −2.1 | 0.041 | 0.0417 | 0.11 | 0.05 | −1.4 | 0.151 | 0.15 | 0.22 |
| mtSNP8 | 0.11 | −2.1 | 0.039 | 0.039 | 0.21 | 0.16 | −2.6 | 0.013 | 0.0134 | 0.11 | 0.03 | −1 | 0.302 | 0.3 | 0.31 | |
| mtSNP21 | 0.11 | −2.1 | 0.039 | 0.039 | 0.21 | 0.16 | −2.6 | 0.013 | 0.013 | 0.11 | 0 | −0.29 | 0.772 | 0.773 | 0.77 | |
| Weight3 | mtSNP8 | 0.1 | −2 | 0.052 | 0.053 | 0.34 | 0.09 | −1.9 | 0.055 | 0.057 | 0.11 | 0.21 | −3.1 | 0.004 | 0.004 | 0.05 |
| mtSNP21 | 0.02 | −0.8 | 0.393 | 0.39 | 0.43 | 0.11 | −1.9 | 0.062 | 0.063 | 0.11 | 0.25 | −3.4 | 0.002 | 0.002 | 0.04 | |
| Weight4 | mtSNP1 | 0.01 | −0.5 | 0.57 | 0.568 | 0.93 | 0.07 | −1.6 | 0.106 | 0.105 | 0.18 | 0.12 | −2 | 0.045 | 0.044 | 0.09 |
| mtSNP4 | 0.01 | −0.5 | 0.57 | 0.568 | 0.93 | 0.07 | −1.6 | 0.106 | 0.105 | 0.18 | 0.12 | −2 | 0.045 | 0.044 | 0.09 | |
| mtSNP7 | 0.01 | −0.5 | 0.57 | 0.568 | 0.93 | 0.06 | −1.4 | 0.145 | 0.143 | 0.2 | 0.13 | −2.1 | 0.041 | 0.040 | 0.09 | |
| mtSNP8 | 0.13 | −2.3 | 0.022 | 0.022 | 0.53 | 0.22 | −3.1 | 0.003 | 0.003 | 0.03 | 0.12 | −2.1 | 0.041 | 0.041 | 0.09 | |
| mtSNP15 | 0.01 | −0.5 | 0.57 | 0.568 | 0.93 | 0.07 | −1.6 | 0.106 | 0.105 | 0.18 | 0.12 | −2.1 | 0.045 | 0.044 | 0.09 | |
| mtSNP16 | 0 | −0.3 | 0.766 | 0.763 | 0.93 | 0.04 | −1.3 | 0.192 | 0.191 | 0.2 | 0.14 | −2.2 | 0.038 | 0.038 | 0.09 | |
| mtSNP21 | 0.07 | −1.6 | 0.114 | 0.113 | 0.93 | 0.27 | −3.4 | 0.001 | 0.002 | 0.03 | 0.13 | −2.2 | 0.037 | 0.036 | 0.09 | |
Body weight was recorded on each animal at approximately 7 (Weight2), 9 (Weight3) and 12 (Weight4) months post-hatching.
Population-based association analysis was performed with program PLINK version 1.07 [22]. From 40 full-sib families each with ∼17 progeny, a sibling was randomly sampled from each family to generate a population-based sample of n = 40 unrelated individuals; we repeated the random sampling to develop three sets of unrelated individuals. Here, t is the t-statistic for regression of phenotype on allele count (); R2 is the square of the multiple correlation coefficient which measures the proportion of total variation explained by the regression ; P is the asymptotic P-value for t-statistic; the empirical P-value was estimated using 20,000 permutations; and FDR-BH is the false discovery rate [27].
indicates significance at P<0.01.
indicates significance at P<0.05.
Summary of mitochondrial markers significantly associated 1 with growth traits and their annotations1.
| SNP | Physical position (bp) | MAF | Annotation4 | Location/amino acid change |
| mtSNP1 | 4116 | 0.49 | NADH dehydrogenase subunit 1 | ORF/SYN |
| mtSNP2 | 4323 | 0.49 | NADH dehydrogenase subunit 1 | ORF/SYN |
| mtSNP3 | 4647 | 0.48 | NADH dehydrogenase subunit 1 | ORF/SYN |
| mtSNP4 | 5212 | 0.48 | NADH dehydrogenase subunit 2 | ORF/SYN |
| mtSNP5 | 5275 | 0.48 | NADH dehydrogenase subunit 2 | ORF/SYN |
| mtSNP6 | 5530 | 0.48 | NADH dehydrogenase subunit 2 | ORF/SYN |
| mtSNP7 | 5740 | 0.49 | NADH dehydrogenase subunit 2 | ORF/SYN |
| mtSNP16 | 12423 | 0.47 | NADH dehydrogenase subunit 4 | ORF/V→ M |
| mtSNP17 | 13231 | 0.49 | NADH dehydrogenase subunit 5 | ORF/SYN |
| mtSNP18 | 13795 | 0.48 | NADH dehydrogenase subunit 5 | ORF/SYN |
| mtSNP19 | 14077 | 0.48 | NADH dehydrogenase subunit 5 | ORF/SYN |
| mtSNP20 | 14626 | 0.49 | NADH dehydrogenase subunit 5 | ORF/SYN |
| mtSNP21 | 15591 | 0.23 | Cytochrome b | ORF/SYN |
| mtSNP22 | 15822 | 0.49 | Cytochrome b | ORF/SYN |
| mtSNP23 | 16305 | 0.49 | Cytochrome b | ORF/SYN |
| mtSNP24 | 16317 | 0.48 | Cytochrome b | ORF/SYN |
| mtSNP8 | 7052 | 0.23 | Cytochrome c oxidase subunit 1 | ORF/SYN |
| mtSNP9 | 7193 | 0.50 | Cytochrome c oxidase subunit 1 | ORF/SYN |
| mtSNP10 | 8774 | 0.48 | Cytochrome c oxidase subunit 2 | ORF/SYN |
| mtSNP11 | 8804 | 0.48 | Cytochrome c oxidase subunit 2 | ORF/SYN |
| mtSNP14 | 9410 | 0.48 | ATPase 6 | ORF/SYN |
| mtSNP15 | 9656 | 0.49 | ATPase 6 | ORF/SYN |
| mtSNP12 | 9084 | 0.48 | ATPase 8 | ORF/SYN |
| mtSNP13 | 9087 | 0.49 | ATPase 8 | ORF/SYN |
Body weight was recorded on each animal at approximately 7 (Weight2), 9 (Weight3) and 12 (Weight4) months post-hatching. Population-based association analysis was performed with program PLINK version 1.07 [22]. From 40 full-sib families each with ∼17 progeny, a sibling was randomly sampled from each family to generate a population-based sample of n = 40 unrelated individuals; we repeated the random sampling to develop three sets of unrelated individuals.
Markers were positioned on mitochondrial genome by BLASTing sequences flanking markers against a rainbow trout mitochondrial reference sequence [33].
SNPs minor allele frequency (MAF). 4SNP annotation; gene name and SNP location (ORF/5′UTR/3′UTR), SYN = Synonymous.
Polymorphism of significantly associated/linked markers to growth traits in three aquaculture broodstocks.
| SNP | TL | CS | HF | ||||||
| A1 | A2 | MAF | A1 | A2 | MAF | A1 | A2 | MAF | |
| nuSNP1 | Assay failed | ||||||||
| nuSNP7 | C | A | 0.02 | C | A | 0.03 | C | A | 0.03 |
| nuSNP8 | Assay failed | ||||||||
| nuSNP9 | A | T | 0.3 | T | A | 0.37 | T | A | 0.23 |
| nuSNP12 | C | T | 0.08 | C | T | 0.25 | C | T | 0.1 |
| nuSNP17 | T | C | 0.38 | T | C | 0.1 | T | C | 0.06 |
| nuSNP20 | G | A | 0.25 | G | A | 0.15 | G | A | 0.31 |
| nuSNP21 | G | A | 0.25 | G | A | 0.15 | G | A | 0.3 |
| nuSNP22 | G | A | 0.31 | A | G | 0.17 | A | G | 0.19 |
| nuSNP23 | C | T | 0.31 | T | C | 0.18 | T | C | 0.19 |
| nuSNP24 | C | T | 0.31 | T | C | 0.17 | T | C | 0.19 |
| nuSNP25 | A | G | 0.13 | A | G | 0.35 | A | G | 0.16 |
| nuSNP27 | C | G | 0.06 | C | G | 0.33 | C | G | 0.47 |
| nuSNP29 | Monomorphic | ||||||||
| mtSNP1 | G | A | 0.45 | A | G | 0.23 | G | A | 0.31 |
| mtSNP4 | A | G | 0.42 | A | G | 0.23 | G | A | 0.31 |
| mtSNP6 | T | C | 0.44 | T | C | 0.17 | C | T | 0.37 |
| mtSNP7 | G | A | 0.43 | A | G | 0.27 | A | G | 0.25 |
| mtSNP8 | C | T | 0.14 | C | T | 0.07 | C | T | 0.06 |
| mtSNP15 | A | G | 0.42 | A | G | 0.21 | G | A | 0.31 |
| mtSNP16 | A | G | 0.45 | A | G | 0.23 | A | G | 0.25 |
| mtSNP21 | G | A | 0.11 | 0 | A | 0 | G | A | 0.15 |
markers were genotyped in 12 broodstocks (8 unrelated fish/stock); TL (Troutlodge, Inc), CS (Clear Springs Foods) and HF (Hagerman Fish Culture Experiment Station), 48 markers were polymorphic, the average MAF is 0.24.
indicate different minor alleles between different populations.
Figure 3NCCCWA genetic/physical map with positions of 19 nuSNPs polymorphic markers. nuSNPs were genotyped on mapping families from the NCCCWA.
Linkage groups were determined and nuSNPs were added to the NCCCWA genetic map [32]. Closest markers from the published map were determined using two-point linkage analyses.