| Literature DB >> 31447879 |
Dimitrios Kyriakis1,2,3, Alexandros Kanterakis2, Tereza Manousaki3, Alexandros Tsakogiannis3, Michalis Tsagris4, Ioannis Tsamardinos5, Leonidas Papaharisis6, Dimitris Chatziplis7, George Potamias2, Costas S Tsigenopoulos3.
Abstract
Gilthead sea bream (Sparus aurata) is a teleost of considerable economic importance in Southern European aquaculture. The aquaculture industry shows a growing interest in the application of genetic methods that can locate phenotype-genotype associations with high economic impact. Through selective breeding, the aquaculture industry can exploit this information to maximize the financial yield. Here, we present a Genome Wide Association Study (GWAS) of 112 samples belonging to seven different sea bream families collected from a Greek commercial aquaculture company. Through double digest Random Amplified DNA (ddRAD) Sequencing, we generated a per-sample genetic profile consisting of 2,258 high-quality Single Nucleotide Polymorphisms (SNPs). These profiles were tested for association with four phenotypes of major financial importance: Fat, Weight, Tag Weight, and the Length to Width ratio. We applied two methods of association analysis. The first is the typical single-SNP to phenotype test, and the second is a feature selection (FS) method through two novel algorithms that are employed for the first time in aquaculture genomics and produce groups with multiple SNPs associated to a phenotype. In total, we identified 9 single SNPs and 6 groups of SNPs associated with weight-related phenotypes (Weight and Tag Weight), 2 groups associated with Fat, and 16 groups associated with the Length to Width ratio. Six identified loci (Chr4:23265532, Chr6:12617755, Chr:8:11613979, Chr13:1098152, Chr15:3260819, and Chr22:14483563) were present in genes associated with growth in other teleosts or even mammals, such as semaphorin-3A and neurotrophin-3. These loci are strong candidates for future studies that will help us unveil the genetic mechanisms underlying growth and improve the sea bream aquaculture productivity by providing genomic anchors for selection programs.Entities:
Keywords: Genome Wide Association Study; Sparus aurata; aquaculture; double digest random amplified DNA; feature selection
Year: 2019 PMID: 31447879 PMCID: PMC6691846 DOI: 10.3389/fgene.2019.00675
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Selected SNPs from GWAs analysis using linear mixed models, with significance threshold equal to 10– 4.
| Position | Gene | P-value | Beta coefficient | Conserved | Position |
|---|---|---|---|---|---|
| Weight | |||||
| Chr1:6970078 | Lymphoid enhancer-binding factor 1 isoform X1 | 3.265E−5 | 174.721 | – | Downstream |
| Chr1:16636968 | Ethanolamine-phosphate cytidylyltransferase-like | 5.059E−5 | 189.556 | ✓ | 3’UTR |
| Chr1:20827142 | Mucin-5AC-like isoform X1 | 4.976E−5 | −161.835 | ✓ | Upstream |
| Chr16:2232897 | Acetylserotonin O-methyltransferase-like, LBH-like isoform X1 | 4.648E−5 | −338.149 | ✓ | 3’UTR |
| Chr6:12617755 | Transmembrane 199 | 3.838E−5 | 205.210 | ✓ | Upstream |
| myosin-7 like | Downstream | ||||
| Tag Weight | |||||
| Chr13:20975921 | RNA-binding 27 isoform X1 | 3.168E−5 | 4.748 | – | Intron |
| Chr13:20975924 | RNA-binding 27 isoform X1 | 3.168E−5 | 4.748 | – | Intron |
| Chr2:2623351 | Tetratricopeptide repeat 36 | 2.823E−5 | 6.183 | – | Upstream |
| Chr22:18343985 | tectonin beta-propeller repeat-containing 2 | 5.405E−5 | −5.139 | – | Upstream |
Figure 1(A) Distribution of each examined trait in our samples. (B) Manhattan plot demonstrating the locations across the chromosomes of the sea bream genome (horizontal axis) versus the –log (p-values) of the association between the genetic variants and phenotype (vertical axis). The higher the dots, the stronger the genetic association. The significance threshold was set to 10–4, in order to correct for multiple testing (dashed line). The different colors represent the different chromosomes. (C) Quantile–quantile (QQ) plot of the data shown in the Manhattan plot. The grey area represents the 95% simultaneous confidence bands. Red line is the diagonal (Y = X) or else how the observed data should be placed if they were normally distributed.
Figure 2Comparison of different algorithms predicting the traits of interest, based on median squared error, after leave one out cross validation. SES algorithm tested for different thresholds (Threshold equal to 0.01 or 0.05) and for different numbers of SNPs as condition set (k = 2, 3, 4, 5). OMP algorithm tested for different thresholds as stop criterion (Threshold = 2 or 4 units in BIC score).
Selected SNPs from SES algorithm with significance threshold equal to 0.05 (best method based on median squared error score).
| Variables | Locus | P-value | Beta coefficient | Threshold | GWAS | Conserved | Position |
|---|---|---|---|---|---|---|---|
| Fat | |||||||
| Chr13:1098152 | Rho-related GTP-binding -like | 0.007 | 1.60 | 0.01 | – | – | 3’ UTR |
| Chr21:19924408 | – | 0.006 | −1.238 | 0.01 | – | – | – |
| Chr8: 1385781 | Protection of telomeres 1 | 0.0024 | 1.55 | 0.01 | – | ✓ | Intron |
| Scaffol8147:18634 | Death-associated kinase 3-like | 0.015 | 0.7 | 0.05 | – | ✓ | Intron |
| Chr7:2453106 | Solute carrier family 41 member 1-like isoform X1-2 | 0.046 | 0.86 | 0.05 | – | – | Intron |
| Chr4:23265532 | NT-3 growth factor receptor isoform X1 | 0.017 | −2.25 | 0.05 | – | – | Upstream |
Selected SNPs from SES algorithm with significance threshold equal to 0.05 (best method based on median squared error score).
| Variables | Locus | P-value | Beta coefficient | Threshold | GWAS | Conserved | Position |
|---|---|---|---|---|---|---|---|
| Length/Width | |||||||
| Chr6:23799286 | Phosphatase 1 regulatory subunit 3D-like | 0.0052 | 0.0397 | 0.01 | – | ✓ | 3d |
| Chr1:20827142 | Upstream: mucin-5AC-like isoform X1 | 0.049 | 0.026 | 0.01 | – | ✓ | Upstream |
| Chr13:9665394 | ATP-dependent RNA helicase DHX33 | 0.0211 | 0.048 | 0.01 | – | ✓ | 3d |
| Chr3:9671223 | A-kinase anchor 9 isoform X3 | 0.0144 | −0.0597 | 0.01 | – | ✓ | 2nd |
| Scaffold13177:8369 | Phosphatase 1 regulatory subunit 3C | 0.015 | 0.057 | 0.01 | – | ✓ | Downstream |
| Chr8:11613979 | Semaphorin-3A | 0.0193 | −0.025 | 0.05 | – | ✓ | Intron |
| Chr22:2545133 | Neurexin-3b isoform X3 | 0.049 | −0.029 | 0.05 | – | – | Intron |
| Scaffold5661:35982 | – | 0.049 | 0.031 | 0.05 | – | ✓ | – |
Figure 3The effect of each of the selected SES SNPs associated with fat content. (A-C) Boxplots of selected SNPs. (A) chr8:1385781, (B) chr13:1098152, (C) chr21:19924408.
Figure 6The effect of each of the selected SES SNPs associated with length/width. (A-E) Boxplots of selected SNPs. (A) chr1:20827142, (B) chr3:9671223, (C) chr6:23799286, (D) chr13:9665394, (E) scaffold5661:35982.
Selected SNPs from SES algorithm with significance threshold equal to 0.05 (best method based on median squared error).
| Variables | Locus | P-value | Beta coefficient | Threshold | GWAS | Conserved | Position |
|---|---|---|---|---|---|---|---|
| Weight | |||||||
| Chr1:16636968 | Ethanolamine-phosphate, cytidylyltransferase-like | 0.0006 | 121.84 | 0.01 | ✓ | ✓ | 3’ UTR |
| Chr6:12617755 | Myosin-7-like isoform X1,short transient receptor potential channel 4-associated | 0.0024 | 138.07 | 0.01 | ✓ | ✓ | Upstream |
| Chr8:11613979 | Semaphorin-3A | 0.0114 | 99 | 0.01 | – | ✓ | Intron |
| Chr16:2232897 | Acetylserotonin O-methyltransferase-like, LBH-like isoform X1 | 0.0022 | −193 | 0.01 | ✓ | – | 3’ UTR |
| Scaffold29:195838 | Mitogen-activated kinase-binding 1-like | 0.0285 | 64.669 | 0.05 | – | – | Intron |
| Chr24:8282385 | STE20-related kinase adapter beta | 0.0022 | 160.80 | 0.05 | – | ✓ | Downstream |
| Trafficking kinesin-binding 2 isoform X1 | Upstream | ||||||
Selected SNPs from SES algorithm with significance threshold equal to 0.05 (best method based on median squared error score).
| Variables | Locus | P-value | Beta coefficient | Threshold | GWAS | Conserved | Position |
|---|---|---|---|---|---|---|---|
| Tag Weight | |||||||
| Chr2:2623351 | Tetratricopeptide repeat 36 | 0.0019 | 4.577 | 0.01 | ✓ | – | Upstream |
| Chr13:20883924 | DNA repair RAD50 | 0.0127 | 2.678 | 0.01 | – | ✓ | Intron |
| Chr13:20975921 | RNA-binding 27 isoform X1 | 0.0073 | 1.810 | 0.01 | ✓ | – | Intron |
| Chr22:18343985 | Zinc finger BED domain-containing 4-like | 0.0117 | −1.967 | 0.01 | ✓ | – | Upstream |
| Midasin isoform X2 | Downstream | ||||||
| Scaffold4139:36071 | Predicted uncharacterized protein LOC106518831, partial | 0.033 | −0.634 | 0.01 | – | ✓ | Upstream |
| Chr15:3260819 | Follistatin-related 1-like | 0.0124 | 3.106 | 0.05 | – | – | Downstream |
| Chr20:6671436 | UBA-like domain-containing 1 | 0.021 | 2.665 | 0.05 | – | ✓ | 2nd |
| Chr22:14483563 | Exostosin-1-like | 0.0448 | 4.898 | 0.05 | – | – | Intron |
| Scaffold14083:12192 | – | 0.042 | −1.349 | 0.05 | – | – | - |