| Literature DB >> 33077766 |
Clémence Fraslin1,2, Florence Phocas3, Anastasia Bestin4, Mathieu Charles5,6, Maria Bernard5,6, Francine Krieg5, Nicolas Dechamp5, Céline Ciobotaru5, Chris Hozé5,7, Florent Petitprez8, Marine Milhes9, Jérôme Lluch9, Olivier Bouchez9, Charles Poncet10, Philippe Hocdé11, Pierrick Haffray4, Yann Guiguen12, Edwige Quillet5.
Abstract
Rainbow trout has a male heterogametic (XY) sex determination system controlled by a major sex-determining gene, sdY. Unexpectedly, a few phenotypically masculinised fish are regularly observed in all-female farmed trout stocks. To better understand the genetic determinism underlying spontaneous maleness in XX-rainbow trout, we recorded the phenotypic sex of 20,210 XX-rainbow trout from a French farm population at 10 and 15 months post-hatching. The overall masculinisation rate was 1.45%. We performed two genome-wide association studies (GWAS) on a subsample of 1139 individuals classified as females, intersex or males using either medium-throughput genotyping (31,811 SNPs) or whole-genome sequencing (WGS, 8.7 million SNPs). The genomic heritability of maleness ranged between 0.48 and 0.62 depending on the method and the number of SNPs used for the estimation. At the 31K SNPs level, we detected four QTL on three chromosomes (Omy1, Omy12 and Omy20). Using WGS information, we narrowed down the positions of the two QTL detected on Omy1 to 96 kb and 347 kb respectively, with the second QTL explaining up to 14% of the total genetic variance of maleness. Within this QTL, we detected three putative candidate genes, fgfa8, cyp17a1 and an uncharacterised protein (LOC110527930), which might be involved in spontaneous maleness of XX-female rainbow trout.Entities:
Mesh:
Year: 2020 PMID: 33077766 PMCID: PMC7573577 DOI: 10.1038/s41598-020-74757-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of GWAS analyses used to detect QTLs associated with spontaneous maleness in XX-rainbow trout.
| Name | Software | Number of SNPs in the analysis | Number of SNPs in GRM | π |
|---|---|---|---|---|
| GCTA_chip | GCTA | 30,811 | 30,811 | – |
| GCTA_seq | GCTA | 8,765,613 | 275,283 | – |
| BCπ-chip | BESSiE | 30,811 | 275,283 | 1% |
| BCπ-seq | BESSiE | 21,149 | 275,283 | 0.02% |
π is the proportion of SNPs with a non-zero-effect included at each cycle k of the MCMC algorithm.
GRM genomic relationship matrix.
Figure 1Total number of genotyped offspring (females, males and intersex) for each of the 50 dams (AA to BX) used in the mating design. The 22 dams with more than 10 genotyped offspring and extreme masculinisation rate selected for in-depth analysis of their genotypes are identified by a black underlying line.
Phenotyping and genotyping dataset from the XX-rainbow trout produced at “Les fils de Charles Murgat” farm.
| Temperature treatment (°C) and age at phenotyping | 12° | 12° | Total 12° | 18° | 18° | Total 18° | Total |
|---|---|---|---|---|---|---|---|
| Total number of phenotyped fish | |||||||
| Total number of genotyped fish | |||||||
| Phenotyped and genotyped females | 6435 (305) | 3336 (258) | 6616 (0) | 3445 (295) | |||
| Phenotyped and genotyped intersex | 45 | 33 | 24 | 30 | |||
| Phenotyped and genotyped males | 65 | 62 | 17 | 18 | |||
| Phenotyped undetermined individuals | 15 | 25 | 24 | 20 | |||
| Proportion of masculinised individuals (intersex + males) | 1.68% | 2.75% | 0.61% | 1.37% |
mpf months post-fertilisation.
Total number of fish are highlighted in bold.
Estimates of genetic and genomic parameters for spontaneous maleness under the different statistical models.
| Analysis | ||||||
|---|---|---|---|---|---|---|
| GCTA-chip | 0.48 ± 0.04 | 0.43 ± 0.02 | 0.22 ± 0.01 | 0.21 ± 0.03 | - | - |
| BCπ-chip* | 0.56 ± 0.05 | 0.43 ± 0.02 | 0.19 ± 0.02 | 0.24 ± 0.03 | 0.10 ± 0.03 | 0.14 ± 0.03 |
| GCTA-seq | 0.62 ± 0.06 | 0.42 ± 0.02 | 0.16 ± 0.02 | 0.26 ± 0.03 | - | - |
| BCπ-seq* | 0.59 ± 0.05 | 0.44 ± 0.02 | 0.17 ± 0.02 | 0.25 ± 0.03 | 0.21 ± 0.03 | 0.04 ± 0.01 |
h2G: genomic heritability, calculated as . : total genetic variance ), : polygenic variance, : genetic variance explained by SNPs, : residual variance, : phenotypic variance ).
*Value of one MCMC chain among the two used for GWAS, the chain with the closest final π to the 1% or 0.02% was chosen.
Detection of QTLs associated with spontaneous maleness in XX-rainbow trout with GCTA-chip and BCπ-chip methods.
| Chromosome | QTL start (Mb) | QTL end (Mb) | Peak SNP name | Peak SNP position (Mb) | Peak − log10(P) or logBF | % | Method |
|---|---|---|---|---|---|---|---|
| 1 | 62.93206 | 63.73809 | Affx-88953453 | 63.53900 | 7.9 | – | GCTA-chip |
| 1 | 64.42011 | 64.63252 | Affx-88916383 | 64.42011 | 11.1 | – | GCTA-chip |
| 1 | 63.45244 | 65.16399 | Affx-88950822 | 64.63252 | 11.3 | 3.86% | BCπ-chip |
| 12 | 6.10355 | 6.79990 | Affx-88940013 | 6.79990 | 8.1 | 0.62% | BCπ-chip |
| 20 | 31.35239 | 31.35239 | Affx-88916019 | 31.35239 | 8.2 | 0.40% | BCπ-chip |
The QTL start and end positions correspond to the confidence and credibility intervals for GCTA-chip and BCπ-chip respectively. For BCπ-chip, the logBF is calculated as twice the natural logarithm of Bayes Factor. % : proportion of total genetic variance explained by the QTL calculated as the sum of the variance estimated by the BCπ-chip analysis of all SNPs within the QTL credibility interval.
Summary of the characteristics of the two QTLs associated with maleness in XX-rainbow trout detected using GCTA-seq and BCπ-seq methods.
| Chromosome | QTL start (Mb) | QTL end (Mb) | QTL size (kb) | Peak SNP name | Peak SNP position (Mb) | Peak SNP − log10(P) or logBF | % | Method |
|---|---|---|---|---|---|---|---|---|
| 1 | 63.229379 | 63.774478 | 545.1 | Omy1-63493395 | 63.493395 | 10.61 | – | GCTA-seq |
| 1 | 64.360291 | 64.707100 | 341.6 | Omy1-64597800 | 64.597800 | 13.74 | – | GCTA-seq |
| 1 | 63.459553 | 63.555809 | 96.27 | Omy1-63520900 | 63.542090 | 12.97 | 0.56% | BCπ-seq1 |
| 1 | 64.266321 | 64.649580 | 383.26 | Omy1-64607018 | 64.607018 | 14.67 | 13.55% | BCπ-seq1 |
| 1 | 63.459545 | 63.727071 | 267.53 | Omy1-63520900 | 63.542090 | 11.55 | 0.51% | BCπ-seq2 |
| 1 | 64.087947 | 64.632497 | 544.55 | Omy1-64610233 | 64.610233 | 17.32 | 14.56% | BCπ-seq2 |
For GCTA-seq, the start and stop positions of the first QTL correspond to a CI estimated with a drop-off method and the start and stop positions of the second QTL correspond to the credibility interval estimated as the union of confidence intervals of four QTLs. The BCπ-seq1 and BCπ-seq2 correspond to the same analysis run with two different seeds for MCMC initialization. Significance of peak SNP is calculated as the –log10(P-value) for GCTA-seq, and as logBF estimated as twice the natural logarithm of Bayes Factor for BCπ-seq. % : proportion of total genetic variance explained by the QTL calculated as the sum of the variance estimated by BCπ-seq analyses of all SNPs within the QTL credibility interval.
Figure 2Annotated SNPs located within the first QTL region from 63.549 to 63.556 Mb on Omy1. The significance of SNP effect (dots) is given by the value of – log10(p-value) estimated with the GCTA-seq method. The dark blue line corresponds to the 1% threshold at the genome wide level. Only SNPs with at least low putative effects on genes (estimated by SNPEff) are represented. The positions of the five genes located within the QTL region are figured by rectangles: banf1 (barrier-to-autointegration factor-like), pygb (glycogen phosphorylase, brain form), abhd12 (alpha/beta-Hydrolase domain containing 12), gins1 (DNA replication complex GINS protein PSF1), ninl (ninein-like protein).
Figure 3Annotated SNPs located within the second QTL region, from 64.360 to 64.633 Mb on Omy1. The significance of SNP effect (dots) is given by the – log10(p-value) estimated with the GCTA-seq method. The dark blue line corresponds to the 1% threshold at the genome wide level. Only SNPs with at least low putative effects on genes (estimated by SNPEff) are represented. Genes located within this QTL are represented by rectangle: slc2a15a (solute carrier family 2, facilitated glucose transporter member 9-like), fgf8 (fibroblast growth factor 8), fbxw4 (F-box/WD repeat-containing protein 4), wbp1l (WW domain binding protein 1-like), cyp17a1 (Cytochrome P450 Family 17 Subfamily A Member 1), borcs7 (BLOC-1-reltaed complex subunit 7), nt5c2 (cytosolic purine 5′-nucleotidase), ina (alpha-internexin), pcgf6 (polycomb group RING finger protein 6), zgc:175214 (RING finger protein 122-like), calhm2 (calcium homeostasis modulator protein 2), hells (lymphocyte-specific helicase-like), LOC110527930 (uncharacterized protein).