| Literature DB >> 32872430 |
Cécile Triay1,2, Matthew A Conte3, Jean-François Baroiller1,2, Etienne Bezault4, Frances E Clark3, David J Penman5, Thomas D Kocher3, Helena D'Cotta1,2.
Abstract
In domesticated strains of the Nile tilapia, phenotypic sex has been linked to genetic variants on linkage groups 1, 20 and 23. This diversity of sex-loci might reflect a naturally polymorphic sex determination system in Nile tilapia, or it might be an artefact arising from the process of domestication. Here, we searched for sex-determiners in wild populations from Kpandu, Lake Volta (Ghana-West Africa), and from Lake Koka (Ethiopia-East Africa) that have not been subjected to any genetic manipulation. We analysed lab-reared families using double-digest Restriction Associated DNA sequencing (ddRAD) and analysed wild-caught males and females with pooled whole-genome sequencing (WGS). Strong sex-linked signals were found on LG23 in both populations, and sex-linked signals with LG3 were observed in Kpandu samples. WGS uncovered blocks of high sequence coverage, suggesting the presence of B chromosomes. We confirmed the existence of a tandem amh duplication in LG23 in both populations and determined its breakpoints between the oaz1 and dot1l genes. We found two common deletions of ~5 kb in males and confirmed the presence of both amhY and amh∆Y genes. Males from Lake Koka lack both the previously reported 234 bp deletion and the 5 bp frameshift-insertion that creates a premature stop codon in amh∆Y.Entities:
Keywords: Nile tilapia; Y-chromosome; amh; male duplication; sex-determinant
Mesh:
Substances:
Year: 2020 PMID: 32872430 PMCID: PMC7563666 DOI: 10.3390/genes11091017
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Map of Africa showing Ghana in West Africa with the location of the Kpandu population in Lake Volta (with medium to hot temperatures) and Lake Koka (a cold lake) located in the Ethiopian highlands of the East African Rift.
Fish samples were used in this study.
| Approach | Population | Family | Sire | Dam | Sex-Ratio | Sampled Males | Sampled Females | Total Analyzed |
|---|---|---|---|---|---|---|---|---|
| ddRAD sequencing | Kpandu | Kp-20 | KpM17 | KpF32 | 49% (NS) | 15 | 14 | 31 |
| Koka | Kk-6 | KkM39 | KkF4 | 55% (NS) | 15 | 14 | 31 | |
| Whole Genome Sequencing | Kpandu | 27 | 27 | 54 | ||||
| Koka | 20 | 14 | 34 |
Chi-square tests of observed vs. expected (1:1). NS for non-significant.
Figure 2Amh genotyping of the Kpandu and Koka breeders used in ddRAD and a subset of individuals used in the whole-genome sequencing (WGS) pools. The complete genotyping of all individuals is shown in Figures S1 and S2.
Sex chromosomic LG23 amh markers analysed in the Kpandu and Koka wild-caught fish to predict the sexual genotype. The Manzala strain with known sex genotypes was used to validate the amplification.
| Population or Strain | Phenotype | N Analyzed | Genotype | Predicted Genotype | ||||
|---|---|---|---|---|---|---|---|---|
| Domestic strain | Manzala | Females | 1 | XX |
|
|
| XX |
| Males | 1 | XY |
|
|
| XY | ||
| 1 | YY |
|
|
| YY | |||
| Wild populations | Kpandu | Females | 27 |
|
|
| XX | |
| Males | 24 |
|
|
| XY | |||
| 3 |
|
|
| XX | ||||
| Koka | Females | 14 |
|
|
| XX | ||
| Males | 8 |
|
|
| XY | |||
| 9 |
|
|
| ND | ||||
Figure 3Manhattan plots of log10-transformed p-values from Fisher’s Exact Test (FET) and AMOVA F of the association of single nucleotide polymorphisms (SNPs) with phenotypic sex. (A) Plots for the Kpandu and the Koka families for all LGs. Unanchored contigs/scaffolds are grouped under the label “Unanch”. (B) Plots for the LG23 region showing SNP sites that were associated with sex. Most of the stronger sex-signals are found between 24 and 40 Mb. (C) Plots for Kpandu family with a close up of the LG3 region showing a very localised signal at 20 Mb due to a RAD locus in the fcgr2b gene. The rest of the signal observed on LG3 comes from the “long arm” of the chromosome known to carry repeated elements. Red lines correspond to p-values of 0.01 in the FET plots while blue lines correspond to mean AMOVA F between males and females of each family.
Figure 4Genotypes of the offspring compared to their parents inferred from ddRAD FET significant sites following an XX/XY pattern. Pink cells indicate an individual having the same genotype as their mother at this site whereas blue cells indicate an individual having the same genotype as their father at this site (dashed cells indicate homozygous YY sites). White cells are missing data. Only sites having both parents and at least half males and females of the offspring sequenced were kept here. Those tested for a ZZ/ZW pattern are shown in Figure S3.
Figure 5SNPs associated with phenotypic sex obtained from the WGS. (A) Manhattan plots of FST for Kpandu and Koka populations obtained with Sex SNP Finder. High FST peaks are observed on every LG meaning there is a signal associated to sex on all chromosomes; (B) Close up of FST on LG23 for Koka showing three high Fst regions in grey and green boxes; (C) Integrative Genomics Viewer (IGV) plot of sequence coverage at ~34.14 Mb on LG23 corresponding to the green box above for both sexes of the two wild populations. High FST values in this region are due to a very high sequence coverage (>10 times the mean coverage) likely corresponding to repetitive sequences on the B chromosome. SNPs are represented by colour traits with a bicolour trait being a polymorphic site, with the size of the colour band proportional to the nucleotide frequency. Frequencies of SNPs are conserved between Kpandu males and Koka females.
Figure 6Schematic representation of the duplicated region of LG23 and its structural variants and polymorphic sites found with DELLY and FreeBayes. The tandem duplication includes the amh gene and part of the oaz1 and dot1l genes. The duplicated regions in each population (in green) are shown with the corresponding deletions and suggested size (in red). The shared male SNPs are shown in blue bars while the population male-specific SNPs are shown in light grey bars. (A) Schematic view of mapping results of the Y chromosome by WGS of the Kpandu and Koka males against the female XX reference genome. (B) Schematic representation of the two possible places of insertion of the tandem duplication in oaz1 and dot1l genes. Both large deletions found with DELLY are likely to be placed in the oaz1-dot1l chimera as the genes are already truncated on the Y haplotype. (C) Schematic representation of the inferred Y haplotype of Kpandu and Koka males.
Structural variants located in the oaz1-dot1l region of LG23 in wild populations of Nile tilapia found with DELLY on whole-genome sequencing data.
| Populations | Start Position | End Position | Confidence Interval around Start & End | Type | Size of Structural Variant | Shared between Populations |
|---|---|---|---|---|---|---|
| Koka | 34491225 | 34512737 | −50,50 | Duplication | 21512 | Yes |
| 34493315 | 34498588 | −390,390 | Deletion | 5273 | Yes | |
| 34503117 | 34509103 | −852,852 | Deletion | 5986 | Yes | |
| Kpandu | 34491225 | 34512741 | −50,50 | Duplication | 21516 | Yes |
| 34491681 | 34491957 | −2,2 | Deletion | 276 | No | |
| 34493315 | 34498548 | −150,150 | Deletion | 5233 | Yes | |
| 34499596 | 34499830 | −3,3 | Deletion | 234 | No | |
| 34503494 | 34509103 | −2,2 | Deletion | 5609 | Yes |
SNPs and small indels found in the sex-determining oaz1-dot1l region on the Y haplotype between males and females of both populations.
| Position | Reference | Alternative | Length | Type | Gene | Intron/Exon | Change of Amino Acid | Found in Literature |
|---|---|---|---|---|---|---|---|---|
| 34491252 | C | G | 1 | snp |
| Exon | ||
| 34491407 | A | C | 1 | snp |
| Exon | Leu → Gly | |
| 34491477 | C | T | 1 | snp |
| Intron | ||
| 34491574 | T | C | 1 | snp |
| Exon | Lys → Glu | Cáceres et al., 2019 [ |
| 34492052 | A | G | 1 | snp |
| Intron | ||
| 34492058 | T | C | 1 | snp |
| Intron | ||
| 34492381 | A | G | 1 | snp |
| Intron | ||
| 34493190 | G | C | 1 | snp |
| Intron | ||
| 34493277 | A | G | 1 | snp |
| Intron | ||
| 34498917 | A | G | 1 | snp | LOC100707471 | Intron | ||
| 34499201 | CTAT | TTAC | 4 | complex |
| 3’ UTR | ||
| 34499475 | A | G | 1 | snp |
| 3’ UTR | ||
| 34499597 | C | T | 1 | snp |
| Exon 7 | ||
| 34499645 | A | C | 1 | snp |
| Exon 7 | ||
| 34499675 | A | G | 1 | snp |
| Exon 7 | ||
| 34499706 | G | C | 1 | snp |
| Exon 7 | Thr → Ser | |
| 34499810 | A | G | 1 | snp |
| Exon 7 | ||
| 34499839 | C | T | 1 | snp |
| Exon 7 | Ala → Thr | |
| 34499987 | C | T | 1 | snp |
| Intron | ||
| 34499994 | A | C | 1 | snp |
| Intron | ||
| 34500307 | C | T | 1 | snp |
| Intron | ||
| 34500348 | T | C | 1 | snp |
| Intron | ||
| 34500364 | G | C | 1 | snp |
| Intron | ||
| 34500464 | G | A | 1 | snp |
| Intron | ||
| 34500668 | C | A | 1 | snp |
| Intron | ||
| 34500750 | G | A | 1 | snp |
| Intron | ||
| 34500773 | C | T | 1 | snp |
| Exon 6 | Glu → Lys | |
| 34501082 | T | C | 1 | snp |
| Exon 6 | Thr → Ala | Cáceres et al., 2019 [ |
| 34501555 | C | T | 1 | snp |
| Intron | ||
| 34502034 | T | C | 1 | snp |
| Exon 3 | Asn → Ser | Cáceres et al., 2019 [ |
| 34502075 | T | A | 1 | snp |
| Exon 3 | Li et al., 2015 [ | |
| 34502169 | ATG | AGTG | 1 | insertion |
| Intron | ||
| 34502196 | G | A | 1 | snp |
| Intron | ||
| 34502353 | A | C | 1 | snp |
| Exon 2 | Asp → Glu | Li et al., 2015 [ |
| 34502501 | T | C | 1 | snp |
| Intron | ||
| 34502686 | T | C | 1 | snp |
| 5’ UTR | ||
| 34502748 | T | C | 1 | snp |
| 5’ UTR | Cáceres et al., 2019 [ | |
| 34502756 | C | T | 1 | snp |
| 5’ UTR | Cáceres et al., 2019 [ | |
| 34502954 | G | A | 1 | snp | Non Coding DNA | |||
| 34509735 | T | C | 1 | snp |
| 3’ UTR | ||
| 34509898 | T | A | 1 | snp |
| 3’ UTR | ||
| 34509976 | T | C | 1 | snp |
| 3’ UTR | ||
| 34510100 | G | T | 1 | snp |
| 3’ UTR | ||
| 34510131 | C | G | 1 | snp |
| 3’ UTR | ||
| 34510239 | TTTAACT | TT | 5 | deletion |
| 3’ UTR | ||
| 34510250 | A | C | 1 | snp |
| 3’ UTR | ||
| 34510260 | A | G | 1 | snp |
| 3’ UTR | ||
| 34510309 | A | G | 1 | snp |
| 3’ UTR | ||
| 34510392 | A | G | 1 | snp |
| 3’ UTR | ||
| 34510428 | GACA | GA | 2 | deletion |
| 3’ UTR | ||
| 34510495 | T | G | 1 | snp |
| 3’ UTR | ||
| 34510529 | C | T | 1 | snp |
| 3’ UTR | Cáceres et al., 2019 [ | |
| 34510747 | G | C | 1 | snp |
| 3’ UTR | ||
| 34510768 | TGTGTGCG | TG | 6 | deletion |
| 3’ UTR | ||
| 34510842 | CTTTTTTTTTTTTTTC | CTTTTTTTTTC, CTTTTTTTTC | 5,6 | deletion |
| 3’ UTR | ||
| 34511324 | TAAATG | TAATG | 1 | deletion |
| 3’ UTR | ||
| 34511937 | T | C | 1 | snp |
| Exon | ||
| 34511950 | A | T | 1 | snp |
| Exon | Leu → His | |
| 34511973 | T | C | 1 | snp |
| Exon | ||
| 34512585 | ACAA | GCAG | 4 | complex |
| Intron |