| Literature DB >> 32293027 |
Sunnvør Í Kongsstovu1,2,3, Hans Atli Dahl1, Hannes Gislason2, Eydna Homrum4, Jan Arge Jacobsen4, Paul Flicek3, Svein-Ole Mikalsen2.
Abstract
The sex determination system of Atlantic herring Clupea harengus L., a commercially important fish, was investigated. Low coverage whole-genome sequencing of 48 females and 55 males and a genome-wide association study revealed two regions on chromosomes 8 and 21 associated with sex. The genotyping data of the single nucleotide polymorphisms associated with sex showed that 99.4% of the available female genotypes were homozygous, whereas 68.6% of the available male genotypes were heterozygous. This is close to the theoretical expectation of homo/heterozygous distribution at low sequencing coverage when the males are factually heterozygous. This suggested a male heterogametic sex determination system in C. harengus, consistent with other species within the Clupeiformes group. There were 76 protein coding genes on the sex regions but none of these genes were previously reported master sex regulation genes, or obviously related to sex determination. However, many of these genes are expressed in testis or ovary in other species, but the exact genes controlling sex determination in C. harengus could not be identified.Entities:
Keywords: zzm321990Clupea harengus; Atlantic herring; genome-wide association study; sex determination
Mesh:
Year: 2020 PMID: 32293027 PMCID: PMC7115899 DOI: 10.1111/jfb.14349
Source DB: PubMed Journal: J Fish Biol ISSN: 0022-1112 Impact factor: 2.051
Number of reads generated by low coverage sequencing and coverage of the 850 Mb Atlantic herring C. harengus genome
| No. of reads | Coverage | |||||||
|---|---|---|---|---|---|---|---|---|
| Pre QC |
| Post QC |
| Pre QC |
| Post QC |
| |
| Total | 2,094,755,946 | 19,577,158.4 | 1,549,740,080 | 14,483,552.2 | 394.3 | 3.8 | 267.1 | 2.6 |
| Female | 945,494,924 | 19,295,814.8 | 708,052,772 | 14,450,056.6 | 178.0 | 3.7 | 122.5 | 2.6 |
| Male | 1,149,261,022 | 20,162,474.1 | 841,687,308 | 14,766,444.0 | 216.3 | 3.9 | 144.7 | 2.6 |
Note: Quality control consisted of trimming of low‐quality sequences and adapter sequences (see method). QC, quality control; , average per individual.
Regions of the Atlantic herring C. harengus genome and number of SNPs associated with sex that were identified in the GWAS
| Chromosome | Position | No. of SNPs |
|---|---|---|
| 8 | 21,063,400–22,268,779 | 488 |
| 21 | 17,047,390–17,055,230 | 41 |
FIGURE 1Manhattan plot showing –log of the P values from the GWAS investigating sex determination regions on the Atlantic herring Clupea harengus genome. The horizontal line indicates the genome‐wide significance threshold [−log10(P) = 8.2]
FIGURE 2Genotypes for the SNPs significantly associated with sex in Atlantic herring Clupea harengus. The dark blue and red vertical lines represent male and female individuals, respectively. The homozygous reference allele genotypes are light blue (). The homozygous alternative allele genotypes are orange (). The heterozygous genotypes are green (). No genotyping data available is black ()
Genotype count for the 529 SNPs associated with sex in Atlantic herring C. harengus
| Genotype | Females | Males | Total |
|---|---|---|---|
| Homozygous (reference + alternative) | 17,161 (16,418 + 743) | 6694 (3522 + 3172) | 23,855 |
| Heterozygous | 106 | 14,639 | 14,745 |
| Total | 17,267 | 21,333 | 38,600 |
Average coverage for the individual SNPs associated with sex in Atlantic herring C. harengus
| Genotype | Females | Males | ||||
|---|---|---|---|---|---|---|
| Average | S.D. |
| Average | S.D. |
| |
| Homozygous reference allele | 4.40 | 2.91 | 16,418 | 3.36 | 1.28 | 3522 |
| Homozygous alternative allele | 4.06 | 0.80 | 743 | 2.94 | 1.02 | 3172 |
| Heterozygous | 5.17 | 0.39 | 106 | 5.25 | 3.26 | 14,639 |
Note: n, number of samples; S.D., standard deviation.
FIGURE 3The experimentally observed proportions of homozygous female and male genotypes of SNPs associated with sex in Atlantic herring Clupea harengus versus read coverage (x) and the corresponding theoretically expected probabilities and . Error bars correspond to 95% confidence intervals from the binomial test. Observed: () female, () male; Expected: () female, () male
Atlantic herring C. harengus genes on the genomic regions associated with sex, together with their orthologs and possible link to sex determination or sex‐related functions
|
| Orthologous gene | Orthologous species | Reason | Reference |
|---|---|---|---|---|
|
|
|
| Highest expression levels in testis | (Bastian |
|
|
|
| Highest expression levels in testis | (Bastian |
|
|
|
| Highest expression levels in mature ovarian follicle | (Bastian |
|
|
|
| Highest expression levels in mature ovarian follicle | (Bastian |
|
|
|
| X‐linked | (Cason |
|
|
|
| X‐linked | (Cantagrel |
|
|
|
| Specifically expressed in the testis | (Taira |
|
|
| Expressed in 29 organs, with the highest expression level in mature ovarian follicles | (Bastian | |
|
|
|
| Expressed in the testis and plays a role in fertilization | (Zhang |
|
|
|
| Plays a role in oocyte maturation | (Wu |
|
| Plays a role in sexual maturation in male sea lamprey | (Bryan | ||
|
|
|
|
| (Kagermeier‐Schenk |
|
|
|
| When mutated, fish have enlarged testes and accumulation of immature oocytes | (Neumann |
Location of the SNPs associated with sex in Atlantic herring C. harengus
| Location of SNPs associated with sex | Number of SNPs |
|---|---|
| Intergenic regions | 151 |
| Promoter regions | 105 |
| 5′ untranslated regions | 10 |
| 3′ untranslated regions | 39 |
| Splice sites | 0 |
| Introns | 167 |
| Coding regions | 57 |
| Synonymous SNPs | 27 |
| Nonsynonymous SNPs | 30 |
| Conservative amino acid substitutions | 12 |
| Nonconservative amino acid substitutions | 18 |
2000 bp upstream and 200 bp downstream of genes.
Details of these nonconservative nonsynonymous substitutions are listed in Table 7.
Nonconservative nonsynonymous substitutions in genes on the Atlantic herring C. harengus genome caused by SNPs significantly associated with sex
| Gene | Chr | Pos | AA substitution | Changes to AA |
|---|---|---|---|---|
|
| 8 | 21,072,591 | Q‐ > H | Polar to positively charged |
| 8 | 21,077,772 | S‐ > P | Polar to nonpolar | |
|
| 8 | 21,086,634 | Q‐ > K | Polar to positively charged |
| 8 | 21,088,137 | S‐ > F | Polar to nonpolar | |
|
| 8 | 21,115,969 | P‐ > S | Nonpolar to polar |
| 8 | 21,116,347 | Q‐ > E | Nonpolar to negatively charged | |
| 8 | 21,116,916 | C‐ > Y | Nonpolar to polar | |
| 8 | 21,117,352 | N‐ > D | Polar to negatively charged | |
| 8 | 21,117,963 | S‐ > L | Nonpolar to polar | |
|
| 8 | 21,162,489 | P‐ > T | Nonpolar to polar |
|
| 8 | 21,176,434 | R‐ > S | Positively charged to polar |
|
| 21 | 17,049,306 | G‐ > S | Nonpolar to polar |
| 21 | 17,049,310 | S‐ > L | Polar to nonpolar | |
| 21 | 17,049,450 | E‐ > K | Negatively charged to positively charged | |
| 21 | 17,049,451 | E‐ > A | Negatively charged to nonpolar | |
| 21 | 17,049,466 | Q‐ > L | Polar to nonpolar | |
| 21 | 17,049,504 | K‐ > E | Positively charged to negatively charged | |
| 21 | 17,051,213 | S‐ > A | polar to nonpolar |
Note: Gene name abbreviations in parentheses are the Ensembl abbreviations and are only given if no abbreviations were available in GeneBank. AA, amino acid.
Sex‐specific deletions and insertions in genomic regions associated with sex in Atlantic herring C. harengus
| Indel no. | Position | Male (M) or female (F) specific | Type | Indel size | Genotype counts (A/H) |
|---|---|---|---|---|---|
| 1 | CHR8:21,128,155 | M | Deletion | 1 | 34/6 |
| 2 | CHR8:21,128,178 | M | Deletion | 1 | 38/2 |
| 3 | CHR8:21,131,541 | F | Insertion | 2 | 30/4 |
| 4 | CHR8:21,265,567 | M | Insertion | 3 | 34/5 |
| 5 | CHR8:21,408,287 | F | Insertion | 1 | 32/2 |
| 6 | CHR8:21,545,424 | M | Deletion | 1 | 34/7 |
| 7 | CHR8:21,545,788 | M | Deletion | 1 | 41/1 |
| 8 | CHR8:21,548,262 | M | Deletion | 1 | 34/6 |
| 9 | CHR8:21,549,148 | M | Insertion | 1 | 39/6 |
| 10 | CHR8:21,549,352 | F | Insertion | 1 | 34/2 |
| 11 | CHR8:21,603,143 | F | Deletion | 1 | 29/6 |
| 12 | CHR8:21,603,205 | M | Insertion | 1 | 37/7 |
| 13 | CHR8:21,638,324 | M | Deletion | 2 | 38/1 |
| 14 | CHR8:21,677,099 | F | Insertion | 1 | 32/5 |
| 15 | CHR8:21,682,470 | M | Insertion | 2 | 33/6 |
| 16 | CHR8:21,721,091 | M | Deletion | 1 | 37/6 |
| 17 | CHR8:21,878,176 | M | Insertion | 2 | 38/10 |
| 18 | CHR8:22,011,043 | F | Insertion | 2 | 29/7 |
Note: There were 55 males and 48 females but not all individuals had data for all variations because of the low sequencing coverage. Only insertions and deletions present in all individuals (with data) of the same sex were included. A, homozygous alternative allele; H, heterozygous.