| Literature DB >> 27499266 |
Bastiaan Star1, Ole K Tørresen1, Alexander J Nederbragt1,2, Kjetill S Jakobsen1, Christophe Pampoulie3, Sissel Jentoft1,4.
Abstract
A variety of sex determination mechanisms can be observed in evolutionary divergent teleosts. Sex determination is genetic in Atlantic cod (Gadus morhua), however the genomic location or size of its sex-locus is unknown. Here, we characterize the sex-locus of Atlantic cod using whole genome sequence (WGS) data of 227 wild-caught specimens. Analyzing more than 55 million polymorphic loci, we identify 166 loci that are associated with sex. These loci are located in six distinct regions on five different linkage groups (LG) in the genome. The largest of these regions, an approximately 55 Kb region on LG11, contains the majority of genotypes that segregate closely according to a XX-XY system. Genotypes in this region can be used genetically determine sex, whereas those in the other regions are inconsistently sex-linked. The identified region on LG11 and its surrounding genes have no clear sequence homology with genes or regulatory elements associated with sex-determination or differentiation in other species. The functionality of this sex-locus therefore remains unknown. The WGS strategy used here proved adequate for detecting the small regions associated with sex in this species. Our results highlight the evolutionary flexibility in genomic architecture underlying teleost sex-determination and allow practical applications to genetically sex Atlantic cod.Entities:
Mesh:
Year: 2016 PMID: 27499266 PMCID: PMC4976360 DOI: 10.1038/srep31235
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Polymorphisms with sex-linked genotypic segregation in 48 Atlantic cod specimens (21 ♂ and 27 ♀).
| LG | Position (bp) | Allele | Genotype (Ref/Het/Alt | ||
|---|---|---|---|---|---|
| Reference | Alternative | Females | Males | ||
| 11 | 11885753 | G | T | 0/2/24 | 1/20/0 |
| 11 | 11886873 | T | A | 0/0/27 | 1/20/0 |
| 11 | 11888434 | T | C | 27/0/0 | 0/21/0 |
| 11 | 11893118 | T | A | 27/0/0 | 1/18/1 |
| 11 | 11897471 | G | GTGT | 0/0/27 | 0/21/0 |
| 11 | 11897513 | C | T | 0/0/27 | 0/20/1 |
| 11 | 11897519 | A | T | 0/0/27 | 0/21/0 |
| 11 | 11897566 | AATCC | A | 0/1/25 | 1/19/0 |
| 11 | 11899188 | A | G | 0/0/27 | 0/20/1 |
| 11 | 11899196 | G | T | 0/0/27 | 0/21/0 |
| 11 | 11899391 | C | CT | 0/0/27 | 0/21/0 |
| 11 | 11899539 | G | T | 0/2/25 | 0/20/0 |
| 11 | 11899548 | TA | T | 0/2/25 | 0/20/0 |
Limited numbers of polymorphisms (3 out of 1,573,340 in the filtered, 13 out of 55,160,622 in the unfiltered dataset) are identified, which have highly homozygous genotypes for females and heterozygous genotypes for males. All polymorphisms are co-located within a 15 Kb region on linkage group (LG) 11.
1Ref = homozygous reference, Het = heterozygous, Alt = homozygous alternative.
2Identified in the filtered SNP dataset.
Figure 1Sex linked segregation of genotypes in Atlantic cod.
Over 55 million polymorphisms are compared in 110 males and 116 females of Atlantic cod using Fishers’ exact test. Six distinct regions are identified with a significant increase in p-values, i.e. above 6SD of the mean. Highest numbers of genotypes with most extreme p-values are found on LG11 (a), whereas reduced numbers with lower p-values are found on LG08, LG14, LG15 and LG17 (b). P-values were calculated using PLINK (v.1.90p) and −log transformed. Unplaced scaffolds have been concatenated for visualization.
Linkage groups and regions that contain loci with sex-linked segregation in 226 Atlantic cod specimens.
| LG | Start (bp) | Stop (bp) | Size (bp) | Sex-linked ( | Total ( | Max- |
|---|---|---|---|---|---|---|
| 08a | 1784400 | 1784813 | 413 | 11 | 23 | 47 |
| 08b | 24118524 | 24118864 | 340 | 8 | 14 | 19 |
| 11 | 11864114 | 11918378 | 54264 | 127 | 472 | 67 |
| 14 | 27062221 | 27062388 | 167 | 3 | 8 | 22 |
| 15 | 3678047 | 3679605 | 1558 | 14 | 64 | 41 |
| 17 | 18769676 | 18769822 | 146 | 3 | 6 | 26 |
Start and Stop refer to the location of the first and last locus with a high (greater than six standard deviation from the mean) Fisher’s exact test p-value in a particular region. Size is calculated as the difference between Start and Stop. For each region, the number of sex-linked loci, total number of loci (including those that are not linked to sex, using a MAF of 0.05) and the maximum p-value (Max-p) observed in that region are given.
Figure 2PacBio read alignments of a female Atlantic cod specimen.
Long read alignments (grey) towards the male gadMor2 reference genome overlap those regions containing sex-associated genotypes (red box) on LG08 (a,b) and LG11 (c). Note the different genomic scale (in Kb) for each of the sub-panels. Small indels (black dots) within the alignments are a typical feature of PacBio read data. Read alignments are visualized using the Integrative Genomics Viewer.
Names and genbank ID of sex-determining genes from various teleosts and genes known to be involved in sex-differentiation in Atlantic cod.
| Gene name | Genbank ID | Species | LG | Start | Stop | Alignment score |
|---|---|---|---|---|---|---|
| akap11 | XM_011473624.1 | Oryzias latipes | 20 | 10120287 | 10118045 | 1859 |
| amh | JN802292.1 | Gadus morhua | 12 | 24019786 | 24018572 | 1610 |
| amhy | HM153803.1 | Odontesthes hatcheri | 12 | 24019643 | 24018570 | 473 |
| amhr2 | NM_001280009.1 | Takifugu rubripes | 13 | 9249009 | 9250493 | 574 |
| ar | FJ268742.1 | Gadus morhua | 10 | 19432117 | 19442119 | 3743 |
| cyp19a | DQ402370.1 | Gadus morhua | 14 | 16165860 | 16169755 | 2851 |
| cyp19b | JN802291.1 | Gadus morhua | 9 | 2257039 | 2260649 | 2648 |
| dmrt | AJ506094.1 | Gadus morhua | 6 | 19949821 | 19940091 | 855 |
| dmrt2a | JN802284 | Gadus morhua | 6 | 19911488 | 19908495 | 1650 |
| dmrt3 | JN802285 | Gadus morhua | 6 | 19922948 | 19919278 | 2020 |
| dmrt4 | JN802286 | Gadus morhua | 17 | 14093750 | 14092035 | 2546 |
| dmrt5 | JN802287 | Gadus morhua | 12 | 20988494 | 20990196 | 2537 |
| dmy | NM_001104680.1 | Oryzias latipes | 6 | 19948867 | 19940028 | 382 |
| esr1 | JX178935.1 | Gadus morhua | 21 | 18435483 | 18415167 | 2611 |
| esr2a | JX178936.1 | Gadus morhua | 21 | 8193571 | 8213026 | 3711 |
| esr2b | JK993476.1 | Gadus morhua | 5 | 20297385 | 20297568 | 334 |
| foxl2 | NM_001104888.1 | Oryzias latipes | 1 | 27838167 | 27837834 | 470 |
| gsdf | KC204828.1 | Gadus morhua | 4 | 25085824 | 25083577 | 1040 |
| sdY | NM_001281416.1 | Oncorhynchus mykiss | 7 | 1898620 | 1898410 | 136 |
| sox3 | AB775143.1 | Oryzias dancena | 10 | 17937590 | 17936636 | 1482 |
| sox9a | JN802288.1 | Gadus morhua | 18 | 7659675 | 7661680 | 1722 |
| sox9b | JN802289.1 | Gadus morhua | 2 | 17478629 | 17480857 | 2714 |
| vasa | HM451456.1 | Gadus morhua | 7 | 3034506 | 3016733 | 3812 |
Location (in base pair) on the various linkage groups (LD) was determined by aligning the protein coding sequence to the genome assembly using exonerate 2.2.0 50 with the option –model coding2genome. Alignments with the highest score (–bestn) were selected as the most likely genomic location.
Figure 3Annotation surrounding the sex-associated region of Atlantic cod on linkage group 11.
Nine gene models (the arrow shows transcriptional direction) have been annotated within a 200 Kb window around the loci with sex-linked genotypic segregation. The histogram (orange) shows the number of sex associated loci (with p-values more than 6 SD from the mean) in windows of 1 Kb. Gene models with cDNA evidence from Atlantic cod (blue) have been annotated with gene names and are in conserved synteny (order and transcriptional direction) with those in three-spined stickleback and spotted gar. Three genes (ptpn2a, cep76 and id4) are in conserved synteny with medaka. Gene models without cDNA evidence (grey) do not have obvious homology with known genes. Average read depth (grey line) is calculated in 1 Kb windows and can be compared to the genome-wide average coverage (red dashed line).