| Literature DB >> 34956335 |
Wenjing Tao1, Jianmeng Cao2, Hesheng Xiao1, Xi Zhu1, Junjian Dong2, Thomas D Kocher3, Maixin Lu2, Deshou Wang1.
Abstract
The Mozambique tilapia (Oreochromis mossambicus) is a fascinating taxon for evolutionary and ecological research. It is an important food fish and one of the most widely distributed tilapias. Because males grow faster than females, genetically male tilapia are preferred in aquaculture. However, studies of sex determination and sex control in O. mossambicus have been hindered by the limited characterization of the genome. To address this gap, we assembled a high-quality genome of O. mossambicus, using a combination of high coverage of Illumina and Nanopore reads, coupled with Hi-C and RNA-Seq data. Our genome assembly spans 1,007 Mb with a scaffold N50 of 11.38 Mb. We successfully anchored and oriented 98.6% of the genome on 22 linkage groups (LGs). Based on re-sequencing data for male and female fishes from three families, O. mossambicus segregates both an XY system on LG14 and a ZW system on LG3. The sex-patterned SNPs shared by two XY families narrowed the sex determining regions to ∼3 Mb on LG14. The shared sex-patterned SNPs included two deleterious missense mutations in ahnak and rhbdd1, indicating the possible roles of these two genes in sex determination. This annotated chromosome-level genome assembly and identification of sex determining regions represents a valuable resource to help understand the evolution of genetic sex determination in tilapias.Entities:
Keywords: Mozambique tilapia; genome; sex chromosome; sex determining region; transition
Year: 2021 PMID: 34956335 PMCID: PMC8692795 DOI: 10.3389/fgene.2021.796211
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Summary of representative cichlid genomes at chromosome level.
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| Sequencing platform | Nanopore, Hi-C | PacBio | Nanopore, Hi-C | Nanopore, Hi-C | PacBio | PacBio, BioNano, Hi-C | PacBio, BioNano, Hi-C | PacBio, BioNano, Hi-C |
| Chromosome number | 22 | 22 | 22 | 22 | 22 | 24 | 22 | 22 |
| Assembly size (Mb) | 1,007 | 1,001 | 1,005 | 1,006 | 957.49 | 961.23 | 903.07 | 880.44 |
| GC% | 42.28 | 40.72 | 41.23 | 40.65 | 41.15 | 39.79 | 40.59 | 41.08 |
| Identified genes | 28,902 | 29,537 | 25,264 | 25,467 | 25,898 | 29,275 | - | 26,070 |
| Contig | 2.78 | 2.93 | 2.65 | 4.40 | 1.41 | 2.15 | 3.84 | 4.44 |
| N50 (Mb) | ||||||||
| Scaffold | 42.17 | 38.84 | 40.35 | 40.72 | 32.66 | 35.59 | 37.15 | 38,67 |
| N50(Mb) | ||||||||
| References | This study | GCA_001858045.3 | GCA_013350305.1 | GCA_013358895.1 | GCA_000238955.5 | GCA_007364235.1 | GCA_013435755.1 | GCA_900246225.3 |
Note: A. centrachus indicates Archocentrus centrarchus (flier cichlid), A. citrinellus indicates Amphilophus citrinellus (Midas cichlid), A. calliptera indicates Astatotilapia calliptera (eastern happy), M. zebra indicates Maylandia zebra.
FIGURE 1(A) Genome-wide chromosomal contact matrix of O. mossambicus based on chromatin interaction data generated by Hi-C. The low to high interaction frequency distribution of Hi-C links among chromosomes is shown from light yellow to dark red in the heatmap. (B) Comparison of the genome of O. mossambicus and A. centrarchus. A Circos atlas presents details of the pseudo-chromosome information from outside to inside: (I) the length of each chromosome, (II) GC content of 100-Kb genomic intervals, (III) density of gene distribution in each 100-Kb genomic interval, (IV) density of repeats in each 100-Kb genomic interval. A synteny comparison between O. mossambicus and A. centrarchus genomes (Chr represents A. centrarchus and lg represents O. mossambicus) revealed high accuracy of our assembled O. mossambicus genome.
FIGURE 2Phylogenetic analysis of O. mossambicus and other representative teleost fish species. The species divergence time is shown at the branches of the phylogenetic tree, and the confidence intervals are given in parentheses.
FIGURE 3Whole genome survey of sex chromosomes in O. mossambicus. (A) Sex-patterned variants of Family 1, intermediate frequency SNPs in males that are fixed or nearly fixed in females. (B) FST comparison of female pool versus male pool from Family 1. (C) Sex-patterned variants of Family 2. (D) FST comparison of female pool versus male pool from Family 2. The genome re-sequencing data are retrieved from NCBI SRA database (Gammerdinger et al., 2019).
FIGURE 4Genome-wide identification of SNPs from 5 male and 5 female individuals from Zhujiang population of O. mossambicus. (A) Principal component analysis of 10 individuals using SNPs. (B) Phylogenetic tree showing relationships of male (M1-M5) and female (F1-F5) SNPs. (C) Sex-linked SNPs of 10 individuals indicate LG14 is the sex chromosome of the Zhujiang population. (D) Density of sex-linked in each 10-Kb genomic interval on LG14.
FIGURE 5Characterization of sex-determining regions. (A) Syntenic relationships of sex-determining regions on LG14 among O. niloticus, O. aureus and O. mossambicus. (B) 5 shared non-synonymous substitutions, and the other 23 non-synonymous substitutions are only found in the Zhujiang population. (C) The predicted protein structures of Y-linked ahnak. (D) The predicted protein structures of X-linked ahnak. (E) The predicted protein structures of Y-linked rhbdd1. (F) The predicted protein structures of X-linked rhbdd1.