| Literature DB >> 31988161 |
Yusuke Takehana1, Margot Zahm2, Cédric Cabau3, Christophe Klopp2,3, Céline Roques4, Olivier Bouchez4, Cécile Donnadieu4, Celia Barrachina5, Laurent Journot5, Mari Kawaguchi6, Shigeki Yasumasu6, Satoshi Ansai7, Kiyoshi Naruse7, Koji Inoue8, Chuya Shinzato8, Manfred Schartl9,10,11, Yann Guiguen12, Amaury Herpin13,14.
Abstract
The genus Oryzias consists of 35 medaka-fish species each exhibiting various ecological, morphological and physiological peculiarities and adaptations. Beyond of being a comprehensive phylogenetic group for studying intra-genus evolution of several traits like sex determination, behavior, morphology or adaptation through comparative genomic approaches, all medaka species share many advantages of experimental model organisms including small size and short generation time, transparent embryos and genome editing tools for reverse and forward genetic studies. The Java medaka, Oryzias javanicus, is one of the two species of medaka perfectly adapted for living in brackish/sea-waters. Being an important component of the mangrove ecosystem, O. javanicus is also used as a valuable marine test-fish for ecotoxicology studies. Here, we sequenced and assembled the whole genome of O. javanicus, and anticipate this resource will be catalytic for a wide range of comparative genomic, phylogenetic and functional studies. Complementary sequencing approaches including long-read technology and data integration with a genetic map allowed the final assembly of 908 Mbp of the O. javanicus genome. Further analyses estimate that the O. javanicus genome contains 33% of repeat sequences and has a heterozygosity of 0.96%. The achieved draft assembly contains 525 scaffolds with a total length of 809.7 Mbp, a N50 of 6,3 Mbp and a L50 of 37 scaffolds. We identified 21454 predicted transcripts for a total transcriptome size of 57, 146, 583 bps. We provide here a high-quality chromosome scale draft genome assembly of the euryhaline Javafish medaka (321 scaffolds anchored on 24 chromosomes (representing 97.7% of the total bases)), and give emphasis on the evolutionary adaptation to salinity.Entities:
Keywords: Medaka; adaptation; evolution; genetic map; long reads; salinity; transcriptome; whole genome sequencing
Mesh:
Year: 2020 PMID: 31988161 PMCID: PMC7056978 DOI: 10.1534/g3.119.400725
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1A couple of Java medakas, Oryzias javanicus. Picture from K. Naruse, NBRP Medaka stock center (https://shigen.nig.ac.jp/medaka/top/top.jsp).
Figure 2Oryzias javanicus assembly pipeline. Sequencing data are represented by colored rectangles with waved bases. Tools used are in gray rectangles. Assembly metrics are in gray and white rectangles. This pipeline is divided in stages symbolized by the frame.
GenomeScope outputs on O. javanicus genome statistics
| Property | min | max |
|---|---|---|
| 0.960% | 0.964% | |
| 908,146,324 bp | 908,641,143 bp | |
| 303,610,795 bp | 303,776,222 bp | |
| 604,535,529 bp | 604,864,921 bp | |
| 95.95% | 99.72% | |
| 1.50% | 1.50% |
Java medaka assembly and annotation statistics
| Gene annotation | |
|---|---|
| Number of genes | 21,454 |
| Number of transcripts | 21,454 |
| Transcriptome size | 57,146,583 bp |
| Mean transcript length | 2,663 bp |
| Longest transcript | 42,733 bp |
| Number of genes with significant hit against NCBI NR | 17,412 (81.2%) |
| Complete BUSCOs | 4,289 (93.6%) |
| Fragmented BUSCOs | 187 (4.1%) |
| Missing BUSCOs | 108 (2.3%) |
Figure 3Phylogenetic position of O. javanicus. Maximum likelihood tree was inferred from the concatenated codon-alignment of 10,852 single-copy genes among 5 reference assemblies of Oryzias species with Southern platyfish (Xiphophorus maculatus) as outgroup. All nodes were supported by 100% bootstrap values.
Figure 4Genome-wide comparison of the Oryzias javanicus and Oryzias latipes genome assemblies. This dot-plot alignment of the O. javanicus genome with the O. latipes reference genome [Ensembl version ASM223467v1 (GCA_002234675.1)] was produced with the D-GENIES software (Cabanettes and Klopp 2018) available online at http://dgenies.toulouse.inra.fr/ with default visualization parameters and a slight filtering of the small matches. The different ranges of percentage of identity are shown in different colors (legend panel with the identity percentage color range).
Figure 5Hatching enzyme of Oryzias javanicus. (A) hce gene cluster of O. latipes (mhce1-5) and O. javanicus (Ojhce1-5). Arrowheads indicate direction of transcription. (B) Salt dependency of O. javanicus hce (black circle) and O. latipes (white circle). Activities are shown as % of relative activity with respect to highest activity, which is considered as 100% in each species.