Literature DB >> 31530635

Construction of High-Resolution RAD-Seq Based Linkage Map, Anchoring Reference Genome, and QTL Mapping of the Sex Chromosome in the Marine Medaka Oryzias melastigma.

Bo-Young Lee1, Min-Sub Kim1, Beom-Soon Choi2, Atsushi J Nagano3, Doris Wai Ting Au4,5, Rudolf Shiu Sun Wu6, Yusuke Takehana7, Jae-Seong Lee8.   

Abstract

Medaka (Oryzias sp.) is an important fish species in ecotoxicology and considered as a model species due to its biological features including small body size and short generation time. Since Japanese medaka Oryzias latipes is a freshwater species with access to an excellent genome resource, the marine medaka Oryzias melastigma is also applicable for the marine ecotoxicology. In genome era, a high-density genetic linkage map is a very useful resource in genomic research, providing a means for comparative genomic analysis and verification of de novo genome assembly. In this study, we developed a high-density genetic linkage map for O. melastigma using restriction-site associated DNA sequencing (RAD-seq). The genetic map consisted of 24 linkage groups with 2,481 single nucleotide polymorphism (SNP) markers. The total map length was 1,784 cM with an average marker space of 0.72 cM. The genetic map was integrated with the reference-assisted chromosome assembly (RACA) of O. melastigma, which anchored 90.7% of the assembled sequence onto the linkage map. The values of complete Benchmarking Universal Single-Copy Orthologs were similar to RACA assembly but N50 (23.74 Mb; total genome length 779.4 Mb; gap 5.29%) increased to 29.99 Mb (total genome length 778.7 Mb; gap 5.2%). Using MapQTL analysis with SNP markers, we identified a major quantitative trait locus for sex traits on the Om10. The integration of the genetic map with the reference genome of marine medaka will serve as a good resource for studies in molecular toxicology, genomics, CRISPR/Cas9, and epigenetics.
Copyright © 2019 Lee et al.

Entities:  

Keywords:  Oryzias melastigma; linkage map; marine medaka; quantitative trait locus; reference genome; sex

Mesh:

Year:  2019        PMID: 31530635      PMCID: PMC6829124          DOI: 10.1534/g3.119.400708

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Many fish species are useful for ecotoxicological research, as they indicate an early warning of environmental contamination caused by various aquatic pollutants (Arellano-Aguilar ). Recently, an increase in the contamination levels of estuaries and coastal water due to anthropogenic pollutants emphasizes the need for marine sentinel model fish species. Medaka (Oryzias sp.) is an important fish species in ecotoxicology and considered a model species, as its biological features include certain advantages such as small body size and short generation time (Kim ). Japanese medaka Oryzias latipes is important and widely used model species for the studies of genetics, evolution, and ecotoxicology, with abundant genomic resources. Since O. latipes is a freshwater species, their responses to environmental toxicants can be different in those of marine fish (Shi ; Wu ; Wheeler ). Marine medaka Oryzias melastigma inhabits brackish water in Asian regions including Pakistan, India, Burma, and Thailand (Naruse 1996). O. melastigma has been acknowledged as a potential model fish for marine ecotoxicological studies and is useful for the evaluation of acute and/or chronic toxicity, and embryo toxicity testing (Chen ; Dong ; Kim ; Kong ; Shen ; Wang ; Kim ). The genus Oryzias has been divided into three major monophyletic species groups; the latipes, the javanicus, and the celebenesis groups, while O. dancena, a closely related species of O. melastigma, has been phylogenetically placed in the javanicus group (Takehana ). Phenotypic distinction between male and female in medaka is distinguished by a number of secondary sex characters including shape and size of dorsal and anal fins due to morphologically indistinguishable sex chromosomes (Schartl 2004). Moreover, Oryzias species shows the both XY and ZW sex-determining systems. O. latipes, O. dancena, and O. minutillus have an XX/XY sex-determining system and O. hubbsi has a ZZ/ZW (Matsuda ; Takehana ). For example, sex-determining gene dmrt1bY has been identified in a few species of the O. latipes group and sox3 in O. dancena and some species of the celebenesis group (Kondo ; Matsuda ; Myosho ). There is a lack of studies on sex-determining genes in O. melastigma. Therefore, more genomic resources specifically devoted for the marine medaka O. melastigma are required. A genetic linkage map is a very useful tool to understand genetic architecture such as chromosome structure, segregation distortion regions, recombination rate, and recombination hotspots (Meyer and Van de Peer 2005; Zhu ; Guo ; Shifman ). Furthermore, it provides a framework for mapping the chromosomal location of single-gene traits and quantitative traits of interest, and helps to facilitate candidate gene cloning, and comparative genomic analysis with some genome information together (Lee ; Amores ; Feng ; Zhu ; Li ; Shao ; Wang ; Xiao ; Kanamori ). A high-density genetic map also plays an important role in assembling whole genome sequences by examining the accuracy of de novo genome assembly (Dukić ; Rhee ; Wang ). Indeed, the importance of a high-density genetic map has been demonstrated during the de novo genome assembly in teleost fish, as it validated the presence of additional genome duplication (Meyer and Schartl 1999; Meyer and Van de Peer 2005; Volff 2005). In addition, restriction-site associated DNA (RAD) sequencing based on next-generation sequencing (NGS) enables the rapid discovery of genome-wide genetic markers and high-throughput single nucleotide polymorphism (SNP) genotyping in mapping families and facilitates the construction of high-density genetic linkage maps in both model and non-model organisms (Baird ; Davey and Blaxter 2010; Amores ; Davey ; Etter ). We have recently published a reference genome assembly (total genome length 779.4 Mb) of O. melastigma as a model species in environmental toxicology (Kim ). In this study, we constructed a high-density genetic linkage map of O. melastigma using an F1 full-sib family. Using the genetic linkage map, sex QTL has been mapped in the genome of O. melastigma and the previous genome assembly was anchored onto the linkage map to improve the contiguity of the assembly. The development of a high-density genetic map is imperative to facilitate both genetic and genomic studies in O. melastigma. The present study will assist in a better understanding of genome-based research in molecular toxicology, genomics, CRISPR/Cas9, and epigenetics.

Materials and methods

Mapping cross

The marine medaka O. melastigma used in this study were obtained from the City University of Hong Kong (kindly provided by Dr. Doris W.T. Au) and maintained at Nagahama Institute of Bio-Science and Technology, Nagahama, Japan. A male and a female fish were bred to produce F1 progenies. In total, 58 F1 individuals were used to create a linkage group (LG).

RAD sequencing

Genomic DNA was extracted from muscle tissue using DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. The size and quality of DNA isolated was checked on 1% agarose gel by electrophoresis and the concentration was measured using Qubit florometer (Thermo Fisher Scientific, Waltham, MA, USA). Genomic DNA (40 ng) of each sample was digested with BglII (5 Unit) and EcoRI (5 Unit), ligated with a Y-shaped adaptor, amplified by polymerase chain reaction (PCR) with KAPA HiFi HS ReadyMix (Kapa Biosystems, Wilmington, MA, USA), and fragments were selected with E-Gel Size Select (Life Technologies, Carlsbad, CA, USA). The mean size of the selected fragments was 333 bp (CV 16.4%). Further details of the library preparation method are described in a previous study by Sakaguchi . RAD sequencing (RAD-seq) was performed using 58 F1 individuals and both parents with HiSeq2500 (Illumina, San Diego, CA, USA) with eight cycles for index read and 51 cycles for the reads of interest. For each parental sample, the same amounts were aliquoted in four different reaction tubes and sequencing of each reaction was carried out to reduce PCR amplification bias. All procedures related to RAD-seq including the library construction were performed by Clockmics Inc. (Osaka, Japan).

Extracting RAD-tags and SNP genotyping by Stacks

Quality filtration of sequence reads was performed using Trimmomatic v.0.33 (Bolger ) with parameter options of -0.33.jar SE -phred33 TOPHRED33 ILLUMINACLIP TruSeq3-PE-2.fa:2:30:10 LEADING:19 TRAILING:19 SLIDINGWINDOW:30:20 AVGQUAL:20 MINLEN:51. RAD-tag extraction and genotyping were performed with Stacks v.1.47 software (http://creskolab.uoregon.edu/stacks/) (Catchen ). The sequence reads were aligned to the available reference genome (GCF_002922805.1, Kim ) using GSnap (https://github.com/juliangehring/GMAP-GSNAP) with default parameters (-t 30 –n 1 –m 5 –i 2), which converted to BAM files. All RAD-tags catalog from the parental samples were extracted by Stacks using the ref_map.pl pipeline with the parameters –m10 and -P 3 and genotyping was called by the parameters of minimum number of 5 reads to call a homozygous genotype, a minimum minor allele frequency of 0.1 to call a heterozygote, and a maximum minor allele frequency of 0.05 to call a homozygote. Among RAD-tags, single nucleotide polymorphism (SNP) markers with maximum likelihood of 0 were selected for mapping and the SNP markers with genotypes of at least 53 F1 offsprings (> 90%) were collected for map construction using the command genotypes –r 53 of Stacks v.1.47. Data of raw sequences were deposited in the Sequence Read Archive (SRA) (http://www.ncbi.nlm.nih.gov/sra) under the accession number PRJNA514812.

Linkage map construction

Linkage analysis of genetic markers (SNPs) was performed using JoinMap 5.0 (Wageningen, Netherlands, Van Ooijen 2018). The SNP markers with a significant segregation distortion (χ2 test, P < 0.01) were removed from the analysis of linkage map construction. Linkage groups were identified by the grouping parameters of independent Likelihood Odds Ratio (LOD) threshold of 5 provided in JoinMap 5.0. Map distances were calculated by the Kosambi’s mapping function and mapping algorithm used for building linkage map was regression mapping based on the default parameters. The regression mapping adds loci one by one starting from the most informative pair of loci. The best position of each added locus is searched by comparing the goodness-of-fit of the calculated map for each tested position. JoinMap performed three rounds of marker positioning with a jump threshold of 5 and we took second round of map as a final map as recommended by the manual. The linkage map was visualized using MapCart 2.32 (Voorrips 2002). The name of the linkage groups was matched with the homologous chromosomes of Japanese medaka.

Anchoring the reference genome on to linkage map

Genetic markers in the linkage map were anchored to the reference genome (GCF_002922805.1) of the marine medaka O. melastigma using Chromonomer v. 1.08 (http://catchenlab.life.illinois.edu/chromonomer/). The integrated genome assembly based on the genetic map was re-assessed with benchmarking universal single-copy orthologs (BUSCO) v.3.0 (Simão ) using the vertebrate database (OrthoDB v.9.0; https://www.orthodb.org/?page=filelist). The gene annotation of the final assembly was carried out using MAKER v.2.31.8 pipeline with manual curation (Suppl. Fig. S1) (Holt and Yandell 2011).

Comparative analysis of two medaka genomes

The final linkage map based genome assembly of marine medaka was compared with the genome of Japanese medaka (Hd-rR strain) to compare the similarity between two medaka genomes using Mummer v.3.0 (http://mummer.sourceforge.net/manual/#coords).

Sex linkage analysis

The mapping panel consisted of 27 males and 31 females. In order to examine sex determining mechanism in O. melastigma by linkage analysis, two completely linked sex markers (sex_xy and sex_zw) were additionally added in the genotype files. For sex_xy and sex_zw genotypes, male individuals were coded as heterozygotes and homozygotes, respectively, while female individuals were coded as homozygotes and heterozygotes, respectively. The linkage analysis of sex markers was performed with JoinMap 5.0 (Wageningen, Netherlands, Van Ooijen 2018). Since the sex trait is qualitative, the phenotype for sex was converted into binary code; male 1 and female 0. For QTL analysis, standard interval mapping was performed and significance was determined by permutation test (n = 1000) using MapQTL 6.0 (van Ooijen et al. 2002).

Data availability

Sequenced species O. melastigma is available upon request. Suppl. Data contains all supplemental tables and figures. Suppl. File S1 contains the names of 2481 SNP, genotype, map positions, and tag sequences in Map info tab and phenotypes for sex of all individuals were in Phenotype tab. Suppl. File S2 contains the re-scaffolding process and results by Chromonomer. Suppl. File S3 includes the gene list annotated in the sex-determining region. Raw data of RAD seq were deposited to GenBank under the accession number PRJNA514812. The genome sequence data are available in GenBank with accession number PRJNA401159. The genome JBrowse is available at http://rotifer.skku.edu:8080/Om2. Supplemental material available at FigShare: https://doi.org/10.25387/g3.7811978.

Results

Constructing genetic linkage map of the marine medaka Oryzias melastigma

Stacks software extracted 113,367 RAD-tags from O. melastigma genome. Among them, the number of putative SNP markers was 34,040. To distinguish polymorphisms from sequencing errors, we collected 24,441 SNP markers with Likelihood Ratio of 0, including 8,518 SNP markers that have been located in the same RAD-tags more than twice. Among the remaining 15,922 loci, 4,497 SNP loci were successfully genotyped in at least 53 F1 individuals (>90%). After removing the markers that showed a segregation distortion (P < 0.01), 3,732 were finally used for building a genetic map. The 3,730 markers were grouped into 24 LGs with a LOD ≥ 5.0 by independence LOD and two remaining markers were not linked with any of those groups. The regression mapping function by JoinMap positioned 2,481 SNPs in the second rounds of mapping by comparing the goodness-of-fit (Suppl. File S1). The 24 LGs were consistent with the number of chromosomes (n = 24) in O. melastigma (Uwa ). The total map length was 1,784 cM and each LG included 57-173 markers with an average marker interval of 0.72 cM (Figure 1 and Table 1).
Figure 1

A linkage map of the marine medaka Oryzias melastigma. The map consists of 24 linkage groups and the bars on each linkage group represents single nucleotide polymorphism (SNP) markers. Colors of bars indicate the reference-assisted chromosome assembly scaffolds that SNP was extracted. (Name, sequences, and position of SNP are included in the Suppl. File 1.)

Table 1

Summary of the genetic linkage map of Oryzias melastigma

LGGroup IDNo. markers mappedMap Length (cM)Oryzias latipes Chromosome
Om01Group 79177.921
Om02Group 219864.9352
Om03Group 510789.1843
Om04Group 610868.3134
Om05Group 1310284.1325
Om06Group 417389.9726
Om07Group 1211880.0817
Om08Group 245753.5848
Om09Group 1711185.6719
Om10Group 1611768.29710
Om11Group 110379.83911
Om12Group 812378.2512
Om13Group 157471.65713
Om14Group 1414281.0314
Om15Group 1810657.11815
Om16Group 198853.32416
Om17Group 312877.58917
Om18Group 211772.04318
Om19Group 229972.17819
Om20Group 238393.80220
Om21Group 209366.78521
Om22Group 97972.19722
Om23Group 117172.723
Om24Group 109373.36624
Total24811783.967
A linkage map of the marine medaka Oryzias melastigma. The map consists of 24 linkage groups and the bars on each linkage group represents single nucleotide polymorphism (SNP) markers. Colors of bars indicate the reference-assisted chromosome assembly scaffolds that SNP was extracted. (Name, sequences, and position of SNP are included in the Suppl. File 1.)

Re-scaffolding of the reference genome with the genetic map

Using the SNP marker information in the high-density genetic linkage map, 810 markers were anchored to 134 scaffolds and among them, 35 were split into 1 to 9 positions, producing a total of 260 integrated scaffolds (Tables 2, Suppl. Table S2, and Suppl. Fig. S2). After integration, the length of the genome scaffolds aligned on the map was 712,537,413 bp (Table 2). Out of the 260 integrated scaffolds, the orientation was determined in 160 scaffolds spanning 670,530,120 bp, which accounted for 94% of the total scaffold length in the linkage map (Table 2). Among 40 reference-assisted chromosome assembly (RACA) scaffolds previously published (Kim ), 20 RACA scaffolds (RACA3, 5, 6, 7, 8, 9, 11, 12, 14, 15, 18, 21, 22, 23, 25, 26, 27, 29, 30, 40) were aligned to the 13 linkage groups (Om23, Om21, Om20, Om19, Om17, Om16, Om14, Om 12, Om11, Om10, Om09, Om08, Om01) without any modification (Suppl. Table S1). Other RACA scaffolds showed the major alignment of one linkage group with the partial alignment of another linkage groups (Suppl. Table S1). Among them, RACA 31 showed the most frequent rearrangement during the anchoring process, which was a major alignment with Om 7 and partial alignments with another 4 linkage groups (Om20, Om18, Om17, and Om02) (Suppl. Table S1). Four linkage groups (Om03, Om05, Om15, and Om22) were completely aligned by each of 4 RACA scaffolds (RACA4, RACA16, RACA33, and RACA36) respectively (Table 2), although some parts of sequences aligned in the linkage groups were inserted from small scaffolds (Suppl. File S2) and the parts of RACA scaffolds were located in parts of another linkage groups (Supp. Table S1). Overall, the final genetic map based genome assembly consisted of 8,493 scaffolds; 24 linkage map-based scaffolds (90.7%) and 8,469 unanchored scaffolds (9.3%). The total genome length was 779 Mb with an N50 value of 29,978,720 bp (Table 3). BUSCO analysis indicated that the final genome assembly of O. melastigma represented 96.8% of the complete copy in the vertebrate gene model (Table 4). The genome annotation pipeline in the final assembly was defined as 24,507 genes (http://rotifer.skku.edu:8080/Om2), ranging from 661 to 1,216 genes per LG (Table 3 and Suppl. Table S3).
Table 2

Physical lengths of linkage map anchored with the reference genome assembly in Oryzias melastigma

LGPhysical Length (bp)No. of anchorsNo. of scaffoldsNo. of oriented scaffoldsLength of Oriented scaffolds (bp)RACA scaffolds anchored to linkage groups
Om0132,649,675378631,960,1211, 33, 39, 40
Om0222,231,40437221118,905,50817, 21, 31,33, 35, 37, 38
Om0335,272,6513411733,734,27736
Om0431,689,6222610831,276,4844, 34, 35, 38
Om0537,216,9972711836,448,57133
Om0635,155,25750141134,854,58232, 35, 36
Om0733,183,4123513832,581,82513, 31, 37
Om0825,074,844169524,678,1011, 2, 30
Om0933,864,5063415828,264,70919, 28, 29, 34, 37
Om1029,394,5984413521,895,78717, 26, 27, 33
Om1127,364,9413313926,207,85323, 24, 25, 36, 39
Om1227,293,507489626,239,88421, 22
Om1333,844,7312310531,595,51010, 19, 20, 24, 32
Om1429,986,920419627,349,87217, 18,
Om1531,267,159239730,446,10016
Om1631,579,375289627,524,06314, 15, 28, 32, 38
Om1734,593,98036221333,270,27710, 11, 12, 13, 16, 31
Om1825,749,4895016723,208,88210, 15, 31
Om1924,827,3263911823,944,2488, 9, 20, 36
Om2026,018,873354223,810,4797, 31
Om2128,918,309303328,918,3095, 6
Om2227,453,564283227,180,0604
Om2323,210,987339522,600,8312, 3,
Om2423,210,987237423,633,7871, 39
Total712,537,413810260160670,530,120

Bold numbers were mainly anchored scaffolds on the linkage map.

Table 3

Statistics of the final genome assembly before and after anchoring in Oryzias melastigma

                             Reference genomes
StatisticsRACALinkage map based
Number of scaffolds8,6028,493
Length of scaffolds (bp)779,456,607778,703,520
N50 (bp)23,737,18729,987,720
Largest scaffold (bp)37,948,42137,217,997
Gap (%)5.295.2
GC content (%)39.0437.02
Number of unanchored scaffolds8,469
Length of unanchored scaffolds (bp)66,142,507
Number of genes23,52824,506
Total genes length (bp)51,834,19624,784,506
Average genes length (bp)2,5861,011
Maximum gene length (bp)80,77526,364
GC content (%)54.3454.27
Table 4

Assessment of LG-based assembly completeness

Vertebrata DB%No. of genes (n = 2586)
Complete BUSCOs (C)96.82504
Complete and single-copy BUSCOs (S)95.82477
Complete and duplicated BUSCOs (D)127
Fragmented BUSCOs (F)1.232
Missing BUSCOs (M)250
Bold numbers were mainly anchored scaffolds on the linkage map.

Comparative genomic analysis of two medaka genomes

The final genome assembly integrated with a genetic map provides an efficient resource for comparative genomic analysis with other medaka genomes such as a Japanese medaka (O. latipes). The 24 genetic map-based scaffolds showed good homology in gene contents and sequences similarity with chromosomes in O. latipes, Most LG-based scaffolds of marine medaka showed collinear relationships completely or in a majority with the counterpart chromosomes of O. latipes (Figure 2 and Suppl. Fig. S3). Other LG/chromosomes in O. melastigma showed disrupted collinearity due to intrachromosomal rearrangements or possible errors in linkage map. Reversely matched parts in the collinearly related scaffolds were most likely caused by the undetermined orientation of scaffolds involved in the anchoring process. In addition, some LG-based scaffolds showed in/del regions (Om05, Om08, Om11, Om14, Om15, Om16, and Om17) compared to Japanese medaka chromosomes.
Figure 2

Genome-wide comparison of the genomic sequences between Oryzias melastigma and Oryzias latipes using Mummer. Red and blue dots represent forward and reverse match, respectively.

Genome-wide comparison of the genomic sequences between Oryzias melastigma and Oryzias latipes using Mummer. Red and blue dots represent forward and reverse match, respectively.

Mapping of sex-determining regions

The linkage analysis of two sex markers (sex_xy and sex_zw) showed that both markers were more or less linked to markers in the Om10. Sex_xy showed the highest LOD scores with 17018 (LOD = 17.06) and strong linkages with other markers in this linkage group (Suppl. Table S4). Although sex_zw showed the highest LOD value (7.42) with 16660, which also had the same LOD with sex_xy, the LOD scores of sex_zw with other markers indicated weak or suspect linkage (Suppl. Table S4). This investigation suggested that the O. melastigma has XY sex-determining system. A significant QTL for sex was detected in Om10 with the LOD significance threshold of 5.3 based on the permutation test (Figure 3A). Among 2,481 markers mapped in the Om linkage groups, 58 markers showed the significant association with sex (Suppl. Table S5), most of which were markers from the RACA27. The most significant two markers associated with sex were 17040 (24.3 cM) and 17018 (24.4 cM) with LOD of 35 (Suppl. Table S5). The genotypes of those markers were completely linked with the sex phenotype of all individuals, except for one animal (d004), which could be caused by wrong phenotyping or sex-reversal. When the animal was removed from the QTL analysis, LOD scores of 17018 and 17040 increased to 881.12 and 299.4, respectively (Figure 3B). Physical location of sex-determining region was mapped between 10,353,256 bp and 10,538,878 bp in Om10, spanning approximately 186 kb, and 10 genes were located in the regions without any obvious candidate gene (Figure 3C). However, the sox3 was located on the 10,625,885 bp, which was 87 kb apart from the mapped region (Suppl. Table S3; Figure 3C).
Figure 3

Quantitative trait locus mapping for sex traits on the Om10 scaffold of Oryzias melastigma. (A) Standard interval mapping. Scales on the left represents the map position (cM) of linkage group Om10; Scales on the top of the graph represents the value of the Likelihood Odds Ratio (LOD) scores. The red dotted line indicates the threshold of significance (LOD = 5.3). (B) Standard interval mapping without d004 individual. Scales on the left represents the physical (Mbp) of Om10 scaffold; Scales on the top of the graph represents the value of the LOD scores. The red dotted line indicates the threshold of significance (LOD = 5.3). (C) Gene lists in the sex-determining region of Om10 scaffold.

Quantitative trait locus mapping for sex traits on the Om10 scaffold of Oryzias melastigma. (A) Standard interval mapping. Scales on the left represents the map position (cM) of linkage group Om10; Scales on the top of the graph represents the value of the Likelihood Odds Ratio (LOD) scores. The red dotted line indicates the threshold of significance (LOD = 5.3). (B) Standard interval mapping without d004 individual. Scales on the left represents the physical (Mbp) of Om10 scaffold; Scales on the top of the graph represents the value of the LOD scores. The red dotted line indicates the threshold of significance (LOD = 5.3). (C) Gene lists in the sex-determining region of Om10 scaffold.

Discussion

A high-resolution genetic map is very useful in diverse genomic research and has been applied to many fish species (Amores ; Li ; Shao ; Xiao ; Wang ; Kanamori ). In this study, a high-density genetic map was constructed in O. melastigma using RAD sequencing and was used for verifying the previously published marine medaka reference genome and for aligning the scaffolds at the chromosomal level. The genetic map of O. melastigma consists of 24 LGs with 2,481 SNP markers, which cover 24 chromosomes (Uwa ). The 810 genetic markers were anchored to the genetic map (Table 2 and Suppl. Figure 2), which mapped 90.7% (713 Mb) of the reference genome assembly sequence onto the genetic map (Table 3). Although all SNP markers were extracted by the alignment of RAD sequences against the reference genome of marine medaka (Kim ), the number of anchored markers were lower than anticipated. Many markers in the LGs were excluded from the integrating process due to inconsistency between the genetic map and reference assembly, indicating possible errors in marker order and/or de novo assembly. Indeed, in most cases the markers tended to be highly concentrated in the narrow regions, suggesting that more recombinant individuals will be required to obtain a more precise marker order. Also, it is likely that the size of the mapping family was not big enough for the marker concentrated regions, compared with the number of markers analyzed, resulting in relatively low number of anchored markers. Despite this shortage, the marine medaka genetic map was still integrated with the reference assembly, considering other fish species had a similar or slightly lower level of scaffolds mapping onto the genetic map (Tine ; Li ; Wang ). Previously, we have developed a reference genome for the marine medaka O. melastigma in several steps (Kim ). De novo assembly was performed using the combination of Platanus and Haplomerger2 assemblers (Kajitani ; Huang ). The contiguity of de novo genome assembly was further increased by RACA (Kim et al. 2013), which assisted in the construction of highly ordered and oriented scaffolds of de novo assembly and reassembled scaffolds into longer chromosomal fragments using the comparative genome information of closely related species. RACA assembly of O. melastigma generated 40 gigantic scaffolds (Suppl. Table 1), which account for 674 Mb (86.5%) of the total genome sequence (Kim ). Taken together, the genetic map-integrated assembly improved the reference sequences by mapping 90.7% of genome sequences onto 24 LGs with 94% determined orientation of mapped scaffolds (Table 2). The BUSCO and N50 demonstrated a better quality of integrated final genome assembly with values increased to 96.8% and 29,987,720 bp, respectively (Tables 3 and 4). Quantitative trait locus (QTL) analysis identified that Om10 was strongly associated with sex in O. melastigma and two markers, 17040 and 10718, showed the most significant association for sex, without any recombinant between genotype and phenotype. LOD scores of 17040 (LOD = 34.99) is little higher than that of 10718 (LOD = 34.98), which is likely associated with the missing genotype in one individual for 10718 (Suppl. Table S5). Thus, sex determining locus should be located between 17018 and 17040. Although no obvious candidate gene for sex was found in the mapped sex-determining regions, we noticed sox3 outside the mapped region (Figure 3C). It was intriguing because a cis-regulatory element in the downstream region of sox3 on Y chromosome plays a key signal to sex determination in O. dancena (Takehana ), suggesting that the same cis-regulatory element might be present in the mapped regions of O. melastigma. For further analysis, fine mapping with additional recombinants and the functional analysis of candidate genes or regulatory elements need to be performed in the mapped sex-determining region. In summary, a high-density genetic map is a very useful resource to determine the accuracy of de novo genome assembly, especially with massively parallel short sequencing reads. In addition, the integration of a genetic map with reference genomes will be a useful resource in various genomic studies including comparative genomic analyses, fine mapping of QTL, positional cloning of candidate genes, and CRISPR/Cas9 studies.
  48 in total

1.  PFOS induced precocious hatching of Oryzias melastigma--from molecular level to individual level.

Authors:  Xinlong Wu; Qiansheng Huang; Chao Fang; Ting Ye; Ling Qiu; Sijun Dong
Journal:  Chemosphere       Date:  2012-01-23       Impact factor: 7.086

2.  SNP discovery and genotyping for evolutionary genetics using RAD sequencing.

Authors:  Paul D Etter; Susan Bassham; Paul A Hohenlohe; Eric A Johnson; William A Cresko
Journal:  Methods Mol Biol       Date:  2011

3.  Molecular staging of marine medaka: a model organism for marine ecotoxicity study.

Authors:  Xueping Chen; Li Li; Jinping Cheng; Leo Lai Chan; Da-Zhi Wang; Ke-Jian Wang; Michael E Baker; Gary Hardiman; Daniel Schlenk; Shuk Han Cheng
Journal:  Mar Pollut Bull       Date:  2011-06-25       Impact factor: 5.553

4.  DMY is a Y-specific DM-domain gene required for male development in the medaka fish.

Authors:  Masaru Matsuda; Yoshitaka Nagahama; Ai Shinomiya; Tadashi Sato; Chika Matsuda; Tohru Kobayashi; Craig E Morrey; Naoki Shibata; Shuichi Asakawa; Nobuyoshi Shimizu; Hiroshi Hori; Satoshi Hamaguchi; Mitsuru Sakaizumi
Journal:  Nature       Date:  2002-05-12       Impact factor: 49.962

5.  Identification of the sex chromosomes of the medaka, Oryzias latipes, by fluorescence in situ hybridization.

Authors:  M Matsuda; C Matsuda; S Hamaguchi; M Sakaizumi
Journal:  Cytogenet Cell Genet       Date:  1998

6.  Reference-assisted chromosome assembly.

Authors:  Jaebum Kim; Denis M Larkin; Qingle Cai; Yongfen Zhang; Ri-Li Ge; Loretta Auvil; Boris Capitanu; Guojie Zhang; Harris A Lewin; Jian Ma
Journal:  Proc Natl Acad Sci U S A       Date:  2013-01-10       Impact factor: 11.205

7.  Stacks: building and genotyping Loci de novo from short-read sequences.

Authors:  Julian M Catchen; Angel Amores; Paul Hohenlohe; William Cresko; John H Postlethwait
Journal:  G3 (Bethesda)       Date:  2011-08-01       Impact factor: 3.154

8.  HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly.

Authors:  Shengfeng Huang; Mingjing Kang; Anlong Xu
Journal:  Bioinformatics       Date:  2017-08-15       Impact factor: 6.937

9.  Genome-wide SNP identification for the construction of a high-resolution genetic map of Japanese flounder (Paralichthys olivaceus): applications to QTL mapping of Vibrio anguillarum disease resistance and comparative genomic analysis.

Authors:  Changwei Shao; Yongchao Niu; Pasi Rastas; Yang Liu; Zhiyuan Xie; Hengde Li; Lei Wang; Yong Jiang; Shuaishuai Tai; Yongsheng Tian; Takashi Sakamoto; Songlin Chen
Journal:  DNA Res       Date:  2015-03-11       Impact factor: 4.458

10.  Development of a promising fish model (Oryzias melastigma) for assessing multiple responses to stresses in the marine environment.

Authors:  Sijun Dong; Mei Kang; Xinlong Wu; Ting Ye
Journal:  Biomed Res Int       Date:  2014-03-03       Impact factor: 3.411

View more
  3 in total

1.  RAD-Seq-Based High-Density Linkage Maps Construction and Quantitative Trait Loci Mapping of Flowering Time Trait in Alfalfa (Medicago sativa L.).

Authors:  Xueqian Jiang; Tianhui Yang; Fan Zhang; Xijiang Yang; Changfu Yang; Fei He; Ruicai Long; Ting Gao; Yiwei Jiang; Qingchuan Yang; Zhen Wang; Junmei Kang
Journal:  Front Plant Sci       Date:  2022-05-26       Impact factor: 6.627

2.  Mapping of Quantitative Trait Loci Controlling Egg-Quality and -Production Traits in Japanese Quail (Coturnix japonica) Using Restriction-Site Associated DNA Sequencing.

Authors:  Mohammad Ibrahim Haqani; Shigeru Nomura; Michiharu Nakano; Tatsuhiko Goto; Atsushi J Nagano; Atsushi Takenouchi; Yoshiaki Nakamura; Akira Ishikawa; Masaoki Tsudzuki
Journal:  Genes (Basel)       Date:  2021-05-13       Impact factor: 4.096

3.  Chromonomer: A Tool Set for Repairing and Enhancing Assembled Genomes Through Integration of Genetic Maps and Conserved Synteny.

Authors:  Julian Catchen; Angel Amores; Susan Bassham
Journal:  G3 (Bethesda)       Date:  2020-11-05       Impact factor: 3.154

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.