| Literature DB >> 28369350 |
Jihoon Jo1, Jooseong Oh1, Hyun-Gwan Lee2, Hyun-Hee Hong1, Sung-Gwon Lee1, Seongmin Cheon1, Elizabeth M A Kern3, Soyeong Jin3, Sung-Jin Cho4, Joong-Ki Park3, Chungoo Park1.
Abstract
The Japanese sea cucumber (Apostichopus japonicus Selenka 1867) is an economically important species as a source of seafood and ingredient in traditional medicine. It is mainly found off the coasts of northeast Asia. Recently, substantial exploitation and widespread biotic diseases in A. japonicus have generated increasing conservation concern. However, the genomic knowledge base and resources available for researchers to use in managing this natural resource and to establish genetically based breeding systems for sea cucumber aquaculture are still in a nascent stage. A total of 312 Gb of raw sequences were generated using the Illumina HiSeq 2000 platform and assembled to a final size of 0.66 Gb, which is about 80.5% of the estimated genome size (0.82 Gb). We observed nucleotide-level heterozygosity within the assembled genome to be 0.986%. The resulting draft genome assembly comprising 132 607 scaffolds with an N50 value of 10.5 kb contains a total of 21 771 predicted protein-coding genes. We identified 6.6-14.5 million heterozygous single nucleotide polymorphisms in the assembled genome of the three natural color variants (green, red, and black), resulting in an estimated nucleotide diversity of 0.00146. We report the first draft genome of A. japonicus and provide a general overview of the genetic variation in the three major color variants of A. japonicus. These data will help provide a comprehensive view of the genetic, physiological, and evolutionary relationships among color variants in A. japonicus, and will be invaluable resources for sea cucumber genomic research.Entities:
Keywords: Apostichopus japonicas; Color variants; Genetic variation; Population genomics; Sea cucumber genome
Mesh:
Year: 2017 PMID: 28369350 PMCID: PMC5437941 DOI: 10.1093/gigascience/giw006
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1.Three color variants of A. japonicus (green, red, and black).
Statistics on total reads of the A. japonicus genome
| Variants | Insertion size (bp) | Total readsa (raw data) | Total readsa (w/o adaptor) | Total readsa (error corrected) | % error corrected |
|---|---|---|---|---|---|
| Green | 180 | 498 608 646 | 474 117 288 | 466 062 920 | 1.70 |
| 400 | 897 432 174 | 842 766 704 | 831 964 242 | 1.28 | |
| 2000 (v1) | 293 701 464 | 270 513 434 | 268 573 812 | 0.72 | |
| 2000 (v2) | 538 359 438 | 496 446 984 | 493 387 418 | 0.62 | |
| Total | 2228 101 722 | 2083 844 410 | 2059 988 392 | 1.14 | |
| Red | 400 | 397 799 042 | 394 984 810 | 383 734 440 | 2.85 |
| Black | 400 | 460 597 940 | 423 543 558 | 416 007 614 | 1.78 |
aThe length of each read is 101 bp.
Figure 2.K-mer distribution of the A. japonicus genome.
Statistics on Apostichopus japonicus genome assembly
| Statistics | Values |
|---|---|
| Total assembled bases (bp) | 664 375 288 |
| Average length of scaffolds (bp) | 5010 |
| Number of scaffolds | 132 607 |
| Number of contigs | 197 146 |
| Length of longest scaffold (bp) | 131 537 |
| GC content (%) | 35.92 |
| Scaffold N50 (bp) | 10 488 |
| Contig N50 (bp) | 5525 |
| Number of genes | 21 771 |
| Number of exons per gene | 4.67 |
| Average exon length (bp) | 209 |
| Number of introns per gene | 4.21 |
| Average intron length (bp) | 1048 |
SNP and small indel statistics among three color variants
| Variants | Percent heterozygous SNP loci | Percent small indel loci |
|---|---|---|
| Green | 6 550 122 | 1 662 708 |
| Red | 14 509 713 | 3 681 007 |
| Black | 12 627 560 | 3 198 584 |
Figure 3.Schematic workflow of A. japonicus genome assembly and annotation. The left side represents the genome assembly and the right side represents the transcriptome assembly that was performed in previous publications. To achieve suitable gene prediction, we integrated these two assembly results.