| Literature DB >> 32010195 |
Boping Tang1, Zhongkai Wang2, Qiuning Liu1, Huabin Zhang1, Senhao Jiang1, Xinzheng Li3, Zhengfei Wang1, Yue Sun1, Zhongli Sha3, Hui Jiang4,5, Xugan Wu6, Yandong Ren2, Haorong Li2, Fujun Xuan1, Baoming Ge1, Wei Jiang3, Shusheng She7, Hongying Sun8, Qiang Qiu2, Wen Wang2, Qun Wang9, Gaofeng Qiu6, Daizhen Zhang1, Yongxin Li2.
Abstract
As an important freshwater aquaculture species in China, the Chinese mitten crab (Eriocheir japonica sinensis) has high economic and nutritional value. However, limited genomic information is currently available for studying its basic development and genetic diversity. Here, we performed whole-genome sequencing on Oxford Nanopore Technologies Limited's platform using promethION. The assembled size of E. j.sinensis genome was approximately 1.27 Gb, which is close to the estimated size (1.19 Gb). Furthermore, based on assessment using Benchmarking Universal Single-Copy Orthologs (BUSCO) (Simao et al., 2015), 94.00% of the expected eukaryotic genes were completely present in the genome assembly. In addition, repetitive sequences accounted for ~61.42% of the assembled genome, and 22,619 protein-coding genes were annotated. Comparative genomics analysis demonstrated that the Chinese mitten crab diverged from Penaeus vannamei ~373.6 million years ago, with a faster evolution rate than shrimp. We anticipate that the annotated high-quality genome of E. j. sinensis will promote research on its basic development and evolution and make substantial contributions to comparative genomic analyses of crustaceans.Entities:
Keywords: Eriocheir japonica sinensis; crab; evolution; genome assembly and annotation; nanopore
Year: 2020 PMID: 32010195 PMCID: PMC6979310 DOI: 10.3389/fgene.2019.01340
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Chinese mitten crab, Eriocheir japonica sinensis.
Statistics on assembled Chinese mitten crab genome and previously published Chinese mitten crab genome.
| Term | Our genome | Previously published genome | ||
|---|---|---|---|---|
| Size (bp) | Number | Size (bp) | Number | |
| N90 | 80,260 | 1,368 | 144 | 772,162 |
| N80 | 305,684 | 511 | 405 | 220,601 |
| N70 | 864,694 | 252 | 1,123 | 43,328 |
| N60 | 1,666,934 | 149 | 42,141 | 3,605 |
| N50 | 3,185,988 | 94 | 111,755 | 2,066 |
| Max length | 16,811,200 | – | 2,002,076 | – |
| Total length | 1,270,960,592 | – | 1,118,179,523 | – |
| Number ≥ 100 bp | – | 6,657 | – | 1,768,649 |
| Number ≥ 10 kb | – | 5,141 | – | 5,093 |
Statistics on genome quality evaluation by BUSCO software.
| Library | Eukaryota | Metazoa |
|---|---|---|
| Complete BUSCO (C) | 285 | 909 |
| Complete and single-copy BUSCO (S) | 281 | 905 |
| Complete and duplicated BUSCO (D) | 4 | 4 |
| Fragmented BUSCO (F) | 5 | 25 |
| Missing BUSCO (M) | 13 | 44 |
| Total BUSCO groups searched | 303 | 978 |
| Summary | 94.00% | 92.90% |
Statistics on Illumina short-read mapping ratio for assembled Chinese mitten crab genome.
| Term | Length (bp)/Percentage |
|---|---|
| No. of total reads | 1,177,643,075 |
| No. of mapped reads | 1,164,904,878 |
| Mapped ratio (%) | 98.82 |
| No. of PE-mapped reads | 836,906,878 |
| PE-mapped ratio (%) | 74.88 |
| No. of SE-mapped reads | 6,091,591 |
| SE-mapped ratio (%) | 0.55 |
PE-mapped represents reads mapped to genome as read pairs. SE-mapped represents reads mapped to genome as single reads.
Figure 2Guanine–cytosine (GC) content of Chinese mitten crab and other genomes. The GC content values were calculated using a sliding window method and the window size is 2 Kb. The whole-genome sequences including the coding region and non-coding region were used for analysis.
Statistics on repetitive sequence annotation in Chinese mitten crab genome.
| Type | Repeat size | % of genome |
|---|---|---|
| Tandem Repeat Finder | 303,992,509 | 23.918327 |
| RepeatMasker | 167,854,132 | 13.206871 |
| ProteinMask | 111,728,104 | 8.790839 |
|
| 620,811,245 | 48.84583 |
| Total | 780,678,689 | 61.424303 |
Statistics on transposable elements (TEs) in Chinese mitten crab genome.
| Type |
|
|
|
| ||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| DNA | 92,713,892 | 7.29 | 3,569,566 | 0.28 | 58,584,466 | 4.61 | 149,193,019 | 11.74 |
| LINE | 12,182,661 | 0.96 | 97,531,450 | 7.67 | 164,787,780 | 12.97 | 201,524,402 | 15.86 |
| SINE | 70,066 | 0.0055 | – | – | 4,609,943 | 0.36 | 4,618,193 | 0.36 |
| LTR | 18,466,448 | 1.45 | 10,647,352 | 0.84 | 27,904,457 | 2.20 | 49,744,359 | 3.91 |
| Other | 78,097,956 | 6.14 | 3,723 | 0.00029 | 198,684,468 | 15.63 | 216,761,658 | 17.05 |
| Unknown | 96,240 | 0.0076 | – | – | 174,272,660 | 13.71 | 174,365,180 | 13.72 |
| Total | 167,854,132 | 13.21 | 111,728,104 | 8.79 | 620,811,245 | 48.85 | 691,775,509 | 54.43 |
Statistics on functional annotation of protein-coding genes in Chinese mitten crab genome.
| Term | Annotation number | Annotation percentage |
|---|---|---|
| GO | 10,360 | 45.80 |
| InterPro | 14,698 | 64.98 |
| KEGG | 12,256 | 54.18 |
| SwissProt | 13,828 | 61.13 |
| TrEMBL | 20,492 | 90.60 |
| Annotated | 20,907 | 92.43 |
| Unannotated | 1,712 | 7.57 |
| Total | 22,619 | – |
Figure 3Quality comparison of protein-coding genes. The distribution of mRNA, CDS, exon, and intron among these species were shown. The species include Eriocheir japonica sinensis, Aedes aegypti, Stegodyphus mimosarum, Drosophila melanogaster, and Penaeus vannamei.
Figure 4Gene family analysis of Chinese mitten crab. (A) Orthologous genes among species. (B) Unique and shared gene families in species.
Figure 5Phylogenetic relationships, divergence time, and evolution rate. (A) Phylogenetic relationship and divergence time of species. Red dot represents fossil record used here. (B) Relative evolution rate of species.