| Literature DB >> 31289832 |
Chang-Ming Bai1, Lu-Sheng Xin1, Umberto Rosani2,3, Biao Wu1, Qing-Chen Wang1, Xiao-Ke Duan4, Zhi-Hong Liu1, Chong-Ming Wang1.
Abstract
BACKGROUND: The blood clam, Scapharca (Anadara) broughtonii, is an economically and ecologically important marine bivalve of the family Arcidae. Efforts to study their population genetics, breeding, cultivation, and stock enrichment have been somewhat hindered by the lack of a reference genome. Herein, we report the complete genome sequence of S. broughtonii, a first reference genome of the family Arcidae.Entities:
Keywords: Hi-C; PacBio; ark shell; chromosomal assembly; genomic
Mesh:
Year: 2019 PMID: 31289832 PMCID: PMC6615981 DOI: 10.1093/gigascience/giz067
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Example of a Scapharca (Anadara) broughtonii, the blood clam.
Summary of sequencing data generated for blood clam genome assembly and annotation
| Library type | Platform | Library size (bp) | Data size (Gb) | Application |
|---|---|---|---|---|
| Short reads | HiSeq X Ten | 350 | 53.06 | Genome survey, correction, and evaluation |
| Long reads | PacBio SEQUEL | 20,000 | 63.33 | Genome assembly |
| PacBio RS II | 20,000 | 3.99 | ||
| Nanopore Minion | 20,000 | 8.47 | ||
| Hi-C | HiSeq X Ten | 350 | 52.16 | Chromosome construction |
Figure 2:Hi-C interaction heat map for Scapharca (Anadara) broughtonii.
Statistics of the final genome assembly of Scapharca (Anadara) broughtonii
| Types | Number | Length (bp) | N50 (bp) | N90 (bp) | Maximum (bp) | Guanine-cytosine content (%) | Gap (bp) |
|---|---|---|---|---|---|---|---|
| Scaffold | 1,026 | 884,566,040 | 44,995,656 | 25,444,477 | 55,667,740 | 33.70 | 65,100 |
| Contig | 1,677 | 884,500,940 | 1,797,717 | 305,905 | 7,852,409 | 33.70 | 0 |
Statistics of gene annotation to different databases
| Annotation database | Annotated number | Percentage (%) |
|---|---|---|
| GO_Annotation | 5,766 | 23.98 |
| KEGG_Annotation | 9,174 | 38.15 |
| KOG_Annotation | 13,626 | 56.67 |
| Pfam_Annotation | 17,321 | 72.04 |
| Swissprot_Annotation | 12,866 | 53.51 |
| TrEMBL_Annotation | 21,887 | 91.03 |
| nr_Annotation | 21,897 | 91.07 |
| nt_Annotation | 12,786 | 53.18 |
| All_Annotated | 22,267 | 92.61 |
Figure 3:Gene ontology (GO) annotation of the predicted genes. The horizontal axis indicates classes of the second-level GO annotation. The vertical axis indicates the number and percentage of genes in each class.
Figure 4:Eukaryotic Orthologous Groups (KOG) classification of the predicted genes. Results are summarized in 24 function classes according to their functions. The horizontal axis represents each class, and the vertical axis represents the frequency of the classes.