| Literature DB >> 32241258 |
Vasanthan Jayakumar1, Hiromi Ishii1, Misato Seki1, Wakako Kumita2, Takashi Inoue2, Sumitaka Hase1, Kengo Sato1, Hideyuki Okano3,4, Erika Sasaki2, Yasubumi Sakakibara5.
Abstract
BACKGROUND: The common marmoset (Callithrix jacchus) is one of the most studied primate model organisms. However, the marmoset genomes available in the public databases are highly fragmented and filled with sequence gaps, hindering research advances related to marmoset genomics and transcriptomics.Entities:
Keywords: Callithrix jacchus; Chromosome-scale scaffolds; Common marmoset; De novo assembly; Non-human primate genomics
Year: 2020 PMID: 32241258 PMCID: PMC7114785 DOI: 10.1186/s12864-020-6657-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1The de novo assembly workflow
The common marmoset genome assembly statistics
| Contigs | Scaffolds | |
|---|---|---|
| # sequences | 1788 | 65 |
| Total assembled sequence | 2.79 Gb | 2.79 Gb |
| Longest sequence length | 46.03 Mb | 213.27 Mb |
| N50 | 6.38 Mb | 143.89 Mb |
| N75 | 2.58 Mb | 115.91 Mb |
| L50 | 117 | 8 |
| L75 | 289 | 14 |
A survey of contig N50s obtained in recent studies, involving improvement of genomes, primarily using long read sequence technologies
| Species | Common name | Assembled genome size | Contig N50 | Additional technologies* |
|---|---|---|---|---|
| Mosquito | 1.28 Gb | 11.76 Mb [ | Yes | |
| Brassica | 0.35 Gb | 1.44 Mb [ | Yes | |
| Water buffalo | 2.6 Gb | 22.4 Mb [ | Yes | |
| Anna’s hummingbird | 1.01 Gb | 5.36 Mb [ | No | |
| Ant | 0.28 Gb | 1.22 Mb [ | Yes | |
| Domestic goat | 2.92 Gb | 18.7 Mb [ | Yes | |
| Rock pigeon | 1.10 Gb | 0.02 Mb [ | Yes | |
| Woodland strawberry | 0.22 Gb | 7.9 Mb [ | Yes | |
| Chicken | 1.21 Gb | 2.9 Mb [ | Yes | |
| Gorilla | 3.08 Gb | 10.02 Mb [ | Yes | |
| Ant | 0.34 Gb | 0.88 Mb [ | Yes | |
| Tibetan hulless barley | 4.00 Gb | 1.56 Mb [ | Yes | |
| Chimpanzee | 2.99 Gb | 12.42 Mb [ | Yes | |
| Orangutan | 3.04 Gb | 11.07 Mb [ | Yes | |
| Black raspberry | 0.29 Gb | 5.1 Mb [ | Yes | |
| Monk fruit | 0.46 Gb | 0.43 Mb [ | No | |
| Corkwing wrasse | 0.61 Gb | 0.46 Mb [ | No | |
| Zebra finch | 1.14 Gb | 5.80 Mb [ | No | |
| Maize | 2.10 Gb | 1.18 Mb [ | Yes |
*Additional technologies include Hi-C, CHICAGO, BioNano optical maps, and others
Fig. 2Contiguity of all the marmoset genome assemblies (CJ2010, CJ2015, CJ2017, and CJ2019). a) N(X) plot of scaffolds, b) N(X) plot of contigs
Statistics of the gaps in the published marmoset genome assemblies
| # Gaps | Improvements in CJ2019 | ||||
|---|---|---|---|---|---|
| Chromosome | CJ2010 | CJ2015 | CJ2019 | Vs CJ2010 | Vs CJ2015 |
| 1 | 13,245 | 7859 | 177 | 98.66 | 97.75 |
| 2 | 11,874 | 6677 | 84 | 99.29 | 98.74 |
| 3 | 9897 | 5404 | 48 | 99.52 | 99.11 |
| 4 | 9532 | 5271 | 83 | 99.13 | 98.43 |
| 5 | 11,928 | 7264 | 166 | 98.61 | 97.71 |
| 6 | 8890 | 4993 | 116 | 98.70 | 97.68 |
| 7 | 10,453 | 6137 | 64 | 99.39 | 98.96 |
| 8 | 7048 | 3956 | 26 | 99.63 | 99.34 |
| 9 | 8252 | 4781 | 66 | 99.20 | 98.62 |
| 10 | 8328 | 4673 | 57 | 99.32 | 98.78 |
| 11 | 8419 | 5131 | 113 | 98.66 | 97.80 |
| 12 | 8444 | 5023 | 72 | 99.15 | 98.57 |
| 13 | 6469 | 3549 | 45 | 99.30 | 98.73 |
| 14 | 6444 | 3702 | 75 | 98.84 | 97.97 |
| 15 | 5580 | 3193 | 36 | 99.35 | 98.87 |
| 16 | 5409 | 3114 | 24 | 99.56 | 99.23 |
| 17 | 3879 | 2086 | 13 | 99.66 | 99.38 |
| 18 | 3031 | 1762 | 39 | 98.71 | 97.79 |
| 19 | 3128 | 1861 | 31 | 99.01 | 98.33 |
| 20 | 3159 | 1899 | 32 | 98.99 | 98.31 |
| 21 | 2872 | 1713 | 25 | 99.13 | 98.54 |
| 22 | 6638 | 4783 | 67 | 98.99 | 98.60 |
| X | 10,542 | 7314 | 300 | 97.15 | 95.90 |
| Y | 269 | 184 | 12 | 95.54 | 93.48 |
Fig. 3Improved mapping rates of all the marmoset genome assemblies (CJ2010, CJ2015, CJ2017, and CJ2019). a) Box plot of the mapping rates of RNAseq data from different brain samples, b) Bar plot of the mapping rates of the common marmoset’s BAC-end data
Fig. 4Dot plot of the alignment between CJ2017 and CJ2019. a) Whole genome view, b) Focused view of chromosome 16. The X-axis represents CJ2017 scaffolds and Y-axis represent CJ2019 scaffolds. A diagonal straight line indicates synteny among the genomes