| Literature DB >> 30513700 |
Gregory A Taylor1, Heather Kirk2, Lauren Coombe3, Shaun D Jackman4, Justin Chu5, Kane Tse6, Dean Cheng7, Eric Chuah8, Pawan Pandoh9, Rebecca Carlsen10, Yongjun Zhao11, Andrew J Mungall12, Richard Moore13, Inanc Birol14,15, Maria Franke16, Marco A Marra17,18, Christopher Dutton19, Steven J M Jones20,21,22.
Abstract
The grizzly bear (Ursus arctos ssp. horribilis) represents the largest population of brown bears in North America. Its genome was sequenced using a microfluidic partitioning library construction technique, and these data were supplemented with sequencing from a nanopore-based long read platform. The final assembly was 2.33 Gb with a scaffold N50 of 36.7 Mb, and the genome is of comparable size to that of its close relative the polar bear (2.30 Gb). An analysis using 4104 highly conserved mammalian genes indicated that 96.1% were found to be complete within the assembly. An automated annotation of the genome identified 19,848 protein coding genes. Our study shows that the combination of the two sequencing modalities that we used is sufficient for the construction of highly contiguous reference quality mammalian genomes. The assembled genome sequence and the supporting raw sequence reads are available from the NCBI (National Center for Biotechnology Information) under the bioproject identifier PRJNA493656, and the assembly described in this paper is version QXTK01000000.Entities:
Keywords: Ursus arctos ssp. Horribilis; genome; grizzly bear; microfluidic partitioning; nanopore
Year: 2018 PMID: 30513700 PMCID: PMC6315469 DOI: 10.3390/genes9120598
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Assembly statistics and gene content for the genome sequences reported in this study. Busco: Benchmarking Universal Single-Copy Orthologs.
| Assembly | # of Scaffolds | Gaps within Scaffolds | Scaffold | Longest Scaffold (bp) | BUSCO |
|---|---|---|---|---|---|
| Supernova | 8474 | 21,957 | 33.78 × 106 | 105.9 × 106 | 3943 (96.1%) |
| Tigmint | 8728 | 21,947 | 26.32 × 106 | 92.41 × 106 | 3943 (96.1%) |
| ARCS | 8679 | 21,996 | 27.77 × 106 | 92.41 × 106 | 3943 (96.1%) |
| LINKS1 | 8350 | 22,219 | 27.77 × 106 | 92.41 × 106 | 3943 (96.1%) |
| LINKS8 | 6673 | 23,947 | 36.71 × 106 | 92.42 × 106 | 3943 (96.1%) |
| Sealer | 6673 | 15,572 | 36.71 × 106 | 92.43 × 106 | 3943 (96.1%) |
Assembly statistics of the grizzly bear and its closest sequenced relative, the polar bear.
| Assembly | # of Scaffolds | Scaffold N50 (bp) | Scaffold | # of Contigs | Contig N50 (bp) | Contig L50 | BUSCO |
|---|---|---|---|---|---|---|---|
| Grizzly Bear | 6673 | 36.71 × 106 | 21 | 22,245 | 314 × 103 | 2191 | 3943 (96.1%) |
| Polar Bear | 23,819 | 15.94 × 106 | 46 | 134,162 | 46 × 103 | 14,124 | 3890 (94.7%) |
Figure 1Jupiter plot. A global genome alignment, using BWA-MEM, of the grizzly genome (left side of circle) to the polar bear genome (right side). Connections show the aligned regions of each assembly. The grizzly scaffolds are limited to those over 10 Mb in length (>85% of the assembly). The longest polar bear scaffolds were selected to sum to the same amount of sequence (2 Gb). Only alignments over 10 kb in length are displayed.