| Literature DB >> 32547856 |
Jiawei Wang1, Weizhen Liu2, Dongzi Zhu1, Xiang Zhou3, Po Hong1, Hongjun Zhao1, Yue Tan1, Xin Chen1, Xiaojuan Zong1, Li Xu1, Lisi Zhang1, Hairong Wei1, Qingzhong Liu1.
Abstract
The sweet cherry (Prunus avium) is one of the most economically important fruit species in the world. However, there is a limited amount of genetic information available for this species, which hinders breeding efforts at a molecular level. We were able to describe a high-quality reference genome assembly and annotation of the diploid sweet cherry (2n = 2x = 16) cv. Tieton using linked-read sequencing technology. We generated over 750 million clean reads, representing 112.63 GB of raw sequencing data. The Supernova assembler produced a more highly-ordered and continuous genome sequence than the current P. avium draft genome, with a contig N50 of 63.65 KB and a scaffold N50 of 2.48 MB. The final scaffold assembly was 280.33 MB in length, representing 82.12% of the estimated Tieton genome. Eight chromosome-scale pseudomolecules were constructed, completing a 214 MB sequence of the final scaffold assembly. De novo, homology-based, and RNA-seq methods were used together to predict 30,975 protein-coding loci. 98.39% of core eukaryotic genes and 97.43% of single copy orthologues were identified in the embryo plant, indicating the completeness of the assembly. Linked-read sequencing technology was effective in constructing a high-quality reference genome of the sweet cherry, which will benefit the molecular breeding and cultivar identification in this species.Entities:
Keywords: 10× Genomics chromium; Genome assembly; Genome sequencing; Linked reads; Sweet cherry
Year: 2020 PMID: 32547856 PMCID: PMC7278891 DOI: 10.7717/peerj.9114
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Raw data and valid data statistics of sequencing for linked-read libraries of sweet cherry (Prunus avium) cv. Tieton.
| Parameter | Value | Parameter | Value |
|---|---|---|---|
| Raw bases (Gb) | 121.61 | Clean bases (Gb) | 112.63 |
| Q20 (%) | 97.52 | Clean reads | 750,890,534 |
| Q30 (%) | 94.24 | Clean ratio (%) | 92.62 |
| GC content (%) | 40.8 | Low ratio (%) | 5.51 |
| N ratio (%) | 0.01 | Adapter ratio (%) | 1.86 |
Comparison of sweet cherry (Prunus avium) genome assemblies of cv. Tieton and cv. Satonishiki.
| Assembly parameters | cv. Tieton | cv. Satonishiki |
|---|---|---|
| Assembled genome size (Mb) | 280.33 | 272.36 |
| Scaffold N50 (Mb) | 2.48 | 0.22 |
| Number of scaffold | 14,344 | 10,148 |
| Longest of scaffold (Mb) | 17.96 | 1.46 |
| Contig N50 (kb) | 63.65 | 28.779 |
| Number of contig | 19,420 | 32,301 |
| Longest of contig (kb) | 670.29 | 19.97 |
| Total contig length (Mb) | 237.92 | 246.8 |
| GC content (%) | 37.86 | 37.7 |
| Ns (%) | 15.12 | 9.34 |
Note:
Mb, Megabase; kb, Kilobase; GC, Guanine-cytosine; Ns, Ambiguous bases.
Figure 1Pseudomolecule construction of sweet cherry (Prunus avium) by assigning scaffolds to seven genetic maps.
Chr 1–8 represents constructed pseudomolecules by merging seven genetic maps. AC, VG, CK, CP, KS, JWF, and JWF1 denote the sweet cherry genetic maps reported in Calle et al. (2018), Guajardo et al. (2015), Klagges et al. (2013), Peace et al. (2012), Shirasawa et al. (2017) and Wang et al. (2015), respectively.
Summary of detected repeat elements of sweet cherry (Prunus avium) cv. Tieton genome.
| Repeat type | Number | Total length (bp) | Percent (%) |
|---|---|---|---|
| LTR | 22,244 | 17,899,535 | 6.39 |
| DNA elements | 11,927 | 7,198,678 | 2.57 |
| LINE | 4,700 | 1,900,833 | 0.68 |
| SINE | 1 | 84 | 0 |
| Simple repeat | 6,266 | 4,736,127 | 1.69 |
| Low complexity | 141 | 23,252 | 0.01 |
| Unknown | 228,932 | 59,943,002 | 21.38 |
| Total | 274,211 | 91,701,511 | 32.71 |
Note:
LTR, Long terminal retrotransposon; SINE, Short interspersed nuclear elements; LINE, Long interspersed nuclear elements.
Summary of noncoding-RNAs prediction in sweet cherry (Prunus avium) cv. Tieton genome.
| Non-coding RNA type | Non-coding RNA number | Total length (bp) | Percentage (%) |
|---|---|---|---|
| miRNA | 21,673 | 1,703,848 | 0.61 |
| rRNA | 35 | 51,780 | 0.02 |
| snoRNA | 86,993 | 5,560,365 | 1.98 |
| tRNA | 521 | 39,227 | 0.01 |
| tRNA-pseudogene | 48 | 3,585 | 0 |
| Total | 109,277 | 7,358,805 | 2.63 |
Notes:
miRNA, micro-RNA; rRNA, ribosomal RNA; snoRNA, small nucleolar RNA; tRNA, transfer RNA
.
Statistics for protein-coding gene prediction of sweet cherry (Prunus avium) cv. Tieton genome.
| Prediction method or software | Number of genes | mRNA number | Average RNA length | Exon number | Average exon length | Intron number | Average intron length |
|---|---|---|---|---|---|---|---|
| De novo | 47,866 | 47,866 | 2118.8 | 179,067 | 302.9 | 131,201 | 359.5 |
| RNA-seq | 16,512 | 16,512 | 4032.3 | 91,646 | 228.5 | 75,134 | 344.6 |
| EVM | 30,455 | 30,455 | 2433.3 | 139,225 | 275.8 | 108,770 | 328.3 |
| PASA | 30,439 | 30,975 | 2720.6 | 140,185 | 277 | 109,210 | 329.2 |
Note:
EVM, EVidenceModeler; PASA, Program to Assemble Spliced Alignments.
Statistics of functional annotated genes of sweet cherry (Prunus avium) cv. Tieton genome.
| Functional database | Number of annotated genes | Percentage (%) |
|---|---|---|
| InterPro | 30,300 | 97.8 |
| NR | 30,882 | 99.7 |
| GO | 16,433 | 53.05 |
| Uniprot | 29,444 | 95.05 |
| KEGG | 9,202 | 29.7 |
| Total | 30,973 | 99.99 |
Note:
NR, NCBI Non-redundant protein; GO, Gene ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 2Species tree and gene family expansion analysis of 13 plant species.
The species tree were constructed using STRIDE. Gene family expansions are indicated in red, and gene family contractions are indicated in green.
Figure 3Synteny analysis between sweet cherry (Prunus avium) cv. Tieton genome and cv. Satonishiki genome.
(A) Scaffold level synteny dot plot. (B) Chromosome-scale synteny dot plot. Sequence identity is indicated by colors.
Statistics of orthogroups analysis between sweet cherry (Prunus avium) cv. Tieton and cv. Satonishiki genome annotations.
| Annotation summary | cv. Tieton | cv. Satonishiki | |
|---|---|---|---|
| NCBI version | Original version | ||
| Number of genes | 30,975 | 35,009 | 43,673 |
| Number of genes in orthogroups | 26,730 | 31,314 | 25,388 |
| Number of unassigned genes | 4,245 | 3,695 | 18,285 |
| Percentage of genes in orthogroups | 86.3% | 89.4% | 58.1% |
| Percentage of unassigned genes | 13.7% | 10.6% | 41.9% |
| Number of orthogroups containing species | 21511 | 21258 | 20738 |
| Percentage of orthogroups containing species | 92.4% | 91.3% | 89% |
| Number of species-specific orthogroups | 14 | 1 | 48 |
| Number of genes in species-specific orthogroups | 67 | 2 | 349 |
Notes:
NCBI version is the improved assembly annotation of sweet cherry cv. Satonishiki released by National Center for Biotechnology Information (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Prunus_avium/100/).
Original version is the assembly annotation of sweet cherry cv. Satonishiki genome documented in (Shirasawa et al., 2017).