| Literature DB >> 36147512 |
Rui Sun1,2,3, Shuangtao Li1,2,3, Linlin Chang1,2,3, Jing Dong1,2,3, Chuanfei Zhong1,2,3, Hongli Zhang1,2,3, Lingzhi Wei1,2,3, Yongshun Gao1,2,3, Guixia Wang1,2,3, Yuntao Zhang1,2,3, Jian Sun1,2,3.
Abstract
Fragaria pentaphylla, a wild diploid quinquefoliolate species of Fragaria, is native to Southwest China. It has two morphs of red and white fruit color in nature and has characteristics of unique fragrance and resistance, which made it not only a valuable breeding material but also a potential model plant for molecular function researches. Here, we generate a high-quality chromosome-level genome assembly of a F. pentaphylla accession, BAAFS-FP039 employing a combination of PacBio Long-Read Sequencing, Illumina Short-Read Sequencing, and Hi-C Sequencing. The assembled genome contained 256.74 Mb and a contig N50 length of 32.38 Mb, accounting for 99.9% of the estimated genome (256.77 Mb). Based on Hi-C data, seven pseudo-chromosomes of F. pentaphylla-FP039 genome were assembled, covering 99.39% of the genome assembly. The genome was composed of 44.61% repetitive sequences and 29,623 protein-coding genes, 97.62% of protein-coding genes could be functionally annotated. Phylogenetic and chromosome syntenic analysis revealed that F. pentaphylla-FP039 was closely related to F. nubicola. This high-quality genome could provides fundamental molecular resources for evolutionary studies, breeding efforts, and exploring the unique biological characteristics of F. pentaphylla.Entities:
Keywords: Fragaria pentaphylla; Hi-C; PacBio; chromosomal assembly; gene annotation
Year: 2022 PMID: 36147512 PMCID: PMC9485601 DOI: 10.3389/fgene.2022.873711
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Summary statistics of the sequencing and assembly of the Fragaria pentaphylla-FP039 genome.
| Library type | Sequencing mode | Clean data (Gb) | Application |
|---|---|---|---|
| Illumina | Pair end 150 bp | 15.15 | Genome survey and correction |
| PacBio | Sequel II HiFi | 33.05 | Genome assembly |
| Hi-C | Pair end 150 bp | 33.47 | Assisted assembly at the chromosomal level |
| Genome assembly and scaffolding at chromosomal level | |||
| Contig number | 44 | ||
| Contig length (bp) | 256,736,466 | ||
| Contig N50 (bp) | 32,376,794 | ||
| Contig N90 (bp) | 11,400,560 | ||
| Scaffold number | 41 | ||
| Scaffold length (bp) | 256,736,766 | ||
| Scaffold N50 (bp) | 34,602,923 | ||
| Scaffold N90 (bp) | 27,601,762 | ||
| Anchored chromosomes size (bp) | 255,168,039 | ||
FIGURE 1Genome assembly of Fragaria pentaphylla-FP039. (A,B): Pictures of Fragaria pentaphylla-FP039 plant grown in the laboratory. (C) Pictures of Fragaria pentaphylla-FP039 plant grown in the wild where it was collected. (D) Circos plot of chromosome-level scaffolds showing the genomic features. Tracks from outside to the inner correspond to a, chromosome ideograms; b, TE (transposable elements) density; c, gene density; d, GC content. (E) Hi-C interaction heatmap showing interactions among seven chromosomes (the color bar on the right showing the interaction intensity).
FIGURE 2Comparative genomics analysis of Fragaria pentaphylla-FP039 genome and other representative plant genomes. (A): Distribution of genes with different copy number in analyzed genomes. (B): Venn diagram of the shared and unique orthogroups calculated among the fourteen genomes. The number of shared orthogroups was showed in the center. And numbers of unique orthogroups of corresponding species were indicated outside. (C): Phylogenetic tree of 14 plant genomes with numbers of expanded (+)/contracted (−) orthogroups at each branch. The ratio of expansion and contraction orthogroups are indicated in the pie diagram with green and purple color, respectively. (D–M): Synteny analysis between F. pentaphylla-FP039 genome and the other ten genomes of diploid Fragaria species.