| Literature DB >> 35610245 |
Hang Yu1, Kun Guo2, Kunlong Lai1, Muhammad Ali Shah1, Zijian Xu1, Na Cui3, Haifeng Wang4.
Abstract
Lonicera japonica (honeysuckle) is one of the most important medicinal plants and widely utilized in traditional Chinese medicine. At present, there are many varieties of honeysuckle used in cultivation, among which Sijihua variety are widely cultivated due to its wide adaptability, stress resistance, early flowering and high yield. In this study, we assembled the genome of Sijihua, which was approximately 886.04 Mb in size with a scaffold N50 of 79.5 Mb. 93.28% of the total assembled sequences were anchored to 9 pseudo-chromosomes by using PacBio long reads and Hi-C sequencing data. We predicted 39,320 protein-coding genes and 92.87% of them could be annotated in NR, GO, KOG, KEGG and other databases. In addition, we identified 644 tRNAs, 2,156 rRNAs, 109 miRNAs and 5,502 pseudogenes from the genome. The chromosome-scale genome of Sijihua will be a significant resource for understanding the genetic basis of high stress-resistance, which will facilitate further study of the genetic diversity and accelerate the genetic improvement and breeding of L. japonica.Entities:
Mesh:
Year: 2022 PMID: 35610245 PMCID: PMC9130202 DOI: 10.1038/s41597-022-01385-4
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Genome assembly and assessment of Sijihua and Lj10107428 genomes.
| Assembly | Sijihua | Lj10107428 | |
|---|---|---|---|
| Genome-sequencing depth (X) | PacBio sequencing | 98.88 | 90(ONT) |
| Illumina sequencing | 61.48 | 56.91 | |
| Hi-C | 103.65 | 94.86 | |
| Estimated genome size (Mb) | 817.45 | 887.15 | |
| Estimated heterozygosity (%) | 0.74 | 1.27 | |
| Number of scaffolds | 967 | 145 | |
| Total length of scaffolds (bp) | 886,131,823 | 903,813,648 | |
| Scaffolds N50 (bp) | 79,566,881 | 84,431,753 | |
| Longest scaffold (bp) | 116,908,140 | 125,163,164 | |
| Number of contigs (bp) | 1,519 | 919 | |
| Total length of contigs (bp) | 886,040,423 | 903,735,777 | |
| Contigs N50 (bp) | 1,578,755 | 2,148,893 | |
| Longest contig (bp) | 12,449,837 | 19,544,413 | |
| GC content (%) | 34.32 | 43.5 | |
| Mapping with Illumina reads (%) | 99.75 | NA | |
| CEGMA assessment (%) | 95.85 | NA | |
| Completeness BUSCOs (%) | 97.03 | 97 | |
| Complete single-copy BUSCOs (%) | 91.33 | 92.6 | |
| Complete duplicated BUSCOs (%) | 5.70 | 4.4 | |
Genome annotation of Sijihua and Lj10107428 genomes.
| Annotation | Sijihua | Lj10107428 |
|---|---|---|
| Number of predicted protein-coding genes | 39,320 | 33,961 |
| Average gene length (bp) | 4,640 | 3,527 |
| Average exon length (bp) | 1,480 | 1,118 |
| Average exon number per gene | 4.87 | 4.63 |
| Average intron length (bp) | 3,160 | 2,407 |
| miRNAs | 109 | 33 |
| rRNAs | 2,156 | 138 |
| tRNAs | 644 | 104 |
| Percentage of repeat sequence (%) | 64.76 | 58.21 |
| Copia (%) | 15.93 | 8.98 |
| Gypsy (%) | 19.14 | 13.77 |
| LINE (%) | 2.61 | 2.33 |
| SINE (%) | 0.34 | 0.17 |
| DNA transposons (%) | 5.82 | 7.67 |
| Pseudogenes | 5,502 | 18 |
| Percentage of Functional annotation genes | 92.87 | NA |
SSRs annotation of Sijihua and Lj10107428 genomes.
| SSR type/species | Sijihua | Lj10107428 | polySSRs |
|---|---|---|---|
| Di-nucleotide | 192,362 | 144,713 | 34,140 |
| Tri-nucleotide | 54,009 | 31,248 | 5,135 |
| Tetra-nucleotide | 6,395 | 4,580 | 654 |
| Penta-nucleotide | 1,526 | 1,100 | 181 |
| Hexa-nucleotide | 972 | 566 | 142 |
| Total | 255,264 | 182,207 | 40,252 |
Fig. 1The five growth stages of honeysuckle. (a) The juvenile bud stage. (b) The third green stage. (c) The complete white stage. (d) The silver flowering stage. (e) The gold flowering stage.
Fig. 219-kmer distribution in the honeysuckle genome.
Fig. 3Comparative genomic analysis between Sijihua and Lj10107428 varieties of honeysuckle. (a) Genomic features landscape of the Sijihua genome. Density of genes, TEs, SNPs, indels, PAVs, inversions and translocations were calculated in a 500 Kb sliding window. (b) Gene collinearity between Sijihua and Lj10107428 varieties. NBS-LRR genes were annotated as yellow dot across genome. (c) Venn diagram of the overlapped genes between Sijihua and Lj10107428 genomes. (d) Expression level comparison of shared genes in Sijihua and Lj10107428 varieties.
Fig. 4Hi-C contact map of the chromosome-scale assembly of Sijihua. Hi-C interaction matrix shows the pairwise correlations among 9 pseudomolecules. The intensity of the dark color is scaled to the strength of the correlation.
| Measurement(s) | Lonicera japonica • RNA sequencing • genome assembly • sequence annotation |
| Technology Type(s) | SMRT Sequencing • RNA sequencing • Hi-C • biomolecular annotation design |
| Factor Type(s) | Genotype |
| Sample Characteristic - Organism | Lonicera japonica |
| Sample Characteristic - Environment | occurrence |
| Sample Characteristic - Location | Shandong Province |