| Literature DB >> 32495827 |
Xiao-Jun Zhou1, Meng-Xue Liu1, Xiao-Yu Lu1, Shan-Shan Sun1, Yan-Wei Cheng1, Hui-Yuan Ya1.
Abstract
Rhododendron micranthum is an evergreen shrub species widely distributed in China that has high ornamental and medicinal value. However, there is a lack of molecular and genomic data for this plant, which severely restricts the development of its relevant research. The objective of the present study was to conduct a first genomic survey of R. micranthum and determine its whole-genome sequencing scheme. Next-generation sequencing (Illumina Hi-Seq Xten) was used to measure the genome size of R. micranthum, K-mer analysis were employed to investigate its genomic profile. Finally, we conducted bioinformatics methods to performed SSR (simple sequence repeat) prediction based on the genomic data. The genome size of R. micranthum was estimated to be 554.22 Mb. The heterozygosity ratio was 0.93%, and the sequence repeat ratio was calculated to be 49.17%. The clean reads of R. micranthum were assembled into 2281551 scaffolds with a N50 value of 916 bp. A total of 479724 SSR molecular markers were identified in the R. micranthum genome, and 871656 pairs of primers designed for application. Among of them, 100 primer pairs were validated, and 71 primer pairs were successfully amplified. In summary, the R. micranthum genome is complex with high heterozygosity and low repeated sequences. In future whole-genome research in R. micranthum, higher-depth '2+3' (Illumina+PacBio) sequencing may yield better assembly results.Entities:
Keywords: Rhododendron micranthum; genomic characteristics; microsatellite molecular markers; next-generation sequencing
Mesh:
Substances:
Year: 2020 PMID: 32495827 PMCID: PMC7303352 DOI: 10.1042/BSR20200988
Source DB: PubMed Journal: Biosci Rep ISSN: 0144-8463 Impact factor: 3.840
Figure 1Leaves and inflorescences of R. micranthum in the Longyuwan National Forest Park of China (33°42′28″ N, 111°45′20″ E)
Figure 2Distribution figure of AT and GC content
Figure 3Distribution curve of K-mer (k = 17) of R. micranthum
Genome estimation based on K-mer statistics
| Sample | K-mer number | K-mer depth | Genome size (Mb) | Revised genome size (Mb) | Heterozygous ratio (%) | Repeat (%) |
|---|---|---|---|---|---|---|
| 49015508228 | 86× | 570.17 | 554.22 | 0.93 | 49.17 |
Statistics of assembled genome sequences for R. micranthum
| Total length (bp) | Total number | Total number (≥2 kb) | Max length (bp) | N50 (bp) | N90 (bp) | |
|---|---|---|---|---|---|---|
| Contig | 845872500 | 2307871 | 65029 | 46257 | 794 | 114 |
| Scaffold | 757117087 | 2281551 | 61736 | 46157 | 916 | 124 |
Figure 4Average sequencing depth and GC content of R. micranthum genome data
For the bar graphs, the x-axis is sequencing depth distribution and the y-axis is GC content distribution. For the spot graphs, the x-axis is GC content and the y-axis is sequencing depth.
Characteristics of SSR markers for R. micranthum genome
| Repeat number | SSR type | ||||||
|---|---|---|---|---|---|---|---|
| Mononucleotide | Dinucleotide | Trinucleotide | Tetranucleotide | Pentanucleotide | Hexanucleotide | Total | |
| 5 | – | – | 13681 | 1797 | 808 | 557 | 16843 |
| 6 | – | 40777 | 5189 | 554 | 174 | 167 | 46861 |
| 7 | – | 27838 | 2462 | 197 | 45 | 66 | 30608 |
| 8 | – | 21967 | 1172 | 70 | 11 | 27 | 23247 |
| 9 | – | 17040 | 679 | 32 | 3 | 12 | 17766 |
| 10 | 86706 | 13808 | 445 | 7 | – | 5 | 100971 |
| 11 | 49577 | 11103 | 256 | 10 | 1 | 5 | 60952 |
| 12 | 32902 | 9324 | 158 | 4 | – | 6 | 42394 |
| 13 | 23281 | 8297 | 118 | – | – | 1 | 31697 |
| 14 | 17659 | 7726 | 68 | – | – | – | 25453 |
| 15 | 14670 | 6649 | 37 | – | – | 3 | 21359 |
| ≥16 | 36530 | 24925 | 104 | 4 | – | 9 | 61572 |
| Total | 261325 | 189454 | 24369 | 2675 | 1043 | 858 | 479724 |
Figure 5SSR-PCR amplification profile of R. micranthum
M is the DNA marker pBR322 DNA/MspI. RM-58, RM-59, RM-60, etc. are primer numbers.