| Literature DB >> 34991463 |
Jiaojun Yu1,2, Jun Fu1,2, Yuanping Fang1,2, Jun Xiang1,2, Hongjin Dong3,4.
Abstract
BACKGROUND: Rubus is the largest genus of the family Rosaceae and is valued as medicinal, edible, and ornamental plants. Here, we sequenced and assembled eight chloroplast (cp) genomes of Rubus from the Dabie Mountains in Central China. Fifty-one Rubus species were comparatively analyzed for the cp genomes including the eight newly discovered genomes and forty-three previously reported in GenBank database (NCBI).Entities:
Keywords: Chloroplast genome; Compare analysis; Phylogeny; Rubus
Mesh:
Year: 2022 PMID: 34991463 PMCID: PMC8740016 DOI: 10.1186/s12864-021-08225-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Accession numbers and features of the eight Rubus plastomes in the present study
| Species | Accession no. | Clean reads | Reads used in assembly | Mean coverage of base (X) | Complete | LSC | SSC | IR | CDS | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Length (bp) | GC (%) | Length (bp) | GC (%) | Length (bp) | GC (%) | Length (bp) | GC (%) | Length (bp) | GC (%) | |||||
| MT478113 | 8,969,158,800 | 8,367,934 | 600 | 156,217 | 37.2 | 85,805 | 35.1 | 18,830 | 31.2 | 25,791 | 42.8 | 78,876 | 38 | |
| MT478114 | 8,545,216,200 | 8,077,463 | 620 | 155,785 | 37.3 | 85,028 | 35.2 | 18,769 | 31.3 | 25,994 | 42.8 | 77,404 | 38 | |
| MT478115 | 9,832,178,700 | 7,522,556 | 1229 | 155,546 | 37.0 | 85,305 | 34.9 | 18,719 | 30.9 | 25,761 | 42.8 | 78,838 | 37.8 | |
| OK127886 | 3,497,807,100 | 20,149,705 | 914 | 156,321 | 37.2 | 85,883 | 35.1 | 18,874 | 31.2 | 25,782 | 42.8 | 78,870 | 38 | |
| OK127882 | 3,625,125,600 | 13,222,835 | 823 | 156,020 | 37.3 | 85,784 | 34.8 | 18,710 | 31 | 25,763 | 42.9 | 79,212 | 37.8 | |
| OK127884 | 3,611,559,900 | 11,723,986 | 915 | 155,906 | 37.3 | 85,125 | 35.2 | 18,749 | 31.4 | 26,016 | 42.8 | 78,960 | 38 | |
| OK127885 | 3,521,878,500 | 8,434,742 | 865 | 156,217 | 37.2 | 85,806 | 35.1 | 18,831 | 31.2 | 25,790 | 42.8 | 78,369 | 38 | |
| OK127883 | 3,408,285,600 | 12,384,841 | 831 | 155,874 | 37.0 | 85,094 | 35.2 | 18,795 | 31.3 | 25,993 | 42.8 | 78,966 | 38 | |
Note: GC guanine-cytosine, LSC large single copy, SSC small single copy, IR inverted repeat, CDS coding sequence
Fig. 1Gene maps of the complete cp genome of eight species of Rubus. Genes on the inside of the circle are transcribed clockwise, while that outside are transcribed counter clockwise. Genes were colored according to their functional groups. The darker gray in the inner circle corresponds to GC content, whereas the lighter gray corresponds to AT content
Gene contents in the cp genomes of Rubus species
| No. | Group of Genes | Genes Names | Amount |
|---|---|---|---|
| 1 | Photosystems I | 5 | |
| 2 | Photosystems II | 16 | |
| 3 | Cytochrome b/f complex | 6 | |
| 4 | ATP synthase | 6 | |
| 5 | NADH dehydrogenase | 12 | |
| 6 | Rubisco large subunit | 1 | |
| 7 | RNA polymerase | 4 | |
| 8 | Ribosomal proteins (SSU) | 15 | |
| 9 | Ribosomal proteins (LSU) | 11 | |
| 10 | Assembly/stability of photosystem I | 1 | |
| 11 | Transfer RNAs | 37 | |
| 12 | Ribosomal RNAs | 8 | |
| 13 | RNA processing | 1 | |
| 14 | Carbon metabolism | 1 | |
| 15 | Cytochrome c synthesis | 1 | |
| 16 | Proteins of unknown function | 6 | |
| 17 | Other genes | 3 |
Note: * Gene contains one intron; ** gene contains two introns; (×2) indicates the number of the repeat unit is 2; (×3) indicates the number of the repeat unit is 3. infA were only annotated in Rubus tephrodes, R. coreanus, R. trianthus, R. hirsutus
Fig. 2The comparison of four regions (LSC, SSC and two IRs) among twelve cp genome. Numbers above or near the colored genes indicated distances between the gene and the edge of borders. The figure is not in scale for length
Fig. 3Mauve alignment of eight Rubus cp genome revealing no interspecific rearrangements
Fig. 4Visualized alignment of the Rubus cp genome sequences with annotated R. tephorodes as reference, using mVISTA. The x-axis represents the base sequence of the alignment and the y-axis represents the pairwise percent identity within 50–100%. Grey arrows represent positions and directions of the genes
Fig. 5Sliding window analysis of the entire cp genome of eleven Rubus species (window length: 600 bp; step size: 200 bp). X-axis: position of the midpoint of a window; Y-axis: nucleotide diversity of each window
Fig. 6Ka/Ks values of protein-coding genes of the seven comparative combinations. Ka, nonsynonymous; Ks, synonymous
Fig. 7Number and type of simple sequence repeats in the 46 Rubus cp genome. A, Total number of SSRs detected in each species. B, Frequencies of identified SSRs in the large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regions. C, Numbers and types of SSRs detected in each species
Fig. 8Number and type of long repeat sequences in the 46 Rubus cp genome. A, Numbers and types of longer repeats in each species. B, Frequency of each repeat type. C, Presence of longer repeats in coding regions and non-coding regions
Fig. 9The maximum likelihood (ML) and Bayesian inference (BI) phylogenetic tree based on whole chloroplast genome data. The support values (bootstrap value [BS]/posterior probability [PP]) are indicated at the branches. BS and PP of 100% are indicated by an asterisk
Sampled species and their voucher specimens used in this study
| Species | Voucher Specimen | Coordinate | Location |
|---|---|---|---|
HGTC HGNU-0024 | E 115°47′55.77″, N 31°05′32.68″ | China, Hubei, Yingshan | |
HGTC HGNU-0194 | E 114°47′05.06″, N 29°58′58.17″ | China, Hubei, Daye | |
HGTC HGNU-0301 | E 115°51′09.30″, N 30°14′10.36″ | China, Hubei, Huangmei | |
HGTC HJD1379 | E 115°48′59.05″, N 31°6′54.67″ | China, Hubei, Yingshan | |
HGTC HJD1113 | E 114°36′51.64″, N 31°34′16.25” | China, Hubei, Hong’an | |
HGTC HJD1006 | E 115°4′52.01″, N 30°29′46.26” | China, Hubei, Xishui | |
HGTC HJD1069 | E 116°1′7.42″, N 30°57′51.44” | China, Hubei, Yingshan | |
HGTC HJD1054 | E 116°2′52.85″, N 30°58′20.62” | China, Hubei, Yingshan |