| Literature DB >> 29352182 |
Yu Bi1,2,3, Ming-Fang Zhang1,2, Jing Xue1,2, Ran Dong3, Yun-Peng Du4,5, Xiu-Hai Zhang6,7.
Abstract
The genus Fritillaria comprises approximately 130 perennial herbaceous species. In the Pharmacopoeia of the People's Republic of China, the bulbs of 11 Fritillaria species are used in Chinese herbal medicines. However, the traditional methods of morphological classification cannot accurately identify closely related species of Fritillaria. Previous studies have attempted to identify these species with universal molecular markers, but insufficient phylogenetic signal was available. In this study, the complete chloroplast genomes of eight Fritillaria species were compared. The length of the eight Fritillaria chloroplast genomes ranges from 151,009 bp to 152,224 bp. A total of 136 SSR loci were identified, including 124 polymorphic SSR loci. For large repeat sequences, 108 repeat loci and four types of repeats were observed. Ten highly variable regions were identified as potential molecular markers. These SSRs, large repeat sequences and highly variable regions provide important information for the development of genetic markers and DNA fingerprints. Phylogenetic analyses showed that the topological structures of all data sets (except the IR regions) were in complete agreement and well resolved. Overall, this study provides comprehensive chloroplast genomic resources, which will be valuable for future studies of evolution and species identification in Fritillaria.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29352182 PMCID: PMC5775360 DOI: 10.1038/s41598-018-19591-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the sequencing data for four Fritillaria species.
| Species | Raw data no. | Mapped read no. | Mapped to reference genome (%) | cp gemome coverage (X) | cp gemome length (bp) | Accession number in Genbank |
|---|---|---|---|---|---|---|
|
| 7,101,661 | 50,995 | 0.72% | 50.25 | 152,224 | MF947708 |
|
| 8,110,794 | 81,422 | 1.00% | 80.29 | 152,118 | KX354691 |
|
| 56,287,190 | 133,071 | 0.24% | 131.45 | 151,846 | MF947710 |
|
| 3,629,318 | 60,458 | 1.67% | 59.74 | 151,803 | MF947709 |
Figure 1Gene map of the Fritillaria chloroplast genome. The genes inside and outside of the circle are transcribed in the clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are shown in different colors. The thick lines indicate the extent of the inverted repeats (IRa and IRb) that separate the genomes into small single copy (SSC) and large single copy (LSC) regions.
Summary of complete chloroplast genomes of Fritillaria species.
| Species | Total | LSC | IR | SSC | Total | Protein coding genes | tRNA | rRNA | AT% | Accession number in Genbank |
|---|---|---|---|---|---|---|---|---|---|---|
|
| 151,991 | 81,769 | 26,343 | 17,535 | 128 | 82 | 38 | 8 | 63.05% | KF769143 |
|
| 152,224 | 81,991 | 26,353 | 17,527 | 128 | 82 | 38 | 8 | 63.01% | MF947708 |
|
| 152,145 | 81,894 | 26,349 | 17,553 | 128 | 82 | 38 | 8 | 63.03% | KF712486 |
|
| 152,118 | 81,875 | 26,387 | 17,469 | 128 | 82 | 38 | 8 | 63.05% | KX354691 |
|
| 151,846 | 82,130 | 26,377 | 16,962 | 128 | 82 | 38 | 8 | 63.04% | MF947710 |
|
| 151,803 | 81,634 | 26,330 | 17,509 | 128 | 82 | 38 | 8 | 63.00% | MF947709 |
|
| 151,691 | 81,437 | 26,352 | 17,550 | 128 | 82 | 38 | 8 | 63.03% | KF769144 |
| 151,009 | 81,286 | 25,887 | 17,949 | 128 | 82 | 38 | 8 | 63.03% | KF769142 |
Figure 2Comparisons of LSC, SSC, and IR region borders among the eight Fritillaria chloroplast genomes.
Variable site analyses in Fritillaria chloroplast genomes.
| Number of sites | Number of variable sites | Number of parsimony informative sites | Nucleotide Diversity | |
|---|---|---|---|---|
| LSC | 84,180 | 1919 | 545 | 0.00770 |
| SSC | 18,289 | 569 | 191 | 0.01122 |
| IR | 26,486 | 121 | 27 | 0.00142 |
| Complete cp genome | 154,988 | 2744 | 796 | 0.00592 |
Numbers of nucleotide substitutions and sequence distance in eight complete cp genomes. The upper triangle shows the number of nucleotide substitutions and the lower triangle indicates the number of sequence distance in complete cp genomes.
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|
|
| 0.0053 | 0.0025 | 0.0082 | 0.0081 | 0.0056 | 0.0019 | 0.0019 | |
|
| 787 | 0.0050 | 0.0082 | 0.0083 | 0.0042 | 0.0054 | 0.0054 | |
|
| 373 | 737 | 0.0079 | 0.0077 | 0.0052 | 0.0026 | 0.0026 | |
|
| 1215 | 1217 | 1165 | 0.0071 | 0.0083 | 0.0083 | 0.0083 | |
|
| 1200 | 1218 | 1141 | 1052 | 0.0084 | 0.0082 | 0.0082 | |
|
| 827 | 626 | 771 | 1221 | 1234 | 0.0057 | 0.0056 | |
|
| 283 | 799 | 380 | 1230 | 1205 | 836 | 0.0016 | |
| 277 | 793 | 391 | 1221 | 1208 | 825 | 243 |
Simple sequence repeats (SSRs) in the eight Fritillaria cp genomes.
| Species | SSR loci no. | PolyM. loci no. | PolyM. loci (100%) | mono- | di- | tri- | tetra- | penta- | Location | Region | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| IGS | Intron | CDS | LSC | IR | SSC | |||||||||
|
| 74 | 62 | 83.78% | 53 | 13 | 2 | 6 | 0 | 46 | 13 | 15 | 57 | 4 | 13 |
|
| 60 | 48 | 80.00% | 39 | 12 | 1 | 8 | 0 | 35 | 11 | 14 | 42 | 4 | 14 |
|
| 85 | 73 | 85.88% | 58 | 15 | 2 | 8 | 2 | 53 | 18 | 14 | 63 | 6 | 16 |
|
| 61 | 49 | 80.33% | 40 | 11 | 1 | 9 | 0 | 34 | 14 | 13 | 44 | 4 | 13 |
|
| 82 | 70 | 85.37% | 55 | 16 | 2 | 8 | 1 | 50 | 17 | 15 | 63 | 6 | 13 |
|
| 62 | 50 | 80.65% | 40 | 10 | 3 | 9 | 0 | 34 | 12 | 16 | 42 | 4 | 16 |
|
| 78 | 66 | 84.62% | 54 | 14 | 2 | 7 | 1 | 47 | 17 | 14 | 58 | 4 | 16 |
| 74 | 62 | 83.78% | 54 | 12 | 1 | 7 | 0 | 45 | 15 | 14 | 55 | 4 | 15 | |
| Total Loci | 136 | 124 | 91.18% | 89 | 24 | 6 | 12 | 5 | 92 | 24 | 20 | 103 | 6 | 27 |
Figure 3Analysis of large repeat sequences in the eight Fritillaria chloroplast genomes. (A) Frequency of repeats by length; (B) Frequency of repeat type.
The large repeated sequences in the eight Fritillaria cp genomes with different hamming distance.
| Species | Hamming Distance = 3 | Hamming Distance = 2 | Hamming Distance = 1 | Hamming Distance = 0 | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| F | R | C | P | all | F | R | C | P | all | F | R | C | P | all | F | P | all | |
|
| 17 | 3 | 1 | 19 | 40 | 11 | 3 | 0 | 14 | 28 | 4 | 0 | 0 | 9 | 13 | 1 | 3 | 4 |
|
| 15 | 2 | 1 | 18 | 36 | 11 | 1 | 0 | 15 | 27 | 9 | 0 | 0 | 11 | 20 | 5 | 6 | 11 |
|
| 19 | 1 | 1 | 18 | 39 | 12 | 1 | 0 | 13 | 26 | 6 | 0 | 0 | 9 | 15 | 0 | 4 | 4 |
|
| 15 | 4 | 0 | 18 | 37 | 9 | 3 | 0 | 11 | 23 | 6 | 1 | 0 | 7 | 15 | 3 | 6 | 9 |
|
| 22 | 9 | 4 | 18 | 53 | 11 | 4 | 2 | 12 | 29 | 8 | 0 | 1 | 10 | 19 | 5 | 8 | 13 |
|
| 11 | 1 | 1 | 15 | 28 | 5 | 1 | 0 | 10 | 16 | 3 | 0 | 0 | 7 | 10 | 0 | 4 | 5 |
|
| 16 | 4 | 1 | 21 | 42 | 10 | 2 | 0 | 16 | 28 | 4 | 0 | 0 | 10 | 14 | 0 | 4 | 4 |
| 17 | 5 | 2 | 20 | 44 | 10 | 3 | 0 | 15 | 28 | 4 | 0 | 0 | 10 | 14 | 0 | 4 | 4 | |
| Total repeated sequences | 132 | 29 | 11 | 147 | 319 | 79 | 18 | 2 | 106 | 205 | 44 | 1 | 1 | 73 | 120 | 14 | 39 | 54 |
F: forward (direct) matching; R: reverse matching; C: complement matching; P: palindromic (inverted) matching.
Figure 4Sliding window analysis of the entire chloroplast genome of eight Fritillaria species (window length: 600 bp; step size: 200 bp). X-axis: position of the midpoint of a window; Y-axis: nucleotide diversity of each window.
Ten regions of highly variable sequences of Fritillaria.
| No. | High variable marker | Length | Variable sites | Parsimony informative sites | Nucleotide diversity |
|---|---|---|---|---|---|
| 1 |
| 877 | 31 | 10 | 0.01557 |
| 2 |
| 661 | 30 | 8 | 0.01519 |
| 3 |
| 889 | 56 | 11 | 0.02260 |
| 4 |
| 965 | 42 | 11 | 0.01851 |
| 5 |
| 1276 | 42 | 14 | 0.01786 |
| 6 |
| 809 | 43 | 8 | 0.01634 |
| 7 |
| 582 | 29 | 8 | 0.01643 |
| 8 |
| 816 | 48 | 17 | 0.02432 |
| 9 |
| 1010 | 62 | 26 | 0.02571 |
| 10 |
| 1001 | 64 | 29 | 0.02229 |
| 11 | Combine | 8886 | 447 | 142 | 0.01951 |
Figure 5Phylogenetic relationships of the eight Fritillaria species inferred from maximum likelihood (ML), maximum parsimony (MP) and Bayesian (BI) analyses of different data partitions. (A) Whole chloroplast genome. (B) LSC region. (C) IR region. (D) SSC region. Numbers above nodes are support values with ML bootstrap values on the left, MP bootstrap values in the middle, and Bayesian posterior probabilities (PP) values on the right.
Figure 6Phylogenetic relationships of the eight Fritillaria species inferred from maximum likelihood (ML), maximum parsimony (MP) and Bayesian (BI) analyses of highly variable regions. (A) Cardiocrinum giganteum as outgroup. (B) Alstroemeria aurea as outgroup. Numbers above nodes are support values with ML bootstrap values on the left, MP bootstrap values in the middle, and Bayesian posterior probabilities (PP) values on the right.