| Literature DB >> 31752332 |
Liping Nie1,2, Yingxian Cui1,2, Liwei Wu1,2, Jianguo Zhou1,2, Zhichao Xu1,2, Yonghua Li3, Xiwen Li4, Yu Wang1,2, Hui Yao1,2.
Abstract
Macrosolen plants are parasitic shrubs, several of which are important medicinal plants, that are used as folk medicine in some provinces of China. However, reports on Macrosolen are limited. In this study, the complete chloroplast genome sequences of Macrosolen cochinchinensis, Macrosolen tricolor and Macrosolen bibracteolatus are reported. The chloroplast genomes were sequenced by Illumina HiSeq X. The length of the chloroplast genomes ranged from 129,570 bp (M. cochinchinensis) to 126,621 bp (M. tricolor), with a total of 113 genes, including 35 tRNA, eight rRNA, 68 protein-coding genes, and two pseudogenes (ycf1 and rpl2). The simple sequence repeats are mainly comprised of A/T mononucleotide repeats. Comparative genome analyses of the three species detected the most divergent regions in the non-coding spacers. Phylogenetic analyses using maximum parsimony and maximum likelihood strongly supported the idea that Loranthaceae and Viscaceae are monophyletic clades. The data obtained in this study are beneficial for further investigations of Macrosolen in respect to evolution and molecular identification.Entities:
Keywords: Macrosolen; Macrosolen bibracteolatus; Macrosolen cochinchinensis; Macrosolen tricolor; Santalales; chloroplast genome; gene loss; phylogenetic relationship
Mesh:
Substances:
Year: 2019 PMID: 31752332 PMCID: PMC6888684 DOI: 10.3390/ijms20225812
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Plant materials of three Macrosolen species. ① Macrosolen cochinchinensis; ② Macrosolen tricolor; and ③ Macrosolen bibracteolatus.
Figure 2Gene map of the complete chloroplast genome of three Macrosolen species. Genes outside the large ring circle are transcribed in a counter-clockwise direction, and genes inside the circle are transcribed clockwise. The same color represents the same category of genes. Deep grey in the inner circle represents GC content, and lighter grey represents A/T content.
Length of chloroplast genome of three Macrosolen species and their base composition.
| Species |
|
|
|
|---|---|---|---|
| Accession No. | MH161424 | MH161425 | MH161423 |
| Genome size (bp) | 129,570 | 126,621 | 127,169 |
| LSC length (bp) | 73,052 | 71,895 | 70,692 |
| SSC length (bp) | 5724 | 5320 | 5587 |
| IRs length (bp) | 25,397 | 24,703 | 25,445 |
| GC content (%) | 37.3 | 37.7 | 37.9 |
| Number of genes | 113 | 113 | 113 |
| Number of protein-coding genes | 68 | 68 | 68 |
| Number of tRNAs | 35 | 35 | 35 |
| Number of rRNAs | 8 | 8 | 8 |
| Number of pseudogenes | 2 | 2 | 2 |
Gene list of chloroplast genome of three Macrosolen species.
| No. | Group of Genes | Gene Names | Number |
|---|---|---|---|
| 1 | Photosystem I | 5 | |
| 2 | Photosystem II | 15 | |
| 3 | Cytochrome b/f complex | 6 | |
| 4 | ATP synthase | 6 | |
| 5 | NADH dehydrogenase | - | 0 |
| 6 | RubisCO large subuni |
| 1 |
| 7 | RNA polymerase | 4 | |
| 8 | Ribosomal proteins (SSU) | 12(2) | |
| 9 | Ribosomal proteins (LSU) | 10(2) | |
| 10 | Proteins of unknown function | 6(2) | |
| 11 | Transfer RNAs | 35(7) | |
| 12 | Ribosomal RNAs | 8(4) | |
| 13 | Other genes | 5 |
* One or two asterisks following genes indicate one or two contained introns, respectively. (×2) indicates that the number of the repeat unit is two. The numbers in parenthesis at the line of ‘Number’ indicate the total number of repeated genes.
Figure 3Codon content of 20 amino acids and stop codons in all of the protein-coding genes of the chloroplast genomes of three Macrosolen species.
Figure 4Comparison of the borders of the large single-copy (LSC), small single-copy (SSC), and inverted repeats (IR) regions among the chloroplast genomes of three Macrosolen species. The number above the gene features means the distance between the ends of genes and the borders sites. These features are not to scale. Ψ: pseudogenes.
Types and amounts of simple sequence repeats (SSRs) in the chloroplast genomes of three Macrosolen species.
| SSR Type | Repeat Unit | Amount | Ratio (%) | ||||
|---|---|---|---|---|---|---|---|
| ① | ② | ③ | ① | ② | ③ | ||
| mono | A/T | 161 | 159 | 153 | 95.3 | 95.8 | 94.4 |
| C/G | 8 | 7 | 9 | 4.7 | 4.2 | 5.6 | |
| di | AC/GT | 5 | 4 | 4 | 9.6 | 8.5 | 9.3 |
| AG/CT | 13 | 14 | 13 | 25 | 29.8 | 30.2 | |
| AT/TA | 34 | 29 | 26 | 64.4 | 61.7 | 60.5 | |
| tri | AAT/ATT | 4 | 4 | 0 | 66.7 | 66.7 | 0 |
| ATC/ATG | 2 | 2 | 2 | 33.3 | 33.3 | 100 | |
| tetra | AAAG/CTTT | 3 | 3 | 3 | 33.3 | 42.9 | 30 |
| AATC/ATTG | 1 | 1 | 0 | 11.1 | 14.3 | 0 | |
| ACAG/CTGT | 1 | 1 | 1 | 11.1 | 14.3 | 10 | |
| AAAT/ATTT | 3 | 1 | 3 | 33.3 | 14.3 | 30 | |
| AATG/ATTC | 1 | 0 | 1 | 11.1 | 0 | 10 | |
| AGAT/ATCT | 0 | 1 | 1 | 0 | 14.3 | 10 | |
| ACAT/ATGT | 0 | 0 | 1 | 0 | 0 | 10 | |
| penta | AATAT/ATATT | 1 | 0 | 0 | 100 | 0 | 0 |
| hexa | ATATCC/ATATGG | 1 | 0 | 0 | 100 | 0 | 0 |
① M. cochinchinensis; ② M. tricolor; and ③ M. bibracteolatus.
Figure 5Repeat sequences in the chloroplast genomes of three Macrosolen species. F, P, R, and C indicate the repeat types F (forward), P (palindrome), R (reverse) and C (complement), respectively. Repeats with different lengths are indicated in different colors.
Figure 6Sequence identity plot comparing the three chloroplast genomes with M. cochinchinensis as a reference by using mVISTA. Grey arrows and thick black lines above the alignment indicate genes with their orientation and the position of their IRs, respectively. A cut-off of 70% identity was used for the plots, and the Y-scale represents the percent identity ranging from 50% to 100%.
Figure 7Sliding window analyses of the three whole chloroplast genomes. X-axis: position of a window. Y-axis: sequence divergence between species of each window. K(a): K values between M. bibracteolatus and M. tricolor; K(b): K values between M. bibracteolatus and M. cochinchinensis; K(c): K values between M. tricolor and M. cochinchinensis.
Figure 8Phylogenetic trees constructed with the matK genes of 15 species by using the maximum parsimony (MP) (A) and maximum likelihood (ML) (B) methods. Phylogenetic trees constructed with 58 common protein-coding genes of 16 species using the MP (C) and ML (D) methods. Numbers at nodes are bootstrap values.
Figure 9Comparison of the chloroplast genome gene content of six parasitic plants and one model plant (Nicotiana tabacum). The common existing genes in the complete chloroplast genome of the seven species are not listed. Red boxes indicate each gene present, and green boxes indicate that each gene is considered as a pseudogene. The yellow boxes indicate an absent gene.