| Literature DB >> 32042041 |
Pingping Li1, Gongli Lou2, Xiaoran Cai2, Bin Zhang2, Yueqin Cheng3, Hongwei Wang4,5.
Abstract
Paulownia species are important ecological, economic and ornamental species, but their phylogenetic relationship remains unclear, which seriously affects the development and utilization of these important resources. The complete chloroplast genomes of six Paulownia species were assembled by next-generation sequencing data. By adding two known Paulownia chloroplast genomes to these six assembled genomes, we performed the comparative analysis and phylogenetic tree reconstruction of Paulownia. The results indicated that the chloroplast genomes of Paulownia species ranged in size from 154,107 to 154,694 bp. These chloroplast genomes contained 117 unique functional genes, including 80 protein-coding genes, four rRNA genes, and 33 tRNA genes. Twelve hotspot regions, five protein-coding genes and seven noncoding regions, were identified in the chloroplast genomes that showed high levels of sequence variation. Additionally, positive selection was observed in three genes, rps2, rbcL and ndhG. The maximum likelihood (ML) and Bayesian (BI) analysis strongly supported the monophyletic origin of Paulownia species, which clustered into two major clades: One clade included P. coreana, P. tomentosa and P. kawakamii, while the other clade comprised the 5 other species including P. fargesii and P. australis. This study provides useful genetic information for phylogenetic reconstruction, taxonomic discrepancies, and studying species evolution and phylogeography in Paulownia.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32042041 PMCID: PMC7010769 DOI: 10.1038/s41598-020-59204-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Map of the chloroplast genome of the six Paulownia species. Genes drawn inside the circle are transcribed clockwise, and those outside the circle are transcribed counterclockwise. The dashed gray area in the inner circle shows the percent GC content of the corresponding genes. LSC, SSC, and IR denote large single copy, small single copy, and inverted repeats, respectively. The circular map of the chloroplast genome was drawn using OGDRAW 1.3.1 (http://ogdraw.mpimp-golm.mpg.de/).
Comparison of the chloroplast genome features of the eight Paulownia species.
| Species | ||||||||
|---|---|---|---|---|---|---|---|---|
| Accession number | MK618176 | MK618177 | MK618178 | MK618179 | MK618180 | MK618181 | KP718622 | KP718624 |
| Total chloroplast genome size (bp) | 154,688 | 154,247 | 154,107 | 154,692 | 154,694 | 154,676 | 154,545 | 154,540 |
| LSC (bp) | 85,415 | 84,972 | 84,807 | 85,418 | 85,420 | 85,400 | 85,241 | 85,236 |
| IR (bp) | 51,540 | 51,544 | 51,560 | 51,540 | 51,540 | 51,540 | 51,568 | 51,568 |
| SSC (bp) | 17,733 | 17,731 | 17,740 | 17,734 | 17,734 | 17,736 | 17,736 | 17,736 |
| Total number of genes | 134 | 134 | 134 | 134 | 134 | 134 | 134 | 134 |
| Protein-coding genes | 86 | 86 | 86 | 86 | 86 | 86 | 86 | 86 |
| rRNAs | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 |
| tRNAs | 40 | 40 | 40 | 40 | 40 | 40 | 40 | 40 |
| GC content (%) | 37.97% | 37.99% | 37.99% | 37.96% | 37.97% | 37.97% | 38.00% | 38.00% |
Nucleotide mutation type.
| Nucleotide mutation | Number of mutations | |
|---|---|---|
| Conversion | A-G | 48 |
| T-C | 49 | |
| Transversion | A-C | 29 |
| A-T | 61 | |
| T-G | 24 | |
| C-G | 5 | |
| Total | 216 |
The values of nucleotide diversity (Pi) in different regions among the eight Paulownia species.
| Structural region | Noncoding region | Coding region | |||
|---|---|---|---|---|---|
| LSC | IRa | SSC | |||
| Total number of sites | 83,984 | 25,763 | 17,716 | 75,487 | 79,429 |
| Number of polymorphic sites | 165 | 6 | 38 | 161 | 55 |
| Pi values | 0.00089 | 0.00012 | 0.00106 | 0.00102 | 0.00032 |
| Theta-W | 0.00076 | 0.00009 | 0.00083 | 0.00084 | 0.00027 |
Figure 2The nucleotide diversity of coding regions among the eight Paulownia species.
Figure 3The nucleotide diversity of nocoding regions among the eight Paulownia species.
Figure 4Analyses of repeated sequences in the chloroplast genomes of the eight Paulownia species. (A) The number of repeats in the eight Paulownia chloroplast genomes. p1: Mono. p2: Di. p3: Tri. p4: Tetra. p5: Penta. p6: Hexa. c: Imperfect repeat. (B) Distribution of SSRs in the eight Paulownia chloroplast genomes.
Figure 5Phylogenetic tree based on whole plastomes with three positively selected genes being removed. The numbers to the left of the slashes on the braches show the bootstrap values obtained by maximum likelihood analyses, and those to the right show the posterior probabilities according to Bayesian inference.