| Literature DB >> 29932433 |
Deng-Feng Xie1, Yan Yu2, Yi-Qi Deng3, Juan Li4, Hai-Ying Liu5, Song-Dong Zhou6, Xing-Jin He7.
Abstract
Urophysa is a Chinese endemic genus comprising two species, Urophysa rockii and Urophysa henryi. In this study, we sequenced the complete chloroplast (cp) genomes of these two species and of their relative Semiquilegia adoxoides. Illumina sequencing technology was used to compare sequences, elucidate the intra- and interspecies variations, and infer the phylogeny relationship with other Ranunculaceae family species. A typical quadripartite structure was detected, with a genome size from 158,473 to 158,512 bp, consisting of a pair of inverted repeats separated by a small single-copy region and a large single-copy region. We analyzed the nucleotide diversity and repeated sequences components and conducted a positive selection analysis by the codon-based substitution on single-copy coding sequence (CDS). Seven regions were found to possess relatively high nucleotide diversity, and numerous variable repeats and simple sequence repeats (SSR) markers were detected. Six single-copy genes (atpA, rpl20, psaA, atpB, ndhI, and rbcL) resulted to have high posterior probabilities of codon sites in the positive selection analysis, which means that the six genes may be under a great selection pressure. The visualization results of the six genes showed that the amino acid properties across each column of all species are variable in different genera. All these regions with high nucleotide diversity, abundant repeats, and under positive selection will provide potential plastid markers for further taxonomic, phylogenetic, and population genetics studies in Urophysa and its relatives. Phylogenetic analyses based on the 79 single-copy genes, the whole complete genome sequences, and all CDS sequences showed same topologies with high support, and U. rockii was closely clustered with U. henryi within the Urophysa genus, with S. adoxoides as their closest relative. Therefore, the complete cp genomes in Urophysa species provide interesting insights and valuable information that can be used to identify related species and reconstruct their phylogeny.Entities:
Keywords: SSRs; Semiaquilegia adoxoides; Urophysa; cp genome; phylogeny; positive selection analysis; repeat analysis
Mesh:
Substances:
Year: 2018 PMID: 29932433 PMCID: PMC6073864 DOI: 10.3390/ijms19071847
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Gene maps of the Urophysa rockii, Urophysa henryi and Semiquilegia adoxoides chloroplast (cp) genomes. Genes shown inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Genes belonging to different functional groups are color-coded. The darker gray color in the inner circle corresponds to the GC content, and the lighter gray color corresponds to the AT content. SSU: small subunit; LSU: large subunit; ORF: open reading frame.
Summary of complete chloroplast genomes. LSC, large single-copy; SSC, small single-copy; IR, inverted repeat
| Species | LSC | SSC | IR | Total | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Length (bp) | GC% | Length (%) | Length (bp) | GC% | Length (%) | Length (bp) | GC% | Length (%) | Length (bp) | GC% | |
|
| 87,128 | 37.2 | 55.0 | 18,216 | 32.5 | 11.5 | 26,584 | 43.7 | 16.8 | 158,512 | 38.8 |
|
| 87,031 | 37.2 | 55.0 | 18,260 | 32.6 | 11.5 | 26,506 | 43.6 | 16.7 | 158,303 | 38.8 |
|
| 87,202 | 37.2 | 55.1 | 18,192 | 32.5 | 11.5 | 26,473 | 43.7 | 16.7 | 158,340 | 38.9 |
|
| 88,522 | 36.3 | 55.3 | 18,405 | 32.0 | 11.5 | 26,632 | 43.1 | 16.6 | 160,191 | 38.1 |
|
| 86,362 | 36.2 | 55.4 | 16,948 | 32.7 | 10.9 | 26,291 | 43.0 | 16.9 | 155,892 | 38.1 |
|
| 86,335 | 36.2 | 55.4 | 16,945 | 32.7 | 10.9 | 26,291 | 43.0 | 16.9 | 155,862 | 38.1 |
|
| 86,348 | 36.2 | 55.4 | 16,944 | 32.6 | 10.9 | 26,290 | 43.0 | 16.9 | 155,872 | 38.1 |
|
| 84,637 | 36.0 | 54.6 | 18,909 | 31.0 | 12.2 | 25,791 | 43.5 | 16.6 | 155,129 | 37.9 |
|
| 83,532 | 35.9 | 54.1 | 21,269 | 31.6 | 13.8 | 24,831 | 43.6 | 16.1 | 154,474 | 37.8 |
|
| 83,582 | 35.9 | 54.1 | 21,249 | 31.6 | 13.8 | 24,831 | 43.6 | 16.1 | 154,493 | 37.8 |
|
| 79,328 | 36.3 | 49.7 | 18,110 | 31.4 | 11.4 | 31,045 | 42.0 | 19.5 | 159,528 | 38.0 |
|
| 84,567 | 36.4 | 54.4 | 17,376 | 32.1 | 11.2 | 26,762 | 43.0 | 17.2 | 155,484 | 38.2 |
Comparison of the sizes of coding and non-coding regions among species.
| Species | Protein-Coding | tRNA | rRNA | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Length (bp) | GC% | Length (%) | Length (bp) | GC% | Length (%) | Length (bp) | GC% | Length (%) | |
|
| 78,867 | 39.2 | 49.8 | 2687 | 53.2 | 1.7 | 8602 | 55.8 | 5.4 |
|
| 78,769 | 39.2 | 49.8 | 2695 | 53.3 | 1.7 | 8602 | 55.8 | 5.4 |
|
| 78,498 | 39.3 | 49.6 | 2706 | 53.6 | 1.7 | 8602 | 55.8 | 5.4 |
|
| 78,903 | 38.4 | 49.3 | 2716 | 53.1 | 1.7 | 9050 | 55.4 | 5.6 |
|
| 79,575 | 38.3 | 51.0 | 2810 | 53.0 | 1.8 | 9050 | 55.4 | 5.8 |
|
| 78,294 | 38.4 | 50.2 | 2813 | 52.9 | 1.8 | 9046 | 55.3 | 5.8 |
|
| 79,560 | 38.3 | 51.0 | 2810 | 53.0 | 1.8 | 9050 | 55.5 | 5.8 |
|
| 78,615 | 38.2 | 50.7 | 2738 | 53.1 | 1.8 | 7559 | 55.2 | 4.9 |
|
| 69,294 | 38.6 | 44.9 | 2717 | 53.1 | 1.8 | 9050 | 55.4 | 5.9 |
|
| 74,355 | 38.1 | 48.1 | 2796 | 52.9 | 1.8 | 9050 | 55.4 | 5.9 |
|
| 81,819 | 38.3 | 51.3 | 2718 | 53.4 | 1.7 | 9050 | 55.4 | 5.7 |
|
| 71,637 | 39.0 | 46.1 | 2716 | 53.2 | 1.7 | 9050 | 55.5 | 5.8 |
List of genes encoded in two Urophysa species and S. adoxoides.
| Category for Genes | Group of Genes | Name of Genes |
|---|---|---|
| Self-replication | transfer RNAs | |
| ribosomal RNAs | ||
| RNA polymerase | ||
| Small subunit of ribosomal proteins (SSU) | ||
| Large subunit of ribosomal proteins (LSU) | ||
| Genes for photosynthesis | Subunits of NADH-dehydrogenase | |
| Subunits of photosystem I | ||
| Subunits of photosystem II | ||
| Subunits of cytochrome b/f complex | ||
| Subunits of ATP synthase | ||
| Large subunit of rubisco |
| |
| Other genes | Tanslational initiation factor |
|
| Protease |
| |
| Maturase |
| |
| Subunit of Acetyl-CoA-carboxylase |
| |
| Envelope membrane protein |
| |
| C-type cytochrome synthesis gene |
| |
| Genes of unknown function | hypothetical chloroplast reading frames (ycf) |
* Gene with two copies.
Figure 2Analysis of repeated sequences in U. rockii, U. henryi, and S. adoxoides chloroplast genomes. (A) Total of four repeat types; (B) Frequency of the palindromic repeat by length; (C) Frequency of the forward repeat by length; (D) Frequency of the reverse repeat by length.
Figure 3Analysis of simple sequence repeats (SSRs) in chloroplast genomes of the three species. (A) Number of different SSR types detected in each species; (B) type and frequency of each identified SSR.
Figure 4The nucleotide diversity of the whole chloroplast genomes of the three species. LSC: large single-copy region; IRs: inverted repeats region; SSC: small single-copy region.
Figure 5Comparison of the borders of the LSC, SSC, and IR regions of the chloroplast genomes of the three species. LR: junction line between LSC and IRb; RS: junction line between IRb and SSC; SR: junction line between SSC and IRa; RL: junction line between IRa and LSC.
Figure 6Phylogenetic relationship of Urophysa with related species based on 79 single-copy genes shared by all cp genomes. Tree constructed by (A) maximum likelihood (ML) with the bootstrap values of ML above the branches; (B) maximum parsimony (MP) and Bayesian inference (BI) with bootstrap values of MP and posterior probabilities of BI above the branches, respectively.
The potential positive selection test based on the branch-site model.
| Gene Name | Null Hypothesis | Alternative Hypothesis | Significance Test | ||||||
|---|---|---|---|---|---|---|---|---|---|
| lnL | df | Omega ( | lnL | df | Omega ( | BEB | NEB | ||
|
| −188.6475 | 26 | 1 | −188.6475 | 27 | 3.40383 | NA | NA | 1 |
|
| −164.11693 | 26 | 1 | −164.1169 | 27 | 3.40719 | NA | NA | 1 |
|
| −621.64162 | 26 | 1 | −621.6416 | 27 | 3.40833 | NA | NA | 1 |
|
| −214.67663 | 26 | 1 | −214.6766 | 27 | 3.38764 | NA | NA | 1 |
|
| −434.45059 | 26 | 1 | −434.4506 | 27 | 3.35869 | NA | NA | 1 |
|
| −318.52192 | 26 | 1 | −318.5219 | 27 | 3.4089 | NA | NA | 1 |
|
| −868.20243 | 26 | 1 | −868.2024 | 27 | 3.40891 | NA | NA | 1 |
|
| −3297.629 | 26 | 1 | −3297.41 | 27 | 69.43581 | 220, E, 0.794 | NA | 5.04 × 10-1 |
|
| −126.25816 | 26 | 1 | −126.2582 | 27 | 3.40693 | NA | NA | 1 |
|
| −920.92455 | 26 | 1 | −920.9246 | 27 | 1 | NA | NA | 1 |
|
| −216.52331 | 26 | 1 | −216.5233 | 27 | 1 | NA | NA | 1 |
|
| −1238.1161 | 26 | 1 | −1238.116 | 27 | 3.33667 | NA | NA | 9.99 × 10-1 |
|
| −1275.4093 | 26 | 1 | −1275.409 | 27 | 3.40886 | NA | NA | 1 |
|
| −567.98294 | 26 | 1 | −567.9829 | 27 | 3.39414 | NA | NA | 1 |
|
| −1274.0507 | 26 | 1 | −1274.051 | 27 | 3.403 | NA | NA | 1 |
|
| −1000.285 | 26 | 1 | −999.941 | 27 | 112.30316 | 88, R, 0.683 | NA | 4.07 × 10-1 |
|
| −223.7602 | 26 | 1 | −223.7602 | 27 | 3.40292 | NA | NA | 1 |
|
| −198.46733 | 26 | 1 | −198.4673 | 27 | 3.38407 | NA | NA | 1 |
|
| −206.74878 | 26 | 1 | −206.7488 | 27 | 3.42095 | NA | NA | 1 |
|
| −375.13705 | 26 | 1 | −375.1371 | 27 | 3.4063 | NA | NA | 1 |
|
| −267.8099 | 26 | 1 | −267.8099 | 27 | 1 | NA | NA | 1 |
|
| −1620.734 | 26 | 1 | −1620.734 | 27 | 3.40891 | NA | NA | 1 |
|
| −179.71897 | 26 | 1 | −179.719 | 27 | 3.4064 | NA | NA | 1 |
|
| −6830.0894 | 26 | 1 | −6830.089 | 27 | 3.40847 | NA | NA | 9.99 × 10-1 |
|
| −4245.754 | 26 | 1 | −4245.49 | 27 | 63.47379 | 28, R, 0.778 | NA | 4.66 × 10-1 |
|
| −540.92362 | 26 | 1 | −540.9236 | 27 | 3.40123 | NA | NA | 1 |
|
| −616.75534 | 26 | 1 | −616.7553 | 27 | 3.40218 | NA | NA | 1 |
|
| −3133.747 | 26 | 1 | −3133.75 | 27 | 1 | 115, N, 0.828 | NA | 1 |
|
| −1307.986 | 26 | 1 | −1307.68 | 27 | 575.22179 | 174, S, 0.696 | NA | 4.35 × 10-1 |
|
| −1787.561 | 26 | 1 | −1787.561 | 27 | 3.40891 | NA | NA | 1 |
|
| −1001.4075 | 26 | 1 | −1001.407 | 27 | 1 | NA | NA | 1 |
|
| −209.10513 | 26 | 1 | −209.1051 | 27 | 3.38566 | NA | NA | 1 |
|
| −1331.3789 | 26 | 1 | −1331.379 | 27 | 3.4089 | NA | NA | 1 |
|
| −2760.6743 | 26 | 1 | −2760.674 | 27 | 1 | NA | NA | 1 |
|
| −2643.2896 | 26 | 1 | −2643.29 | 27 | 1 | NA | NA | 9.98 × 10-1 |
|
| −2937.477 | 26 | 1 | −2937.41 | 27 | 5.22178 | 440, E, 0.736 | NA | 7.20 × 10-1 |
|
| −1301.1173 | 26 | 1 | −1301.117 | 27 | 3.40876 | NA | NA | 1 |
|
| −731.03212 | 26 | 1 | −731.0321 | 27 | 3.33544 | NA | NA | 1 |
|
| −935.76375 | 26 | 1 | −935.7638 | 27 | 3.40891 | NA | NA | 1 |
|
| −1922.7755 | 26 | 1 | −1922.775 | 27 | 3.38592 | NA | NA | 1 |
|
| −1960.3785 | 26 | 1 | −1960.379 | 27 | 3.39639 | NA | NA | 1 |
|
| −172.24809 | 26 | 1 | −172.2481 | 27 | 3.40087 | NA | NA | 1 |
|
| −413.59385 | 26 | 1 | −413.5939 | 27 | 3.4089 | NA | NA | 1 |
|
| −435.90511 | 26 | 1 | −435.9051 | 27 | 3.40785 | NA | NA | 1 |
|
| −498.98549 | 26 | 1 | −498.9855 | 27 | 3.408 | NA | NA | 1 |
|
| −1445.5558 | 26 | 1 | −1445.556 | 27 | 3.39588 | NA | NA | 1 |
|
| −4069.2947 | 26 | 1 | −4069.295 | 27 | 3.41513 | NA | NA | 1 |
Bold types are positively selected sites. BEB: Bayesian Empirical Bayes; NEB: Naïve Empirical Bayes; Amino acid: (E: Glu; R: Arg; N: Asn; S: Ser).
Figure 7Two of the amino acids sequences that showed positive selection in the branch-site model test. (A) Amino acids sequences of the rpl20 gene; (B) amino acids sequences of the ndhI gene. The red blocks represent the different amino acids.