| Literature DB >> 28676809 |
Yuan Huang1,2, Jun Wang3, Yongping Yang4, Chuanzhu Fan3, Jiahui Chen1,4.
Abstract
Chloroplast genomes of plants are highly conserved in both gene order and gene content. Analysis of the whole chloroplast genome is known to provide much more informative DNA sites and thus generates high resolution for plant phylogenies. Here, we report the complete chloroplast genomes of three Salix species in family Salicaceae. Phylogeny of Salicaceae inferred from complete chloroplast genomes is generally consistent with previous studies but resolved with higher statistical support. Incongruences of phylogeny, however, are observed in genus Populus, which most likely results from homoplasy. By comparing three Salix chloroplast genomes with the published chloroplast genomes of other Salicaceae species, we demonstrate that the synteny and length of chloroplast genomes in Salicaceae are highly conserved but experienced dynamic evolution among species. We identify seven positively selected chloroplast genes in Salicaceae, which might be related to the adaptive evolution of Salicaceae species. Comparative chloroplast genome analysis within the family also indicates that some chloroplast genes are lost or became pseudogenes, infer that the chloroplast genes horizontally transferred to the nucleus genome. Based on the complete nucleus genome sequences from two Salicaceae species, we remarkably identify that the entire chloroplast genome is indeed transferred and integrated to the nucleus genome in the individual of the reference genome of P. trichocarpa at least once. This observation, along with presence of the large nuclear plastid DNA (NUPTs) and NUPTs-containing multiple chloroplast genes in their original order in the chloroplast genome, favors the DNA-mediated hypothesis of organelle to nucleus DNA transfer. Overall, the phylogenomic analysis using chloroplast complete genomes clearly elucidates the phylogeny of Salicaceae. The identification of positively selected chloroplast genes and dynamic chloroplast-to-nucleus gene transfers in Salicaceae provide resources to better understand the successful adaptation of Salicaceae species.Entities:
Keywords: NUPT; Salicaceae; chloroplast genome; evolution; organellar horizontal gene transfer; phylogenetic incongruence; phylogenomics
Year: 2017 PMID: 28676809 PMCID: PMC5476734 DOI: 10.3389/fpls.2017.01050
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Features of chloroplast complete genomes in Salicaceae.
| Length of | Length of | Length of | No. of | No. of | No. of | No. of | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Genome | Length | Length | Length | Length | coding | protein coding | non-coding | GC | unique | unique | unique | unique | |
| Species | size | of LSC | of SSC | of IRA | of IRB | sequence (%) | sequence (%) | sequence | (%) | gene | CDS | tRNA | rRNA |
| 155671 | 84584 | 16291 | 27398 | 27398 | 91593 (58.84) | 80115 (51.47) | 64078 | 36.7 | 113 | 77 | 30 | 4 | |
| 155246 | 84516 | 15830 | 27450 | 27450 | 90483 (58.28) | 79005 (50.89) | 64763 | 36.6 | 112 | 76 | 30 | 4 | |
| 155531 | 84470 | 16213 | 27424 | 27424 | 92026 (59.17) | 80187 (51.56) | 63505 | 36.7 | 113 | 77 | 30 | 4 | |
| 156620 | 85980 | 16308 | 27166 | 27166 | 91738 (58.57) | 80079 (51.13) | 64882 | 37.0 | 113 | 77 | 30 | 4 | |
| 155590 | 84452 | 16220 | 27459 | 27459 | 92014 (59.14) | 80175 (51.53) | 63576 | 36.7 | 113 | 77 | 30 | 4 | |
| 155214 | 84077 | 16221 | 27458 | 27458 | 92014 (59.28) | 80175 (51.65) | 63200 | 36.7 | 113 | 77 | 30 | 4 | |
| 156505 | 84618 | 16567 | 27660 | 27660 | 92467 (59.08) | 80628 (51.52) | 64038 | 36.7 | 113 | 77 | 30 | 4 | |
| 157094 | 84922 | 16499 | 27846 | 27827 | 92542 (58.91) | 80703 (51.37) | 64552 | 36.7 | 113 | 77 | 30 | 4 | |
| 156766 | 84887 | 16589 | 27644 | 27646 | 92406 (58.95) | 80568 (51.39) | 64360 | 36.7 | 113 | 77 | 30 | 4 | |
| 157446 | 85454 | 16318 | 27837 | 27837 | 92575 (58.80) | 80736 (51.28) | 64871 | 36.7 | 113 | 77 | 30 | 4 | |
| 156067 | 84377 | 16490 | 27600 | 27600 | 92473 (59.25) | 80634 (51.67) | 63594 | 36.8 | 113 | 77 | 30 | 4 | |
| 155776 | 83955 | 16549 | 27636 | 27636 | 91404 (58.68) | 79565 (51.08) | 64372 | 36.8 | 113 | 77 | 30 | 4 | |
| 155449 | 83911 | 16488 | 27525 | 27525 | 92423 (59.46) | 80584 (51.84) | 63026 | 36.9 | 113 | 77 | 30 | 4 | |
| 157033 | 85129 | 16600 | 27652 | 27652 | 92440 (58.87) | 80601 (51.33) | 64593 | 36.7 | 113 | 77 | 30 | 4 |
Positive selection sites identified using Codeml under branch-site model.
| Gene | Branch | Null | Alternative | Putative sites under positive selection, amino acid, and corresponding posterior probability | |
|---|---|---|---|---|---|
| -578.123 | -566.714 | 1.78 | 130 S 0.970∗ | ||
| -445.250 | -437.826 | 1.17 | 37 S 0.997∗∗; 38 L 0.992∗∗; 39 A 0.955∗; 40 Y 0.995∗∗; 41 Q 0.997∗∗; 42 I 0.909; 43 L 0.996∗∗; 44 Y 0.999∗∗; 45 R 0.986∗; 46 A 0.987∗; 47 M 0.961∗; 48 K 0.952∗; 49 K 0.989∗; 52 Q 0.978∗; 54 T 0.965∗ | ||
| -9664.014 | -9628.861 | 5.07 | 2271 M 0.982∗; 2272 A 0.982∗; 2275 G 0.969∗ | ||
| -1585.549 | -1581.119 | 2.92 | 319 I 0.959∗ | ||
| -499.511 | -491.763 | 8.27 | 116 N 0.988∗; 117 V 0.989∗ | ||
| -872.644 | -829.745 | 1.99 | 101 E 1.000∗∗; 102 V 1.000∗∗; 103 I 0.999∗∗; 104 D 0.997∗∗; 105 T 0.979∗; 106 F 0.998∗∗; 107 P 0.998∗∗; 108 Y 0.978∗; 109 F 0.974∗; 110 V 0.999∗∗; 111 S 1.000∗∗; 112 G 0.999∗∗; 113 V 1.000∗∗; 114 L 0.980∗; 115 H 1.000∗∗; 116 L 1.000∗∗; 117 I 0.974∗; 119 S 0.998∗∗; 120 A 1.000∗∗; 121 V 0.996∗∗; 122 L 0.995∗∗; 123 G 0.979∗; 124 F 0.999∗∗; 125 G 0.998∗∗; 127 I 0.978∗; 128 Y 0.998∗∗; 129 H 0.996∗∗; 130 A 1.000∗∗; 131 L 0.992∗∗; 132 L 0.996∗∗; 133 G 0.982∗; 134 P 0.980∗; 135 E 1.000∗∗; 136 T 0.997∗∗; 137 L 0.999∗∗; 138 E 0.998∗∗; 139 E 0.974∗ | ||
| -196.515 | -192.834 | 6.66 | 20 P 0.981∗ |
Sizes of chloroplast genome, of nuclear genomes, and of NUPTs detected by BlastZ.
| Category | ||
|---|---|---|
| Nuclear genome size (Mb) | 481 | 450 |
| cp genome size (Kb) | 157,033 | 155,590 |
| Total number of NUPTs | 574 | 713 |
| Total length of NUPTs (bp) | 2,869,374 | 491,553 |
| Total gap length of NUPTs (bp) | 2,333,068 | 298,700 |
| Query aligned length (bp) | 536,306 | 192,853 |
| Query gap (bp) | 1,204,611 | 565,676 |
| Number of NUPTs genes in nuclear gene region | 314 | 41 |
| Number of NUPTs CDS in nuclear CDS regions | 233 | 25 |
| Length of cp CDS transferred in nuclear CDS region (bp)/proportion to NUPT length | 126,219/23.5% | 8,312/4.3% |
| Number of transferable genes | 75 | 31 |
| NUPTs in proportional of cp genome | 341.50% | 123.90% |
| NUPTs in proportional of nuclear genome | 0.111% | 0.043% |
| Number of transferable genes | 75 | 31 |