| Literature DB >> 29026168 |
Ying Li1, Jian-Guo Zhou1, Xin-Lian Chen1, Ying-Xian Cui1, Zhi-Chao Xu1, Yong-Hua Li2, Jing-Yuan Song1, Bao-Zhong Duan3, Hui Yao4.
Abstract
Numerous variations are known to occur in the chloroplast genomes of parasitic plants. We determined the complete chloroplast genome sequences of two hemiparasitic species, Taxillus chinensis and T. sutchuenensis, using Illumina and PacBio sequencing technologies. These species are the first members of the family Loranthaceae to be sequenced. The complete chloroplast genomes of T. chinensis and T. sutchuenensis comprise circular 121,363 and 122,562 bp-long molecules with quadripartite structures, respectively. Compared with the chloroplast genomes of Nicotiana tabacum and Osyris alba, all ndh genes as well as three ribosomal protein genes, seven tRNA genes, four ycf genes, and the infA gene of these two species have been lost. The results of the maximum likelihood and neighbor-joining phylogenetic trees strongly support the theory that Loranthaceae and Viscaceae are monophyletic clades. This research reveals the effect of a parasitic lifestyle on the chloroplast structure and genome content of T. chinensis and T. sutchuenensis, and enhances our understanding of the discrepancies in terms of assembly results between Illumina and PacBio.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29026168 PMCID: PMC5638910 DOI: 10.1038/s41598-017-13401-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gene map of the complete chloroplast genome of T. chinensis. Genes on the inside of the circle are transcribed clockwise, while those outside are transcribed counter clockwise. The darker gray in the inner circle corresponds to GC content, whereas the lighter gray corresponds to AT content.
Figure 2Gene map of the complete chloroplast genome of T. sutchuenensis. Genes on the inside of the circle are transcribed clockwise, while those outside are transcribed counter clockwise. The darker gray in the inner circle corresponds to GC content, whereas the lighter gray corresponds to AT content.
Comparisons among the chloroplast genome characteristics of T. chinensis, T. sutchuenensis, and other five species.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Family | Loranthaceae | Loranthaceae | Viscaceae | Santalaceae | Olacaceae | Orobanchaceae | Solanaceae |
| Accession No. | KY996492 | KY996493 | KJ512176 | KT070882 | KX775962 | M81884 | Z00044 |
| Genome size(bp) | 121,363 | 122,562 | 131,016 | 147,253 | 118,743 | 70,028 | 155,844 |
| LSC length(bp) | 70,357 | 70,630 | 75,814 | 84,601 | 84,168 | 19,799 | 86,684 |
| SSC length(bp) | 6,082 | 6,102 | 9,014 | 13,972 | 9,763 | 4,759 | 18,482 |
| IR length(bp) | 22,462 | 22,915 | 23,094 | 24,340 | 12,406 | 22,735 | 25,339 |
| GC content(%) | 37.3 | 37.3 | 36.2 | 37.7 | 38.1 | 37.5 | 37.8 |
| Number of genes | 106 | 106 | 104 | 114 | 112 | 53 | 151 |
| Number of protein-coding genes | 66 | 66 | 66 | 67 | 69 | 10 | 112 |
| Number of tRNAs | 28 | 28 | 29 | 30 | 35 | 17 | 30 |
| Number of rRNAs | 8 | 8 | 8 | 8 | 8 | 8 | 8 |
| Number of pseudogenes | 4 | 4 | 1 | 9 | 5 | 18 | 1 |
Genes with introns in the chloroplast genomes of T. chinensis and T. sutchuenensis as well as the lengths of the exons and introns.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
|
|
| LSC | 150 | 779 | 375 | ||
|
| LSC | 335 | 621 | 229 | |||
|
| LSC | 6 | 755 | 642 | |||
|
| LSC | 9 | 718 | 483 | |||
|
| LSC; IR | 394 | 645 | 437 | |||
|
| LSC | 10 | 924 | 397 | |||
|
| LSC, IR | 114 | — | 232 | 543 | 26 | |
|
| LSC | 456 | 752 | 1617 | |||
|
| LSC | 127 | 730 | 230 | 771 | 153 | |
|
|
| LSC | 163 | 753 | 410 | ||
|
| LSC | 332 | 634 | 229 | |||
|
| LSC | 6 | 799 | 642 | |||
|
| LSC | 6 | 715 | 483 | |||
|
| LSC; IR | 399 | 712 | 369 | |||
|
| LSC | 9 | 924 | 389 | |||
|
| LSC, IR | 114 | — | 232 | 539 | 26 | |
|
| LSC | 450 | 756 | 1602 | |||
|
| LSC | 127 | 759 | 230 | 785 | 153 |
Figure 3Comparison of the complete chloroplast genomes of six species using the MAUVE algorithm. Local collinear blocks are colored in this figure to indicate syntenic regions, while histograms within each block represent the degree of sequence similarity.
Figure 4Codon content of 20 amino acid and stop codons in all protein-coding genes of the chloroplast genomes of two species. The histogram on the left-hand side of each amino acid shows codon usage within the T. chinensis chloroplast genome, while the right-hand side illustrates the genome of T. sutchuenensis.
Types and amounts of SSRs in the T. chinensis and T. sutchuenensis chloroplast genomes.
|
|
|
|
| ||
|---|---|---|---|---|---|
|
|
|
|
| ||
| mono | A/T | 138 | 134 | 93.9 | 96.4 |
| C/G | 9 | 5 | 6.1 | 3.6 | |
| di | AC/GT | 3 | 4 | 7.2 | 7.7 |
| AG/CT | 14 | 13 | 33.3 | 25 | |
| AT/TA | 25 | 35 | 59.5 | 67.3 | |
| tri | AAG/CTT | 2 | 2 | 100 | 50 |
| AAT/ATT | 0 | 2 | 0 | 50 | |
| tetra | AAAC/GTTT | 1 | 0 | 25 | 0 |
| AAAG/CTTT | 1 | 0 | 25 | 0 | |
| AATC/ATTG | 1 | 0 | 25 | 0 | |
| ACAG/CTGT | 1 | 1 | 25 | 50 | |
| AAGT/ACTT | 0 | 1 | 0 | 50 | |
| penta | AATAT/ATATT | 1 | 1 | 100 | 100 |
Figure 5Phylogenetic trees constructed using two methods based on two datasets from different species. (A) ML tree based on 54 protein-coding genes; (B) ML tree based on matK genes; (C) NJ tree based on 54 protein-coding genes; (D) NJ tree based on matK genes. Number at nodes are values for bootstrap support.
Discrepancies in assembly results obtained using Illumina and PacBio.
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
|
| ||||
| 1 | 4615 | C | 10 | 9 | intergenic region |
| 2 | 17326 | C | 9 | 8 | introns |
| 3 | 17952 | C | 7 | 6 | introns |
| 4 | 27768 | G | 7 | 6 |
|
| 5 | 32700 | C | 6 | 5 |
|
| 6 | 41040 | C | 10 | 9 | intergenic region |
| 7 | 45020 | G | 8 | 7 |
|
| 8 | 51046 | T | 6 | 7 | intergenic region |
| 9 | 55514 | C | 10 | 9 | intergenic region |
| 10 | 58056 | A | 9 | 8 | intergenic region |
| 11 | 87056 | G | 7 | 6 | intergenic region |
| 12 | 90138 | C | 8 | 7 | intergenic region |
| 13 | 101590 | G | 8 | 7 | intergenic region |
| 14 | 104671 | C | 7 | 6 | intergenic region |