| Literature DB >> 27917320 |
Xin Yao1, Ying-Ying Liu2, Yun-Hong Tan3, Yu Song1, Richard T Corlett3.
Abstract
Complete chloroplast genome sequences have been very useful for understanding phylogenetic relationships in angiosperms at the family level and above, but there are currently large gaps in coverage. We report the chloroplast genome for Helwingia himalaica, the first in the distinctive family Helwingiaceae and only the second genus to be sequenced in the order Aquifoliales. We then combine this with 36 published sequences in the large (c. 35,000 species) subclass Campanulidae in order to investigate relationships at the order and family levels. The Helwingia genome consists of 158,362 bp containing a pair of inverted repeat (IR) regions of 25,996 bp separated by a large single-copy (LSC) region and a small single-copy (SSC) region which are 87,810 and 18,560 bp, respectively. There are 142 known genes, including 94 protein-coding genes, eight ribosomal RNA genes, and 40 tRNA genes. The topology of the phylogenetic relationships between Apiales, Asterales, and Dipsacales differed between analyses based on complete genome sequences and on 36 shared protein-coding genes, showing that further studies of campanulid phylogeny are needed.Entities:
Keywords: Asterids; Campanulidae; Phylogeny; Plastomes; Yunnan
Year: 2016 PMID: 27917320 PMCID: PMC5131622 DOI: 10.7717/peerj.2734
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
List of campanulid species (and the outgroup, Coffea arabica) and their accession numbers in GenBank included in the phylogenetic analyses of whole chloroplast genomes.
| Species | Accession number in NCBI | Family | Order | Length (bp) | Coding gene | tRNA | rRNA | GC (%) | LSC (bp) | SSC (bp) | IRs (bp) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Apiaceae | Apiales | 147,074 | 85 | 35 | 8 | 37.5 | 93,367 (63.48) | 17,573 (11.95) | 36,134 (24.57) | ||
| Apiaceae | Apiales | 154,719 | 85 | 37 | 8 | 37.4 | 84,768 (54.79) | 17,551 (11.34) | 52,400 (33.87) | ||
| Apiaceae | Apiales | 155,989 | 84 | 37 | 8 | 37.7 | 85,870 (55.05) | 17,518 (11.23) | 52,601 (33.72) | ||
| Apiaceae | Apiales | 158,355 | 88 | 37 | 8 | 37.6 | 85,230 (53.82) | 27,993 (17.68) | 55,986 (35.35) | ||
| Apiaceae | Apiales | 155,911 | 85 | 43 | 8 | 37.7 | 84,244 (54.03) | 17,571 (11.27) | 54,096 (34.70) | ||
| Apiaceae | Apiales | 153,628 | 85 | 37 | 8 | 37.6 | 86,659 (56.41) | 17,470 (11.37) | 49,499 (32.22) | ||
| Apiaceae | Apiales | 158,500 | 88 | 37 | 8 | 37.6 | 84,875 (53.55) | 17,661 (11.14) | 55,964 (35.31) | ||
| Apiaceae | Apiales | 147,282 | 83 | 36 | 8 | 37.5 | 93,185 (63.27) | 17,663 (11.99) | 36,434 (24.74) | ||
| Apiaceae | Apiales | 152,890 | 84 | 37 | 8 | 37.8 | 86,116 (56.33) | 17,508 (11.45) | 49,266 (32.22) | ||
| Apiaceae | Apiales | 154,737 | 85 | 37 | 8 | 37.3 | 84,585 (54.66) | 17,140 (11.08) | 53,012 (34.26) | ||
| Araliaceae | Apiales | 156,687 | 87 | 37 | 8 | 38.0 | 86,680 (55.32) | 18,247 (11.65) | 51,760 (33.03) | ||
| Araliaceae | Apiales | 153,207 | 85 | 37 | 8 | 37.6 | 84,352 (55.06) | 18,739 (12.23) | 50,116 (32.71) | ||
| Araliaceae | Apiales | 156,413 | 87 | 37 | 8 | 37.9 | 86,467 (55.28) | 18,118 (11.58) | 51,828 (33.14) | ||
| Araliaceae | Apiales | 156,318 | 87 | 37 | 8 | 38.1 | 86,114 (55.09) | 18,070 (11.56) | 52,134 (33.35) | ||
| Aquifoliaceae | Aquifoliales | 157,671 | 95 | 40 | 8 | 37.6 | 87,000 (55.18) | 18,436 (11.69) | 52,234 (33.13) | ||
| Aquifoliaceae | Aquifoliales | 157,610 | 95 | 40 | 8 | 37.6 | 86,952 (55.17) | 18,429 (11.69) | 52,228 (33.14) | ||
| Aquifoliaceae | Aquifoliales | 157,611 | 95 | 40 | 8 | 37.6 | 86,948 (55.17) | 18,434 (11.70) | 52,227 (33.14) | ||
| Aquifoliaceae | Aquifoliales | 157,621 | 95 | 40 | 8 | 37.6 | 87,064 (55.24) | 18,435 (11.70) | 52,122 (33.07) | ||
| Aquifoliaceae | Aquifoliales | 157,741 | 95 | 40 | 8 | 37.6 | 87,109 (55.22) | 18,436 (11.69) | 52,238 (33.12) | ||
| Aquifoliaceae | Aquifoliales | 157,822 | 95 | 40 | 8 | 37.7 | 87,204 (55.25) | 18,513 (11.73) | 52,182 (33.06) | ||
| Aquifoliaceae | Aquifoliales | 157,918 | 95 | 40 | 8 | 37.6 | 87,266 (55.26) | 18,432 (11.67) | 52,222 (33.07) | ||
| Helwingiaceae | Aquifoliales | 158,362 | 94 | 40 | 8 | 37.7 | 87,810 (55.45) | 18,560 (11.72) | 51,991 (32.83) | ||
| Asteraceae | Asterales | 151,076 | 87 | 37 | 8 | 37.5 | 82,740 (54.77) | 18,392 (12.17) | 49,944 (33.06) | ||
| Asteraceae | Asterales | 149,510 | 87 | 37 | 8 | 37.7 | 81,961 (54.82) | 17,972 (12.02) | 49,577 (33.16) | ||
| Asteraceae | Asterales | 152,559 | 90 | 36 | 8 | 37.7 | 83,596 (54.80) | 18,487 (12.12) | 50,476 (33.09) | ||
| Asteraceae | Asterales | 151,129 | 85 | 35 | 8 | 37.4 | 82,885 (54.84) | 18,376 (12.16) | 49,868 (33.00) | ||
| Asteraceae | Asterales | 152,550 | 87 | 37 | 8 | 37.7 | 83,580 (54.79) | 18,660 (12.23) | 50,310 (32.98) | ||
| Asteraceae | Asterales | 152,772 | 86 | 44 | 8 | 37.5 | 84,105 (55.05) | 18,599 (12.17) | 50,068 (32.77) | ||
| Asteraceae | Asterales | 150,944 | 67 | 25 | 7 | 37.4 | 82,193 (54.45) | 18,271 (12.10) | 50,480 (33.44) | ||
| Asteraceae | Asterales | 152,803 | 57 | 17 | 8 | 37.6 | 84,593 (55.36) | 18,900 (12.37) | 49,310 (32.27) | ||
| Asteraceae | Asterales | 151,410 | 84 | 32 | 8 | 37.2 | 85,311 (56.34) | 18,559 (12.26) | 47,540 (31.40) | ||
| Campanulaceae | Asterales | 171,724 | 82 | 37 | 8 | 38.8 | 105,555 (61.47) | 11,295 (6.58) | 54,874 (31.95) | ||
| Campanulaceae | Asterales | 169,551 | 83 | 36 | 8 | 38.8 | 102,320 (60.35) | 7,747 (4.57) | 59,484 (35.08) | ||
| Campanulaceae | Asterales | 167,287 | 82 | 37 | 10 | 38.8 | 104,955 (62.74) | 8,578 (5.13) | 53,754 (32.13) | ||
| Campanulaceae | Asterales | 162,321 | 83 | 44 | 10 | 38.3 | 100,110 (61.67) | 7,661 (4.72) | 54,550 (33.61) | ||
| Caprifoliaceae | Dipsacales | 156,875 | 81 | 38 | 8 | 38.4 | 90,137 (57.46) | 18,846 (12.01) | 47,892 (30.53) | ||
| Caprifoliaceae | Dipsacales | 155,078 | 81 | 39 | 8 | 38.6 | 88,858 (57.30) | 18,672 (12.04) | 47,548 (30.66) | ||
| Rubiaceae | Gentianales | 155,189 | 85 | 45 | 8 | 37.4 | 85,164 (54.88) | 18,207 (11.73) | 51,818 (33.39) |
Note:
Numbers in parentheses in the LSC, SSC and IRs columns are the percentage of the total length.
Figure 1Circular gene map of the Helwingia himalaica chloroplast genome.
Genes placed outside of the outer circle are transcribed in clockwise direction whereas genes inside are transcribed in counterclockwise direction. Different colours refer to genes from different functional groups. The area in darker gray in the inner circle indicates GC content while the lighter gray indicates AT content.
List of genes in the chloroplast genome of Helwingia himalaica.
| Category | Groups of gene | Name of genes |
|---|---|---|
| Protein synthesis and DNA-replication | Transfer RNAs | |
| Ribosomal RNAs | ||
| Ribosomal protein small subunit | ||
| Ribosomal protein large subunit | ||
| Subunits of RNA polymerase | ||
| Photosynthesis | photosystem I | |
| Photosystem II | ||
| Cythochrome b/f complex | ||
| ATP synthase | ||
| NADH-dehydrogenase | ||
| Large subunit rubisco | ||
| Miscellaneous group | Translation initiation factor | |
| Acetyl-CoA carboxylase | ||
| Cytochrome c biogenesis | ||
| Maturase | ||
| ATP-dependent protease | ||
| Inner membrane protein | ||
| Pseudogene unknown function | Conserved hypothetical chloroplast ORF |
List of repeated sequences in the chloroplast genome of Helwingia himalaica.
| Repeat length (bp) | Repeat A start site | Repeat A location | Repeat A region | Repeat B start site | Repeat B location | Repeat B region | Repeat type |
|---|---|---|---|---|---|---|---|
| 43 | 0 | IRa | 87797 | IRb | P | ||
| 30 | 9030 | LSC | 47728 | LSC | P | ||
| 27 | 45989 | LSC | 124368 | SSC | F | ||
| 26 | 43 | LSC | 87772 | IRb | P | ||
| 26 | 10811 | LSC | 10840 | LSC | P | ||
| 26 | 33886 | LSC | 33912 | LSC | F | ||
| 26 | 91380 | IRb | 154796 | IRa | F | ||
| 23 | 61775 | LSC | 61786 | LSC | F | ||
| 21 | 9036 | LSC | 37766 | LSC | F | ||
| 21 | 37766 | LSC | 47731 | LSC | P | ||
| 21 | 38950 | LSC | 69860 | LSC | F | ||
| 20 | 38564 | LSC | 38581 | LSC | F | ||
| 20 | 49313 | LSC | 49333 | LSC | F | ||
| 19 | 385 | LSC | 412 | LSC | P | ||
| 19 | 6791 | LSC | 6817 | LSC | F | ||
| 19 | 8756 | LSC | 38919 | LSC | P | ||
| 19 | 10620 | LSC | 38738 | LSC | F | ||
| 19 | 15636 | LSC | 15653 | LSC | F | ||
| 19 | 34034 | LSC | 111456 | IRb | R | ||
| 19 | 34034 | LSC | 134727 | IRa | C | ||
| 19 | 53790 | LSC | 81476 | LSC | P | ||
| 19 | 59571 | LSC | 59590 | LSC | F | ||
| 18 | 4719 | LSC | 66845 | LSC | R | ||
| 18 | 5785 | LSC | 34036 | LSC | R | ||
| 18 | 6349 | LSC | 90529 | IRb | F | ||
| 18 | 6349 | LSC | 155655 | IRa | P | ||
| 18 | 9101 | LSC | 37836 | LSC | F | ||
| 18 | 40424 | LSC | 42639 | LSC | F | ||
| 18 | 40973 | LSC | 43197 | LSC | F | ||
| 18 | 57793 | LSC | 121171 | SSC | C |
Notes:
rpl2(trnH-GUG) means spacer between rpl2 and trnH-GUG, etc.
P means palindromic match, F means forward (direct) match, R means reverse match, and C means complement match.
Figure 2Gene arrangement map of chloroplast genome alignment of seven representative species from seven campanulid families and Coffea arabica (as a reference) determined by Mauve software (Darling et al., 2004).
The polyline in the blocks indicates sequence similarity among these eight species. Line linking gene blocks among the eight species with same colour indicates ortholog. Gene blocks above are transcribed clockwise and those below are transcribed counterclockwise. The coding genes in the 14 main gene blocks are listed under the figure.
The 36 protein-coding genes shared by the 37 campanulid species and used for construction of the protein-coding gene phylogeny.
| Gene | Length (bp) | Gene | Length (bp) | Gene | Length (bp) |
|---|---|---|---|---|---|
| 1,539 | 2,205 | 144 | |||
| 246 | 246 | 1,458 | |||
| 744 | 113 | 417 | |||
| 708 | 135 | 415 | |||
| 363 | 1,062 | 207 | |||
| 1,516 | 1,062 | 207 | |||
| 306 | 120 | 114 | |||
| 477 | 222 | 747 | |||
| 963 | 111 | 618 | |||
| 114 | 186 | 435 | |||
| 96 | 117 | 418 | |||
| 2,253 | 132 | 336 |
Figure 3Phylogeny of 37 campanulid species using their complete chloroplast genomes.
In subgraph (A) numbers near nodes (on left) indicate the Bayesian posterior probability and numbers near nodes (on right) indicate the maximum likelihood bootstrap values for each clade present in the 50% majority-rule consensus tree. In subgraph (B) numbers near nodes indicate the maximum parsimony bootstrap values for each clade present in the 50% majority-rule consensus tree.
Figure 4Phylogeny of 37 campanulid species using their 36 shared coding genes.
In subgraph (A) numbers near nodes indicate the Bayesian posterior probability. In subgraph (B) numbers near nodes indicate the maximum likelihood bootstrap values for each clade present in the 50% majority-rule consensus tree. In subgraph (C) numbers near nodes indicate the maximum parsimony bootstrap values for each clade present in the 50% majority-rule consensus tree.