Literature DB >> 23577110

Complete plastid genome sequencing of Trochodendraceae reveals a significant expansion of the inverted repeat and suggests a Paleogene divergence between the two extant species.

Yan-xia Sun1, Michael J Moore, Ai-ping Meng, Pamela S Soltis, Douglas E Soltis, Jian-qiang Li, Heng-chang Wang.   

Abstract

The early-diverging eudicot order Trochodendrales contains only two monospecific genera, Tetracentron and Trochodendron. Although an extensive fossil record indicates that the clade is perhaps 100 million years old and was widespread throughout the Northern Hemisphere during the Paleogene and Neogene, the two extant genera are both narrowly distributed in eastern Asia. Recent phylogenetic analyses strongly support a clade of Trochodendrales, Buxales, and Gunneridae (core eudicots), but complete plastome analyses do not resolve the relationships among these groups with strong support. However, plastid phylogenomic analyses have not included data for Tetracentron. To better resolve basal eudicot relationships and to clarify when the two extant genera of Trochodendrales diverged, we sequenced the complete plastid genome of Tetracentron sinense using Illumina technology. The Tetracentron and Trochodendron plastomes possess the typical gene content and arrangement that characterize most angiosperm plastid genomes, but both genomes have the same unusual ∼4 kb expansion of the inverted repeat region to include five genes (rpl22, rps3, rpl16, rpl14, and rps8) that are normally found in the large single-copy region. Maximum likelihood analyses of an 83-gene, 88 taxon angiosperm data set yield an identical tree topology as previous plastid-based trees, and moderately support the sister relationship between Buxaceae and Gunneridae. Molecular dating analyses suggest that Tetracentron and Trochodendron diverged between 44-30 million years ago, which is congruent with the fossil record of Trochodendrales and with previous estimates of the divergence time of these two taxa. We also characterize 154 simple sequence repeat loci from the Tetracentron sinense and Trochodendron aralioides plastomes that will be useful in future studies of population genetic structure for these relict species, both of which are of conservation concern.

Entities:  

Mesh:

Year:  2013        PMID: 23577110      PMCID: PMC3618518          DOI: 10.1371/journal.pone.0060429

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The eudicot order Trochodendrales [1] contains only two extant genera, both of which are monotypic: Trochodendron Sieb. & Zucc. and Tetracentron Oliver. Historically, these two genera have been treated either as the separate families Trochodendraceae and Tetracentraceae, or as the combined family Trochodendraceae [1]–[7]. The Trochodendraceae sensu APG III [1] appear to have been widespread in the Northern Hemisphere during the Paleogene and Neogene [7]–[15]. However, the two extant species of the family have small geographic ranges and are restricted to eastern Asia [16]. Trochodendron aralioides Sieb. & Zucc. is a large, evergreen shrub or small tree native to the mountains of Japan to South Korea and Taiwan, and the Ryukyu Islands [2], [17], whereas Tetracentron sinense Oliver is a deciduous tree occurring in southwestern and central China and the eastern Himalayan regions. Both species are characterized by apetalous flowers arranged in cymose inflorescences and by loculicidal capsules that dehisce to release winged seeds [2], [5], [7], [18]. Although earlier researchers reported that wood of Trochodendrales wood lacked vessels and thus suggested that Trochodendrales were among the earliest-diverging angiosperms, recent research has documented the presence of vessels in the wood of both genera [2], [7], [19]. Molecular phylogenetic studies, including analyses of complete plastid genome sequences, have routinely recovered Trochodendrales as an early-diverging member of the clade Eudicotyledoneae (sensu [20]; all italicized clade names follow this system), specifically as part of a strongly supported clade with Buxales and Gunneridae, or core eudicots [21]–[27]. However, the relationships among Trochodendrales, Buxales, and Gunneridae have often been only weakly supported. In the 17-gene analysis of Soltis et al. [28], which included data from all three plant genomes, Trochodendrales and Buxales were subsequent sisters to Gunneridae, with 100% and 98% BS support, respectively. However, other studies have found Buxales to be sister to Gunneridae with only weak support [24], [26], [29]–[30], whereas in other analyses Trochodendrales have appeared as sister to Gunneridae [27], [31]–[32]. Complete plastid genome sequences have been used increasingly over the past decade to resolve deep-level phylogenetic relationships that have been unclear based on only a few genes. For example, recent plastid phylogenomic studies have helped to resolve key relationships among the earliest-diverging Mesangiospermae [33] as well as early-diverging Eudicotyledoneae and Pentapetalae [26], [34]. Indeed, the plastid genome represents an excellent source of characters for plant phylogenetics due to the generally strong conservation of plastid genome structure and its mix of sequence regions that vary tremendously in evolutionary rate [35]–[37], which enable plastid genome sequence data to be applied to phylogenetic problems at almost any taxonomic level in plants [26], [38]–[43]. It is now relatively inexpensive to generate complete plastid genome sequence due to rapid improvements in next-generation sequencing (NGS) technologies [25], [44]–[45] and due to the relatively small size of the plastid genome (∼150 kb) and its structural conservation, which enable dozens of plastomes to be multiplexed per sequencing lane and facilitate relatively straightforward genome assembly [45]–[48]. Despite the promise of NGS technology for plastid genomics, the complete plastomes of only eight genera of early-diverging eudicots have been reported: Ranunculus (Ranunculaceae, Ranunculales), Megaleranthis (Ranunculaceae, Ranunculales), Nandina (Berberidaceae, Ranunculales), Nelumbo (Nelumbonaceae, Proteales), Platanus (Platanaceae, Proteales), Meliosma (Sabiaceae, Sabiales), Trochodendron (Trochodendraceae, Trochodendrales) and Buxus (Buxaceae, Buxales). Previous phylogenetic analyses based on some of these complete genomes have not fully resolved the relationships among early-diverging eudicots, however; in addition to the uncertainty surrounding relationships of Buxales, Trochodendrales, and Gunneridae, the positions of Sabiales and Proteales remain poorly supported [26]–[27]. Plastome taxon sampling is still sparse in these clades, however, and additional sampling may help elucidate these recalcitrant relationships. In addition to their important role in phylogenetics, plastid genomes may be rich sources of population-level data. The non-recombination and uniparental inheritance of most plastid genomes can make plastid genomes extremely useful for population genetics, particularly for tracing maternal lineages [49]–[50]. For example, chloroplast simple sequence repeats (cpSSR) have been widely used in plant population genetics [51], including within early-diverging eudicots, where numerous cpSSR loci have been reported from the plastid genome of the endangered species Megaleranthis saniculifolia (Ranunculaceae) [52]. Here we report the complete plastid genome sequences of Tetracentron sinense and Trochodendron aralioides (the protein-coding and rRNA genes of Trochodendron cp genome were used for phylogenetic analyses in Moore et al. [26], but the cp genome structure of this genus has never been reported), as well as the results of new phylogenetic analyses based on adding Tetracentron and Megaleranthis genomes [52] to the 83-gene data set of Moore et al. [26]. We also compare the plastid genome structure of Trochodendron and Tetracentron, including the characterization of a significant expansion of the inverted repeat in both taxa, and we estimate the divergence time between the two genera. Finally, we characterize the distribution and location of cpSSRs in both Tetracentron sinense and Trochodendron aralioides, which provided further opportunity to study the population genetic structures of these two ancient relict species.

Results

Sequencing and Genome Assembly

Illumina paired-end sequencing produced 892.11 Mb of data for Tetracentron sinense. We obtained 9912310 raw reads of 90 bp in length. The N50 of contigs was 13,981 bp and the summed length of contigs was 143,709 bp. The mean coverage of this genome was 5424.2×. After de novo and reference-guided assembly, we obtained a cp genome containing nine gaps. PCR and Sanger sequencing were used for filling the gaps. Four junction regions between IRs and SSC/LSC were first determined based on de novo contigs, and subsequently confirmed by PCR amplifications and Sanger sequencing, sequenced results were compared with the assembled genome directly and no mismatch or indel was observed, which validated the accuracy of our assembly. The genome sequences of Tetracentron sinense and Trochodendron aralioides have been submitted to GenBank (GenBank IDs: KC608752 and KC608753).

General Features of the Tetracentron and Trochodendron Plastomes

The plastid genome size of Tetracentron sinense is 164,467 base pairs (bp) (Figure 1), and that of Trochodendron aralioides is 165,945 bp (Figure 2). Both genomes show typical quadripartite structure, consisting of two copies of an inverted repeat (IR) separated by the large single-copy (LSC) and small single-copy (SSC) regions (Table 1). The IR exhibits a significant expansion relative to most other angiosperms at the LSC/IR junction; specifically, the IR in both Tetracentron and Trochodendron has expanded to include the entirety of the rps19, rpl22, rps3, rpl16, rpl14, and rps8 genes (Figures 1, 2). The SSC/IR boundary occurs within the ycf1 gene, as is typical in angiosperms, but is slightly expanded in the Trochodendron genome to include 1461 bp of the 5′ end of ycf1 (versus 1083 bp in Tetracentron; Figure 3). This expansion of the IR at the SSC junction contributes to the difference in length between the two Trochodendrales plastomes; the remainder of the difference is largely the result of length differences among various noncoding regions (Table 2).
Figure 1

Map of the Tetracentron sinense plastid genome.

Figure 2

Map of the Trochodendron aralioides plastid genome.

Table 1

Basic characteristic of the Tetracentron sinense and Trochodendron aralioides plastid genomes.

Tetracentron Trochodendron
total genome length164467165945
IR length3023130744
SSC length1953918974
LSC length8446685483
total length of coding sequence9469995168
total length of noncoding sequence6976870777
overall G/C content38.1%38.0%

All values given are in base pairs (bp), unless otherwise noted.

Figure 3

Comparison of the IR junctions in Tetracentron and Trochodendron.

Table 2

The principal noncoding regions contributing to the size difference between the Tetracentron and Trochodendron plastid genomes.

Spacer region or intron names Tetracentron Trochodendron length difference
trnK-UUU/rps16 spacer8701308438
rps16/trnQ-UUG spacer15291797268
trnS-GCU/trnG-UCC spacer505658153
trnE-UUC/trnT-GGU spacer9571316359
trnT-UGU/trnL-UAA spacer11991309110
petA/psbJ spacer1146754−392
ycf1/ndhF spacer440325−115
*rpl16 intron865972107

All sizes are in base pairs. The only locus residing in the IR is marked with an asterisk (*).

All values given are in base pairs (bp), unless otherwise noted. All sizes are in base pairs. The only locus residing in the IR is marked with an asterisk (*). Both genomes contain 119 genes (79 protein-coding genes, 30 tRNA genes, and 4 rRNA genes) arranged in the same order, of which 24 are duplicated in the IR regions (Table 3). Sequence divergence between Tetracentron and Trochodendron in coding regions is low (Table 4, Figures 4, 5). Only 7 genes (rps11, rpoA, rpl32, rps16, ndhF, ycf1, and rpl36) exhibit divergences of more than 2%, and 12 genes have an identical sequence (Table 4, Figure 4). The genes ndhF, ycf1, and rpl36 have the highest sequence divergences (2.7%, 3.5% and 4.4%, respectively). The coding regions account for 57.5% and 57.3% of the Tetracentron and Trochodendron plastid genomes, respectively. For both cp genomes, single introns are present in 18 genes, whereas three genes (rps12, clpP, and ycf3) have two introns (Table 5). The overall genomic G/C nucleotide composition is 38.1% and 38.0% for Tetracentron and Trochodendron, respectively; detailed A/T contents of different regions of the plastome for both genomes are listed in Table 6. Due to the lower A/T content of the four rRNA genes, the IR regions possess lower A/T content than the single-copy regions.
Table 3

List of genes present in the plastid genomes of Tetracentron sinense and Trochodendron aralioides.

Group of genesName of genes
Protein synthesis and DNA replicationRibosomal RNAs rrn4.5 (×2) rrn5 (×2) rrn16 (×2) rrn23 (×2)
Transfer RNAs trnH-GUG trnK-UUU* trnQ-UUG trnS-GCU trnG-UCC* trnR-UCU trnC-GCA trnD-GUC trnY-GUA trnE-UUC trnT-GGU trnS-UGA trnG-GCC trnfM-CAU trnS-GGA trnT-UGU trnL-UAA*trnF-GAA trnV-UAC* trnM-CAU trnW-CCA trnP-UGG trnI-GAU* (×2) trnL-CAA (×2) trnV-GAC (×2) trnI-GAU (×2) trnA-UGC* (×2) trnR-ACG (×2) trnN-GUU (×2) trnL-UAG
small subunit rps2 rps3 rps4 rps7 (×2) rps8 rps11 rps12* (×2) rps14 rps15 rps16* rps18 rps19
Ribosomal proteins large subunit rpl2* (×2) rpl14 rpl16* rpl20 rpl22 rpl23 (×2) rpl32 rpl33 rpl36
RNA polymerase rpoA rpoB rpoC1* rpoC2
PhotosynthesisPhotosystem I psaA psaB psaC psaI psaJ
Photosystem II psbA psbB psbC psbD psbE psbF psbH psbI psbJ psbK psbL psbM psbN psbT psbZ
Cytochrome b6/f petA petB* petD* petG petL petN
ATP synthase atpA atpB atpE atpF* atpH atpI
NADH dehydrogenase ndhA* ndhB*(×2) ndhC ndhD ndhE ndhF ndhG ndhH ndhI ndhJ ndhK
Large subunit of Rubisco rbcL
Miscellaneous proteinsSubunit of Acetyl-CoA-carboxylase accD
c-type cytochrome synthesis gene ccsA
Envelope membrane protein cemA
Protease clpP*
Translational initiation factor infA
Maturase matK
Genes of unknown functionHypothetical conserved coding frame ycf1 ycf2(×2) ycf3* ycf4

Genes with introns are marked with asterisks (*).

Table 4

Comparisons of the protein-coding genes of Tetracentron and Trochodendron.

GeneLength in Tetracentron Length in Trochodendron Number of nucleotide differencesProportion of nucleotide differencesNumber of indel differences
petL 102102000
psaI 111111000
psaJ 129129000
psbE 252252000
psbF 120120000
psbJ 123123000
psbL 117117000
psbT 108108000
rpl23 288288000
rps19 279279000
rps7 468468000
rps8 399399000
rpl2 82582510.001210
rps3 65765710.001520
petD 50450410.001980
rpl16 50150110.002490
rpl14 36936910.002710
ycf2 68796897190.002761
ndhB 1533153350.003260
ycf3 50750720.003940
rpl33 20120110.004980
psbZ 18918910.005290
psaA 22532253120.005330
psbK 18618610.005380
rps12 37237220.005380
psbA 1062106260.005650
rpl20 35435420.005650
rpoC1 20492070120.005861
atpA 1524152490.005910
rpl22 48648030.006251
ndhJ 47747730.006290
psbD 1062106270.006590
petA 96396370.007270
rpoB 32133213240.007470
psbN 13213210.007580
psaB 22052205170.007710
psbC 14221422110.007740
atpH 24624620.008130
psaC 24624620.008130
ndhA 1095109590.008220
rps4 60660650.008250
infA 23423420.008550
atpB 14971497130.008680
cemA 69069060.00870
petG 11411410.008770
psbI 11111110.009010
rbcL 14281428130.00910
petB 64864860.009260
atpI 74474470.009410
clpP 60960960.009850
rps14 30330330.00990
atpE 40240240.009950
ccsA 966966100.010350
psbB 15271527160.010480
accD 14911491160.010730
ndhK 82285890.010951
ndhC 36336340.011020
petN 909010.011110
ndhG 53153160.01130
rpoC2 41374146500.012091
ndhD 15031503180.012640
rps2 71171190.012660
psbH 22222230.013510
ndhI 54354380.014730
atpF 55555590.016220
matK 15361536250.016280
ndhE 30630350.01651
rps18 30330350.01650
ndhH 11821182200.016920
ycf4 555555100.018050
rps15 27327350.018320
psbM 10510520.019050
rps11 41741790.021580
rpoA 10141014240.023670
rpl32 16216240.024690
rps16 22722760.026220
ndhF 22232223610.027440
ycf1 568856911950.03456
rpl36 11411450.043860

Genes are ranked from lowest to highest proportion of nucleotide differences.

Figure 4

Amount of sequence divergence between the protein-coding genes of Tetracentron and Trochodendron.

Figure 5

Sequence identity plot between Trochodendron and Tetracentron.

Table 5

Exon and intron lengths (bp) in plastid genes containing introns in Tetracentron sinense and Trochodendron aralioides, respectively.

GeneExon 1 (Te/Tr)Intron 1 (Te/Tr)Exon 2 (Te/Tr)Intron 2 (Te/Tr)Exon 3 (Te/Tr)
trnK-UUU 37/3735/35
trnG-UCC 24/24698/69848/48
trnL-UAA 35/35444/44250/50
trnV-UAC 39/39583/58537/37
trnI-GAU 42/42954/95435/35
trnA-UGC 38/38794/79435/35
petB 6/6793/797642/642
petD 8/8704/709496/496
atpF 145/145727/724410/410
ndhA 553/5531106/1084542/542
ndhB 777/777700/700756/756
rpl2 391/391671/674434/434
rpl16 9/9865/972402/402
rps12 114/114232/232538/53626/26
rpoC1 432/432728/7141617/1638
clpP 71/71682/710292/292659/650246/246
ycf3 124/124734/725230/230731/758153/153
rps16 40/40831/844227/227

The rps12 gene is trans-spliced, and hence the length of intron 1 is unknown.

Table 6

A/T content (%) of different regions in Tetracentron and Trochodendron.

Region Tetracentron Trochodendron
overall61.8661.98
LSC63.5063.74
IR57.6357.83
SSC67.8467.48
Protein-coding regions61.5861.53
Genes with introns are marked with asterisks (*). Genes are ranked from lowest to highest proportion of nucleotide differences. The rps12 gene is trans-spliced, and hence the length of intron 1 is unknown.

Characterization of SSR Loci

In all, 154 SSR loci (77 each from Tetracentron sinense and Trochodendron aralioides) were detected in the two plastid genomes, of which 123 are mononucleotide repeats, 28 are dinucleotide repeats, two are trinucleotide repeats, and one is a tetranucleotide repeat (Table 7). Nearly all of the SSR loci are composed of A/T repeats (Table 7), and these SSR loci are mostly present in noncoding regions. The tetranucleotide locus identified in Tetracentron is in the first intron of ycf3. The two trinucleotide loci in Trochodendron are both located in the spacer region between trnK-UUU and rps16. The unique C mononucleotide repeat from Trochodendron is present in the trnV-ndhC intergenic spacer region.
Table 7

Distribution of SSR loci in the plastid genomes of Tetracentron and Trochodendron.

BaseLengthPosition in plastid genome
SSR loci in Tetracentron
A102085–2094 7164–7173 9478–9487 17266–17275 39220–39229 47812–47821 58880–58889 69930–69939 124816–124825 136417–136426 141648–141657
119611–9621 46892–46902 47147–47157 50813–50823 75797–75807 80873–80883 82302–82312 133069–133079 160432–160442
12217–228 49977–49988 50332–50343 118899–118910 162450–162461 163452–163463 163940–163951
1465157–65170
1538842–38856
1739891–39907
1874838–74855
2272886–72907
T105266–5275 6724–6733 9153–9162 19332–19341 54468–54477 63461–63470 67706–67715 107277–107286 112508–112517 117373–117382 118300–118309 121204–121213 126456–126465 130614–130623
117004–7014 7679–7689 13144–13154 31361–31371 37925–37935 47779–47789 67810–67820 76013–76023 88492–88502
1255307–55318 71723–71734 84983–84994 85471–85482 86473–86484 118884–118895 119027–119038
1313902–13914
1472926–72939
AT101734–1743 20833–20842 50404–50413–63181–63190
124862–4873 12996–13007 114822–114833
1460686–60699
TA1034083–34092 34111–34120 114741–114750
1449132–49145
TAAA2046875–46894
SSR loci in Trochodendron
A10118854–118863 126258–126267 142993–143002 163821–163830 18142–18151 40389–40398 41060– 41069 51091–51100 6136–6145 68969–68978 76681–76690 86529–86538
11134406–134416 16427–16437 30306–30316 39963–39973 51490–51500 70911–70921 81823–81833 9789–9799
1210420–10431 48058–48069 48322–48333
13164932–164944
16161805–161820 73777–73792 75726–75741
1546189–46203
17214–230 83299–83315 9304–9320
T10108427–108436 120424–120433 121028–121037 122665–122674 131951–131960 164891–164900 20189–20198 40375–40387 48933–4894253154–53163 53339–53348 5700–5709 6030–6039 68604–68613 72934–72943 83282–83291 87599–87608
11127885–127895 14709–14719 55604–55614 57547–57557
1250271–50282
1373814–73826 86485–86497
1476896–76909
1548889–48903
1689609–89624
AT101724–1733 51556–51565 64459–64468
124921–4932 4943–4954 4984–4995 4998–5009 5044–5055 5085–5096 5099–5110 5145–5156 5186–5197 5200–5211
1873275–73292
TA101738–1747 21689–21698
TAA185016–5033 5218–5235
C1055999–56008

Phylogenetic and Molecular Dating Analyses

ML analyses of the 83-gene, 88-taxon data set yielded a tree with a similar topology and bootstrap support (BS) values (Figure 6) as that of the plastid phylogenomic study of Moore et al. [26]. The clades of Trochodendron+Tetracentron and Ranunculus+Megaleranthis were supported with 100% ML BS support. Trochodendrales are sister to the remaining angiosperms with high support (BS = 100%), but Buxaceae are sister to Gunneridae with only 67% BS support.
Figure 6

A maximum likelihood tree determined by GARLI (−ln L = −1095466.026) for the 83-gene, 88-taxon data set.

Numbers associated with branches are ML bootstrap support values. Error bars around nodes correspond to 95% highest posterior distributions of divergence times based on 6 fossils using the program BEAST. Eo = Eocene, Mi = Miocene, Ol. = Oligocene, Pa = Paleocene, Pl = Pliocene.

A maximum likelihood tree determined by GARLI (−ln L = −1095466.026) for the 83-gene, 88-taxon data set.

Numbers associated with branches are ML bootstrap support values. Error bars around nodes correspond to 95% highest posterior distributions of divergence times based on 6 fossils using the program BEAST. Eo = Eocene, Mi = Miocene, Ol. = Oligocene, Pa = Paleocene, Pl = Pliocene. Molecular dating analyses suggest that Trochodendron and Tetracentron diverged between 44-30 million ago. The crown group 95% highest posterior density (HPD) age estimates for other major lineages of Pentapetalae were as follows: Superasteridae (115-109 mya), Dilleniaceae+Superrosidae (116-112 mya), Superrosidae (114-111 mya), Santalales (98-75 mya), Caryophyllales (76-60 mya), Asteridae (104-99 mya), Rosidae (111-108 mya), Vitaceae+Saxifragales (114-110 mya), and Saxifragales (109-107 mya).

Discussion

Expansion of the IR Region in Trochodendrales Plastomes

The plastid genomes of Tetracentron and Trochodendron exhibit the typical gene content and genome structure of angiosperms [37], [53]–[54], with the notable exception of a significantly expanded IR region (Figures 1, 2, 3). This ∼4 kb expansion is responsible for the relatively large size of both Trochodendrales plastomes, which are ∼4–5 kb larger than the typical upper size range of angiosperm plastid genomes, including those of nearly all other early-diverging eudicots (Table 8). Significant expansion, contraction, and even loss of the IR appears to be an evolutionarily uncommon phenomena but are nonetheless associated with much of the more significant variation in plastome size in angiosperms. For example, the largest known angiosperm plastome, that of Pelargonium x hortorum, also possesses the largest known IR, at ∼76 kb in length [55]. Other significant IR expansions and contractions have been found in Campanulaceae [56]–[57], Apiaceae [58], and Lemna (Araceae) [59].
Table 8

Numbers of genes (including genes that span IR/SC junctions) in the IR regions of early-diverging eudicots.

Basal eudicot lineagesSpeciesGenes in IR regioncp genome size (bp)
Ranunculales Ranunculus macranthus 20155129
Megaleranthis saniculifolia 19159924
Nandina domestica 19156599
Proteales Nelumbo lutea 18163206
Platanus occidentalis 19161791
Sabiales Meliosma aff. cuneifolia 18160357
Buxales Buxus microphylla 18159010
Trochodendrales Tetracentron sinense 24164467
Trochodendron aralioides 24165945

Impact of Additional Taxon Sampling on Basal Eudicot Phylogeny

The inclusion of Megaleranthis and Tetracentron in our analyses had no effect on the relationships among the major early-diverging eudicot lineages, and very little effect on support values. Of the basal splits among the eudicots with BS values less than 100% in both the current tree and that of Moore et al. [26], all were within 3% BS value. For example, the sister relationship of Buxales and Gunneridae is 70% in Moore et al. [26] vs. 67% with the inclusion of Megaleranthis and Tetracentron, and the sister relationship of Sabiales and Proteales has BS support of 80% in Moore et al. [26] vs. 83% in the current analyses. These similar values are unsurprising given that Tetracentron and Trochodendron are found to be relatively closely related in our analyses. Indeed, the relatively low sequence divergence between the Tetracentron and Trochodendron plastid genomes supports the taxonomic placement of Tetracentraceae within Trochodenraceae, as advocated by APG III [1]. Although it is possible that the addition of the noncoding regions of the plastid genome (or at least those noncoding regions that can be aligned) to our data set may improve support for these relationships, we may have to look to the other plant genomes for a confident resolution of relationships among the early-diverging eudicots. In fact, the sister relationship of Buxales and Gunneridae received high support (BS = 98%) in the 17-gene analyses of Soltis et al. [28], which employed a combination of 11 plastid genes, 18S and 26S nuclear rDNA, and 4 mitochondrial genes. However, the sister relationship of Sabiales and Proteales were more poorly supported (BS = 59%) in Soltis et al. [28].

Divergence Time Between Tetracentron and Trochodendron

Cenozoic Trochodendrales fossils are known throughout the Northern Hemisphere, with the Paleocene Nordenskioldia the earliest certain fossil of the order [7]–[15]. Both Tetracentron and Trochodendron had wide distributions in the Northern Hemisphere during the Paleogene and Neogene. Fossil remains of Tetracentron have been found in Japan [60]–[61], Idaho [62], Princeton, British Columbia and Republic, Washington [63], and Iceland [15]; Trochodendron fossil remains have been reported from Kamchatka [64], Japan [11], Idaho and Oregon [11]–[12], Washington [7], and British Columbia [63]. Our estimate of the divergence time between the two genera of Trochodendraceae (44-30 mya) encompasses the recent estimate of 37-31 mya from Bell et al. [65], which was based on analysis of 567 taxa and three genes, as well as the mid-Eocene estimate of ∼45 mya derived from the rbcL analysis of Anderson et al. [66], which employed numerous fossil constraints from the early-diverging eudicots. The congruence among these studies and with the fossil record suggests that a mid- to late Eocene divergence for the two extant Trochodendraceae lineages may be a reasonable estimate.

Analysis of Plastid SSR Loci in the Trochodendrales

Because microsatellite loci, including cpSSRs, often exhibit high variation within species, they are considered valuable molecular markers for population genetics [67]–[69]. A limited number of SSR loci were recently characterized for Tetracentron [70], but no cpSSR loci are available for Trochodendraceae. The 77 cpSSR loci that were identified in both Tetracentron and Trochodendron represent ∼42% more loci than the 54 loci reported in the plastid genome of Megaleranthis (Ranunculaceae), the only other early-diverging eudicot for which a comprehensive analysis of cpSSR loci is available. The abundant and varied cpSSR loci identified in Trochodendrales will be useful in characterizing the population genetics of both extant species, which are of conservation interest in the wild because of their relatively narrow, presumably relictual distributions, and decreasing numbers [71]. Tetracentron is officially afforded second-class protection in China.

Materials and Methods

Sample Preparation, Sequencing, and Assembly

Fresh leaves of Tetracentron sinense were collected from the Kunming Institute of Botany at the Chinese Academy of Sciences, and a voucher was deposited at the Herbarium of Wuhan Botanical Garden, Chinese Academy of Science (HIB). Chloroplast DNA was isolated following the protocol of Zhang et al. [45], and an Illumina library was constructed following the manufacturer’s protocol (Illumina). The DNA was indexed by tag and sequenced together with eight other species in one lane of an Illumina Genome Analyzer IIx at Beijing Genomics Institute (BGI) in Shenzhen, China. Illumina Pipeline 1.3.2 was used conducting image analysis and base calling. Raw sequence reads produced by Illumina paired-end sequencing were filtered for high quality reads which were subsequently assembled into contigs with a minimum length of 100 bp using SOAPdenovo [72] with the Kmer = 57. Contigs were aligned to the Trochodendron aralioides plastid genome using BLAST (http://blast.ncbi.nlm.nih.gov/), and aligned contigs were ordered according to the reference genome.

Genome Annotation and Analysis

The Tetracentron and Trochodendron plastid genomes were annotated with DOGMA [73] and BLAST tools from NCBI (the National Center for Biotechnology Information). Physical maps were generated using GenomeVx [74] with subsequent manual editing. Sequence divergence between the Tetracentron and Trochodendron plastid genomes was evaluated using DnaSP version 5.10 [75], and genome sequence identity plots were generated using mVISTA [76] (http://genome.lbl.gov/vista/mvista/submit.shtml). Msatfinder ver. 1.6.8 [77] was used to identify SSR loci by manually setting repeat units.

Phylogenetic and Divergence Time Analyses

All protein-coding sequences, as well as all rRNA sequences, were extracted from the Tetracentron and Megaleranthis plastome [52] and added manually to the 83-gene, 86-taxon alignment of Moore et al. [26]. ML analyses were performed on the concatenated 83-gene data set using the following partitioning strategy: (1) codon positions 1 and 2 together; (2) codon position 3; and (3) rRNA genes. The optimal nucleotide sequence model was selected for each partition using jModelTest 2.1.1 using the Decision Theory (DT) criterion [78]. The following models were selected: TVM+I+Γ for codon positions 1+2 and for codon position 3, and TIM1+ I+Γ for rRNA. Partitioned ML analyses were conducted using GARLI 2.0 [79]. A total of ten search replicates were conducted to find the optimal tree, and nonparametric bootstrap support was assessed with 100 replicates [80]. All ML searches used random taxon addition to build starting trees. Divergence times were estimated using BEAST version 1.7.4 [81], using the same dating strategies employed in Moore et al. [26]. In addition to the three calibration points (used in Moore et al. [26]) of minimum ages of 131.8 mya for angiosperms [82]–[85], 125 mya for eudicots [83], [86], and 85 mya for the most recent common ancestor of Quercus and Cucumis [26], we additionally constrained the stem lineage of Malpighiales using a minimum of 89.3 my [87] and the node uniting Calycanthus and Liriodendron using 98 my [88], and set the age of Proteales to a minimum of 98 my [89].
  37 in total

1.  Automatic annotation of organellar genomes with DOGMA.

Authors:  Stacia K Wyman; Robert K Jansen; Jeffrey L Boore
Journal:  Bioinformatics       Date:  2004-06-04       Impact factor: 6.937

2.  Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms.

Authors:  Michael J Moore; Charles D Bell; Pamela S Soltis; Douglas E Soltis
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-28       Impact factor: 11.205

3.  Chloroplast simple sequence repeats (cpSSRs): technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species.

Authors:  Daniel Ebert; Rod Peakall
Journal:  Mol Ecol Resour       Date:  2009-01-28       Impact factor: 7.090

4.  CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP.

Authors:  Joseph Felsenstein
Journal:  Evolution       Date:  1985-07       Impact factor: 3.694

5.  The highly rearranged chloroplast genome of Trachelium caeruleum (Campanulaceae): multiple inversions, inverted repeat expansion and contraction, transposition, insertions/deletions, and several repeat families.

Authors:  M E Cosner; R K Jansen; J D Palmer; S R Downie
Journal:  Curr Genet       Date:  1997-05       Impact factor: 3.886

6.  Polymorphic simple sequence repeat regions in chloroplast genomes: applications to the population genetics of pines.

Authors:  W Powell; M Morgante; R McDevitt; G G Vendramin; J A Rafalski
Journal:  Proc Natl Acad Sci U S A       Date:  1995-08-15       Impact factor: 11.205

7.  Complete chloroplast DNA sequence from a Korean endemic genus, Megaleranthis saniculifolia, and its evolutionary implications.

Authors:  Young-Kyu Kim; Chong-wook Park; Ki-Joong Kim
Journal:  Mol Cells       Date:  2009-03-19       Impact factor: 5.034

8.  Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns.

Authors:  Robert K Jansen; Zhengqiu Cai; Linda A Raubeson; Henry Daniell; Claude W Depamphilis; James Leebens-Mack; Kai F Müller; Mary Guisinger-Bellian; Rosemarie C Haberle; Anne K Hansen; Timothy W Chumley; Seung-Bum Lee; Rhiannon Peery; Joel R McNeal; Jennifer V Kuehl; Jeffrey L Boore
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-28       Impact factor: 11.205

9.  Rapid and accurate pyrosequencing of angiosperm plastid genomes.

Authors:  Michael J Moore; Amit Dhingra; Pamela S Soltis; Regina Shaw; William G Farmerie; Kevin M Folta; Douglas E Soltis
Journal:  BMC Plant Biol       Date:  2006-08-25       Impact factor: 4.215

10.  Complete chloroplast genome sequence of a major allogamous forage species, perennial ryegrass (Lolium perenne L.).

Authors:  Kerstin Diekmann; Trevor R Hodkinson; Kenneth H Wolfe; Rob van den Bekerom; Philip J Dix; Susanne Barth
Journal:  DNA Res       Date:  2009-05-04       Impact factor: 4.458

View more
  26 in total

1.  The complete plastome of macaw palm [Acrocomia aculeata (Jacq.) Lodd. ex Mart.] and extensive molecular analyses of the evolution of plastid genes in Arecaceae.

Authors:  Amanda de Santana Lopes; Túlio Gomes Pacheco; Tabea Nimz; Leila do Nascimento Vieira; Miguel P Guerra; Rubens O Nodari; Emanuel Maltempi de Souza; Fábio de Oliveira Pedrosa; Marcelo Rogalski
Journal:  Planta       Date:  2018-01-16       Impact factor: 4.116

2.  The Linum usitatissimum L. plastome reveals atypical structural evolution, new editing sites, and the phylogenetic position of Linaceae within Malpighiales.

Authors:  Amanda de Santana Lopes; Túlio Gomes Pacheco; Karla Gasparini Dos Santos; Leila do Nascimento Vieira; Miguel Pedro Guerra; Rubens Onofre Nodari; Emanuel Maltempi de Souza; Fábio de Oliveira Pedrosa; Marcelo Rogalski
Journal:  Plant Cell Rep       Date:  2017-10-30       Impact factor: 4.570

3.  Complete Chloroplast Genome of Tanaecium tetragonolobum: The First Bignoniaceae Plastome.

Authors:  Alison Gonçalves Nazareno; Monica Carlsen; Lúcia Garcez Lohmann
Journal:  PLoS One       Date:  2015-06-23       Impact factor: 3.240

Review 4.  A review of the prevalence, utility, and caveats of using chloroplast simple sequence repeats for studies of plant biology.

Authors:  Gregory L Wheeler; Hanna E Dorman; Alenda Buchanan; Lavanya Challagundla; Lisa E Wallace
Journal:  Appl Plant Sci       Date:  2014-11-20       Impact factor: 1.936

5.  A precise chloroplast genome of Nelumbo nucifera (Nelumbonaceae) evaluated with Sanger, Illumina MiSeq, and PacBio RS II sequencing platforms: insight into the plastid evolution of basal eudicots.

Authors:  Zhihua Wu; Songtao Gui; Zhiwu Quan; Lei Pan; Shuzhen Wang; Weidong Ke; Dequan Liang; Yi Ding
Journal:  BMC Plant Biol       Date:  2014-11-19       Impact factor: 4.215

6.  Complete chloroplast genome of Macadamia integrifolia confirms the position of the Gondwanan early-diverging eudicot family Proteaceae.

Authors:  Catherine J Nock; Abdul Baten; Graham J King
Journal:  BMC Genomics       Date:  2014-12-08       Impact factor: 3.969

7.  Complete plastome sequencing of both living species of Circaeasteraceae (Ranunculales) reveals unusual rearrangements and the loss of the ndh gene family.

Authors:  Yanxia Sun; Michael J Moore; Nan Lin; Kole F Adelalu; Aiping Meng; Shuguang Jian; Linsen Yang; Jianqiang Li; Hengchang Wang
Journal:  BMC Genomics       Date:  2017-08-09       Impact factor: 3.969

8.  Complete Plastid Genome Sequencing of Four Tilia Species (Malvaceae): A Comparative Analysis and Phylogenetic Implications.

Authors:  Jie Cai; Peng-Fei Ma; Hong-Tao Li; De-Zhu Li
Journal:  PLoS One       Date:  2015-11-13       Impact factor: 3.240

9.  Mimosoid legume plastome evolution: IR expansion, tandem repeat expansions, and accelerated rate of evolution in clpP.

Authors:  Diana V Dugas; David Hernandez; Erik J M Koenen; Erika Schwarz; Shannon Straub; Colin E Hughes; Robert K Jansen; Madhugiri Nageswara-Rao; Martijn Staats; Joshua T Trujillo; Nahid H Hajrah; Njud S Alharbi; Abdulrahman L Al-Malki; Jamal S M Sabir; C Donovan Bailey
Journal:  Sci Rep       Date:  2015-11-23       Impact factor: 4.379

10.  The Complete Chloroplast Genome Sequences of Three Veroniceae Species (Plantaginaceae): Comparative Analysis and Highly Divergent Regions.

Authors:  Kyoung Su Choi; Myong Gi Chung; SeonJoo Park
Journal:  Front Plant Sci       Date:  2016-03-23       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.