Literature DB >> 35406981

Plastid Genome of Equisetum xylochaetum from the Atacama Desert, Chile and the Relationships of Equisetum Based on Frequently Used Plastid Genes and Network Analysis.

Anchittha Satjarak1, Linda E Graham2, Marie T Trest2, Patricia Arancibia-Avila3.   

Abstract

The modern pteridophyte genus Equisetum is the only survivor of Sphenopsida, an ancient clade known from the Devonian. This genus, of nearly worldwide distribution, comprises approximately 15 extant species. However, genomic information is limited. In this study, we assembled the complete chloroplast genome of the giant species Equisetum xylochaetum from a metagenomic sequence and compared the plastid genome structure and protein-coding regions with information available for two other Equisetum species using network analysis. Equisetum chloroplast genomes showed conserved traits of quadripartite structure, gene content, and gene order. Phylogenetic analysis based on plastome protein-coding regions corroborated previous reports that Equisetum is monophyletic, and that E. xylochaetum is more closely related to E. hyemale than to E. arvense. Single-gene phylogenetic estimation and haplotype analysis showed that E. xylochaetum belonged to the subgenus Hippochaete. Single-gene haplotype analysis revealed that E. arvense, E. hyemale, E. myriochaetum, and E. variegatum resolved more than one haplotype per species, suggesting the presence of a high diversity or a high mutation rate of the corresponding nucleotide sequence. Sequences from E. bogotense appeared as a distinct group of haplotypes representing the subgenus Paramochaete that diverged from Hippochaete and Equisetum. In addition, the taxa that were frequently located at the joint region of the map were E. scirpoides and E. pratense, suggesting the presence of some plastome characters among the Equiseum subgenera.

Entities:  

Keywords:  Equisetum; haplotype map; plastid genome

Year:  2022        PMID: 35406981      PMCID: PMC9002811          DOI: 10.3390/plants11071001

Source DB:  PubMed          Journal:  Plants (Basel)        ISSN: 2223-7747


1. Introduction

Equisetum L. is a genus of vascular plants that represents ancient Sphenopsida, a long-enduring clade known from fossils of the Devonian and later ages and, therefore, is considered useful in understanding the evolution of vascular plants. This genus is comprised of approximately 15 extant species, with a nearly worldwide distribution [1,2]. Previous studies have examined the evolutionary relationships among stem and crown Equisetum species using both morphology and genomic data. However, because morphology can vary as the result of hybridization and climate differences, molecular approaches have become popular. Recent studies have indicated three subgenera, including the primitive subgenus Paramochaete, and the later diverging subgenera Equisetum and Hippochaete. However, such relationships were estimated from relatively few plastid genes, e.g., rbcL, rps4, and trnL-F e.g., [3,4,5,6]. Among reported pteridophyte plastome sequences, only three were from Equisetum species: one from E. hyemale [7] and two from US and Korean E. arvense [8,9], which were placed in subgenera Equisetum and Hippochaete, respectively e.g., [3,4,5,6]. These reports revealed plastome variation among Equisetum species. The two E. arvense genomes differed by 417 bp and the E. hyemale genome was about 1.5 kbp smaller than that of E. arvense. In addition, rpl16 of E. arvense had an intron that is not present in E. hyemale [7,8,9]. These observations indicate that additional chloroplast genomes would be useful in evaluating evolutionary trends in this long-enduring genus. Previous Equisetum plastome information was obtained using PCR amplification or from organelle-enriched DNA. The advancement of sequencing technologies and computational techniques allowed us to obtain complete organelle genome sequences from the shotgun metagenomic data we have archived for E. xylochaetum, presenting an additional technical option for obtaining Equisetum plastid genomes. Therefore, in this study, we assembled the complete plastid genome of E. xylochaetum, a giant species endemic to the Atacama Desert of Chile, South America, and used the information obtained to explore the evolution of Equisetum plastid genomes and the phylogenetic relationship of Equisetum species, as well as to determine whether the phylogeny estimated by using the popular plastid conserved regions was congruent with the haplotype mapping results of the corresponding sequences. Results showed that we successfully constructed the de novo plastid genome of E. xylochaetum using the shotgun metagenomic data. Phylogenetic estimations and comparison of Equisetum plastid genomes showed that E. xylochaetum was in the subgenus Hippochaete and that the Equisetum plastid genomes from subgenera Hippochaete and Equisetum were conserved in terms of genome structure, gene content, and gene order. Furthermore, results from TCS haplotype mapping showed that some of the taxa had a higher level of nucleotide diversity and some of the taxa shared common nucleotide haplotypes. Therefore, more conserved nucleotide regions and complete plastid genomes are needed for a better understanding of the evolutionary relationships of Equisetum.

2. Results

The chloroplast genome of E. xylochaetum displayed a quadripartite structure. The single-copy regions were 93,902 bp and 9726 bp, with two reverse repeated regions (IRa and IRb) of 14,386 bp in length. The GC contents of the LSC, SSC, and IR regions individually, and of the cp genome as a whole, were 31.5%, 30.9%, 48.4%, and 33.9%, respectively. The E. xylochaetum plastome encoded a total of 119 unique genes, of which nine were duplicated in the IR regions. Seventy-eight were protein-coding genes, 38 were tRNA genes, and eight were rRNA genes. Fourteen genes contain introns (atpF, clpP, ndhA, ndhB, petB, petD, rpl2, rpoC1, rps12, ycf3, trnK(uuu), trnL(uaa), trnV(uac), and trnI(gau)) as shown in Figure 1.
Figure 1

Circular map of the Equisetum xylochaetum plastid genome, NCBI accession MW282958, drawn by OGDRAW version 1.3.1 [10]. Genes positioned on the outside of the map are transcribed counterclockwise and those inside the map are transcribed clockwise. The thick lines indicate the extent of the inverted repeat regions.

A comparison of Equisetum plastid genomes showed a collinear relationship, forming only one syntenic block in whole genome alignment. The genomes had similar genome size, % GC, gene content, gene length, and had identical gene order (Table 1 and Table 2). The protein-coding regions of E. xylochaetum plastid genes were subjected to purifying selection when compared against the corresponding protein-coding genes of E. hyemale and E. arvense.
Table 1

Comparison of general characters of Equisetum plastid genomes.

E. xylochaetum E. hyemale E. arvense (US)E. arvense (Korea)
Accession MW282958KC117177GU191334JN968380
genome size 132,400131,760133,309132,726
LSC 93,90292,58093,54292,961
SSC 9,72618,99419,46919,477
IRs 14,38610,09310,14910,144
%GC 33.933.733.433.4
Table 2

Protein-coding gene content and introns of Equisetum plastid genomes. Comparison showed percent identity and size of the gene and its derived proteins.

DNAProtein
No.Gene (# of Intron)Identical Site (%)Mean (bp)SD (bp)min (bp)max (bp)Identical Site (%)Mean (aa)SD (aa)min (aa)max (aa)
1. accD 92.81021.581948114389.934026.5316380
2. atpA 95.5153920.815271575985126.9508524
3. atpB 94.4147001470147096.34890489489
4. atpE 93.4396039639693.11310131131
5.atpF (1)97.5555055555573.91840184184
6. atpH 96.324602462461008108181
7. atpI 95.4747074774799.12480284248
8. ccsA 93.3943.51.594294591.1313.50.5313314
9. cemA 89.51438.521.71425147682.7478.57.2474491
10. chlB 94.31549.54.51545155493.2515.51.5514517
11. chlL 91.6879687388593.92922290294
12. chlN 92.81300.510.51290131190.4432.53.5429436
13.clpP (1)95.1615061561598.52040204204
14. infA 94.7243024324396.38008080
15. matK 88.7147031467147381.44891488490
16.ndhA (1)92.71101.81.31101110491.8366.30.4366367
17.ndhB (1)94.8147301473147394.34900490490
18. ndhC 95.9363036336398.31200120120
19. ndhD 95.1149701497149795.24980498498
20. ndhE 98.330303033031001000100100
21. ndhF 92.82221.51.52220222392.2739.50.5739740
22. ndhG 91.460617.258563385.72015.7194210
23. ndhH 95.3118201182118297.23930393393
24. ndhI 97.3549054954998.41820182182
25. ndhJ 93.9520.54.551652594.8172.51.5171174
26. ndhK 90.6747.89.173275386.8248.33243250
27. petA 92.4955.57.594896393.8317.52.5315320
28.petB (1)9664806486481002150215215
29.petD (1)96.948304834831001600160160
30. petG 97.411401141141003703737
31. petL 93.8960969693.53103131
32. petN 9996096961003103131
33. psaA 96.2225302253225399.67500750750
34. psaB 95.7220502205220599.27340734734
35. psaC 95.1246024624698.88108181
36. psaI 91.9111011111194.43603636
37. psaJ 97.712901291291004204242
38. psaM 96990999996.93203232
39. psbA 98.110620106210621003530353353
40. psbB 96.11527015271527995080508508
41. psbC 95142201422142299.44730473473
42. psbD 95.6106201062106287.33530353353
43. psbE 97.224602462461008108181
44. psbF 98.312001201201003903939
45. psbH 94.7225022522589.27407474
46. psbI 97.311101111111003603636
47. psbJ 99.212301231231004004040
48. psbK 97168016816896.45505555
49. psbL 98.311701171171003803838
50. psbM 98.2111011111194.43603636
51. psbN 95.51320132132934304343
52. psbT 97.4112.51.511111497.336.50.53637
53. psbZ 94.218901891891006206262
54. rbcL 96.1142801428142899.24750475475
55. rpl14 97.6369036936999.21220122122
56.rpl16 (1 in E. arvense)93.14230423423951400140140
57.rpl2 (1)94.3834.81.383483795.3277.30.4277278
58. rpl20 89.7347.31.334534885.2114.80.4114115
59. rpl21 91.3364.51.536336686120.50.5120121
60. rpl22 94.1372037237296.61230123123
61. rpl23 94.9273027327393.39009090
62. rpl32 95.3171017117198.25605656
63. rpl33 95.5201020120190.96606666
64. rpl36 93.911401141141003703737
65. rpoA 93.51.18.54.51014102393.5338.51.5337340
66. rpoB 93.93235.533.83216329492.91.00.511.310711097
67.rpoC1 (1)93.32060.33.92058206791.3685.81.3685688
68. rpoC2 92.34143214122416487.21380713731387
69. rps11 94.7396039639695.41310131131
70. rps12 98.137203723721001230123123
71. rps14 92.2306030630693.11010101101
72. rps15 95.2270027027092.18908989
73. rps18 96.5228022822898.77507575
74. rps19 95.7279027927998.99209292
75. rps2 95.27080708708972350235235
76. rps3 95.4657065765796.32180218218
77. rps4 94.1624062462492.32070207207
78. rps7 94.9468046846894.81550155155
79. rps8 95.7399039939995.51320132132
Protein-coding regions of Equisetum species were similar in size, ranging between having the same length in atpB, E, F, H, I, clpP, infA, ndhB, C, D, E, H, I, petB, D, G, L, N, psaA, B, C, I, J, M, psbA, B, C, D, E, F, H, I, J, K, L, M, N, Z, rbcL, rpl14, 16, 22, 23, 32, 33, 36, rps2, 3, 4, 7, 8, 11, 12, 14, 15, 18, and 19 to having a 195 bp or 64 amino acids difference in accD. These genes have a similar number and position of introns except for the presence of 753 bp of intron in rpl16 in E. arvense. The percentage of the identical nucleotide of the aligned sites ranged from 88.7 percent in matK to 99.2 percent in psbJ, while the percentage of the identical derived amino acid of the aligned sites ranged from 73.9 percent in atpF to 100 percent in atpH, ndhE, petB, D, G, N, psaJ, psbA, E, F, I, J, L, Z, rpl36, and rps2 (Table 1). Phylogenetic estimation of Equisetum using plastome protein-coding sequences suggested that the known complete plastid genomes of Equisetum species formed a monophyletic clade of the two subgenera, Hippochaete and Equisetum. The newly assembled E. xylochaetum plastome indicates placement within Hippochaete with E. hyemale (Figure 2).
Figure 2

Maximum-likelihood tree inferred from all Equisetum plastome protein-coding regions using a GTR+I+F model. The scale bar represents the estimated number of nucleotide substitutions per site. The bootstrap and posterior probability values are reported at the respective nodes. The values include the ML bootstrap values of nucleotide and protein data and the BI posterior probability of the nucleotide and protein data, respectively.

Single-gene ML phylogenetic analysis of atpB, matK, rpoB, rps4, and trnL-F resolved the known subgenera of Equisetum, including Paramochaete, Hippochaete, and Equisetum (Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7). The majority of the Equisetum species were resolved with ML bootstrap values of at least 50. However, the monophyly of some Equisetum species could not be resolved. The monophyly of E. arvense and E. variegatum was not resolved in the matK tree, the monophyly of E. bogotense, E. laevigatum, E. myriochaetum, E. hyemale, and E. giganteum was not resolved in the rps4 tree, and the monophyly of E. hyemale, E. praealtum, E. ramosissimum, E. trachyodon, and E. xylochaetum was not resolved in the trnL-F tree. All hybrid taxa were phylogenetically placed within the clade consisting of the majority of their maternal parent, if the monophyly of the taxa was absent. In the case of the rps4 tree, these hybrids included Equisetum x fontqueri isolate 26093 located within the clade of E. telmateia, Equisetum x litorale isolates 41084 and 41085 with E. arvense, Equisetum x schaffneri isolates 40813 and 40824 with E. giganteum, and Equisetum x schaffneri isolate 40814 with E. myriochaetum. For trnL-F, the hybrid taxa Equisetum x ferrissii (AY226113) located in the clade with E. laevigatum, Equisetum x litorale isolates 41084 and 41085 with E. arvense, and Equisetum x schaffneri isolate 40814 with E. myriochaetum.
Figure 3

Phylogenetic estimation and TCS network of Equisetum atpB sequences. The scale bar of the tree represents the estimated number of nucleotide substitutions per site. The maximum-likelihood bootstrap values are reported at the respective nodes. The colours of taxa present in the tree correspond with the colours in the TCS haplotype map. The size of the circle represents the number of the taxa that share the same haplotype.

Figure 4

Phylogenetic estimation and TCS network of Equisetum matK sequences. The scale bar of the tree represents the estimated number of nucleotide substitutions per site. The maximum-likelihood bootstrap values are reported at the respective nodes. The colours of taxa present in the tree correspond with the colours in the TCS haplotype map. The size of the circle represents the number of the taxa that share the same haplotype.

Figure 5

Phylogenetic estimation and TCS network of Equisetum rpoB sequences. The scale bar of the tree represents the estimated number of nucleotide substitutions per site. The maximum-likelihood bootstrap values are reported at the respective nodes. The colours of taxa present in the tree correspond with the colours in the TCS haplotype map. The size of the circle represents the number of the taxa that share the same haplotype.

Figure 6

Phylogenetic estimation and TCS network of Equisetum rps4 sequences. The scale bar of the tree represents the estimated number of nucleotide substitutions per site. The maximum-likelihood bootstrap values are reported at the respective nodes. The colours of taxa present in the tree correspond with the colours in the TCS haplotype map. The size of the circle represents the number of the taxa that share the same haplotype.

Figure 7

Phylogenetic estimation and TCS network of Equisetum trnL-trnF. The scale bar of the tree represents the estimated number of nucleotide substitutions per site. The maximum-likelihood bootstrap values are reported at the respective nodes. The colours of taxa present in the tree correspond with the colours in the TCS haplotype map. The size of the circle represents the number of the taxa that share the same haplotype.

TCS haplotype network analyses using atpB, matK, and rpoB resolved distinct clades representing each of the Equisetum subgenera. At the species level, haplotype networks constructed using atpB and rpoB showed one haplotype for each Equisetum species. In contrast, maps of matK, rps4, and trnL-F resolved more than one haplotype for some species and resolved some haplotypes that consisted of more than one species. For matK, there was more than one haplotype for E. arvense and E. hyemale and there was one haplotype that consisted of sequences from E. arvense and E. variegatum (Figure 4). Haplotype maps of rps4 and trnL-F seemed to be more complex compared to those of atpB, matK, and rpoB. In the map of rps4 (Figure 6), we observed 10 haplotypes, of which, haplotypes 1–3 of E. bogotense appeared as a distinct group representing subgenus Paramochaete. A few haplotypes consisted of only one Equisetum species, which were haplotype 4 for E. palustre, haplotype 5 for E. diffusum, and haplotype 8 for E. scirpoides. The hybrid taxa were embedded within the same haplotypes as their maternal taxa. These included Equisetum x fontqueri isolate 26093 that was in haplotype 6 with E. telmateia, Equisetum x litorale isolates 41084 and 41085 in haplotype 7 with E. arvense, Equisetum x schaffneri isolates 40813 and 40824 in haplotype 9 with E. giganteum, and Equisetum x schaffneri isolate 40814 in haplotype 9 with E. myriochaetum. Some haplotypes consisted of many plant species, i.e., haplotype 7 and 9, where the majority of Equisetum and Hippochaete were placed together, respectively. Interestingly, a rps4 sequence from E. hyemale grouped with other sequences of that species but also was present as a unique haplotype, as haplotype 10 with E. praealtum isolate 41501. The map of trnL-F (Figure 7) resolved two distinct groups of haplotypes representing subgenus Paramochaete (haplotype 1) and subgenus Equisetum (haplotypes 2–8). Many of the Equisetum species were present as unique haplotypes, including E. bogotense (haplotype 1), E. palustre (haplotype 2), E. pratense (haplotype 3), E. telmateia (haplotype 4), E. sylvaticum (haplotype 5), E. fluviatile (haplotype 6), Equisetum x dycei (haplotype 7), and E. scirpoides (haplotype 9). Some Equisetum species were resolved as more than a single haplotype. E. hyemale isolate 20201 was resolved as a unique haplotype 14 while E. hyemale isolate 1273o was located in haplotype 10 with E. variegatum. For E. variegatum, in addition to its member in haplotype 10, E. variegatum isolates 40820 and 40823 were resolved as additional unique haplotypes 11 and 12. In addition, E. myriochaetum isolate 40826 was present as haplotype 15, while most members were located in haplotype 13. Most of the hybrid taxa in the trnL-F map were placed in the same haplotypes as their maternal taxa. Equisetum x ferrissii (AY226113) was located in haplotype 16 with its maternal taxon, E. laevigatum, Equisetum x ferrissii in haplotype 13 with the majority of E. hyemale, Equisetum x litorale isolates 41084 and 41085 in haplotype 8 with E. hyemale, Equisetum x schaffneri isolates 40813 and 40824 in haplotype 13 with E. giganteum, and Equisetum x schaffneri isolate 40814 in haplotype 13 with the majority of E. myriochaetum.

3. Discussion

In this study, we assembled the complete plastid genome of E. xylochaetum from shotgun metagenomes of E. xylochaetum sampled from two Atacama Desert locales exhibiting different degrees of disturbance. Results showed that the plastid genomes constructed from these two E. xylochaetum metagenome accessions were identical, suggesting that the Equisetum samples were from the same Equisetum population. Comparison of nucleotide, and their derived protein, sequences of this newly assembled E. xylochaetum plastid genome to those of E. hyemale and E. arvense showed that the Equisetum plastid genomes were highly conserved in terms of structure and function, even though the two subgenera (Hippochaete and Equisetum) might have diverged as early as 135 Mya during the early Cretaceous [4,6]. All the plastid protein-coding sequences were subjected to purifying selection, with genes of the same type having identical nucleotide percentages and having nucleotide identity ranging from 88.7–99.2 percent. The only major difference in gene structure was presence of the intron in E. arvense rpl16. To determine where and when the intron of rp116 originated in the Equisetum lineage, more Equisetum rpl16 sequences or complete plastid genomes are required. In a broad sense, the phylogenetic positions of Equisetum species inferred by using all protein-coding sequences along with their derived proteins and the single-gene analysis present in this study were congruent with results from previous studies that used a single-gene approach [11] or a combination of multi-genes and morphological characters e.g., [4,5,6], where Equisetum formed monophyletic clades of each subgenus and placed E. xylochaetum in Hippochaete. Despite the presence of the high conservation level of Equisetum plastid genes, it was surprising to us that the single-gene phylogenetic approach was not sufficient to resolve relationships among Equisetum taxa, especially those closely related taxa placed in subgenus Hippochaete, e.g., E. giganteum, E. variegatum, and E. hyemale. Therefore, it is evident that more Equisetum plastid genomes, plus additional molecular information from other genetic compartments, are needed. The addition of haplotype mapping provided in this study enhanced the understanding of how plastid genes from each taxon are related. In general, the haplotype maps reflected the relationship resolved from phylogenetic estimation using the corresponding nucleotide regions. Even so, these new maps aid the visualisation of how these plastome nucleotide data were interrelated to each other at the level of isolate, species, and subgenus. The presence of only one shared distinct haplotype of an Equisetum species, though its samples were collected from different locales, suggested a high conservation level of the corresponding genes within its plastid genomes. On the other hand, the presence of more than one haplotype at the specific level suggested the presence of nucleotide diversity, indicating the need to further examine the populations of E. arvense, E. bogotense, E. hyemale, E. variegatum, and E. myriochaetum. In addition, the presence of a haplotype consisting of more than one Equisetum species, e.g., haplotypes 8 and 9 of the rps4 map and haplotypes 8 and 13 of the trnL-F map, suggested that these conserved regions alone were not sufficient for studying the relationship and diversity of Equisetum taxa. These findings emphasize the need for more Equisetum plastid genomes. The presence of distinct haplotype(s) in the early-diverging species E. bogotense in rps4 and trnL-F suggested that these plastid sequences might not represent the ancestral characters of Equisetum. Instead, these E. bogotense samples may only represent the survival representatives of the extinct members that also evolved during the course of time. In contrast, according to the rps4 and trnL-F maps, the taxa that frequently occurred at the junction region between each subgenus were E. scirpoides and E. pratense, suggesting that these taxa might be particularly helpful for understanding how the Equisetum subgenera diverged.

4. Materials and Methods

Nucleotide data for Equisetum xylochaetum Mett. were obtained from GenBank BioProject PRJNA555713 [12], generated by metagenomic shotgun sequencing of the microbiome of giant Equisetum xylochaetum sampled from two streambed locales in the Atacama Desert of northern Chile that differed in the degree of human disturbance. The two raw data sets, separately archived in accessions SRX6486516 and SRX6486517, each represented pooled replicate DNA extractions from both above-ground green and below-ground non-green tissues. To obtain the complete chloroplast genome of E. xylochaetum, metagenomic sequences were trimmed using Trimmomatic v. 0.39 [13] using the parameter sliding window:4:30. Next, the trimmed sequences from the two raw data sets were independently assembled using MEGAHIT ver. 1.2.9 [14] with the parameter “bubble-level equal to 0” in order to prevent the merging of sequences that were highly similar, e.g., sequences from closely related species or sequences that display single nucleotide polymorphisms. Each assembly yielded a contig of the complete plastid genome of E. xylochaetum, and these two contigs were identical in sequence. To validate the assembly, we calculated the coverage of the plastid genomes using the methods described in Satjarak and Graham [15]. One of the two contigs, which had the mean coverage of 706 fold, was then selected for annotation of protein-coding genes using proteins inferred from E. arvense [8,9] and E. hyemale [7] as references. The tRNAs and rRNA genes were annotated using tRNAscan-SE On-line [16] and the RNAmmer 1.2 Server [17], respectively. The complete plastid genome of E. xylochaetum was deposited in GenBank under accession number MW282958. A representative plant specimen has been deposited at the University of Concepción herbarium under accession number CONC-CH 6005. We compared the plastid genome of E. xylochaetum obtained from this study to other complete Equisetum plastid genomes, including E. hyemale (KC117177), E. arvense from the US (GU191334), and E. arvense from Korea (JN968380). We examined general characteristics of the genomes, including the genome size, %GC, the gene content, gene length, gene order, and polymorphism of nucleotides within coding regions and their derived proteins. To consider nucleotide polymorphisms, we aligned the protein-coding sequences (Table 1) using Geneious translation alignment: global alignment with free end gap, standard genetic code, and Identity (1.0/0.0) cost matrix (Geneious ver. 9.1.3; https://www.geneious.com; accessed in 31 January 2022). The mode of evolution of protein-coding regions was performed using the method described in Mekvipad and Satjarak [18]. For the polymorphism of protein sequences, we aligned the derived protein sequences using MAFFT alignment: auto algorithm and Blosum62 scoring matrix [19]. To investigate the relationship of E. xylochaetum and other Equisetum complete chloroplast genomes, Psilotum nudum (NC_003386.1) was used as an outgroup. The protein-coding sequences and protein sequences (Table 1) were similarly aligned, trimmed using Trimal ver. 1.2 [20], and concatenated. The nucleotide data matrix was 60,987 bp and the protein data matrix consisted of 19,435 amino acids. Phylogenetic relationships were estimated using maximum likelihood and Bayesian frameworks as described in Satjarak and Graham [15]. To investigate whether the Equisetum relationship resolved from frequently-used nucleotide sequences reported in previous studies exhibited grades or evolutionary intermediates, we performed haplotype network analysis of selected, frequently-used Equisetum conserved regions. These included atpB, matK, rpoB, rps4, and trnL-F (Table 3). To prepare the data matrices, the conserved nucleotide regions were extracted from the complete plastid genomes and from DNA sequences from other published studies (Table 3). Next, the data were aligned, and the phylogenetic trees were estimated using the methods described above. The haplotype network analysis was calculated using (Templeton, Crandall, and Sing; TCS) [21] and visualized in PopArt v1.7 [22].
Table 3

Nucleotide sequences used in the single gene phylogenetic analysis and TCS haplotype mapping.

No.NameGenBank AccessionLocalityReferences
atpB
1. E. arvense GU191334USA[8]
2. E. arvense JN968380Korea[9]
3. E. hyemale KC117177unknown[7]
4.E. ramosissimum subsp. debileEU439074unknown[23]
5. E. telmateia AF313542unknown[24]
6. E. xylochaetum MW282958ChileThis study
7. Equisetum x ferrissii AF313541unknown[24]
matK
1. E. arvense JX392862China[25]
2. E. arvense JX392863Europe[25]
3. E. arvense AY348551unknown[26]
4. E. arvense GU191334USA[8]
5. E. arvense JN968380Korea[9]
6. E. bogotense KP757846unknown[27]
7. E. hyemale EU749486unknown[28]
8. E. hyemale EU749485unknown[28]
9. E. hyemale EU749484unknown[28]
10. E. hyemale EU749487unknown[28]
11. E. hyemale HF585136unknown[29]
12. E. hyemale KC117177unknown[7]
13. E. palustre MZ400482Sweden[30]
14. E. ramosissimum JF303895unknown[31]
15. E. scirpoides MZ400480Sweden[30]
16. E. variegatum MZ400481Sweden[30]
17. E. xylochaetum MW282958ChileThis study
rpoB
1. E. arvense HQ658110China[32]
2. E. arvense GU191334USA[8]
3. E. arvense JN968380Korea[9]
4. E. hyemale KC117177Unknown[7]
5. E. ramossissimum HQ658109China[32]
6. E. xylochaetum MW282958ChileThis study
rps4
1.E. arvense subsp. arvense isolate 41072MH750111Finland [5]
2. E. arvense AJ583677unknown[3]
3. E. arvense JN968380Korea[9]
4. E. arvense GU191334USA[8]
5.E. arvense subsp. arvense isolate 26084MH750108India (Himachal Pradesh) [5]
6.E. arvense subsp. arvense isolate 40833MH750109USA (California) [5]
7.E. arvense subsp. arvense isolate 41071MH750110Finland [5]
8.E. arvense subsp. boreale isolate 41073MH750112Finland/Norway (border) [5]
9.E. arvense subsp. boreale isolate 41074MH750113Finland/Norway (border)[5]
10.E. arvense x E. telmateia subsp. braunii isolate 40834MH750114USA (California) [5]
11. E. bogotense AF231898unknown[33]
12. E. bogotense AF313603unknown[24]
13. E. bogotense AJ583678unknown[3]
14.E. bogotense isolate 40800MH750115Argentina[5]
15.E. bogotense isolate 40802MH750116Ecuador[5]
16.E. bogotense isolate 40827MH750117Colombia[5]
17. E. diffusum AJ583679unknown[3]
18.E. diffusum isolate 40804MH750118 India [5]
19. E. fluviatile AJ583680unknown[3]
20.E. fluviatile isolate 41075MH750119Finland [5]
21.E. fluviatile isolate 41076MH750120Finland [5]
22. E. giganteum AJ583681unknown[3]
23.E. giganteum isolate 40806MH750121Chile[5]
24. E. hyemale AJ583682unknown[3]
25. E. hyemale KC117177unknown[7]
26.E. hyemale isolate 23252MH750123Norway[5]
27. E. laevigatum AJ583683unknown[3]
28.E. laevigatum isolate 40812MH750125USA (California) [5]
29. E. myriochaetum AJ583684unknown[3]
30.E. myriochaetum isolate 40825MH750126Mexico [5]
31.E. myriochaetum isolate 40936MH750127El Salvador [5]
32. E. palustre AJ583685unknown[3]
33.E. palustre isolate 17671MH750128UK (England, Norfolk) [5]
34.E. palustre isolate 39349MH750129UK (England, Surrey) [5]
35.E. praealtum isolate 41501MH750122USA (Ohio) [5]
36. E. pratense AJ583686unknown[3]
37.E. pratense isolate 39348MH750130Finland [5]
38.E. ramosissimum subsp. debileAJ583687unknown[3]
39.E. ramosissimum subsp. debileEU439173unknown[23]
40.E. ramosissimum subsp. debile isolate 24579MH750131Sri Lanka [5]
41.E. ramosissimum subsp. debile isolate 40837MH750132New Caledonia [5]
42.E. ramosissimum subsp. ramosissimum isolate 36802MH750133Spain (Andalucia) [5]
43. E. scirpoides AJ583688unknown[3]
44.E. scirpoides isolate 26090MH750134Greenland [5]
45.E. scirpoides isolate 40830MH750124Russia (Kamtschatka) [5]
46. E. sylvaticum AJ583689unknown[3]
47.E. telmateia subsp. brauniiAJ583690unknown[3]
48.E. telmateia subsp. braunii isolate 40817 MH750136USA (California) [5]
49.E. telmateia subsp. braunii isolate 40828MH750137Canada (British Columbia)[5]
50.E. telmateia subsp. braunii isolate 40832MH750138USA (California) [5]
51.E. telmateia subsp. braunii isolate 40836MH750139USA (California) [5]
52.E. telmateia subsp. telmateia isolate 41082MH750140Ireland[5]
53. E. variegatum AJ583691unknown[3]
54.E. variegatum isolate 11639MH750141UK (Wales) [5]
55.E. variegatum isolate 40819MH750148Ireland [5]
56.E. variegatum isolate 40820MH750142France (Pyrenees) [5]
57.E. variegatum isolate 40823MH750143USA (Keweenaw, Michigan) [5]
58.E. variegatum isolate 41083MH750144Ireland [5]
59.E. variegatum subsp. alaskanum isolate 40818MH750145USA (Alaska) [5]
60.E. variegatum subsp. alaskanum isolate 40821MH750146Canada (British Columbia)[5]
61.E. variegatum subsp. alaskanum isolate 40822MH750147Canada (Banff)[5]
62. E. xylochaetum MW282958ChileThis study
63.Equisetum scirpoides isolate 41089MH750135Finland[5]
64. Equisetum x ferrissii AF313590unknown[24]
65.Equisetum x fontqueri isolate 26093 (E. telmateia x E. palustre) MH750149UK (Scotland) [5]
66.Equisetum x litorale isolate 41084 (E. arvense x E. fluviatile) MH750150Ireland [5]
67.Equisetum x litorale isolate 41085 (E. arvense x E. fluviatile) MH750151Ireland [5]
68.Equisetum x schaffneri isolate 40813 (E. giganteum x E. myriochaetum) MH750152Mexico [5]
69.Equisetum x schaffneri isolate 40814 (E. myriochaetum x E. giganteum) MH750153Peru (cult RBG Edinburgh) [5]
70.Equisetum x schaffneri isolate 40824 (E. giganteum x E. myriochaetum) MH750154Mexico [5]
trnL-trnF
1. E. arvense JN968380Korea[9]
2. E. arvense GU191334USA[8]
3. E. arvense AY226125Franc[34]
4. E. arvense GQ428069unknown[35]
5. E. arvense HM590277Estonia[36]
6. E. arvense GQ244921unknown[37]
7.E. arvense subsp boreale isolate 41074MH750043Finland/Norway[5]
8.E. arvense subsp. arvense isolate 26084MH750038India[5]
9.E. arvense subsp. arvense isolate 26085MH750039UK[5]
10.E. arvense subsp. arvense isolate 40833MH750040USA[5]
11.E. arvense subsp. arvense isolate 41071MH750041Finland[5]
12.E. arvense subsp. boreale isolate 41073MH750042Finland/Norway[5]
13.E. arvense x E. telmateia subsp. braunii isolate 40834MH750044USA[5]
14. E. bogotense AY226124Colombia[34]
15.E. bogotense isolate 40800MH750045Argentina[5]
16.E. bogotense isolate 40801MH750046Chile[5]
17.E. bogotense isolate 40802MH750047Ecuador[5]
18.E. bogotense isolate 40803MH750048Chile[5]
19.E. bogotense isolate 40827MH750049Colombia[5]
20. E. diffusum AY226126India[34]
21.E. diffusum isolate 40804MH750050India[5]
22. E. fluviatile AY226121Canada[34]
23. E. fluviatile GQ244922unknown[37]
24.E. fluviatile isolate 41075MH750051Finland[5]
25.E. fluviatile isolate 41076MH750052Finland[5]
26. E. giganteum AY226118Ecuador[34]
27.E. giganteum isolate 40805MH750053Jamaica[5]
28.E. giganteum isolate 40806MH750054Chile[5]
29.E. giganteum isolate 40807MH750055Peru[5]
30.E. giganteum isolate 40810MH750057Argentina[5]
31.E. giganteum isolate 40811MH750058Argentina[5]
32. E. hyemale KC117177unknown[7]
33. E. hyemale AY327837unknown[34]
34.E. hyemale isolate 0796gGQ244923unknown[37]
35.E. hyemale isolate 1273oGQ244924unknown[37]
36.E. hyemale isolate 20201MH750061France[5]
37.E. hyemale isolate 23252MH750062Norway[5]
38.E. hyemale isolate 41088MH750063Finland[5]
39.E. hyemale subsp. affineAY226110USA[34]
40.E. iganteum isolate 40809MH750056Argentina[5]
41. E. laevigatum AY226112USA[34]
42.E. laevigatum isolate 40812MH750065USA[5]
43. E. myriochaetum AY226114USA[34]
44.E. myriochaetum isolate 40815MH750066USA[5]
45.E. myriochaetum isolate 40816MH750067USA[5]
46.E. myriochaetum isolate 40825MH750068Mexico[5]
47.E. myriochaetum isolate 40826MH750069Ecuador[5]
48.E. myriochaetum isolate 40936MH750070El Savador[5]
49.E. myriochaetum isolate 41080MH750071Guatemala[5]
50. E. palustre AY226123Canada[34]
51. E. palustre GQ244925unknown[37]
52.E. palustre isolate 39349MH750072UK[5]
53.E. praealtum isolate 40831MH750059USA[5]
54.E. praealtum isolate 41501MH750060USA[5]
55. E. pratense AY226122Canada[34]
56. E. pratense GQ244926unknown[37]
57. E. pratense HM590278Estonia[36]
58.E. pratense isolate 39348MH750073Finland[5]
59.E. pratense isolate 41086MH750074Finland[5]
60.E. pratense isolate 41087MH750075Finland[5]
61.E. ramosissimum subsp. debileAY226115Taiwan[34]
62.E. ramosissimum subsp. debile isolate 23679MH750076Reunion[5]
63.E. ramosissimum subsp. debile isolate 24579MH750077Sri Lanka[5]
64.E. ramosissimum subsp. debile isolate 40837MH750078New Caledonia[5]
65.E. ramosissimum subsp. ramosissimum isolate 36802MH750079Spain[5]
66.E. ramosissimum subsp. ramosissimum isolate 40829MH750080Turkey[5]
67. E. scirpoides AY226116Canada[34]
68. E. scirpoides GQ244927unknown[37]
69.E. scirpoides isolate 26090MH750082Greenland[5]
70.E. scirpoides isolate 40830MH750064Russia[5]
71.E. scirpoides isolate10933MH750081UK[5]
72. E. sylvaticum MH750083UK[5]
73. E. sylvaticum AY226120France[34]
74. E. sylvaticum GQ244928unknown[37]
75.E. sylvaticum isolate 41081MH750084Finland[5]
76.E. telmateia isolate 11642MH750089China[5]
77.E. telmateia isolate 41082MH750090Ireland[5]
78.E. telmateia subsp. brauniiAY226119USA[34]
79.E. telmateia subsp. braunii isolate 40817MH750085USA[5]
80.E. telmateia subsp. braunii isolate 40828MH750086Canada[5]
81.E. telmateia subsp. braunii isolate 40832MH750087USA[5]
82.E. telmateia subsp. braunii isolate 40836MH750088USA[5]
83.E. trachyodon isolate 41092MH750106Finland[5]
84. E. variegatum AY226117USA[34]
85.E. variegatum isolate 0584gGQ244929unknown[37]
86.E. variegatum isolate 0977oGQ244930unknown[37]
87.E. variegatum isolate 11639MH750091UK[5]
88.E. variegatum isolate 40819MH750098Ireland[5]
89.E. variegatum isolate 40820MH750092France[5]
90.E. variegatum isolate 40823MH750093USA[5]
91.E. variegatum isolate 41083MH750094Ireland[5]
92.E. variegatum subsp. alaskanum isolate 40818MH750095USA[5]
93.E. variegatum subsp. alaskanum isolate 40821MH750096Canada[5]
94.E. variegatum subsp. alaskanum isolate 40822MH750097Canada[5]
95. E. xylochaetum MW282958ChileThis study
96.E. xylochaetum isolate 40614MH750107Chile[5]
97. Equisetum sp. AY327838unknown[34]
98.Equisetum x dycei isolate 26083MH750099UK[5]
99.Equisetum x ferrissii (E. laevigatum x E. hyemale)AY226113USA[34]
100.Equisetum x ferrissii (Equisetum hyemale x laevigatum) AY226111Canada[34]
101.Equisetum x litorale isolate 41084MH750101Ireland[5]
102.Equisetum x litorale isolate 41085MH750102Ireland[5]
103.Equisetum x schaffneri isolate 40813MH750103Mexico[5]
104.Equisetum x schaffneri isolate 40814MH750104Peru[5]
105.Equisetum x schaffneri isolate 40824MH750105Mexico[5]

5. Conclusions

In summary, our study demonstrated that metagenomic data can be a useful way to obtain plastid genomes. The comparison of the de novo plastid genome of E. xylochaetum with other reported Equisetum plastomes showed a high degree of conservation in terms of structure, gene content, gene order, and nucleotide polymorphisms. Even so, this new plastid genome provided additional information about the evolution and diversity of Equisetum, e.g., the presence of an intron in rpl16. Haplotype analyses of the selected conserved nucleotides showed that some Equisetum species were distantly related to other taxa, inferred from the presence of distinct haplotypes. Many of the taxa appeared as shared haplotypes, suggesting that the molecular data we currently have might not be sufficient for a full understanding of the evolutionary relationship of Equisetum and that more Equisetum plastid genomes are needed.
  28 in total

1.  Origin and evolution of the chloroplast trnK (matK) intron: a model for evolution of group II intron RNA structures.

Authors:  Georg Hausner; Robert Olson; Dawn Simon; Ian Johnson; Erin R Sanders; Kenneth G Karol; Richard M McCourt; Steven Zimmerly
Journal:  Mol Biol Evol       Date:  2005-11-02       Impact factor: 16.240

2.  First insights into fern matK phylogeny.

Authors:  Li-Yaung Kuo; Fay-Wei Li; Wen-Liang Chiou; Chun-Neng Wang
Journal:  Mol Phylogenet Evol       Date:  2011-03-21       Impact factor: 4.286

3.  Origin of Equisetum: Evolution of horsetails (Equisetales) within the major euphyllophyte clade Sphenopsida.

Authors:  Andrés Elgorriaga; Ignacio H Escapa; Gar W Rothwell; Alexandru M F Tomescu; N Rubén Cúneo
Journal:  Am J Bot       Date:  2018-07-19       Impact factor: 3.844

4.  Horsetails are the sister group to all other monilophytes and Marattiales are sister to leptosporangiate ferns.

Authors:  Nils Knie; Simon Fischer; Felix Grewe; Monika Polsakiewicz; Volker Knoop
Journal:  Mol Phylogenet Evol       Date:  2015-05-18       Impact factor: 4.286

5.  New chloroplast primers for intraspecific variation in Dicranum scoparium Hedw. (Dicranaceae) and amplification success in other bryophyte species.

Authors:  A Lang; Y Naciri
Journal:  Mol Ecol Resour       Date:  2009-12-21       Impact factor: 7.090

6.  Comparative DNA sequence analyses of Pyramimonas parkeae (Prasinophyceae) chloroplast genomes.

Authors:  Anchittha Satjarak; Linda E Graham
Journal:  J Phycol       Date:  2017-03-06       Impact factor: 2.923

7.  Molecular phylogeny of horsetails (Equisetum) including chloroplast atpB sequences.

Authors:  Jean-Michel Guillon
Journal:  J Plant Res       Date:  2007-05-03       Impact factor: 3.000

8.  Complete plastid genomes from Ophioglossum californicum, Psilotum nudum, and Equisetum hyemale reveal an ancestral land plant genome structure and resolve the position of Equisetales among monilophytes.

Authors:  Felix Grewe; Wenhu Guo; Emily A Gubbels; A Katie Hansen; Jeffrey P Mower
Journal:  BMC Evol Biol       Date:  2013-01-11       Impact factor: 3.260

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  Evolution of organellar genes of chlorophyte algae: Relevance to phylogenetic inference.

Authors:  Nuttapong Mekvipad; Anchittha Satjarak
Journal:  PLoS One       Date:  2019-05-06       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.