Literature DB >> 30405200

Evolution of six novel ORFs in the plastome of Mankyua chejuense and phylogeny of eusporangiate ferns.

Hyoung Tae Kim1,2, Ki-Joong Kim3.   

Abstract

In this paper, three plastomes of Mankyua chejuense, Helminthostachys zeylanica, and Botrychium ternatum in Ophioglossaceae were completely sequenced in order to investigate the plastome evolution and phylogeny of eusporangiate ferns. They were similar to each other in terms of length and the gene orders; however, six unknown open reading frames (ORFs) were found between rps4 and trnL-UAA genes in M. chejuense. Similar sequence regions of six ORFs of M. chejuense were found at the plastomes of Ophioglossum californicum and H. zeylanica, as well as the mitochondrial genome (mitogenome) of H. zeylanica, but not in B. ternatum. Interestingly, the translated amino acid sequences of three ORFs were more similar to the proteins of distantly related taxa such as algae and bacteria than they were to proteins in land plants. It is likely that the six ORFs region arose from endosymbiotic gene transfer (EGT) or horizontal gene transfer (HGT), but further study is needed to verify this. Phylogenetic analyses suggested that Mankyua was resolved as the earliest diverging lineage and that Ophioglossum was subsequently diverged in Ophioglossaceae. This result supports why the plastome of M. chejuense have contained the most ancestral six ORFs in the family.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30405200      PMCID: PMC6220310          DOI: 10.1038/s41598-018-34825-6

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Chloroplast is an apparatus for photosynthesis in plant cells that holds an independent genome compared to nuclear and mitochondrial genomes. The plastomes of land plants are typically 120–160 kb in length and have quadripartite structures[1]. Because of the strong selective constraint for photosynthesis, land plant plastomes usually contain a set of unique 100–120 photosynthetic and housekeeping genes that originate from cyanobacteria[1]. Chloroplast gene(s) or duplicated part(s) of plastomes are frequently transferred to the nuclear or mitochondrial genome (mitogenome) through intracellular gene transfer (IGT)s[2,3]. However, gene transfer to the counter direction is very rare evolutionary events[2]. Only a few cases of IGT to plastomes have been reported in unrelated plant families, such as Apiaceae[4,5], Poaceae[6,7], Apocynaceae[8], and Anacardiaceae[9]. In all of these IGT cases, short portions of mitogenome were the donors to plastomes, and there has been no documented case of nuclear genome donor to plastome. Horizontal plastome capture through hybridization between similar species is one kind of horizontal gene transfer (HGT) that is relatively common in land plants. In addition, HGT has also been documented between far distant organisms such as plant-fungus, plant-bacteria, and plant-virus[10]. Many land plants live in the symbiotic associations with fungi or bacteria. Therefore, they have relatively high chances of HGT between distant organisms. However, on the plant side, the reported HGTs were engaged in mitogenomes or nuclear genomes, not in plastomes. Land plant plastomes do not normally recombine with other genomes, therefore it is very rare for them to act as a recipient of the HGT in land plant plastomes[10]. In contrast, in the green alga Euglena myxocylindracea, plastomes show intron gains from bacteria[11,12]. So far, more than 2,000 complete plastome sequences are available from public databases, such as NCBI. Plastomes, however, appear to be recalcitrant to the incorporation of foreign DNA by either IGT or HGT[9]. Only a few families, as mentioned in the previous paragraphs, have been recognized as containing DNA of nonplastome origin. However, most published reports on plastomes have been on those from seed plants. We still have limited complete plastomes for several major fern lineages. Ferns are usually divided into two groups: eusporangiate and leptosporangiate ferns. The eusporangiate ferns form basal paraphyletic assemblages because they include the eusporangiate fern clade. So far, 65 plastomes have been reported in leptosporangiate ferns[13-15]; in contrast, only nine plastomes in eusporangiate ferns have been sequenced from two species of Marattiales[16,17], two species of Psilotales[18,19], two species of Equisetales[18,20,21], and one species of Ophioglossales[18]. The order Ophioglossales of eusporangiate ferns contains a single family Ophioglossaceae, and this family consists of four genera (Ophioglossum, Botrychium, Helmintostachys, and Mankyua)[22]. Ophioglossum and Botrychium each consist of a number of species and are both relatively common in the northern hemisphere; however, both Helmintostachys and Mankyua are monotypic genera and show restricted distribution patterns in temperate regions of East Asia[23,24]. Among four genera, Mankyua has recently been described from a volcanic island in the Southern part of Korea as Mankyua chejuense[24]. It is a rare, endemic, and endangered plant species, and only a couple hundred individuals were reported to live in the specific habitats of small scattered volcanic craters called “Gotjawal” in Jeju island of Korea[25]. In the phylogeny of Ophiglossacae, Hauk, et al.[26] showed that Ophioglossum was the sister group to the clade of Helminthostachys + Botrychium s.l. using rbcL and trnL-F sequences. However, the phylogenetic studies including Mankyua have shown different phylogenetic relationships among four genera. Sun, et al.[27] suggested that M. chejuense was sister to the clade of Botrychium + Helminthostachys and that Ophioglossum was the sister group to the three genera. In contrast, Shinohara, et al.[28] suggested that Botrychium was sister to Ophioiglossum + Helminthostachys and that Mankyua was sister to the remaining taxa. In addition to the topological incongruences of the four genera, several nodes in the previous phylogenies of Ophioglossaceae were not strongly supported. As a result, the relationships among four genera in Ophioglossaceae remain unclear. In this paper, plastomes of M. chejuense, H. zeylanica, and B. ternatum were completely sequenced and compared with previously reported plastomes of O. californiacum in order to investigate the evolution of plastomes in Ophioglossaceae. During this study, we identified approximately 10 kb insertion with six unknown ORFs between rps4 and trnL-UAA genes of M. chejuense plastome. These six ORFs were located in the same direction as those in polycistronic genes. Therefore, we discussed the possible origins of the six ORFs through intensive comparative data analysis. In addition, the phylogeny of the four genera in the family Ophioglossaceae was reconstructed based on coding sequences of the plastome in order to resolve the enigmatic relationships among the four genera in Ophioglossaceae.

Results and Discussion

Genome structure and gene contents of plastomes in Ophioglossaceae

The four completely annotated plastome sequences reported in this study are available from the National Center for Biotechnology Information (NCBI) under the accession numbers of B. ternatum (KM817789), H. zeylanica (KM817788), and M. chejuensis 1,2 (NC017006, KP205433). The row Illumina MiSeq sequence data files also available from the NCBI database (Supplemental Table 1). We sequenced two different accessions of M. chejuensis using different methods: PCR-amplified Sanger sequencing and the MiSeq (Illumina, San Diego) NGS. The plastome of M. chejuense sequenced through PCR was 146,221 bp in length with a large single copy (LSC) region of 106,096 bp, a small single copy (SSC) region of 20,613 bp, and two inverted repeat (IR) regions of 9,756 bp each (Fig. 1). It contained 135 genes, including 84 protein coding genes, 8 ribosomal RNAs, 37 transfer RNAs, and six unknown ORFs. Four rRNA and five tRNA genes were duplicated in the IR region (Table 1). Sixteen genes had one intron while the clpP and ycf3 genes each had two introns. The plastome of M. chejuense sequenced through NGS was 146,225 bp. An average coverage depth of the plastome was approximately 400 times (Supplemental Table 1). A total of six poly-T length variations and 45 single nucleotide polymorphisms (SNPs) were found between two plastome sequences of M. chejuense (Supplementary Table S2). Thirty-seven SNPs were found at the coding regions; in particular, SNPs in petB and psbB accounted for 71% of the total SNPs. Interestingly, non-synonymous substitutions were almost three-fold the prevalence of synonymous substitutions in petB and psbB, and petB in plastome of M. chejuense sequenced by PCR method had one premature stop codon caused by substitution (TGG > TGA).
Figure 1

Maps of three plastomes in Ophioglossaceae. Arrows and dashes refer to inversion and expanded positions, respectively.

Table 1

Gene list of chloroplast genomes found in four genera in Ophioglossaceae.

Group of geneConserved genesM. chejuense (PCR)M. chejuense (NGS) H. zeylanica B. ternatum O. californicum
RNA genesRibosomal RNAsrrn4.5(x2), rrn5(x2), rrn16(x2), rrn23(x2)
Transfer RNAstrnA-UGCa(x2), trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnfM-CAU, trnG-GCC, trnG-UCC, trnH-GUG, trnI-CAU, trnI-GAUa(x2), trnK-UUUa, trnL-CAA, trnL-UAAa, trnL-UAG, trnM-CAU, trnN-GUU(x2), trnP-GGG, trnP-UGG, trnQ-UUG, trnR-ACG(x2), trnR-CCG, trnR-UCU, trnS-CGA, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnV-GAC(x2), trnV-UACa, trnW-CCA, trnY-GUAφtrnT-UGUφtrnT-UGU
Protein genesPhotosystem I psaA, psaB, psaC, psaI, psaJ, psaM
Photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
CytochromepetA, petDa, φpetB a petB a petB a petB a petB a
ATP synthaseatpA, atpB, atpE, atpFa, atpH, atpI
Chlorophyll biosynthesis chlL, chlN chlB chlB chlB φchlB chlB
Rubisco rbcL
NADH dehydrogeneasendhAa, ndhBa, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
ATP-dependent protease subunit P clpP a
Chloroplast envelope membrane protein cemA
Ribosomal proteinslarge unitsrpl2a, rpl14, rpl16a, rpl20, rpl21, rpl22, rpl23, rpl32, rpl33, rpl36
small unitsrps2, rps3, rps7, rps8, rps11, rps12a, rps14, rps15, rps18, rps19 rps4 rps4 rps4 φrps4 rps4
TranscriptionRNA polymerase rpoC1 a rpoA, φrpoB, rpoC2 rpoA, φrpoB, rpoC2 rpoA, rpoB, φrpoC2 φrpoA, rpoB, φrpoC2 rpoA, rpoB, rpoC2
TranslationInitiation factor infA
Miscellaneous proteins accD, ccsA matK matK φmatK matK
Hypothetical proteins & Conserved reading frameycf2, ycf3a, ycf4, ycf12ORF135, ORF187, ORF295, ORF372, ORF436, ORF531, φycf1ORF135, ORF187, ORF295, ORF372, ORF436, ORF531, φycf1 φycf1 ycf1 ycf1

(x2): duplicated genes, a: genes having introns φ: pseudogene.

Maps of three plastomes in Ophioglossaceae. Arrows and dashes refer to inversion and expanded positions, respectively. Gene list of chloroplast genomes found in four genera in Ophioglossaceae. (x2): duplicated genes, a: genes having introns φ: pseudogene. Even though intraspecific variations of plastomes in ferns have been reported[21], population studies of M. chejuense have shown extremely low genetic diversity[29]. Based on our observation of the reproduction of M. chejuense over three years, asexual reproduction by rhizomes was found to be very common. Therefore, these polymorphisms between two plastomes seem to not be genuine. It has been previously shown that after free-living cyanobacteria are engulfed by eukaryotes, numerous genes are translocated from plastids to nucleus[30]. Nuclear copies of organellar DNAs have frequently been found in land plants[31], and they have led to misleading phylogeny results[32]. Consequently, PCR-amplified sequences of the plastome of M. chejuense might be derived from the nuclear DNA rather than the plastome of M. chejuense, because the nuclear copies of plastid DNA were homologous to their counterparts in the plastome and their primer regions were shared. The plastome of H. zeylanica was 145,120 bp with an LSC region of 103,088 bp, an SSC of 19,950 bp, and two IR regions of 11,041 bp (Fig. 1). The plastome of B. ternatum was 139,127 bp with an LSC of 99,586 bp, an SSC of 20,569 bp, and two copies of IR with 9,486 bp (Fig. 1). Among four genera in Ophioglossaceae, rps16 and trnT-UGU genes were commonly lost; however, pseudo trnT-UGU remained in the H. zeylanica and B. ternatum plastomes between rps4 and trnL-UAA. The chlB, matK, petB, rpoA, rpoB, rpoC2, rps4, and ycf1 were pseudogenes in at least one plastome, but not in all plastomes. But, the status of pseudogene was not confirmed because we did not study the RNA editing. In addition, there was an inversion of trnT-GGU in the plastome of B. ternatum.

Six ORFs of M. chejuense and similar sequences in Ophioglossaceae

The most distinctive feature among the plastomes of the four genera was the region between trnT-GGU and ndhJ (Fig. 2A). Compared to B. ternatum plastome, the three plastomes of the other genera contained the expanded regions between trnT-GGU and ndhJ, but they were not identical (Fig. 2B). The intergenic spaces (IGSs) of rps4 - trnL (M. chejuense and O. californicum), trnT-GGU - trnfM-CAU (O. californicum and H. zeylanica) and trnF-GAA - ndhJ (H. zeylanica) were 1.5~10 times longer than the IGSs of these regions in B. ternatum. In particular, six unknown ORFs (ORF295, ORF436, ORF135, ORF372, ORF531, and ORF187) were found between rps4 and trnL-UAA genes in M. chejuense, and these six ORFs were located in the same direction as polycistronic genes. The expanded regions in O. californicum and H. zeylanica were partial of six ORFs of M. chejuense with structural mutations (Fig. 2C).
Figure 2

Alignments of plastomes in Ophioglossaceae. (A) Whole genome alignment. (B) Alignment between trnT-GGU and ndhJ. (C) Visualized alignment by MAUVE.

Alignments of plastomes in Ophioglossaceae. (A) Whole genome alignment. (B) Alignment between trnT-GGU and ndhJ. (C) Visualized alignment by MAUVE. Contigs generated by the de novo assembly of three sets of NGS data were hit to six ORFs region in the plastome of M. chejuense using blastn in order to investigate translocated ORFs into other genomes such as mitogenome. Two contigs of H. zeylanica with 10-11 coverage depths included sequences similar to six ORFs except for plastome contigs. One contig contained plastome genes of petN and psbM and mitogenome genes of nad7 and nad2 with six ORFs (Fig. 3A). IGS of rps4 - trnL-UAA in the plastome of M. chejuense corresponded highly with petN-psbM in the mitochondrial contig of H. zeylanica, even though there were rearrangements and insertions/deletions (Fig. 3B). Another contig contained a region similar to ORF295 with low similarity. Certain contigs of M. chejuense, H. zeylanica, and B. ternatum had similar sequences of six ORFs; however, they were less than 1,500 bp with less than three coverage depths (almost around 1). These contigs were removed upon further analyses because we could not verify the assembly errors.
Figure 3

(A) Comparison between plastome and mitogenome of H. zeylanica. (B) Sequence similarity between six ORFs regions of H. zeylanica and M. chejuense. Score/max ratio colouring with blue < = 0.25, green < = 0.5, orange < = 0.75, and red >0.75.

(A) Comparison between plastome and mitogenome of H. zeylanica. (B) Sequence similarity between six ORFs regions of H. zeylanica and M. chejuense. Score/max ratio colouring with blue < = 0.25, green < = 0.5, orange < = 0.75, and red >0.75. As the total length of NGS data of H. zeylanica was 4.25 Gb, and the lowest 1 C reported in Ophioglossaceae so far was 2.5 Gb[33,34], 10-11 coverage depths implied that this contig belonged to mitogenome rather than nuclear genome, even though many plastome and mitogenome sequences were found in nuclear genomes[35]. The translated amino acid sequences of ORF135, ORF295, and ORF436 between the plastome of M. chejuense and mitogenome of H. zeylanica had over 70% identity while that of ORF372 had 40% identity (Supplementary Fig. 1). The ORF187 and ORF531 of H. zeylanica underwent frame-shift mutations and rearrangement, respectively.

The origin of six ORFs

Based on blastp results (Table 2), the translated amino acid sequence of ORF295 was only similar to the protein of the green alga Roya anglica, which belongs to Streptophyta, and the translated amino acid sequence of ORF436 was similar to the proteins of Chlorophyta, which is a sister group of Streptophyta in Viridiplantae. Interestingly, the translated amino acid sequence of ORF531 was more similar to bacterial proteins than the proteins in Viridiplantae, even though the TrlaMp60 of Treubia lacunosa belonging to Streptophyta was hit to ORF531. Blastn results showed that the plastomes of very few species in ferns contained similar sequences to six ORFs (Supplementary Table S2).
Table 2

Results of blastp of six ORFs with e-value 10−2.

GeneDescriptionPhylumSpeciesMax.scoreTotal.scoreQuery.coverE.valueIdentAccession
ORF295hypothetical protein (chloroplast)StreptophytaRoya anglica898965%2.00E-1730%YP_009033761.1
ORF436hypothetical protein (chloroplast)ChlorophytaEttlia pseudoalveolaris95.595.541%3.00E-1732%YP_009105467.1
hypothetical protein (chloroplast)ChlorophytaPrasiola crispa68.668.642%4.00E-0927%AKZ21082.1
hypothetical protein (chloroplast)ChlorophytaSarcinofilum mucosum65.965.928%5.00E-0935%YP_009367460.1
hypothetical protein (chloroplast)ChlorophytaOstreobium sp. HV05007a67.467.428%1.00E-0835%ARQ82113.1
hypothetical protein (chloroplast)ChlorophytaGloeotilopsis planctonica67.867.837%2.00E-0829%AOC61661.1
ORF531hypothetical proteinCyanobacteriaPleurocapsa sp. PCC 731978.678.652%2.00E-1127%WP_019503236.1
hypothetical proteinCyanobacteriaChondrocystis sp. NIES-410275.175.150%2.00E-1027%WP_096724718.1
DUF3854 domain-containing proteinCyanobacteriaCyanothece sp. CCY011070.970.949%4.00E-0928%WP_008277431.1
DUF3854 domain-containing proteinCyanobacteriaCyanothece sp. CCY011070.170.149%7.00E-0929%WP_008277548.1
DUF3854 domain-containing proteinCyanobacteriaCyanothece sp. PCC 782269.369.321%9.00E-0936%WP_049802779.1
hypothetical proteinCyanobacteriaMyxosarcina sp. GI169.369.350%1.00E-0825%WP_052055931.1
DUF3854 domain-containing proteinCyanobacteriaCrocosphaera watsonii69.369.349%1.00E-0829%WP_007304689.1
DUF3854 domain-containing proteinCyanobacteriaTolypothrix bouteillei68.968.949%2.00E-0828%WP_050044965.1
hypothetical proteinCyanobacteriaPleurocapsa sp. CCALA 16168.268.250%3.00E-0827%WP_106238468.1
DNA primaseCyanobacteriaCrocosphaera watsonii WH 040265.565.525%5.00E-0836%CCQ65996.1
ATPaseCyanobacteriaAphanothece hegewaldii676748%7.00E-0828%WP_106459345.1
MULTISPECIES: DUF3854 domain-containing proteinCyanobacteriaCyanothece66.266.236%1.00E-0731%WP_009547941.1
DUF3854 domain-containing proteinCyanobacteriaCrocosphaera watsonii66.266.250%1.00E-0726%WP_007310072.1
hypothetical proteinCyanobacteriaMyxosarcina sp. GI165.965.950%1.00E-0725%WP_052055870.1
DNA primaseCyanobacteriaAphanothece hegewaldii65.965.950%2.00E-0724%WP_106459560.1
hypothetical proteinCyanobacteriaMyxosarcina sp. GI165.565.546%2.00E-0726%WP_052056024.1
DUF3854 domain-containing proteinCyanobacteriaCyanothece sp. CCY011064.764.734%4.00E-0731%WP_008278684.1
hypothetical protein TrlaMp60StreptophytaTreubia lacunosa62.462.422%7.00E-0737%YP_004927707.1
hypothetical proteinFirmicutesTumebacillus sp. AR2320861.661.624%1.00E-0627%WP_087457668.1
DUF3854 domain-containing proteinCyanobacteriaCyanothece sp. CCY011063.263.243%1.00E-0627%WP_008276837.1
hypothetical proteinCyanobacteriaMyxosarcina sp. GI162.462.450%2.00E-0625%WP_052056026.1
hypothetical protein BWK76_02530ProteobacteriaDesulfobulbaceae bacterium A261.661.628%3.00E-0630%OQX20052.1
hypothetical proteinCyanobacteriaMyxosarcina sp. GI161.261.250%4.00E-0624%WP_052055874.1
hypothetical proteinFirmicutesLachnospiraceae bacterium60.860.846%4.00E-0622%WP_099450353.1
hypothetical proteinCyanobacteriaMyxosarcina sp. GI159.759.750%1.00E-0524%WP_052055951.1
hypothetical protein C7H79_02365ProteobacteriaNitrosomonas sp. APG558.558.534%2.00E-0531%PSJ18450.1
hypothetical proteinActinobacteriStreptomyces pini58.558.530%3.00E-0525%WP_093850669.1
hypothetical proteinCyanobacteriaMyxosarcina sp. GI158.258.250%4.00E-0524%WP_052056112.1
hypothetical proteinActinobacteriMycobacterium szulgai58.258.236%4.00E-0524%WP_082965783.1
ATP-binding proteinActinobacteriStreptomyces coelicolor57.857.830%4.00E-0525%WP_011030338.1
hypothetical protein A5657_18130ActinobacteriMycobacterium szulgai57.857.836%5.00E-0524%OBK51436.1
hypothetical protein CBD94_01510ProteobacteriaGammaproteobacteria bacterium TMED23455.555.549%3.00E-0426%OUW91419.1
phage/plasmid primase P4CyanobacteriaStanieria cyanosphaera54.754.750%4.00E-0425%WP_015193635.1
MULTISPECIES: hypothetical proteinProteobacteriaAlteromonas54.754.728%5.00E-0425%WP_052010194.1
phage P4 DNA primase domain-containing proteinCyanobacteriaAnabaena sp. 9053.953.952%7.00E-0429%WP_015081293.1
hypothetical proteinCyanobacteriaNostoc sp. ‘Peltigera malacea cyanobiont’ DB399253.953.930%8.00E-0429%WP_099101112.1
DNA primaseFirmicutesEubacterium aggregans53.153.134%0.00129%WP_090304657.1
phage/plasmid primase P4 family C-terminal domain containing proteinProteobacteriaDesulfovibrio africanus53.553.520%0.00129%WP_005988925.1
primaseFirmicutesLactobacillus equicursoris53.553.528%0.00123%WP_008463426.1
DNA primaseCyanobacteriaCrocosphaera watsonii WH 040152.852.849%0.00122%CCQ62642.1
hypothetical proteinProteobacteriaThiotrichales bacterium HS_0853.153.146%0.00124%WP_103918394.1
primaseFirmicutesLactobacillus delbrueckii52.852.828%0.00223%WP_003622798.1
DUF3854 domain-containing proteinCyanobacteriaCrocosphaera watsonii52.452.449%0.00222%WP_053074885.1
MULTISPECIES: hypothetical proteinProteobacteriaDefluviimonas52.452.428%0.00229%WP_035839891.1
phage/plasmid primase P4ProteobacteriaMesorhizobium ciceri51.651.625%0.00325%WP_013531683.1
hypothetical protein AMDU1_APLC00062G0028EuryarchaeotaThermoplasmatales archaeon A-plasma51.651.618%0.00434%EQB70371.1
hypothetical protein BSZ19_16225ProteobacteriaBradyrhizobium japonicum51.251.248%0.00421%OSJ33189.1
hypothetical proteinCyanobacteriaAphanizomenon flos-aquae51.251.252%0.00428%WP_027404306.1
primaseProteobacteriaDesulfovibrio vulgaris51.651.622%0.00426%WP_010939463.1
primaseProteobacteriaDesulfovibrio vulgaris51.651.622%0.00426%WP_011792015.1
hypothetical protein A5769_14235ActinobacteriMycobacterium intracellulare51.251.211%0.00538%OBG17368.1
hypothetical proteinActinobacteriMycobacterium intracellulare51.251.211%0.00538%WP_081284074.1
hypothetical proteinProteobacteriaBradyrhizobium japonicum50.850.848%0.00621%WP_094184029.1
MULTISPECIES: DNA primaseFirmicutesLachnospiraceae50.850.833%0.00727%WP_066730774.1
hypothetical proteinProteobacteriaSandarakinorhabdus sp. AAP6250.150.124%0.00822%WP_017667662.1
hypothetical proteinFirmicutesLachnospiraceae bacterium TWA450.450.434%0.00927%WP_082039423.1
phage/plasmid primase, P4 familyFirmicutesLachnospiraceae bacterium TWA450.150.134%0.0127%KIR03447.1
Results of blastp of six ORFs with e-value 10−2. The first question regarding these six ORFs is whether or not they were created through the shuffling process of endogenous sequences in the plastome. The mitogenomes of land plants rapidly evolve structurally[36], and direct repeats and inverted repeats have served as good tools for rearrangement[37,38]. Additionally, double-strand break repairs with non-homologous end-joining affect the dynamic mitogenomic variation[39]. As a result, novel chimeric ORFs generated by the shuffling process have been reported in the mitogenomes of land plants[40,41]. However, in contrast with mitogenome, rearrangements of plastome have been generally restricted in land plants[1], especially in eusporangiate ferns[42]. Therefore, we ruled out the possibility of the six ORFs having been generated by the shuffling process of plastome sequences. The second question is where the six ORFs region originated from prior to translocation to M. chejuense (or Ophioglossaceae). Gene transfer from other genomes, such as mitogenome or nuclear genome to plastome, were previously thought to occur extremely rarely if at all[43]; however, recently reported gene transfers from mitogenome to plastome[6] have suggested the possibility of gene transfers from nuclear genome to plastome. Knox[44] proposed that large ORFs in the plastomes of 51 species belonging to the Campanulaceae sensu lato arose from nuclear genome and Spooner, et al.[4] provided the first evidence of a known nuclear sequence transferred into plastome. Martin, et al.[45] revealed that massive EGTs have occurred during the evolution from plastid to nucleus in land plants. This implies that plant nuclear genome contains many genes which are orthologous to bacterial genes. Therefore, if the six ORFs region in the plastome of M. chejuense was transferred from the nuclear genome of M. chejuense, they could be similar to the ancient DNAs which remain in green algae or bacteria but not in the plastomes of land plants (Fig. 4A). In addition, as the structures of plastome in land plants have been very conserved throughout evolution with the exception of certain lineages[1], it is likely that the translocated six ORFs region in the plastome of M. chejuense keeps its structure.
Figure 4

Integration models of six ORFs in Ophioglossaceae. (A) Ancient endosymbiotic gene transfer model. (B) Recent horizontal gene transfer model.

Integration models of six ORFs in Ophioglossaceae. (A) Ancient endosymbiotic gene transfer model. (B) Recent horizontal gene transfer model. Another possible scenario for six ORFs is gene transfer from bacteria to the ancestor of Ophioglossaceae or directly to more ancient clade of ferns (Fig. 4B). Although a few nuclear genome sequences have been reported in land plants and most of them belonged to angiosperms[46,47] (https://www.ncbi.nlm.nih.gov/genome), the six ORFs were only similar to the plastomes of very few fern species and the mitogenomes of H. zeylanica. In addition, the amino acid sequences of ORF436 and ORF531 were more similar to the genes of green algae or bacteria, which are distantly related to ferns, than to these of land plants. So far, many HGTs in land plants have been reported, and bacteria, fungi, and viruses have been agents of HGT in certain cases[48,49]. HGT from bacteria to eukaryotes has been detected in yeast[50] along with that from bacteria to organelle[51]. Therefore, it is conceivable that the six ORFs result from HGT from bacteria to ancestor of Ophioglossaceae or more ancient clade of ferns.

The phylogenetic relationships among genera in eusporangiate ferns

Eusporangiate ferns consist of four major families: Equisetaceae, Ophioglossaceae, Psilotaceae, and Marratiaceae. The generic relationships of eusporangiate ferns have been relatively well resolved by previous studies, except for those of the family Ophioglossaceae. Therefore, our phylogenetic study is focused on the family Ophioglossaceae. Four genera of Ophioglossaceae have different distribution patterns. Both Botrychium and Ophioglossum have cosmopolitan distributions[52]. H. zeilanica are distributed in Asia from India and Ceyon to South China, Taiwan, and tropical Australia[52], but M. chejuense is distributed only in Jeju Island of South Korea[24], specifically in twenty areas called “Gotjawal” created by volcanic activity. The trophophore of Mankyua is similar to that of Helminthostachys, but its sporophore is similar to that of Ophioglossum. In addition, Mankyua and Ophioglossum have subterranean vegetative reproduction[24]. Even though these intermediate features of Mankyua confused its phylogenetic position in Ophioglossaceae, phylogenetic analysis containing Mankyua is rare. Sun, et al.[27] presented that Ophioglossum is the sister of Mankyua + Helminthostachys + Botrychium through parsimony analysis using rbcL data, and Shinohara, et al.[28] suggested two different phylogenetic position of Mankyua by ML and Bayesian analysis using rbcL and matK. However, the bootstrap values for the clade comprised of more than two genera that were still under 90% according to previous studies. The phylogenetic relationships among all species used in this paper were almost identical between ML and Bayesian analysis (Fig. 5). Only the topology of ((Amborella, Illicium), (Trithuria (Nuphur, Nymphaea)) was supported as being stronger than that of (Amborella (Illicium (Trithuria (Nuphur, Nymphaea)) in terms of bootstrap value under ML analysis. Eusporangiate ferns, except for the Angiopteris, were monophyly with strong supports, and Ophioglossaceae also formed a clade. The phylogenetic relationships among the four genera in Ophioglossaceae in this study are completely different from those of Sun, et al.[27] and Shinohara, et al.[28]. Mankyua was firstly diverged from a common ancestor of Ophioglossaceae, and then Ophioglossum was subsequently diverged from a common ancestor of Helminthostachys and Botrychium. Finally, Helminthostachys diverged from a sister group with Botrychium. The phylogenetic relationships of Ophioglossum, Helminthostachys, and Botrychium and not for Mankyua correspond with those described in Hauk, et al.[26]. They described that the ophioglossoid (Ophioglossum s.l.) and botrychioid (Helminthostachys + Botrychium s.l.) diverged relatively early in the evolutional history of the Ophioglossaceae.
Figure 5

Phylogeny of eusporangiate ferns using 44 complete plastome sequences of ferns and its relatives. Numbers on the branches refer to ML bootstrap/Bayesian posterior probability. Dash and star stand for less than 50/0.5 and 100/1.0, respectively.

Phylogeny of eusporangiate ferns using 44 complete plastome sequences of ferns and its relatives. Numbers on the branches refer to ML bootstrap/Bayesian posterior probability. Dash and star stand for less than 50/0.5 and 100/1.0, respectively. Considering molecular phylogenetic analysis and the morphological characters of Ophioglossaceae, it seems that the ancestor of Ophioglossaceae have linear, fleshy spikes and vegetative reproduction. The ophioglossoid derived from a common ancestor have specialized their own trophophore and botrychioid have kept their trophophore and have specialized their own sporophore. In addition, the longitudinally dehiscent of sporangium in H. zeylanica was not plesiomorphic but apomorphic characteristic.

Materials and Methods

Plants materials and DNA extraction

H. zeylanica, M. chejuense, and B. ternatum were sampled at Cambodia and Jeju Island, Korea. The voucher specimens were deposited in the Korea University herbarium (KUS, K.-J. Kim et al., TCA2009-0806; K.-J. Jo et al., 2012–0028; K.-J. Kim et al., 2011–1638; Kim et al., 2012–0053). Total genomic DNA was extracted from fresh leaves using the CTAB method[53]. The DNA was purified using ultra-centrifugation in a cesium chloride/ethidium bromide gradient, then further purified by dialysis[54].

Sequencing of the plastome of M. chejuense by PCR method and assembling

The total genomic DNA of M. chejuense was PCR-amplified in order to construct a plastome map using a series of primer sets designed based on three plastome sequences of Psilotum nudum, Adianthum capillus-veneris, and Angiopteris evecta[16,55]. Both the long-range PCR method and normal PCR method were employed using overlapping primer sets. The PCR condition for long range amplification was as follows: initial denaturation step for 4 min at 94 °C, then 35 cycle amplifications consisting of 30 sec denaturation at 94 °C, 30 sec annealing at 53~65 °C, and about 1 min/kb extension at 68 °C, followed by an extension period of 7 min at 72 °C. The PCR condition for normal amplification was as follows: initial denaturation step for 4 min at 94 °C, then 35 cycle amplifications consisting of 30 sec denaturation at 94 °C, 30 sec annealing at 47~52 °C, and about 2 min extension at 72 °C, followed by an extension period of 3 min at 72 °C. The PCR products were purified with the MEGAquick-spin kit (iNtRON, Seoul, Korea) and the cleaned products were sequenced in both directions using an ABI 3730XL automatic sequencer. Sequence contigs were assembled using Sequencher 4.7 (Gene Code Corporation, Ann Arbor, MI, USA).

Sequencing of the plastomes of H. zeylanica, M. chejuense, and B. ternatum by NGS and assembling

The genomic DNAs of H. zeylanica, M. chejuense, and B. ternatum were sequenced using MiSeq (Illumina, San Diego, CA, USA) (Supplementary Table S3). The raw reads were trimmed by trimmomatic 0.36[56] with LEADING:10 (trimming the leading sequences until quality >10), TRAILING:10 (trimming the trailing sequences until quality >10), SLIDINGWINDOW:4:20 (trimming the window of size four for reads with the average quality less than 20), and MINLEN:50 (removing reads less than 50 bp in length). We followed the assembly method described by Kim, et al.[57] using the plastome sequences of O. californicum[18] and M. chejuense (NC017006) sequenced through PCR in this paper. Certain regions with low coverages caused by simple sequence repeats were verified using PCR.

Gene annotation

Genes in four plastomes were annotated compared with previously reported plastomes in eusporangiate ferns based on similarity. Coding genes and tRNAs were re-checked by blastp[58] and tRNAscan-SE[59]. ORFs were annotated using with >303 bp in length.

Analyses of six ORFs in Ophioglossaceae

Six ORFs of M. chejuense were searched using blastn with 11 word size and 10−5 e-value and blastp with 3 word size and 10−2 e-value[58] in order to investigate the homology with previously reported sequences in GenBank. In order to investigate the translocation of six ORFs into other genomes like nuclear or mitochondrial genome (mitogenome), three NGS raw data were de novo assembled using megahit[60] and contigs were hit to six ORFs using blastn[58]. Mauve[61] and Circoletto[62,63] were used to visualize sequence similarity between six ORFs contigs in Ophioglossaceae.

Phylogenetic relationships among four genera in Ophioglossaceae

The 44 complete plastome sequences of ferns and their relatives were used to resolve the unclear intergeneric relationships in Ophioglossaceae (Supplementary Table S4). Eighty-four protein coding genes were extracted from each plastome. Each gene was aligned by MAFFT[64] and 84 aligned genes were concatenated into a single aligned sequence. The best-fit nucleotide substitution models for each gene position in a single concatenated sequence were evaluated using Partitionfinder V2.1.1[65,66]. The maximum likelihood (ML) analysis was inferred by RAxML Black Box[67] in CIPRES Science Gateway[68] and Bayesian inference (BI) analysis was inferred by Mrbayes[69] under GTR substitution model with gamma-distributed rate variation and a proportion of invariable sites (ngen = 1,000,000, samplefreq = 200, burninfrac = 0.25). Supplementary Dataset 1
  54 in total

1.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

2.  Gene transfer from organelles to the nucleus: how much, what happens, and Why?

Authors: 
Journal:  Plant Physiol       Date:  1998-09       Impact factor: 8.340

3.  Plastome Sequencing of Ten Nonmodel Crop Species Uncovers a Large Insertion of Mitochondrial DNA in Cashew.

Authors:  Samar O Rabah; Chaehee Lee; Nahid H Hajrah; Rania M Makki; Hesham F Alharby; Alawiah M Alhebshi; Jamal S M Sabir; Robert K Jansen; Tracey A Ruhlman
Journal:  Plant Genome       Date:  2017-11       Impact factor: 4.089

4.  Recent horizontal intron transfer to a chloroplast genome.

Authors:  Elena V Sheveleva; Richard B Hallick
Journal:  Nucleic Acids Res       Date:  2004-02-03       Impact factor: 16.971

5.  New reports of nuclear DNA content for 407 vascular plant taxa from the United States.

Authors:  Chengke Bai; William S Alverson; Aaron Follansbee; Donald M Waller
Journal:  Ann Bot       Date:  2012-10-24       Impact factor: 4.357

6.  MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.

Authors:  Fredrik Ronquist; Maxim Teslenko; Paul van der Mark; Daniel L Ayres; Aaron Darling; Sebastian Höhna; Bret Larget; Liang Liu; Marc A Suchard; John P Huelsenbeck
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

7.  Plastid phylogenomics of the cool-season grass subfamily: clarification of relationships among early-diverging tribes.

Authors:  Jeffery M Saarela; William P Wysocki; Craig F Barrett; Robert J Soreng; Jerrold I Davis; Lynn G Clark; Scot A Kelchner; J Chris Pires; Patrick P Edger; Dustin R Mayfield; Melvin R Duvall
Journal:  AoB Plants       Date:  2015-05-04       Impact factor: 3.276

8.  Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

Authors:  Yeong Deuk Jo; Yoomi Choi; Dong-Hwan Kim; Byung-Dong Kim; Byoung-Cheorl Kang
Journal:  BMC Genomics       Date:  2014-07-04       Impact factor: 3.969

9.  Plastid Phylogenomics Resolve Deep Relationships among Eupolypod II Ferns with Rapid Radiation and Rate Heterogeneity.

Authors:  Ran Wei; Yue-Hong Yan; A J Harris; Jong-Soo Kang; Hui Shen; Qiao-Ping Xiang; Xian-Chun Zhang
Journal:  Genome Biol Evol       Date:  2017-06-01       Impact factor: 3.416

10.  Fern genomes elucidate land plant evolution and cyanobacterial symbioses.

Authors:  Fay-Wei Li; Paul Brouwer; Lorenzo Carretero-Paulet; Shifeng Cheng; Jan de Vries; Pierre-Marc Delaux; Ariana Eily; Nils Koppers; Li-Yaung Kuo; Zheng Li; Mathew Simenc; Ian Small; Eric Wafula; Stephany Angarita; Michael S Barker; Andrea Bräutigam; Claude dePamphilis; Sven Gould; Prashant S Hosmani; Yao-Moan Huang; Bruno Huettel; Yoichiro Kato; Xin Liu; Steven Maere; Rose McDowell; Lukas A Mueller; Klaas G J Nierop; Stefan A Rensing; Tanner Robison; Carl J Rothfels; Erin M Sigel; Yue Song; Prakash R Timilsena; Yves Van de Peer; Hongli Wang; Per K I Wilhelmsson; Paul G Wolf; Xun Xu; Joshua P Der; Henriette Schluepmann; Gane K-S Wong; Kathleen M Pryer
Journal:  Nat Plants       Date:  2018-07-02       Impact factor: 15.793

View more
  3 in total

1.  Structural Variation of Plastomes Provides Key Insight Into the Deep Phylogeny of Ferns.

Authors:  Xin-Yu Du; Li-Yaung Kuo; Zheng-Yu Zuo; De-Zhu Li; Jin-Mei Lu
Journal:  Front Plant Sci       Date:  2022-05-02       Impact factor: 6.627

Review 2.  Horizontal Gene Transfer Involving Chloroplasts.

Authors:  Ewa Filip; Lidia Skuza
Journal:  Int J Mol Sci       Date:  2021-04-25       Impact factor: 5.923

3.  The dynamic evolution of mobile open reading frames in plastomes of Hymenophyllum Sm. and new insight on Hymenophyllum coreanum Nakai.

Authors:  Hyoung Tae Kim; Jung Sung Kim
Journal:  Sci Rep       Date:  2020-07-06       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.