Literature DB >> 30226857

The complete plastid genomes of Ophrys iricolor and O. sphegodes (Orchidaceae) and comparative analyses with other orchids.

Luca Roma1, Salvatore Cozzolino1, Philipp M Schlüter2,3, Giovanni Scopece1, Donata Cafasso1.   

Abstract

Sexually deceptive orchids of the genus Ophrys may rapidly evolve by adaptation to pollinators. However, understanding of the genetic basis of potential changes and patterns of relationships is hampered by a lack of genomic information. We report the complete plastid genome sequences of Ophrys iricolor and O. sphegodes, representing the two most species-rich lineages of the genus Ophrys. Both plastomes are circular DNA molecules (146754 bp for O. sphegodes and 150177 bp for O. iricolor) with the typical quadripartite structure of plastid genomes and within the average size of photosynthetic orchids. 213 Simple Sequence Repeats (SSRs) (31.5% polymorphic between O. iricolor and O. sphegodes) were identified, with homopolymers and dipolymers as the most common repeat types. SSRs were mainly located in intergenic regions but SSRs located in coding regions were also found, mainly in ycf1 and rpoC2 genes. The Ophrys plastome is predicted to encode 107 distinct genes, 17 of which are completely duplicated in the Inverted Repeat regions. 83 and 87 putative RNA editing sites were detected in 25 plastid genes of the two Ophrys species, all occurring in the first or second codon position. Comparing the rate of nonsynonymous (dN) and synonymous (dS) substitutions, 24 genes (including rbcL and ycf1) display signature consistent with positive selection. When compared with other members of the orchid family, the Ophrys plastome has a complete set of 11 functional ndh plastid genes, with the exception of O. sphegodes that has a truncated ndhF gene. Comparative analysis showed a large co-linearity with other related Orchidinae. However, in contrast to O. iricolor and other Orchidinae, O. sphegodes has a shift of the junction between the Inverted Repeat and Small Single Copy regions associated with the loss of the partial duplicated gene ycf1 and the truncation of the ndhF gene. Data on relative genomic coverage and validation by PCR indicate the presence, with a different ratio, of the two plastome types (i.e. with and without ndhF deletion) in both Ophrys species, with a predominance of the deleted type in O. sphegodes. A search for this deleted plastid region in O. sphegodes nuclear genome shows that the deleted region is inserted in a retrotransposon nuclear sequence. The present study provides useful genomic tools for studying conservation and patterns of relationships of this rapidly radiating orchid genus.

Entities:  

Mesh:

Year:  2018        PMID: 30226857      PMCID: PMC6143245          DOI: 10.1371/journal.pone.0204174

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Plastids such as chloroplasts are important plant organelles involved in the photosynthetic process thus providing essential energy to plants [1]. Plastids have small circular genomes, ranging from 135 to 160 kb [2-4]. Most angiosperm plastid genomes so far annotated have a quadripartite structure containing two copies of Inverted Repeat (IR) regions, separating a Large Single Copy (LSC) and Small Single Copy (SSC) regions [5-7]. Recently, with the extraordinary advances in sequencing platforms, many plastid genomes have been annotated and have provided valuable tools for the understanding of plant phylogenies and genome evolution e.g. [8]. Plastid structure and gene order are generally stable, and the rate of nucleotide substitution is slow [9] so that plastid genomes were traditionally considered to have experienced rearrangements rarely enough to be suitable to demarcate major plant groups [10]. Nonetheless, several angiosperm lineages show extensive gene order changes in plastid genomes that are often correlated with increased rates of nucleotide substitutions and gene and/or multiple intron losses [11, 12]. These rearrangements in the plastid genome have been found to be often associated with repeated sequences [2]. The family Orchidaceae consists of more than 700 genera and approximately 28,000 species [13], which are distributed in a wide variety of habitats. So far, several complete plastid genomes have been annotated in different orchid lineages. These studies revealed that Orchidaceae often underwent accelerated plastome evolution including large inversions, shifts in boundaries between IRs and the two single copies, indels, intron losses, and pseudogene formation by stop codons often associated with shifts from heterotrophy to parasitism/heterotrophism [14,15]. Compared to other angiosperms, photosynthetic orchids were also found particularly variable in the conservation of NADH dehydrogenase (ndh) genes [16], that encode components of the thylakoid complex involved in the redox level of the cyclic photosynthetic electron transporters. The number of intact and degraded ndh genes present in the orchids plastomes varies even among closely related species suggesting that this specific gene class may be actively degraded in Orchidaceae [17]. This is not surprising as gene transfer from plastid to nucleus is known to occur frequently during evolutionary processes as even the complete loss of some plastid-encoded ndh genes seems to not affect the plant life [15]. Indeed, there is no clear-cut evidence of phylogenetic signal in the pseudogenization or loss of the ndh genes. For instance, no correlation with phylogeny was found for ndh genes loss in the Epidendroideae lineages while related species of Oncidiinae show a consistent loss of two ndh genes (ndhF and ndhK) and pseudogenization by gene truncation of other five genes (ndhA, D, H, I and J) [18]. The IR/SC junctions represent another hotspot of orchid plastome evolution, with the rearrangement of flanking regions leading to expansion or contraction of the inverted repeat regions. Different types of junctions have been reported in orchids, with considerable variation particularly in the ycf1 gene [19]. It has been hypothesized that the exhibited usage bias of A/T base pairs typical of all known orchid ycf1 genes would render less stable the DNA in the ycf1 gene thus leading to the higher recombination of IR/SSC junction [20]. This often leads to a consequent partial or complete degradation of the ndhF gene, or even, in some case, to its transfer to mitochondrial DNA by intraorganellar recombination [17]. Despite Orchidaceae represents approximately 1/8 of all flowering plants [13], most published plastid sequences belong to tropical orchid lineages, while there is a remarkable dearth of information for the important temperate terrestrial subtribe Orchidinae with only two Habenaria and one Platanthera species plastomes having been annotated so far [17, 21]. With the aim to fill this gap, we sequenced the complete plastid genomes of Ophrys iricolor and Ophrys sphegodes. These species are representative of the two main diverging lineages of the Mediterranean Ophrys, a sexually deceptive genus belonging to the subtribe Orchidinae characterized by an elevated taxonomic complexity due to a very fast radiation by pollinator shifts [22, 23]. The specific aims of the present study were to (i) annotate the complete plastid genome sequences of two Ophrys species, (ii) evaluate the homology between these two plastomes, (iii) investigate any significant characteristics suggesting plastome rearrangement in Ophrys and their phylogenetic signal, and (iv) explore significant changes in gene content and gene order in the subtribe Orchidinae compared to other orchid subtribes.

Materials and methods

Genome sequencing, assembling and annotation

DNA was extracted from a specimen of Ophrys iricolor (collected between Miamou and Agios Kyrillos, Crete, Greece; N34.9693, E24.9154; under permit number 118565/3022 issued by the Ministry of Environment and Energy in Athens on 13.02.2015) and from a specimen of Ophrys sphegodes (collected between Cagnano Varano and San Nicandro Garganico, Apulia, Italy; N41.9133, E15.6784 under permit number 173 issued by the National Park of Gargano in Monte Sant’Angelo (FG) on 12.01.2016). Whole genomic libraries were sequenced in paired-end mode, 2 x 150 bp, using the Illumina HiSeq 4000 platform (Illumina Inc., San Diego, CA, USA) at the Functional Genomics Centre Zurich (Switzerland). The obtained reads were trimmed using the software TRIMMOMATIC v. 0.36 [24] and the resulting trimmed reads (309,012,252 reads for O. sphegodes and 251,959,572 reads for O. iricolor) were de novo assembled using NOVOPLASTY v. 2.5.2 [25]. The gene annotation of the Ophrys plastid genomes was carried out using the software GESEQ v. 1.42 [26] and BLAST v. 2.6.0 [27] searches. From this initial annotation analysis, putative starts and stops of the gene exons, along with the positions of the related introns, were determined based on comparisons to homologous genes in other plastid genomes [28]. All tRNA genes were verified by using tRNAscan-SE server v. 1.3.1 [29]. The physical maps of the plastid circular genomes were drawn using Organellar Genome DRAW (OGDRAW) v. 1.2.1 [30]. The complete plastome sequences of Ophrys sphegodes and O. iricolor were deposited in the Sequence Reads Archive (NCBI-SRA) database under the accession number SRP148126. BLAST v. 2.6.0 [27, 31] was used to check whether deleted part of the ndhF gene in the O. sphegodes plastid genome was translocated into the nuclear genome. Reads were realigned against the assembled scaffolds of O. sphegodes nuclear genome (unpublished) using BWA v. 0.7.16 and converted in BAM [32] format using SAMtools v. 1.5 [33]. Finally, a BLASTX search was performed to annotate the nuclear O. sphegodes scaffold1075174.

Genome structure, deletions validation, and repeat sequences

The software MAFFT v. 7.205 [34] and the Perl script Nucleotide MUMmer (NUCmer) available in MUMmer 3.0 [35] were employed to compare the plastome structures between O. sphegodes and O. iricolor. To detect putative errors in the de novo assemblies, the trimmed reads were mapped to the assembled genomes using the aligner BWA [32], converted to BAM format using SAMtools [33] and finally visualized using the IGV genome browser v. 2.4 [36]. To validate the deletion in silico, BAM files were further analysed using the software BEDtools coverage v. 2.21.0 [37] which generated a table in BED format containing an interval “windows” with coverage information across the two Ophrys plastomes. The BED file format was in turn used to visualize the sequencing coverage in regions of interest using the software CNView v. 1.0 [38]. To experimentally validate the ndhF deletion in O. sphegodes/O. iricolor, we designed primers for both the flanking and internal regions of ndhF from the assembled plastomes (S1A Fig). With these primers, we PCR amplified DNAs of O. sphegodes and O. iricolor from different localities and of O. incubacea and O. fusca, as close relatives to O. sphegodes and O. iricolor, respectively and O. insectifera as distant related. PCR reaction conditions were as described in [39], with 5 ng of total DNA as template. Amplification products were visualized on 2% agarose gel using a 100 bp ladder as standard. PCR products and ladder were stained with ethidium bromide and photographed using a digital camera. Confirmatory sequences of the PCR products were done with ABI3130 automatic sequencer following manufacture instructions. Simple sequence repeats (SSRs) or microsatellites were detected using the MIcroSAtellite (MISA) Perl script v. 1.0 [40]. Thresholds were set at eight repeat units for mononucleotide SSRs, four repeat units for di- and trinucleotide SSRs, and three repeat units for tetra-, penta- and hexanucleotide SSRs as done in [41]. We also analysed tandem repeat sequences from the plastid genomes of O. sphegodes and O. iricolor and searched for forward, reverse and palindromic repeats by using REPuter [42]. We limited the maximum computed repeats and the minimal repeat size to 50 and 8, respectively and with a Hamming distance equal to 1.

Prediction of RNA editing sites and identification of positive signatures in plastid protein-coding genes

Potential RNA editing sites in protein-coding genes of Ophrys plastome were predicted by the program PREPACT v. 2.0 [43] using the following 30 highly homologous reference genes from Phalaenopsis aphrodite: accD, atpA, atpB, atpF, atpI, ccsA, clpP, matK, petB, petD, petG, petL, psaB, psaI, psbB, psbE, psbF, psbL, rpl2, rpl20, rpl23, rpoA, rpoB, rpoC1, rpoC2, rps2, rps8, rps14, rps16, and ycf3. In order to identify putative genes under positive selection, the 67 protein-coding genes present in sixteen Orchidaceae plastomes (Ophrys iricolor, AP018716 O. sphegodes AP018717, Cattleya crispata NC_026568.1, Corallorhiza odontorhiza KM390021.1, Cymbidium aloifolium NC_021429.1, Cypripedium japonicum KJ625630.1, Goodyera procera NC_029363.1, Habenaria pantlingiana NC_026775.1, Masdevallia coccinea NC_026541.1, Phalaenopsis aphrodite NC_017609.1, Anoectochilus emeiensis NC_033895.1, Apostasia wallichii NC_030722.1, Dendrobium officinale KX377961.1, Phragmipedium longifolium KM032625.1, Platanthera japonica MG925368.1, Vanilla planifolia KJ566306.1) were downloaded from Genbank. We analysed all coding gene regions, except ndh genes, due to their frequent loss across the entire set of orchids listed here. In order to build a reference phylogenetic tree, all genes were aligned using MAFFT software v. 7.205 [44] and were concatenated using MESQUITE software v. 3.5 [45]. PARTITION FINDER software v. 2.1.0 [46] was used in order to search the best evolution model for each gene and a reference phylogenetic tree was built using RAxML software v. 8.2.10 using 1000 bootstrap replicates [47]. The positive signatures were analysed using SELECTON server v. 2.4 (http://selecton.tau.ac.il/index.html; [48], Ophrys iricolor was used as query sequence (i.e. the plastome type without ndhF deletion) and codon alignment was done using the software MAFFT v. 7.205 [44] implemented in SELECTON software. The phylogenetic tree was set as input in SELECTON analyses and branch lengths were automatically optimized from the software. The gene divergence was estimated by the sum of total branch lengths that link the operational taxonomical units to the common ancestor of Orchidaceae species sampled here as done in [28]. SELECTON software generated for each gene as output the number of putative sites under positive selection. In order to test whether positive selection is operating on a protein, a Likelihood Ratio Test for positive selection was performed with the comparison of M8 (allows positive selection) against M8a (null model). We consider in our analysis only sites where possible positive selection was inferred (lower bound > 1 and test with probability < 0.01). P-values were adjusted for multiple testing in R (R Core Team) using FDR method in the p.adjust function.

Results and discussion

Genome organization and features

The plastomes of the two Ophrys species are circular DNA molecules of 146,754 bp for O. sphegodes and 150,177 bp for O. iricolor with the typical quadripartite structure of plastid genomes of flowering plants (Fig 1): a pair of inverted repeats of 25,052 bp and 26,348 bp, respectively, separated by a large single copy (LSC) region (80,471 bp and 80,541, respectively) and a small single copy (SSC) region of 16,179 bp and 16,940 bp, respectively for O. sphegodes (DDBJ accession number AP018717) and O. iricolor (DDBJ accession number AP018716). The size of the Ophrys plastid genome was comparable to other published plastomes of photosynthetic orchids. The plastomes of the two Ophrys species are largely collinear with the exception of a large deletion in the ndhF gene in O. sphegodes.
Fig 1

Gene map of Ophrys sphegodes (a) and Ophrys iricolor (b) plastid genomes. Genes drawn inside the circle are transcribed in the clockwise direction, and genes drawn outside are transcribed in the counter-clockwise direction. Different functional groups of genes are colour-coded. The darker grey in the inner circle corresponds to G/C content, and the lighter grey corresponds to A/T content. LSC, Large Single Copy; SSC, Small Single Copy; IRA/B, Inverted Repeat A/B. The enlargement shows that the loss of the partial duplicated gene of ycf1 and the truncation of ndhF gene in O. sphegodes are correlated with the shift of the junction between the IR and SSC.

Gene map of Ophrys sphegodes (a) and Ophrys iricolor (b) plastid genomes. Genes drawn inside the circle are transcribed in the clockwise direction, and genes drawn outside are transcribed in the counter-clockwise direction. Different functional groups of genes are colour-coded. The darker grey in the inner circle corresponds to G/C content, and the lighter grey corresponds to A/T content. LSC, Large Single Copy; SSC, Small Single Copy; IRA/B, Inverted Repeat A/B. The enlargement shows that the loss of the partial duplicated gene of ycf1 and the truncation of ndhF gene in O. sphegodes are correlated with the shift of the junction between the IR and SSC. The percentage of plastid reads in total WGS data was 5.43% for O. sphegodes and 1.96% for O. iricolor. The lowest average coverage of the assembled plastid genomes used was 13,673x for O. sphegodes and 3,816x for O. iricolor. The G/C contents were 37.14% and 36.4% respectively for O. sphegodes and O. iricolor, similar to other angiosperms (Table 1). The Ophrys plastome is predicted to encode 107 distinct genes, 17 of which are completely duplicated in the IR regions resulting in a total of 124 genes (Table 2). The annotation revealed distinct protein-coding genes (seven of them completely duplicated, namely ndhB, rpl2, rpl23, rps7, rps12, rps19 and ycf2), 30 distinct tRNAs genes (five of them duplicated, trnH-GUG, trnL-CAA, trnN-GUU, trnR-ACG, trnV-GAC and one triplicated trnM-CAU), and four distinct rRNA genes (all of them completely duplicated: rrn4.5, rrn5, rrn16 and rrn23). A truncated gene ndhF, was identified in O. sphegodes but not in O. iricolor. Ten genes contain one intron (atpF, ndhA, ndhB, petB, petD, rpl2, rpl16, rps12, rps16 and rpoC1) and two genes (clpP and ycf3) contain two introns.
Table 1

Comparison of two Ophrys plastid genomes.

Ophrys sphegodesOphrys iricolor
Plastid reads16,782,955 bp4,952,605 bp
Average plastid coverage13,673 x3,816 x
G/C percentage37.14%36.4%
Large Single Copy Region80,471 bp80,541 bp
Small Single Copy Region16,179 bp16,940 bp
Inverted Repeats25,052 bp26,348 bp
Table 2

List of genes identified in the plastomes of Ophrys iricolor and Ophrys sphegodes.

Group of geneName of gene
Ribosomal RNA genesrrn16a; rrn23a; rrn4.5a; rrn5a
Transfer RNA GenestrnC-GCA; trnD-GUC; trnE-UUC; trnF-GAA; trnG-CCC; trnG-GCC; trnH-GUGa; trnL-CAAa; trnL-UAG; trnM-CAUc; trnN-GUUa; trnP-UGG; trnQ-UUG; trnR-ACGa; trnR-UCU; trnS-GCU; trnS-UGA; trnS-GGA; trnT-GGU; trnT-UGU; trnV-GACa; trnW-CCA; trnY-GUA
Small subunit of ribosomerps2; rps3; rps4; rps7a; rps8; rps11; rps12a; rps14; rps15; rps16; rps18; rps19a
Large subunit of ribosomerpl2a; rpl14; rpl16; rpl20; rpl22; rpl23a; rpl32; rpl33; rpl36
DNA-dependent RNA polymeraserpoA; rpoB; rpoC1; rpoC2
Genes for photosynthesis:
Subunits of photosystem I (PSI)psaA; psaB; psaC; psaI; psaJ; ycf3; ycf4
Subunits of photosystem II (PSII)psbA; psbB; psbC; psbD; psbE; psbF; psbH; psbI; psbJ; psbK; psbL; psbM; psbN, psbT; psbZ
Subunits of cytochrome b6fpetA; petB; petD; petG; petL; petN
Subunits of ATP synthaseatpA; atpB; atpE; atpF; atpH; atpI
Subunits of NADH dehydrogenasendhA; ndhBa; ndhC; ndhD; ndhE; ndhFb; ndhG; ndhH; ndhI; ndhK; ndhJ
Large subunits of RubiscorbcL
Other genes:
MaturasematK
Envelope membrane proteincemA
Subunit of acetyl-CoA carboxylaseaccD
C-type cytochrome synthesis geneccsA
ProteaseclpP
Component of TIC complexycf1d
Translation initiation factor IF-1infA
Genes of unknown functionycf2a

a Duplicated gene

b Truncated in O. sphegodes

c triplicated gene

d partially duplicated in O. iricolor

a Duplicated gene b Truncated in O. sphegodes c triplicated gene d partially duplicated in O. iricolor

Repeat sequence detection

The occurrence, type, and distribution of SSRs in Ophrys plastomes were analysed. In total, 213 SSRs were identified in O. sphegodes and O. iricolor. Three of these microsatellites occurred in the sequence portion that is deleted in O. sphegodes plastome (S1 Table). Homopolymers and dipolymers were the most common SSRs with, respectively, 71% and 24% occurrence. Seven and nine SSRs were present in compound formation in O. iricolor and O. sphegodes, respectively. Furthermore, the majority of O. sphegodes and O. iricolor SSRs are located in IGS regions (56.2% and 55%), followed by coding sequences (38.2% and 38%) and introns (5.6% and 7%), respectively. SSRs located in coding regions were found mainly in ycf1 and rpoC2 genes. A comparison of SSRs found in the two Ophrys species showed that 67 SSRs (31.5% of the total) were polymorphic between the two species. Among these polymorphic SSRs, 46 were located in the IGS regions, 5 in introns and 16 in genes (S1 Table). Ophrys sphegodes contains 15 directed repeats, 9 inverted repeats, 3 complementary repeats and 21 palindromic repeats, whose lengths range from 18 to 60 bp. Ophrys iricolor contains 15 directed repeats, 27 palindromic repeats, 2 complementary repeats and 5 inverted repeats, whose lengths range from 20 to 60 bp. Most of the O. iricolor and O. sphegodes repeats were located in IGS regions (65.3% and 66.7% respectively), others were located in genes (22.4%, in ycf2, petG, ndhC, psaA and 22.9%; in psbI, ndhC, ycf2, ndhA respectively) and introns (12.3% and 10.4% in clpP and rps16 intron respectively).

RNA editing sites prediction and positive signatures of adaptive evolution

The RNA editing is a post-transcriptional modification typical of plastid and mitochondrial DNA. The process originated early during the evolution of land plants and several RNA editing sites have been maintained or lost during angiosperms evolution [49, 50]. In our analysis, PREPACT found a total of 83 and 87 putative RNA editing sites in 25 genes in O. sphegodes and O. iricolor respectively (S2 Table), in line with previous report for other orchids [51]. The RNA editing sites predicted for plastid genes of Ophrys sphegodes and Ophrys iricolor occur in the first or second codon position with all nucleotide changes being from cytidine (C) to uridine (U), as very often reported in other angiosperms. In O. sphegodes the genes predicted to have RNA editing sites are matK (12 sites), rpoC1 (9 sites), rpoC2 (8 sites), rpoB (8 sites), accD (6 sites), rpoA (4 sites), atpA (4 sites), rpl2 (3 sites) rpl20 (3 sites), atpI (3 sites), ccsA (3 sites), ycf3 (3 sites), clpP (3 sites), petB (2 sites), rps16 (2 sites) and the atpF, petD, petL, psaB, psaI, psbF, rpl23, rps2, rps8 and rps14 genes with only one site. In O. iricolor the genes predicted were the same as O. sphegodes with few differences: ccsA (5 sites), atpA (3 sites), psaB (2 sites), rps14 (2 sites), atpF (2 sites) (S2 Table) which suggest a general conservation of the RNA editing mechanism within Ophrys but also that RNA editing evolution accumulated enough differences to differentiate two Ophrys species. A previous study has also found that the number of RNA editing sites predicted for protein-coding genes in orchids species is high in comparison with other monocots [51].Likelihood ratio test between a null model and an alternative model carried out following [52] shows that 24 genes are under positive selection (S3 Table); overall, the most divergent genes have the stronger signatures of positive selection (S2 Fig). In details, the positively selected genes were involved in different essential functions such as photosynthesis, PSII (psbA, psbB, psbE, psbH, psbM, psbN genes), large subunits of rubisco (rbcL), ATP synthase (atpI gene), cytochrome b6f (petB gene), subunits of RNA polymerase (rpoA, rpoB, rpoC1, rpoC2 genes), RNA maturation (matK gene), ribosomal proteins (rpl20, rpl22, rpl32, rpl33, rps12, rps19 genes), fatty acid biosynthesis (accD gene), cytochrome biosynthesis (ccsA gene), import of protein in the plastid (ycf1 gene), and unknown function (ycf2 gene). The high number of genes containing positive signatures (including the rbcL gene) among photosynthesis-related genes are consistent with previous observation on other monocots and may be related to the recent increase of diversification rate following adaptation to different ecological conditions. [53]. In particular, and as already suggested for other monocots as Arecaceae, many tropical orchid species grow as epiphytes in tropical forests and are shade adapted. The transition to the terrestrial habitus of all temperate orchid lineages (as Ophrys) may have promoted a new selective pressure for improving the photosynthesis efficiency under the new terrestrial ecological conditions [52]. Interestingly, some positively selected sites that were identified in our study (e.g., the accD and ycf1genes) have been found very variable also in other orchids and flowering plants [54]. In particular accD gene is a conserved plastid gene involved in de novo synthesis of fatty acids [55] and is essential for chloroplast functionality, leaf development and longevity [56]. Therefore accD has been associated in a significant manner with adaptation to the environment, including factors such as temperature, light, humidity, and atmosphere [57]. On the other hand, ycf1 is one of the largest plastid genes and it has been found extremely divergent in orchids plastomes [19], likely because of its position at IR/SC junction that generates large variation in sequence length and pseudogenes [58] as also found in our study.

Genomic comparison of Ophrys with other orchid plastomes

The Ophrys plastid genome is fully collinear both in gene order an gene orientation with the other available Orchidoideae. When compared with representative species belonging to the different subfamilies of the Orchidaceae (i.e., Epidendroideae, Cypripedioideae, Vanilloideae and Apostasioideae), we found that Cypripedioideae, Epidendroideae, Vanilloideae and Orchidoideae are largely collinear in plastid sequence with a few small exceptions: an inversion of the psbM—petN gene order in Epidendroideae and a gene inversion in the SSC of Vanilla (S3 Fig). In contrast to these four tribes, large rearrangements in gene order have been found in the supposed basal smaller tribe of Apostasioideae. However, under the assumption that the common plastid types observed in most orchids represent the primitive state, it is likely that the rearrangements found in Apostasia wallichii and Apostasia odorata (but not in the related Neuwiedia [59]) may be due to recent, terminal autoapomorphic changes rather than being representative of the ancestral gene order of the orchid family. As many ndh genes had either truncations or indels, resulting in frameshifts or pseudogenes in several orchid plastomes, we also compared ndh genes in the different tribes. Ophrys iricolor, like other Orchidoideae, has the complete set of ndh plastid genes, i.e. 11 functional genes, which is different from Apostasia wallichii and Vanilla planifolia in which the ndhB gene is truncated and from Vanilla planifolia where all other 10 ndh subunits are deleted. The presence of ndh genes within terrestrial Orchidoideae is ubiquitous, which contrasts with the extensive variation in presence/absence of ndh genes found within tropical orchid genera (see Cymbidium [17]). The functional role of the ndh genes seems closely related to the land adaptation of photosynthesis so they have been conserved in terrestrial, temperate orchid plastomes whereas they are partially lost in epiphytic, tropical orchid plastomes [60].

Boundaries between single copy and inverted repeat regions

Expansion or contraction of the IR region is one of the main causes of size variation among angiosperm plastid genomes [61] and it has found to be variable even among related orchid species as, for instance, within the Cymbidium genus [17]. The multiple genome alignment analysis using plastome sequences of O. sphegodes and O. iricolor revealed the loss of a ycf1 fragment in the IR and partial deletion of the ndhF gene in O. sphegodes (S4 Fig). In silico validation confirmed the partial ndhF gene loss in O. sphegodes and demonstrated that part of the ycf1 gene is duplicated in O. iricolor, which does not occur in O. sphegodes (Fig 2).
Fig 2

In silico validation of ndhF deletion (using software CNView) comparing O. sphegodes plastid reads against reference genome of O. iricolor (a) and O. iricolor plastid reads against reference genome of O. sphegodes (b). Y-axis represents normalized coverage values.

In silico validation of ndhF deletion (using software CNView) comparing O. sphegodes plastid reads against reference genome of O. iricolor (a) and O. iricolor plastid reads against reference genome of O. sphegodes (b). Y-axis represents normalized coverage values. In O. sphegodes, the loss of the partial duplicated gene of ycf1 and the partial deletion of ndhF gene are correlated with the shift of the junction between the IR and SSC (Fig 1) with a pattern very similar to some Cymbidium species [17]. High sequence variability, especially in the ycf1 gene at IR-SSC junction, have been frequently observed as a result of expansion and contractions events by gene conversion [62, 63]. While in silico validation by CNVIEW largely confirms the occurrence of the ndhF deletion in O. sphegodes, however, approximately 2% of O. sphegodes reads map on the plastid region corresponding to O. iricolor plastome type (i.e. where complete ndhF occurs). At the same time, IGV also reveals that 888 of O. iricolor reads map on the junction with ndhF deletion (i.e. corresponding to O. sphegodes plastome type). Thus, to confirm the occurrence of ndhF deletion in O. sphegodes/O. iricolor, we amplified DNA with primers for both the flanking and internal regions of ndhF. Further, to rule out any possible cross contamination (during the NGS steps) as cause of presence of both plastome types in both Ophrys species, different accessions were used in PCR validation. PCR amplifications with primers flanking ndhF yielded two amplicons in O. iricolor: a small one (0.25 Kb), corresponding to the plastid fragment with the ndhF deletion, and a larger amplicon (3.25 Kb) containing the undeleted ndhF gene. Only the small plastid fragment with the ndhF deletion (primers F1/R1) was detected in O. sphegodes. To exclude, in O. sphegodes, that the small fragment was selectively amplified due to its shorter size and higher copy number, we also amplified O. sphegodes and O. iricolor (as control) with primers located within the ndhF deletion (primers F2/R2). Contrary to expectation (i.e. no amplification in O. sphegodes) both species successfully amplified a 1.2 Kb fragment. However, the two species differed in their amplicon yield, i.e. we obtained a stronger amplification band in O. iricolor compared to O. sphegodes (S1 Fig b). Taken together, this suggest that both species contained copies with and without the ndhF deletion but with a different relative representation (high proportion of deletions in O. sphegodes and low in O. iricolor). The fact that all examined members of O. sphegodes and O. iricolor lineages (including the basal O. insectifera) share a similar PCR amplification pattern suggests that the deletion of ndhF has likely occurred only once during the early evolution of the genus Ophrys, i.e. immediately before the separation of the two main lineages. The presence of two plastome types (with a different relative representation) across the two lineages represents an unusual case of maintenance of plastid heteroplasmy likely established as consequence of retention of ancestral polymorphism or of plastid capture by hybridization. Both processes have been commonly suggested to explain the unusual genomic admixture detected among Ophrys species as they are characterized by very rapid radiation and recurrent hybridization [64, 65].

Genomic localization of deleted ndhF gene in O. sphegodes nuclear genome

BLAST search of the assembly for the deleted ndhF region from the plastid genome of O. sphegodes found the nuclear scaffold1075174 (length 5,436 bp) with a score of 924 and e-value of 0.0. Reads of whole genome sequencing were mapped against scaffold1075174 to check whether some reads overlap with the junction between plastid deleted region and the remaining part of this scaffold. A total of 124,961 reads mapped on the scaffold. BLASTX search for the scaffold1075174 (after excluding the deleted plastid region) revealed the presence of a reverse transcriptase, a GAG pre-integrase domain, and the gag-polypeptide of LTR copia-type. Twelve reads map on the junction between ndhF and the reverse transcriptase so confirming the connection between the two parts. This result represents a clear indication that the deleted plastid region has been inserted in a retrotransposon nuclear sequence of O. sphegodes (Fig 3). Most of the repetitive DNA in available orchid genomes are gypsy- and copia-like retrotransposons [66] and their activity is likely to significantly contributed to the orchid large genome size [67].
Fig 3

Results of BLASTX search of scaffold1075174 (length 5,436 bp): Putative domain hits are indicated by the colored arrows.

Conclusions

The complete plastid genomes provided here for two taxa from the rapidly evolving orchid genus Ophrys represents a source of novel information that can help resolve evolutionary questions. While the plastid gene order and organization reveal the signal of phylogenetic relationships among main species groups in this genus, the highly variable SSRs and tandem repeats with suitable level of intraspecific variation can be used as markers in phylogeographic and speciation studies among those closely related species. These relationships can now be explored with the novel genomic resources available today.

Distribution of simple sequence repeat (SSR) in Ophrys sphegodes and O. iricolor plastid genomes.

IGS: intergenic spacer. (DOCX) Click here for additional data file.

List of RNA editing sites predicted in protein-coding genes of Ophrys plastomes using PREPACT program.

High dashes indicate absence of RNA editing, * stop codon. (DOCX) Click here for additional data file.

Positive selection sites identified with selecton with d.f. = 1.

“Null” and “positive” columns list likelihood values obtained under the models M8a (null model) and M8 (positive selection), respectively. (DOCX) Click here for additional data file.

PCR validation of the ndhF deletion.

PCR amplifications using (a) F1 and R1 primers; (b) F2 and R2 primers. M = marker II (λ DNA / Hind III digested); 1 = O. fusca Campania, 2 = O. fusca Tuscany, 3 = O. iricolor Greece; 4 = O. sphegodes Campania; 5 = O. sphegodes Apulia; 6 = O. incubacea Apulia; 7 = O. insectifera Spain. A dotted line represents the IRB-SSC junction. Primer sequences: F1: 5’—GCTCCGTTCCATGCCTCATT– 3’ R1: 5’–TCGTCGTATGTGGGCTTTCC– 3’ F2: 5’–TTAGCAATTGCACCGACAAA– 3’ R2: 5’–TCTGTTTCCACCGGACAG– 3’. (TIFF) Click here for additional data file. Molecular evolution analyses of Ophrys plastid genes: a) divergence of protein-coding genes (gene divergence was estimated by the sum of total branch lengths in each gene tree inferred, mean ± SD); b) number of putative sites under positive selection. (TIF) Click here for additional data file.

Comparison of gene rearrangements in the plastid genomes among 10 species representative of the five Orchid subfamilies.

Genes are indicated in the colored boxes. Boxes colors represent gene families: purple = photosystem I; yellow = photosystem II; orange = NADH-dehydrogenase; light green = ribosome large subunit; light blue = ribosome small subunit; light red = rubisco subunit; red = RNA polymerase; green = ATP synthase; pink = cytochrome b/f complex; dark grey = acetyl-CoA carboxylase; light grey = hypothetical plastid reading frame (ycf series), protease, translation initiation factor IF-1; dark purple = maturase; dark blue = envelope membrane protein. (TIFF) Click here for additional data file.

Dot-plot analyses of Ophrys sphegodes and O. iricolor plastid genomes using Mummer software.

A positive slope indicates that compared sequences are in the same orientation; a negative slope indicates that compared sequences can be aligned, but their orientation is opposite. Red: Sequences in the same direction; Blue: inversions. (JPG) Click here for additional data file.
  58 in total

1.  The evolution of chloroplast RNA editing.

Authors:  Michael Tillich; Pascal Lehwark; Brian R Morton; Uwe G Maier
Journal:  Mol Biol Evol       Date:  2006-07-11       Impact factor: 16.240

2.  Complete chloroplast genome sequences of Solanum commersonii and its application to chloroplast genotype in somatic hybrids with Solanum tuberosum.

Authors:  Kwang-Soo Cho; Kyeong-Sik Cheon; Su-Young Hong; Ji-Hong Cho; Ju-Seong Im; Manjulatha Mekapogu; Yei-Soo Yu; Tae-Ho Park
Journal:  Plant Cell Rep       Date:  2016-07-14       Impact factor: 4.570

3.  Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns.

Authors:  Robert K Jansen; Zhengqiu Cai; Linda A Raubeson; Henry Daniell; Claude W Depamphilis; James Leebens-Mack; Kai F Müller; Mary Guisinger-Bellian; Rosemarie C Haberle; Anne K Hansen; Timothy W Chumley; Seung-Bum Lee; Rhiannon Peery; Joel R McNeal; Jennifer V Kuehl; Jeffrey L Boore
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-28       Impact factor: 11.205

4.  Complete chloroplast genome of Oncidium Gower Ramsey and evaluation of molecular markers for identification and breeding in Oncidiinae.

Authors:  Fu-Hui Wu; Ming-Tsair Chan; De-Chih Liao; Chen-Tran Hsu; Yi-Wei Lee; Henry Daniell; Melvin R Duvall; Choun-Sea Lin
Journal:  BMC Plant Biol       Date:  2010-04-16       Impact factor: 4.215

Review 5.  Genomics and chloroplast evolution: what did cyanobacteria do for plants?

Authors:  John A Raven; John F Allen
Journal:  Genome Biol       Date:  2003-03-03       Impact factor: 13.583

6.  GeSeq - versatile and accurate annotation of organelle genomes.

Authors:  Michael Tillich; Pascal Lehwark; Tommaso Pellizzer; Elena S Ulbricht-Jones; Axel Fischer; Ralph Bock; Stephan Greiner
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

7.  Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids.

Authors:  Zhitao Niu; Jiajia Pan; Shuying Zhu; Ludan Li; Qingyun Xue; Wei Liu; Xiaoyu Ding
Journal:  Front Plant Sci       Date:  2017-10-04       Impact factor: 5.753

8.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

9.  OrganellarGenomeDRAW--a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets.

Authors:  Marc Lohse; Oliver Drechsel; Sabine Kahlau; Ralph Bock
Journal:  Nucleic Acids Res       Date:  2013-04-22       Impact factor: 16.971

10.  Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approach.

Authors:  Adi Stern; Adi Doron-Faigenboim; Elana Erez; Eric Martz; Eran Bacharach; Tal Pupko
Journal:  Nucleic Acids Res       Date:  2007-06-22       Impact factor: 16.971

View more
  8 in total

1.  A Phylogenomic Analysis of the Floral Transcriptomes of Sexually Deceptive and Rewarding European Orchids, Ophrys and Gymnadenia.

Authors:  Laura Piñeiro Fernández; Kelsey J R P Byers; Jing Cai; Khalid E M Sedeek; Roman T Kellenberger; Alessia Russo; Weihong Qi; Catharine Aquino Fournier; Philipp M Schlüter
Journal:  Front Plant Sci       Date:  2019-11-29       Impact factor: 5.753

2.  The complete chloroplast genome sequence of Platanthera chlorantha (Orchidaceae).

Authors:  Félix Lallemand; Michał May; Anna Ihnatowicz; Marcin Jąkalski
Journal:  Mitochondrial DNA B Resour       Date:  2019-07-22       Impact factor: 0.658

3.  Characterization of the complete plastome of Ophrys aveyronensis, a Euro-Mediterranean orchid with an intriguing disjunct geographic distribution.

Authors:  Joris A M Bertrand; Anaïs Gibert; Christel Llauro; Olivier Panaud
Journal:  Mitochondrial DNA B Resour       Date:  2019-09-26       Impact factor: 0.658

4.  The complete chloroplast genome sequence of Dactylorhiza majalis (Rchb.) P.F. Hunt et Summerh. (Orchidaceae).

Authors:  Michał May; Alžběta Novotná; Julita Minasiewicz; Marc-Andre Selosse; Marcin Jąkalski
Journal:  Mitochondrial DNA B Resour       Date:  2019-09-02       Impact factor: 0.658

5.  Comparative plastome analysis of Musaceae and new insights into phylogenetic relationships.

Authors:  Ning Fu; Meiyuan Ji; Mathieu Rouard; Hai-Fei Yan; Xue-Jun Ge
Journal:  BMC Genomics       Date:  2022-03-21       Impact factor: 3.969

6.  Plastome structure of 8 Calanthe s.l. species (Orchidaceae): comparative genomics, phylogenetic analysis.

Authors:  Consolata Nanjala; Vincent Okelo Wanga; Wyclif Odago; Elizabeth Syowai Mutinda; Emmanuel Nyongesa Waswa; Millicent Akinyi Oulo; Elijah Mbandi Mkala; Josiah Kuja; Jia-Xin Yang; Xiang Dong; Guang-Wan Hu; Qing-Feng Wang
Journal:  BMC Plant Biol       Date:  2022-08-03       Impact factor: 5.260

7.  Target Nuclear and Off-Target Plastid Hybrid Enrichment Data Inform a Range of Evolutionary Depths in the Orchid Genus Epidendrum.

Authors:  Carolina Granados Mendoza; Matthias Jost; Eric Hágsater; Susana Magallón; Cássio van den Berg; Emily Moriarty Lemmon; Alan R Lemmon; Gerardo A Salazar; Stefan Wanke
Journal:  Front Plant Sci       Date:  2020-01-29       Impact factor: 5.753

8.  Plastome Evolution and Phylogeny of Orchidaceae, With 24 New Sequences.

Authors:  Young-Kee Kim; Sangjin Jo; Se-Hwan Cheon; Min-Jung Joo; Ja-Ram Hong; Myounghai Kwak; Ki-Joong Kim
Journal:  Front Plant Sci       Date:  2020-02-21       Impact factor: 5.753

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.