Literature DB >> 32282915

Unprecedented Intraindividual Structural Heteroplasmy in Eleocharis (Cyperaceae, Poales) Plastomes.

Chaehee Lee1, Tracey A Ruhlman1, Robert K Jansen1,2.   

Abstract

Plastid genomes (plastomes) of land plants have a conserved quadripartite structure in a gene-dense unit genome consisting of a large inverted repeat that separates two single copy regions. Recently, alternative plastome structures were suggested in Geraniaceae and in some conifers and Medicago the coexistence of inversion isomers has been noted. In this study, plastome sequences of two Cyperaceae, Eleocharis dulcis (water chestnut) and Eleocharis cellulosa (gulf coast spikerush), were completed. Unlike the conserved plastomes in basal groups of Poales, these Eleocharis plastomes have remarkably divergent features, including large plastome sizes, high rates of sequence rearrangements, low GC content and gene density, gene duplications and losses, and increased repetitive DNA sequences. A novel finding among these features was the unprecedented level of heteroplasmy with the presence of multiple plastome structural types within a single individual. Illumina paired-end assemblies combined with PacBio single-molecule real-time sequencing, long-range polymerase chain reaction, and Sanger sequencing data identified at least four different plastome structural types in both Eleocharis species. PacBio long read data suggested that one of the four E. dulcis plastome types predominates.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  RDR; chloroplast; homologous recombination; plastid genome; rearrangement; repeat

Mesh:

Year:  2020        PMID: 32282915      PMCID: PMC7426004          DOI: 10.1093/gbe/evaa076

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Plastids are one of the most essential features in photosynthetic eukaryotes. Plastid genomes (plastomes) of land plants have a conserved quadripartite structure in a gene-dense unit genome consisting of large and small single copy (LSC and SSC; ∼80 and 20 kb) regions separated by a large inverted repeat (IRA and IRB; ∼25 kb) (Wicke et al. 2011; Ruhlman and Jansen 2014). Plastomes of most land plants range from 120 to 170 kb in size with a median size and GC content of 154 kb and 37.6%, respectively (Wicke et al. 2011; Weng et al. 2014; Park et al. 2018). After massive transfer of plastid DNA to the host nucleus during symbiogenesis, most plastomes retain genes encoding ∼80 proteins, 30 tRNAs, and 4 rRNAs. Although variation in plastome size, structure, and gene content is uncommon in most photosynthetic taxa, extensive variation has been documented in several angiosperm lineages (Ruhlman and Jansen 2018). Expansion, contraction, and loss of the IR are the main factors causing plastome size variation, and considerable variation in IR extent has been observed in diverse lineages, including Geraniaceae (Guisinger et al. 2011; Blazier, Jansen, et al. 2016; Zhang et al. 2016; Weng et al. 2017; Ruhlman and Jansen 2018), Fabaceae (Dugas et al. 2015; Schwarz et al. 2015), Ericaceae (Fajardo et al. 2013; Martínez-Alberola et al. 2013), Berberidaceae (Ma et al. 2013), Trochodendraceae (Sun et al. 2013), Plantaginaceae (Zhu et al. 2016), Cactaceae (Sanderson et al. 2015; Solórzano et al. 2019), Annonaceae (Blazier, Ruhlman, et al. 2016), Campanulaceae (Knox 2014; Cheon et al. 2017), Papaveraceae (Park et al. 2018), and Passifloraceae (Rabah et al. 2019; Shrestha et al. 2019). The plastome of Pelargonium transvaalense is ∼243 kb in size due to a remarkably expanded IR (∼88 kb), leaving relatively small LSC (∼60 kb) and SSC (∼7 kb) regions (Weng et al. 2017). Similar to plastome size variation, examples of gene and intron losses have been documented in several angiosperm families. The loss of the entire suite of ndh genes in Erodium (Blazier et al. 2011), Orchidaceae (Chang et al. 2006; Wu et al. 2010; Delannoy et al. 2011; Barrett et al. 2014; Kim et al. 2015; Ruhlman et al. 2015), Alismatales (Peredo et al. 2013), and Carnegiea (Sanderson et al. 2015) and loss of accD, clpP, rpl20, rps7, and ycf1 genes and the introns of atpF, rpoC1, and clpP in Passiflora (Jansen et al. 2007; Rabah et al. 2019; Shrestha et al. 2019) are examples. The monocot family Poaceae (Poales) also experienced gene and intron losses, such as accD, ycf1, and ycf2 genes and the introns of clpP and rpoC1 (Hiratsuka et al. 1989; Maier et al. 1995; Morris and Duvall 2010). These changes do not occur in basal lineages of Poales (Bromeliaceae and Typhaceae) (Stevens 2001), which retain the structure and content of conserved plastomes (Guisinger et al. 2010; Redwan et al. 2015; Poczai and Hyvönen 2017). Although most plastomes contain very small amounts of repetitive DNA apart from the canonical IR, large numbers of repeats have been identified in several eudicot and monocot lineages with highly rearranged plastomes (Lee et al. 2007; Cai et al. 2008; Haberle et al. 2008; Guisinger et al. 2011; Zhang et al. 2011; Weng et al. 2014). In some of these, a significant positive correlation between the degree of genomic rearrangements and the number of repeats was detected (Guisinger et al. 2011; Weng et al. 2014). Repetitive DNA in plastomes plays a central role in genomic rearrangements and sequence divergence via illegitimate recombination and slipped-strand mispairing (Asano et al. 2004; Rogalski et al. 2006; Timme et al. 2007; Gray et al. 2009; Zhang et al. 2011). Although the large canonical IR is well known to reverse the polarity of SC regions through recombination (Kolodner and Tewari 1979; Bendich 2004), other inversions have been associated with short IRs (Kim and Lee 2005; Schwarz et al. 2015; Rabah et al. 2019). The typical IR does not occur in conifer plastomes but lineage-specific, short IRs are associated with large inversions. Since Tsumura et al. (2000) discovered inversion isomers associated with ∼1.2-kb IR in Pinaceae, examples of repeat associated isomeric plastomes within a single species or individual have been documented in several gymnosperm lineages including Pinaceae, Cupressaceae, Podocarpaceae, and Taxaceae (Wu et al. 2011; Guo et al. 2014; do Nascimento Vieira et al. 2016; Hsu et al. 2016; Qu et al. 2017; Fu et al. 2019). In angiosperms, two distinct plastome configurations that differed in the orientation of a ∼45-kb segment were detected among accessions of the legume Medicago truncatula with verification using DNA gel blot analysis (Gurdon and Maliga 2014). Recently, with the aid of PacBio long reads, alternative plastome arrangements were identified within the plastome of a single individual of Monsonia emarginata (Geraniaceae), and these structural variants were associated with large repeats (Ruhlman et al. 2017). Approximately 4,000 land plant plastomes have been sequenced and are publicly available (accessed on October 2, 2019 from NCBI). However, taxon sampling is biased in favor of certain major groups, limiting the knowledge of plastome organization and evolution in many lineages. For example, despite 616 sequenced plastomes of the order Poales available in NCBI, just eight belong to the non-Poaceae lineages Bromeliaceae, Typhaceae, Eriocaulaceae, Cyperaceae, and Joinvilleaceae (Guisinger et al. 2010; Redwan et al. 2015; Wysocki et al. 2016). Among 14 Poales families (APG IV 2016), Cyperaceae (sedges) is the second largest with more than 5,500 species and a cosmopolitan distribution. However, little is known about plastome organization and evolution within the family, which is represented by three plastomes in NCBI that have not yet been published. In this study, plastome sequences of two Eleocharis species were completed. In contrast to basal Poales, Eleocharis plastomes exhibited multiple structural types within a single individual, unprecedented levels of genome rearrangement and repetitive DNA, and extensive gene loss and duplication.

Materials and Methods

Plant Materials and DNA Extraction

Corms of water chestnut (Eleocharis dulcis, accession # PI 106274) were provided by the U.S. National Plant Germplasm System in the United States Department of Agriculture and Gulf coast spikerush, Eleocharis cellulosa, was collected in west Texas (supplementary table S1, Supplementary Material online). Corms of E. dulcis and live plants of E. cellulosa were grown in the University of Texas at Austin (UT-Austin) greenhouses. Because culms (a hollow or pithy stalk or stem of grasses, sedges, and rushes) play the primary role in photosynthesis due to highly reduced leaves modified into a tubular sheath at the base of culms, total genomic DNA was extracted from newly emergent culms using the cetyl trimethylammonium bromide method of Doyle and Doyle (1987) with modifications that included the addition of 3% polyvinylpyrrolidone (PVP; Sigma, St. Louis, Mo, USA) and 3% beta-mercaptoethanol (Sigma, St Louis, MO). Following treatment with RNase A (ThermoScientific, Lafayette, CO) and phase separation with chloroform, DNA was recovered by ethanol precipitation, resuspended in DNase-free water, and stored at –20 °C for genome sequencing. DNA of E. dulcis but not E. cellulosa was extracted from a single individual. Voucher specimens of E. dulcis and E. cellulosa were deposited in the Billie L. Turner Plant Resources Center (TEX-LL) at UT-Austin (supplementary table S1, Supplementary Material online).

Genome Sequencing, Assembly, and Annotation

Genomic DNAs were sent to the Beijing Genomics Institute for library preparation and Illumina sequencing. Approximately 40 million 150-bp, paired-end (PE) reads were generated on the Illumina HiSeq X-Ten sequencing platform (Illumina, San Diego, CA) and reads were assembled de novo using Velvet version 1.2.08 (Zerbino and Birney 2008) at the Texas Advanced Computing Center. Multiple de novo assemblies were performed with a range of k-mer sizes (intervals of four from 61 to 117), three minimum depths of coverage (200×, 500×, and 1,000×), default insert size estimation, and scaffolding off. Plastid contigs from all assemblies were imported into Geneious v.9.1.8 (http://www.geneious.com) (Kearse et al. 2012) and all protein-coding genes were identified by BLAST against reference plastomes of Nicotiana tabacum (NC_001879), Typha latifolia (NC_013823), Ananas comosus (NC_026220), and Hypolytrum nemorum (NC_036036) and by GeSeq with MPI-MP chloroplast references (Tillich et al. 2017). Once draft plastomes were assembled, all plastid contigs were mapped to drafts to identify the IR boundaries and mismatches among contigs. To validate any ambiguities and determine the IR boundaries, Bowtie2 v.2 (Langmead and Salzberg 2012) was used to map the Illumina reads to draft plastomes from which IRa had been removed. Verification of protein-coding genes was manually performed in Geneious by aligning genes with their homologs from the plastome sequences listed above as well as from Spinacia oleracea (NC_002202) and Arabidopsis thaliana (NC_000932). Because the H. nemorum plastome is publicly available in NCBI but has not yet been published, annotations were carefully examined and manually modified following the method described above prior to use in comparative analyses. All transfer RNA genes were identified using tRNAscan-SE v.2.0 (Lowe and Chan 2016). In Eleocharis and H. nemorum plastomes, genes that were annotated with low similarity (25–50%) to reference genes were further examined and regarded as putatively missing or nonfunctional based on the following criteria used in Rabah et al. (2019): absence of an open-reading frame with a complete conserved domain as searched in conserved domain database (www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) and presence of internal stop codons that interrupted the reading frame or the conserved domain. Linear plastome maps were drawn with OGDRAW v. 1.2 (Lohse et al. 2013). For repeat visualization, circular maps were drawn using Circoletto (Darzentas 2010). Whole-genome alignments were performed to examine the arrangement of locally colinear blocks (LCBs) of different plastome structural types of E. dulcis and E. cellulosa and to estimate the genomic rearrangements of Eleocharis plastomes compared with the basal Poales, T. latifolia, as a reference (supplementary fig. S1, Supplementary Material online) using progressiveMauve 2.3.1 in Geneious (Darling et al. 2010) with default parameters. One copy of the IR (IRa) was removed from plastomes prior to the Mauve alignment. The LCBs of E. dulcis, E. cellulosa, H. nemorum, A. comosus, and T. latifolia plastomes were manually numbered and annotated for genes within each LCB. Strand orientation was indicated with “–”, if the LCB was in the reverse orientation relative to the reference. Breakpoint (BP) and reversal distances were estimated using the web-based application CREx (Common Interval Rearrangement Explorer) (Bernt et al. 2007) with T. latifolia as a reference.

Long-Range Polymerase Chain Reaction and Sanger Sequencing

Nine oligonucleotide primers were designed for long-range polymerase chain reaction (PCR) using Primer 3 (supplementary table S2, Supplementary Material online) (Untergasser et al. 2012). Primer sequences were located in genes (petN, petD, rps2, rpoC2, rpoB, psbK, rpl20, rps8, and ndhJ) situated at the ends of syntenic blocks to confirm plastome assemblies. In addition, 11 primers were designed to perform nested PCR using amplicons of targeted regions (supplementary table S2, Supplementary Material online). Long-range PCR with multiple primer combinations was performed using the high-fidelity TaKaRa PrimeStar GXL DNA Polymerase (Takara Bio USA, Inc., Mountain View, CA). PCR reactions were performed in 12.5 µl, including 7 µl distilled water, 2.5 µl 5× PrimeSTAR GXL Buffer, 1 µl dNTP mixture, 0.5 µl DNA polymerase, 0.5 µl each primer, and 0.5 µl genomic DNA (∼50 ng). Each target was amplified using variable conditions depending on the size: initial denaturation at 98 °C for 1 min, followed by 30 cycles of denaturation at 98 °C for 10 s, annealing at 58–62 °C for 15 s, and extension at 68 °C for 50–150 s. PCR products were treated with Exonuclease I (New England BioLabs, Ipswich, MA) and Shrimp Alkaline Phosphatase (Fermentas, Glen Burnie, MD) to remove residual primers and PCR amplicons were Sanger sequenced at the Genome Sequencing and Analysis Facility at UT-Austin.

PacBio Sequencing

Genomic DNA of E. dulcis with a total mass of 13.1 μg was sent to Beijing Genomics Institute for 20-kb PacBio library construction and single-molecule real-time (SMRT) sequencing (Pacific Biosciences, Menlo Park, CA). PacBio long reads were generated from one SMRT cell on RSII system. Error correction of long reads was performed using the LSC v.2.0 tool (Au et al. 2012) with ∼46 million Illumina PE reads generated from DNA of the same E. dulcis individual. Error-corrected reads were mapped to complete plastome monomers and to genes at the end of each Illumina/Velvet-assembled syntenic block to detect reads that may support the presence of multiple structural types.

Repetitive DNA Sequence Analysis

Repeat analyses were performed to calculate the content of tandem and dispersed repeats. One copy of the IR (IRa) was removed from plastomes prior to repeat analyses. Dispersed repeats were detected by using the command line version BLAST v.2.8.1+ (Altschul et al. 1990) with a word size of 16 and percent identity of 80.0% and each Eleocharis, H. nemorum, A. comosus, and T. latifolia plastomes as both query and subject. BLAST results were carefully examined and duplicate repeats were eliminated. Tandem repeats were identified by using the web version of Tandem Repeats Finder v.4.09 with default parameters (Benson 1999). Total repeats were categorized into several size classes based on the length of BLAST hits. The proportion of total repeats and repeats by each size class was calculated for each plastome (length of repeats/length of plastome [without IRa] × 100).

Results

Multiple Structural Types in Eleocharis Plastomes

Plastome contigs assembled de novo from E. dulcis and E. cellulosa Illumina PE reads were very complex and suggested multiple arrangements of syntenic blocks (fig. 1 and supplementary figs. S2 and S3, Supplementary Material online). All possible arrangements of the syntenic blocks were assembled to yield the greatest number of unique draft plastome monomers. Careful examination of draft assemblies suggested four structural types that vary with respect to the arrangement of syntenic blocks in the LSC region.
. 1.

—Schematic representations of Eleocharis dulcis and E. cellulosa plastomes. (A) Unit-genome maps and repetitive DNA content of E. dulcis and E. cellulosa plastomes. Completed Eleocharis plastome sequences were submitted to OGDRAW (Lohse et al. 2013) to generate physical maps and Circoletto (Darzentas 2010) to visualize repetitive DNA. Structural type and plastome size (in parentheses) are shown below the species name. Syntenic blocks (numbered 1–6) detected by progressiveMauve (Darling et al. 2010) are depicted by open boxes; the negative symbol (–) indicates reverse-oriented strands relative to the reference (Typha). Red boxes indicate blocks that that vary in order among different structural types, whereas blocks encompassed by black boxes maintain their positions across types. Genes indicated in red font are located near the end of each syntenic block and were employed as annealing sites for PCR confirmation of plastome arrangements. The vertical black bar at the top of each linear map is provided for scale. The circular representation of each species is below each linear map with syntenic block numbers shown. Dispersed repeats and IR are shown within the circular map in blue and red, respectively. (B) Each syntenic block is illustrated with a different color and follows the same numbering convention as in (A). Representative genes near the end of each syntenic block are shown at the top. Gene symbols in white font indicate those in long-range PCR and correspond to the red gene symbols in (A). Different syntenic block arrangements are highlighted in red boxes for both Eleocharis species. IR, inverted repeat; SSC, small single copy region.

—Schematic representations of Eleocharis dulcis and E. cellulosa plastomes. (A) Unit-genome maps and repetitive DNA content of E. dulcis and E. cellulosa plastomes. Completed Eleocharis plastome sequences were submitted to OGDRAW (Lohse et al. 2013) to generate physical maps and Circoletto (Darzentas 2010) to visualize repetitive DNA. Structural type and plastome size (in parentheses) are shown below the species name. Syntenic blocks (numbered 1–6) detected by progressiveMauve (Darling et al. 2010) are depicted by open boxes; the negative symbol (–) indicates reverse-oriented strands relative to the reference (Typha). Red boxes indicate blocks that that vary in order among different structural types, whereas blocks encompassed by black boxes maintain their positions across types. Genes indicated in red font are located near the end of each syntenic block and were employed as annealing sites for PCR confirmation of plastome arrangements. The vertical black bar at the top of each linear map is provided for scale. The circular representation of each species is below each linear map with syntenic block numbers shown. Dispersed repeats and IR are shown within the circular map in blue and red, respectively. (B) Each syntenic block is illustrated with a different color and follows the same numbering convention as in (A). Representative genes near the end of each syntenic block are shown at the top. Gene symbols in white font indicate those in long-range PCR and correspond to the red gene symbols in (A). Different syntenic block arrangements are highlighted in red boxes for both Eleocharis species. IR, inverted repeat; SSC, small single copy region. Although the Illumina PE reads mapped to the four plastome types with ∼1,900× and 1,080× depth of coverage for E. dulcis and E. cellulosa (supplementary table S3, Supplementary Material online), respectively, these data were generated from 300- to 350-bp insert libraries and therefore could not confirm the presence of different plastome types. To address this and to confirm the putative multiple types, long-range PCR was performed for both species. Long-range PCR amplification provided strong evidence for multiple structural conformations of Eleocharis plastomes (fig. 1, table 1, and supplementary figs. S4 and S5, Supplementary Material online). All but 3 of the 20 unique junction regions among syntenic blocks 2–6 for each plastome type were amplified using multiple combinations of primers designed to anneal in nine genes located at the boundaries of each block (fig. 1 and supplementary table S2, Supplementary Material online). For E. dulcis, all junction regions were amplified and most of PCR amplifications yielded fragments of expected sizes, which ranged from ∼4.4 to ∼11 kb, confirming the presence of four plastome structural types (table 1). Although PCR of the 2–5 and (−3)–4 junctions exhibited two amplicons of different sizes, those with the stronger intensity were consistent with the expected sizes. Three fragments that were amplified from the 2–3 junction showed a similar intensity and one of them was congruent with the expected size. For E. cellulosa, all junctions for plastome types 1 and 4 were amplified with fragments of expected sizes except for the 5–6 and 4–6 junctions (table 1). Plastome types 2 and 3 were supported with PCR amplifications for 2 and 3 junctions. Unexpectedly, (−4)–5 and 5–6 junctions found only in E. cellulosa plastome types 1 and 4 were also amplified in E. dulcis (supplementary fig. S6 and table S4, Supplementary Material online). The sizes of each PCR amplicon from E. dulcis (∼8 kb and ∼4.8/∼3 kb) were very close to those from E. cellulosa (∼8.5 kb and ∼4.5 kb).
Table 1

Verification of Multiple Structural Types in Eleocharis Plastomes

SpeciesTypeSyntenic BlockPrimer FPrimer RSize (bp)PCRPCR Band Size (bp)Sanger Seq.PacBio Seq.
Eleocharis dulcis 12 – 3E_petNE_petD8,188+5,500/6,200/8,100+
3 – (−4)E_rps2E_rpoB4,842+∼5,000+a
(−4) – (−5)E_rpoC2E_rpl204,368+∼4,300+a
(−5) – 6E_psbKE_rps88,550+∼8,500
22 – 5E_petNE_psbK11,127+ ∼11,000 b /∼6,200+
5 – (−3)E_rpl20E_rps24,381+∼4,300+a+
(−3) – 4E_petDE_rpoC25,261+ ∼5,000 b /∼3,000+c+
4 – 6E_rpoBE_rps85,181+∼5,000+c+
32 – 5E_petNE_psbK11,128+ ∼11,000 b /∼6,200+
5 – 4E_rpl20E_rpoC24,368+∼4,300+a
4 – (−3)E_rpoBE_rps24,842+∼5,000+a
(−3) – 6E_petDE_rps85,610+∼5,500+d
42 – (−4)E_petNE_rpoB4,869+∼4,000+c
(−4) – 3E_rpoC2E_petD5,257+∼4,300+c+
3 – (−5)E_rps2E_rpl204,381+∼4,300+a+
(−5) – 6E_psbKE_rps88,550+∼8,500
Eleocharis cellulosa 1(−2) – 3E_ndhJE_petD5,387+∼5,400n/an/a
3 – (−4)E_rps2E_rpoB4,816+∼5,000n/an/a
(−4) – 5E_rpoC2E_psbK9,000+∼8,500n/an/a
5 – 6E_rpl20E_rps83,025+∼4,500n/an/a
2(−2) – 5E_ndhJE_psbK8,326n/an/a
5 – 4E_rpl20E_rpoC22,685+∼2,700n/an/a
4 – (−3)E_rpoBE_rps24,846+∼4,500n/an/a
(−3) – 6E_petDE_rps86,400n/an/a
3(−2) – 3E_ndhJE_petD5,387+∼5,000n/an/a
3 – (−4)E_rps2E_rpoB4,816+∼4,500n/an/a
(−4) – (−5)E_rpoC2E_rpl202,685+∼2,700n/an/a
(−5) – 6E_psbKE_rps89,339n/an/a
4(−2) – 3E_ndhJE_petD5,387+∼5,000n/an/a
3 – (−5)E_rps2E_rpl204,346+∼4,300n/an/a
(−5) – 4E_psbKE_rpoC29,001+∼8,000n/an/a
4 – 6E_rpoBE_rps83,495/2,312+∼5,000n/an/a

Note.—+ indicates that adjacencies of syntenic blocks were confirmed. F, forward; R, reverse; n/a, not available.

Entire junction was sequenced.

Band size in bold indicates the one with higher intensity.

Boundaries of junction were sequenced.

Almost entire junction was sequenced.

Verification of Multiple Structural Types in Eleocharis Plastomes Note.—+ indicates that adjacencies of syntenic blocks were confirmed. F, forward; R, reverse; n/a, not available. Entire junction was sequenced. Band size in bold indicates the one with higher intensity. Boundaries of junction were sequenced. Almost entire junction was sequenced. To further examine the E. dulcis plastome, the PCR amplicons for several junction regions were Sanger sequenced (table 1) and boundary sequences were compared with draft plastome assemblies from Illumina data. DNA sequences of seven junction boundaries consistently mapped to the expected regions in each plastome type. Among seven junctions, three, 3–(−4), (−4)–(−5), and 5–(−3), were fully sequenced by nested PCR using the PCR amplicons as templates with primers designed to anneal near the 3′ end (supplementary table S2, Supplementary Material online). One other region, (−3)–6, was almost fully sequenced. Long read PacBio SMRT sequencing was performed for E. dulcis. A total of 68,167 raw PacBio reads with the mean size of 10,035 bp was generated. Following correction with Illumina sequences, the read count was reduced to 61,379 with the mean size of 9,139 bp and mean GC content of 35.7%. Mapping of long reads to each plastome type, emphasizing reads containing genes situated at the end of each syntenic block, showed that the majority of reads were consistent with the plastome type 2 except for four reads. Specifically, when corrected long reads were mapped to one end of syntenic block 2, where petN gene is located, 119 of 123 mapped reads were consistent with 2–5 syntenic block junction, but four reads that showed partial inconsistency with plastome type 2 had the sequences of 2–3 junction (adjacency pair of petN–petD genes) in plastome type 1, confirming the presence of multiple structural types in a single individual of E. dulcis.

Plastome Features

Minimal size variation was detected among the four plastome types in E. dulcis and E. cellulosa, ranging from 196,668 to 199,561 bp and 192,023 to 193,234 bp, respectively (supplementary table S5, Supplementary Material online), all of which were larger than the median plastome size (152 kb) for 3,656 angiosperms (accessed on October 2, 2019 from NCBI). Hypolytrum nemorum (NC_036036), representing a basal lineage of Cyperaceae, had a plastome size of 180,648 bp with the IR expanded to ∼ 38 kb and was also larger than the median angiosperm plastome size but smaller than Eleocharis. The IR and SSC regions of both Eleocharis species were similar in size (IR: ∼36 kb, E. dulcis and ∼35 kb, E. cellulosa; SSC: ∼10 kb) with only seven genes (ndhF, trnL-UAG, rpl32, trnT-UGU, ndhD, psaC, and ndhE genes) in the SSC (table 2). However, since the size variation in the IR and SSC among different plastome types could not be identified with the current data, minor size variation in LSC region (115,003–117,896 bp, E. dulcis and 111,670–112,881 bp, E. cellulosa) accounted for the overall size variation among four types identified within each species.
Table 2

Summary of Major Features of Eleocharis, Hypolytrum, and Basal Poales Plastid Genomes

Family
Cyperaceae
Bromeliaceae
Typhaceae
Taxon Eleocharis dulcis (Type 2) Eleocharis cellulosa (Type 2) Hypolytrum nemorum Ananas comosus Typha latifolia
Genome size (bp)199,561193,234180,648159,636161,572
LSC (% of genome)117,896 (59.1)112,881 (58.4)95,644 (52.9)87,482 (54.8)89,140 (55.2)
SSC (% of genome)9,601 (4.8)10,311 (5.3)8,150 (4.5)18,622 (11.7)19,652 (12.2)
IR (% of genome)36,032 (18.1)35,021 (18.1)38,427 (21.3)26,766 (16.8)26,390 (16.3)
Total number of genes139132135131131
Number of unique genes105105110113113
Number of unique protein-coding genes (duplicated in IR)72 (14)72 (12)76 (12)79 (6)79 (6)
Number of unique tRNA genes (duplicated in IR)29 (10)29 (9)30 (8)30 (8)30 (8)
Number of unique rRNA genes (duplicated in IR)4 (4)4 (4)4 (4)4 (4)4 (4)
Number of genes with introns1717181818
GC content (%)32.632.834.937.436.6
GC content of IR/LSC/SSC (%)37.6/30.2/25.537.8/30.4/25.838.5/32.6/28.142.7/35.4/31.442.4/34.4/30.5
Genic DNA (% of genome, GC [%])76,207 (38.2, 38.4)74,394 (38.5, 38.5)98,905 (54.8, 38.1)91,317 (57.2, 40.2)91,212 (56.4, 39.9)
Intergenic spacers (% of genome, GC [%])123,354 (61.8, 29.1)118,840 (61.5, 29.3)81,743 (45.2, 31.1)68,611 (42.8, 33.7)70,541 (43.6, 32.4)
Gene density0.700.680.750.820.81
Putative gene losses accD, clpP, rpl23, rps18, ycf1, ycf2, infA, and trnT-GGU accD, clpP, rpl23, rps18, ycf1, ycf2, infA, and trnT-GGU accD, rps18, and infA
Putative gene duplications rpoA (2) rpl36 (4), trnfM-CAU (6), trnH-GUG (3), trnP-UGG (3), and trnV-GAC (3) rpl36 (2)* trnfM-CAU (3), trnH-GUG (3), and trnV-GAC (3) trnV-GAC (3)

Note.—The number in ( ) in putative gene duplications indicates the number of copies. Asterisk (*) on rpl36 gene indicates gene duplication in only plastome type 4 of E. cellulosa.

Summary of Major Features of Eleocharis, Hypolytrum, and Basal Poales Plastid Genomes Note.—The number in ( ) in putative gene duplications indicates the number of copies. Asterisk (*) on rpl36 gene indicates gene duplication in only plastome type 4 of E. cellulosa. Plastomes of both Eleocharis species contained the same number of unique genes (105; 72 protein-coding genes, 29 tRNA, and 4 rRNA genes), which was fewer than H. nemorum (110; 76 protein-coding genes, 30 tRNA, and 4 rRNA genes) and the highly conserved basal Poales plastomes (113; 79 protein-coding genes, 30 tRNA, and 4 rRNA genes) due to putative gene losses. However, the total number of genes in Eleocharis was greater than basal Poales due to gene duplication events (table 2). The total number of genes was higher in E. dulcis (139) than E. cellulosa (132) due to more extensive duplications (table 2). The plastomes of both species experienced several putative gene losses including accD, clpP, rpl23, rps18, ycf1, ycf2, infA, and trnT-GGU. The plastome of H. nemorum also lacked accD, rps18 and infA and experienced duplication of trnV-GAC (table 2). Gene density (the number of genes per kb) in E. dulcis, E. cellulosa, and H. nemorum plastomes was 0.70, 0.68 and 0.75 and the percentage of the genome that contained coding regions was 38.2%, 38.5%, and 54.8%, respectively, substantially lower than basal Poales (0.82 and 57.2% for A. comosus and 0.81 and 56.4% for T. latifolia) (table 2). The GC content of E. dulcis and E. cellulosa plastomes was low at 32.6% and 32.9%, respectively, compared with the earliest diverging taxon of Cyperaceae, H. nemorum (34.9%). All Cyperaceae had GC values less than the basal Poales T. latifolia (36.6%) and A. comosus (37.4%) and the mean value (37.5%) of 3,656 angiosperm plastomes (fig. 2 and table 2). The IR GC content of E. dulcis (37.6%) and E. cellulosa (37.8%) plastomes was higher than the mean for angiosperm plastome IRs, whereas the GC content of LSC and SSC regions for E. dulcis (30.2% and 25.5%) and E. cellulosa (30.4% and 25.8%) was lower (table 2).
. 2.

—Eleocharis plastomes exhibit atypical size and GC content. Plastome size and GC content of 3,656 angiosperms in the NCBI Genome database were plotted. Parasitic (yellow) and autotrophic (teal) species are indicated with different colors. The two Eleocharis plastomes are labeled with red font. Other Cyperaceae and larger plastomes, including Pelargonium and Annona cherimola, are indicated with teal font. The Erodium plastomes with high GC content are also represented with a teal label. bp, basepairs.

Eleocharis plastomes exhibit atypical size and GC content. Plastome size and GC content of 3,656 angiosperms in the NCBI Genome database were plotted. Parasitic (yellow) and autotrophic (teal) species are indicated with different colors. The two Eleocharis plastomes are labeled with red font. Other Cyperaceae and larger plastomes, including Pelargonium and Annona cherimola, are indicated with teal font. The Erodium plastomes with high GC content are also represented with a teal label. bp, basepairs. Overall, the organization of Eleocharis plastomes was substantially different from the highly conserved plastomes of basal Poales, Bromeliaceae and Typhaceae, whereas the basal taxon in Cyperaceae, H. nemorum, exhibited features intermediate between those of Eleocharis and basal Poales. Whole-genome alignment using progressiveMauve (fig. 3 and supplementary table S6, Supplementary Material online) identified 25 LCBs from five complete plastomes (E. dulcis, E. cellulosa, H. nemorum, A. comosus, and T. latifolia). The two basal Poales plastomes were collinear, whereas Hypolytrum had BP and reversal distances of 11 and 7, respectively compared with basal Poales plastomes (table 3). The BP and reversal distances were 16 and 13 (E. dulcis) or 16 and 14 (E. cellulosa) between Hypolytrum and Eleocharis plastomes, respectively. The largest BP distance (25) was between E. dulcis and basal Poales plastomes, whereas the largest reversal distance (21) was between E. cellulosa and basal Poales. The BP and reversal distances were 5 and 4 between the two Eleocharis species, respectively. Among the four plastome types in each Eleocharis species, six LCBs (1–6 syntenic blocks) were identified excluding small LCBs of large repeats (supplementary fig. S3, Supplementary Material online). Plastomes of both Eleocharis species had different orders of LCBs (3–5 syntenic blocks) in the LSC region and in E. cellulosa LCB 2 was reversed compared with E. dulcis.
. 3.

—Whole-plastome alignment of five Poales species. Newly completed plastomes of Eleocharis dulcis and E. cellulosa and publicly available basal Cyperaceae and Poales plastomes from NCBI (Hypolytrum nemorum, Ananas comosus, and Typha latifolia) were analyzed by progressiveMauve to identify LCBs with the Typha plastome as a reference. One copy of the inverted repeat was removed before the analysis and numerals at top indicate size in kilobases (kb). The corresponding LCBs among five plastomes are shaded and connected with a line of the same color. The histogram inside each block shows pairwise nucleotide sequence identity. LCBs that are flipped across the plane indicate an inverted strand.

Table 3

Pairwise Comparison of Breakpoint and Reversal Distances for Eleocharis, Hypolytrum, and Basal Poales Plastomes

Typha latifolia Ananas comosus Hypolytrum nemorum Eleocharis dulcis Eleocharis cellulosa
Typha latifolia
Ananas comosus 0/0
Hypolytrum nemorum 11/711/7
Eleocharis dulcis 25/2025/2016/13
Eleocharis cellulosa 24/2124/2116/145/4
—Whole-plastome alignment of five Poales species. Newly completed plastomes of Eleocharis dulcis and E. cellulosa and publicly available basal Cyperaceae and Poales plastomes from NCBI (Hypolytrum nemorum, Ananas comosus, and Typha latifolia) were analyzed by progressiveMauve to identify LCBs with the Typha plastome as a reference. One copy of the inverted repeat was removed before the analysis and numerals at top indicate size in kilobases (kb). The corresponding LCBs among five plastomes are shaded and connected with a line of the same color. The histogram inside each block shows pairwise nucleotide sequence identity. LCBs that are flipped across the plane indicate an inverted strand. Pairwise Comparison of Breakpoint and Reversal Distances for Eleocharis, Hypolytrum, and Basal Poales Plastomes

Repetitive DNA in Eleocharis Plastomes

Repeat analyses revealed that Eleocharis plastomes contain abundant repetitive DNA with minor variation among different structural types (supplementary tables S7 and S8, Supplementary Material online). Along with Eleocharis, basal Cyperaceae and Poales taxa were included in repeat analyses. The amount and proportion of dispersed repeats in Eleocharis plastomes (39,752 bp, 24.3% for E. dulcis and 31,118 bp, 19.7% for E. cellulosa) were substantially greater than in H. nemorum (8.8%, 12,520 bp), which in turn were greater than A. comosus (1.1%, 1,495 bp) and T. latifolia (0.9%, 1,210 bp) (table 4).
Table 4

Statistics of Dispersed and Tandem Repeats in Eleocharis, Hypolytrum, and Basal Poales Plastomes

Family
Cyperaceae
Bromeliaceae
Typhaceae
Species Eleocharis dulcis Type 2 Eleocharis cellulosa Type 2 Hypolytrum nemorum Ananas comosus Typha latifolia
Genome size (IRa excluded)163,529158,213142,221132,862134,642
GC %31.631.83436.335.5
Dispersed repeats (DRs)
 Length of DR39,75231,11812,5201,4951,210
 GC % of DR30.930.232.635.933.7
 GC % without DR31.83233.936.335.5
 % of DR in genome24.319.78.81.10.9
Tandem repeats (TRs)
 Length of TR5,8642,8336,6382,0573,270
 GC % of TR25.925.527.818.413.5
 GC % without TR31.831.934.336.636
 % of TR in genome3.61.84.71.52.4
Total repeats
 Length of total repeats42,21632,69516,7183,5524,436
 GC % of total repeats30.429.931.125.719.1
 GC % without total repeats3232.134.336.636
 % of total repeats in genome25.820.711.82.73.3
Statistics of Dispersed and Tandem Repeats in Eleocharis, Hypolytrum, and Basal Poales Plastomes Dispersed repeats were grouped into five size classes (fig. 4). The E. dulcis plastome contained the largest number of dispersed repeats (321) ranging from 17 bp to ∼5 kb followed by E. cellulosa (238; 17 bp to ∼3 kb) and H. nemorum (157; 17 to ∼850 bp), whereas basal Poales had 57 and 47 repeats smaller than 60 bp (fig. 4 and supplementary table S8, Supplementary Material online). Repeats >1,000 bp were only detected in Eleocharis plastomes where they represented more than 10% and 8% of the E. dulcis and E. cellulosa genomes, respectively (fig. 4). The three Cyperaceae plastomes contain larger repeats in the range of 61–1,000 bp relative to other Poales, and these larger repeats constitute a substantial proportion of the plastome (fig. 4). In Eleocharis, many of the larger repeats in the >201 bp size range were concentrated between syntenic blocks 2–6 (fig. 1).
. 4.

—Repetitive DNA content in five Poales plastomes. (A) The number of dispersed repeats in different size classes. (B) The proportion of plastome that represents dispersed repeats in different size classes. bp, basepairs.

—Repetitive DNA content in five Poales plastomes. (A) The number of dispersed repeats in different size classes. (B) The proportion of plastome that represents dispersed repeats in different size classes. bp, basepairs. The greatest proportion of tandem repeats were identified in H. nemorum (6,638 bp; 4.7%) followed by E. dulcis (5,864 bp; 3.6%) (table 4). Unlike dispersed repeats, only E. dulcis plastome contained unusually large amounts of tandem repeats. Eleocharis cellulosa had fewer tandem repeats (2,833 bp; 1.8%) than T. latifolia (3,270 bp; 2.4%). Some tandem repeats were embedded in large dispersed repeats in Eleocharis plastomes, and there were instances where small dispersed repeats were located within tandem repeats. In the E. dulcis plastome, three copies of the largest tandem repeat (278 bp) with 99% sequence identity included trnfM-CAU in the IR. Most tandem repeats were found in intergenic spacers (IGSs) except for occurrences in the introns of several genes (rps16, ycf3, rpoC1, rpl16, and ndhB), including 16 copies of a 48-bp tandem repeat that share 84% sequence identity within the intron of rpoC2 causing an expansion of its size to 5,097 bp. Similar to E. dulcis, most tandem repeats in E. cellulosa were found in IGS regions. Three introns included tandem repeats, the introns of rps16, ndhB, and rpoC2. The rpoC2 gene (4,707 bp) was expanded by 12 copies of a 48-bp tandem repeat with 84% sequence identity within its intron. Three and seven copies of a 48- and 45-bp tandem repeat were identified within rpoC2 (4,707 bp) in H. nemorum. Overall, the exceptionally abundant repeats that enlarged IGS regions contributed to plastome size expansion in Eleocharis. Repetitive DNA also resulted in gene duplications in Eleocharis plastomes. Three copies of the trnfM-CAU were located in the 278-bp tandem repeat of the E. dulcis IR, resulting in six total copies of this sequence. Several other genes were multiplied, including four copies of rpl36 and two copies of rpoA in the LSC, and three copies of trnH-GUG, trnV-GAC, and trnP-UGG (two copies in the IRs). In E. cellulosa, whereas rpl36 was duplicated in a 593-bp dispersed repeat only in the LSC of plastome type 4, all four structural types had three copies of trnfM-CAU, trnH-GUG, and trnV-GAC (two copies in the IRs) located in various sized dispersed repeats (177–456 bp). In H. nemorum, one additional copy of trnV-GAC along with two in the IR was identified in the LSC within an 800-bp dispersed repeat with 98.5% sequence identity.

Discussion

Despite being the second largest family of Poales, plastome organization and evolution of Cyperaceae have not been examined. In this study, plastomes of two Eleocharis species, E. dulcis and E. cellulosa, were atypical with respect to size, gene content, GC content, gene order, and gene density compared with the highly conserved basal Poales plastomes. The most unusual feature was the presence of multiple plastome structural types within each Eleocharis species. Furthermore, plastome structural heteroplasmy was observed within single individuals and was associated with a remarkable abundance of repetitive DNA. The discussion mainly focuses on possible mechanisms that could generate different structural types in plastomes and comparisons to alternative plastome structures (isomeric forms) reported in a few distantly related seed plant lineages. The plastome organization of Eleocharis is also compared with basal Cyperaceae and other Poales.

Structural Heteroplasmy in Eleocharis Plastomes

In two species of Eleocharis, at least four different plastome structural types are present, and in one species, E. dulcis, this unprecedented structural heteroplasmy was identified in a single plant. Initial assemblies using Illumina short reads suggested multiple plastome types, a suggestion confirmed by long-range PCR, Sanger sequencing, and PacBio SMRT data. A few previous studies in conifers and Medicago (Fabaceae) detected isomeric plastomes associated with inversions mediated by lineage-specific short repeats (only for conifers) using a variety of approaches ranging from Southern hybridization to Illumina sequencing and PCR (Tsumura et al. 2000; Wu et al. 2011; Guo et al. 2014; do Nascimento Vieira et al. 2016; Hsu et al. 2016; Qu et al. 2017; Fu et al. 2019). Recently, PacBio long read data combined with Illumina read assembly demonstrated alternative gene order arrangements for the M. emarginata (Geraniaceae) plastome (Ruhlman et al. 2017). Although four major plastome structural types were clearly evident in Eleocharis, long-range PCR data suggested the possibility of a higher degree of heteroplasmy as shown in supplementary table S4, Supplementary Material online. A similar suggestion was made by Ruhlman et al. (2017) for Monsonia. The possibility that the unexpected PCR results may be the result of amplifications of plastid DNA that has been transferred to nuclear or mitochondrial genomes cannot be ruled out. It is also possible that PCR artifacts arising from PCR-mediated recombination (Lahr and Katz 2009; Alverson et al. 2011) could explain the unexpected amplicons. However, consistent sizes between PCR amplicons for most junctions, Sanger sequencing, Illumina assemblies, and PacBio reads strongly support the unprecedented structural heteroplasmy in Eleocharis. For decades, plastomes were considered circular with limited recombination. The presence of isomeric, presumed circular plastomes with different orientations of the single copy regions was attributed to intramolecular recombination between IR copies within a single unit genome and was referred to as “flip-flop” recombination (Palmer 1983, 1985; Brears et al. 1986; Stein et al. 1986). Two previous studies in Pinaceae adopted the flip-flop intramolecular recombination model to explain the different plastome isoforms (Tsumura et al. 2000; Wu et al. 2011), whereas other studies of conifers suggested a repeat-mediated homologous recombination mechanism based on a circular unit genome (Guo et al. 2014; do Nascimento Vieira et al. 2016; Hsu et al. 2016; Qu et al. 2017; Fu et al. 2019). These previous studies ignored the overwhelming evidence that the plastome exists as linear, branched and occasionally circular forms (Deng et al. 1989; Lilly et al. 2001; Oldenburg and Bendich 2004; Scharff and Koop 2006; Shaver et al. 2006). In fact, it has been shown that the different isomers of the SC regions occur by a recombination-dependent-replication (RDR) mechanism involving different unit genomes (Oldenburg and Bendich 2004; Maréchal and Brisson 2010). Thus, any discussion of the mechanism for generating multiple structural types of plastomes must consider RDR as the primary cause. Ruhlman et al. (2017) proposed that RDR between different linear copies of plastomes generated alternative plastome arrangements associated with large IRs (>2 kb) in M. emarginata (see fig. 4 in Ruhlman et al. 2017). The unusually large number and size of repeats (up to ∼5 kb in E. dulcis and ∼3 kb in E. cellulosa) could account for the large number of plastome structural types by providing homologous sequence for the RDR pathway. Many of the large (>1 kb) and intermediate (201–1,000 bp) repeats in Eleocharis plastomes are located between syntenic blocks 3–5 and may be involved with repeat-mediated RDR within or between unit genomes. Structural heteroplasmy may be more prevalent than appreciated, especially in other lineages that have plastomes with highly rearranged gene orders and abundant larger repeats, such as Passiflora, Trifolium, Campanulaceae, and Geraniaceae. The phylogenetic distribution of this phenomenon in Cyperaceae or other Poales is unknown due to limited taxon sampling in the family and order. The publicly available plastome of one of the basal taxa in Cyperaceae, H. nemorum, has not been published and no raw read data is available for analysis. Plastome sequences of two basal lineages, Bromeliaceae and Typhaceae, do not show structural heteroplasmy (Guisinger et al. 2010; Redwan et al. 2015; Poczai and Hyvönen 2017) and two other Poales lineages (Poaceae and Eriocaulaceae) do not exhibit multiple plastome types (Darshetkar et al. 2019; Orton et al. 2019). Additional sampling of plastome sequences in Cyperaceae and related families of cyperids is needed to explore the origin and extent of structural heteroplasmy in this clade.

Plastome Organization

In addition to remarkable structural heteroplasmy, Eleocharis plastomes have a number of unusual genomic characteristics compared with basal Poales, including size expansion, abundant repetitive DNA, higher degree of genome rearrangement, IR expansion, increased IGS, low GC content, low gene density, and numerous gene duplications and losses. In most angiosperms, increases in plastome size are caused by IR expansion as reported in several unrelated lineages, including Pelargonium (IR ∼ 88 kb; Chumley et al. 2006; Weng et al. 2014, 2017), Annona (IR ∼ 64.5 kb; Blazier, Ruhlman, et al. 2016), Lamprocapnos (IR ∼ 51 kb; Park et al. 2018), Asarum (IR ∼ 45 kb; Sinn et al. 2018), and Passiflora (IR ∼ 35 kb; Shrestha et al. 2019). Plastome size expansion in Eleocharis is mainly due to increased number and size of repeats and to a lesser extent IR expansion (tables 2 and 4). Increased plastome size in the basal member of Cyperaceae, H. nemorum, is also caused by these two phenomena. This is in contrast to other members of the Poales (Typha, Ananas, and Eriocaulon), which have typical IRs and much less repetitive DNA and size variation (Guisinger et al. 2010; Redwan et al. 2015; Darshetkar et al. 2019). Eleocharis plastomes have similar SSC/IR boundaries to H. nemorum but they have an independent IR expansion into the LSC. Despite IR expansion into both the SSC and LSC and increased number of repeats in the IGS regions of the IR in Eleocharis plastomes, the size of their IR is smaller than H. nemorum because they lack ycf1 and ycf2. Repeat content and IR expansion have contributed to the increase in plastome size of Eleocharis and both of these factors affect GC content. The IR has higher GC content than SC regions (table 2) so IR expansion should result in higher overall GC content in plastomes. However, Eleocharis plastomes had the lowest GC values (∼32.6%) among photosynthetic angiosperms despite their genome size expansion (fig. 2). Other Cyperaceae have a similar pattern, which is likely due to having more repeats in IGS regions, which are known to be GC poor (table 4, Cai et al. 2006). The prevalence of repeats in the IGS regions also contributed to the lower gene density in Eleocharis and Hypolytrum compared with basal Poales plastomes (table 2). Thus, unlike most other photosynthetic angiosperms, Eleocharis plastomes have experienced increased plastome size driven by IR expansion and accumulation of repeats, and low GC content and gene density. Whole-genome alignment revealed extensive gene order changes in Eleocharis plastomes (fig. 3 and table 3) relative to T. latifolia. The exceptional number of gene order changes is likely related to the abundance of repeats, which facilitate repeat-mediated homologous recombination. The IR was previously suggested to play a role in plastome structural stability based on the presence of extensive rearrangements in IR-lacking groups in legumes (Palmer and Thompson 1982) and Cryptomeria (Hirao et al. 2008). However, many studies failed to find a correlation between plastome stability and the presence of the IR in several angiosperm lineages, including Campanulaceae (Haberle et al. 2008; Knox 2014), Oleaceae (Lee et al. 2007), Ericaceae (Fajardo et al. 2013; Martínez-Alberola et al. 2013), Plantago (Zhu et al. 2016), Geraniaceae (Chumley et al. 2006; Guisinger et al. 2011; Weng et al. 2014; Blazier, Jansen, et al. 2016), and Passifloraceae (Rabah et al. 2019; Shrestha et al. 2019). In Eleocharis and other lineages, the presence of dispersed repeats was more likely the primary factor facilitating plastome rearrangements. Plastomes of basal Poales, Typha (Guisinger et al. 2010) and Ananas (Redwan et al. 2015), as well as Eriocaulon (Darshetkar et al. 2019) have a conserved gene content, similar to the ancestral angiosperm plastome (Ruhlman and Jansen 2014). However, Cyperaceae plastomes have numerous gene duplications and losses (table 2). Many gene duplications are the result of IR expansion into the SSC and LSC. The other duplicated genes are associated with the dispersed repeats. Gene losses are more extensive in Eleocharis than in H. nemorum (table 2). The missing Eleocharis genes have been pseudogenized or lost in several eudicot lineages, including Campanulaceae (Cosner et al. 1997; Haberle et al. 2008; Knox 2014; Cheon et al. 2017), Trifolium (Cai et al. 2008; Magee et al. 2010; Sabir et al. 2014), Ericaceae (Fajardo et al. 2013; Martínez-Alberola et al. 2013), Carnegiea (Sanderson et al. 2015), Silene (Erixon and Oxelman 2008; Sloan et al. 2012), Geraniaceae (Chumley et al. 2006; Guisinger et al. 2011; Weng et al. 2014, 2017; Blazier, Jansen, et al. 2016; Park et al. 2017), and Passifloraceae (Rabah et al. 2019; Shrestha et al. 2019). Plastomes of these groups have abundant repetitive DNA and highly rearranged gene order, a syndrome that is shared with Eleocharis. In monocots, Poaceae plastomes are structurally similar to basal Poales except for three inversions and plastome size reduction caused by several gene losses (Quigley and Weil 1985; Hiratsuka et al. 1989; Doyle et al. 1992; Maier et al. 1995; Michelangeli et al. 2003; Morris and Duvall 2010; Harris et al. 2013). The loss of accD, ycf1, ycf2, clpP, and rpl23 is homoplastic in Poales as the same losses have occurred independently in the distantly related families Poaceae and Cyperaceae (supplementary fig. S1, Supplementary Material online). More plastome sequencing in Cyperaceae lineages is needed to elucidate the evolution of clpP, ycf1, ycf2, and rpl23 in the family. Moreover, transcriptome data would allow the determination of the fate of missing genes in Eleocharis and other related Cyperaceae.

Conclusion

Analyses utilizing Illumina and PacBio SMRT sequences, long-range PCR, and Sanger sequencing data demonstrated unprecedented structural heteroplasmy within a single E. dulcis individual. At least four different plastome structural types are present in both Eleocharis species, and it is likely that this phenomenon is much more widespread in the family. The long-held notion that plastomes are highly conserved in structure within individuals and species must be reconsidered, especially for lineages with a prevalence of large dispersed repeats that are likely to exhibit structural heteroplasmy in plastomes. Click here for additional data file.
  92 in total

1.  A comparative analysis of the Lactuca and Helianthus (Asteraceae) plastid genomes: identification of divergent regions and categorization of shared repeats.

Authors:  Ruth E Timme; Jennifer V Kuehl; Jeffrey L Boore; Robert K Jansen
Journal:  Am J Bot       Date:  2007-03       Impact factor: 3.844

2.  Circoletto: visualizing sequence similarity with Circos.

Authors:  Nikos Darzentas
Journal:  Bioinformatics       Date:  2010-08-24       Impact factor: 6.937

3.  Inverted repeats in chloroplast DNA from higher plants.

Authors:  R Kolodner; K K Tewari
Journal:  Proc Natl Acad Sci U S A       Date:  1979-01       Impact factor: 11.205

4.  The chloroplast genome of Anomochloa marantoidea (Anomochlooideae; Poaceae) comprises a mixture of grass-like and unique features.

Authors:  Leah M Morris; Melvin R Duvall
Journal:  Am J Bot       Date:  2010-03-02       Impact factor: 3.844

5.  Tobacco plastid ribosomal protein S18 is essential for cell survival.

Authors:  Marcelo Rogalski; Stephanie Ruf; Ralph Bock
Journal:  Nucleic Acids Res       Date:  2006-08-31       Impact factor: 16.971

6.  tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes.

Authors:  Todd M Lowe; Patricia P Chan
Journal:  Nucleic Acids Res       Date:  2016-05-12       Impact factor: 16.971

7.  GeSeq - versatile and accurate annotation of organelle genomes.

Authors:  Michael Tillich; Pascal Lehwark; Tommaso Pellizzer; Elena S Ulbricht-Jones; Axel Fischer; Ralph Bock; Stephan Greiner
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

8.  Plastome phylogenomics and characterization of rare genomic changes as taxonomic markers in plastome groups 1 and 2 Poeae (Pooideae; Poaceae).

Authors:  Lauren M Orton; Sean V Burke; Melvin R Duvall
Journal:  PeerJ       Date:  2019-06-03       Impact factor: 2.984

9.  Complete nucleotide sequence of the Cryptomeria japonica D. Don. chloroplast genome and comparative chloroplast genomics: diversified genomic structure of coniferous species.

Authors:  Tomonori Hirao; Atsushi Watanabe; Manabu Kurita; Teiji Kondo; Katsuhiko Takata
Journal:  BMC Plant Biol       Date:  2008-06-23       Impact factor: 4.215

10.  Mimosoid legume plastome evolution: IR expansion, tandem repeat expansions, and accelerated rate of evolution in clpP.

Authors:  Diana V Dugas; David Hernandez; Erik J M Koenen; Erika Schwarz; Shannon Straub; Colin E Hughes; Robert K Jansen; Madhugiri Nageswara-Rao; Martijn Staats; Joshua T Trujillo; Nahid H Hajrah; Njud S Alharbi; Abdulrahman L Al-Malki; Jamal S M Sabir; C Donovan Bailey
Journal:  Sci Rep       Date:  2015-11-23       Impact factor: 4.379

View more
  6 in total

1.  Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences.

Authors:  Yan-Yan Guo; Jia-Xing Yang; Hong-Kun Li; Hu-Sheng Zhao
Journal:  Front Plant Sci       Date:  2021-02-09       Impact factor: 5.753

2.  Comparative Chloroplast Genomics of Sophora Species: Evolution and Phylogenetic Relationships in the Early-Diverging Legume Subfamily Papilionoideae (Fabaceae).

Authors:  Min Liao; Xin-Fen Gao; Jun-Yi Zhang; Heng-Ning Deng; Bo Xu
Journal:  Front Plant Sci       Date:  2021-12-16       Impact factor: 5.753

3.  Organelle Phylogenomics and Extensive Conflicting Phylogenetic Signals in the Monocot Order Poales.

Authors:  Hong Wu; Jun-Bo Yang; Jing-Xia Liu; De-Zhu Li; Peng-Fei Ma
Journal:  Front Plant Sci       Date:  2022-01-31       Impact factor: 5.753

4.  Putting small and big pieces together: a genome assembly approach reveals the largest Lamiid plastome in a woody vine.

Authors:  Luiz Henrique M Fonseca; Alison G Nazareno; Verônica A Thode; Alexandre R Zuntini; Lúcia G Lohmann
Journal:  PeerJ       Date:  2022-04-07       Impact factor: 2.984

5.  Complete chloroplast genome sequence of Lens ervoides and comparison to Lens culinaris.

Authors:  Nurbanu Tayşi; Yasin Kaymaz; Duygu Ateş; Hatice Sari; Cengiz Toker; M Bahattin Tanyolaç
Journal:  Sci Rep       Date:  2022-09-05       Impact factor: 4.996

6.  Comparative plastome analysis of Blumea, with implications for genome evolution and phylogeny of Asteroideae.

Authors:  Furrukh Mehmood; Abdur Rahim; Parviz Heidari; Ibrar Ahmed; Péter Poczai
Journal:  Ecol Evol       Date:  2021-05-06       Impact factor: 2.912

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.