Literature DB >> 30131884

Low-copy nuclear markers in Isoëtes (Isoëtaceae) identified with transcriptomes.

Peter W Schafran1,2, Gabriel Johnson2, W Carl Taylor2, Elizabeth A Zimmer2, Lytton J Musselman1.   

Abstract

PREMISE OF THE STUDY: Few genetic markers provide phylogenetic information in closely related species of Isoëtes (Isoëtaceae). We describe the development of primers for several putative low-copy nuclear markers to resolve the phylogeny of Isoëtes, particularly in the southeastern United States. METHODS AND
RESULTS: We identified regions of interest in Isoëtes transcriptomes based on low-copy genes in other plants. Primers were designed for these regions and tested with 16 taxa of Isoëtes and one species of Lycopodium. Parts of the pgiC, gapC, and IBR3 gene regions show phylogenetic signal within the North American and Mediterranean clades of Isoëtes.
CONCLUSIONS: Transcriptome data prove useful for identification and primer design of low-copy genes. Three new markers show potential for inferring phylogenies in regional clades of Isoëtes, and possibly across the entire genus.

Entities:  

Keywords:  IBR3; Isoëtes; Sanger sequencing; gapC; pgiC; primer design

Year:  2018        PMID: 30131884      PMCID: PMC5947607          DOI: 10.1002/aps3.1142

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Isoëtes L. (Isoëtaceae, Lycopodiophyta) is a cosmopolitan genus of ca. 250 recognized species. These heterosporous lycophytes consist of a 2–3‐lobed rootstock that bears linear, quill‐like, microphyllous leaves or sporophylls. All microphylls have the potential to develop into sporophylls (Foster and Gifford, 1974). Mega‐ and microsporangia are produced at the base of sporophylls, in some species covered by a layer of tissue called a velum. Traditionally, spore ornamentation and velum coverage have been considered taxonomically important. Although species inhabit a variety of ecological niches, from obligate aquatic to ephemeral terrestrial habitats, their morphology is extremely conserved. Phylogenetic studies in closely related clades of Isoëtes have been limited by a dearth of morphological features and molecular markers. Hoot and Taylor (2001) identified the nuclear ribosomal gene internal transcribed spacer (ITS), a LEAFY homolog nuclear gene intron (LFY), and the plastid atpB‐rbcL spacer region as informative markers in Isoëtes. However, although these markers and the plastid rbcL gene show utility in large‐scale, global phylogenies, they generally lose resolution at the regional level (Rydin and Wikström, 2002; Hoot et al., 2006; Larsén and Rydin, 2016). LFY is more variable than the other three markers and is fairly informative in recently diverged species groups (Taylor et al., 2004; Hoot et al., 2004). With only a single informative nuclear marker within groups such as the eastern North American clade, it is difficult to fully test phylogenetic hypotheses of reticulate evolution and incomplete lineage sorting. Transcriptomes provide a valuable tool for marker selection and PCR primer design in the absence of a sequenced genome, as is the case in Isoëtes. Databases such as the 1000 Plants project (http://www.onekp.com; Matasci et al., 2014) contain transcriptomes across all major lineages of land plants, allowing identification of unique marker regions for a group of interest. Here we describe use of transcriptome data to develop PCR primers for phylogenetically informative low‐copy nuclear markers in Isoëtes.

METHODS AND RESULTS

Markers of interest were selected based on a literature search of reportedly low‐copy nuclear markers in ferns and mosses (Table 1; Szövényi et al., 2006; Schuettpelz et al., 2008; Rothfels et al., 2013). Nucleotide sequences for these markers were obtained from the National Center for Biotechnology Information's (NCBI) GenBank (http://www.ncbi.nlm.nih.gov/genbank/; Clark et al., 2016) or TreeBASE (http://www.treebase.org; Sanderson et al., 1994) databases. Transcriptomes for three Isoëtes taxa were provided by other sources (I. echinospora Durieu from S. Hetherington, University of Oxford, Oxford, United Kingdom; and I. tegetiformans Rury and an unnamed Isoëtes species from the 1000 Plants project [http://www.oneKP.com]). Using the BLAST+ 2.4 software package (Camacho et al., 2009), local BLAST databases were constructed from each Isoëtes transcriptome. The sequences of selected fern (Rothfels et al., 2013) and moss (Szövényi et al., 2006) low‐copy nuclear markers were BLASTed against the transcriptome databases to identify those markers present as single‐copy in Isoëtes. These single‐copy marker regions were extracted from their respective transcriptome and aligned with marker sequences from the literature using Geneious version 7 (Kearse et al., 2012). Primer sequences from the literature were modified to match the Isoëtes transcriptome sequences.
Table 1

Primers designed for low‐copy markers identified in Isoëtes transcriptomes

Marker IDPrimer namesPrimer sequences (5′–3′) T a (°C)
pgiCpgiC_1156FpgiC_1900RF: GGTCTCCTAAGTGTCTGGAATGT55
R: GTTCTCCAAAATCAATTTCTCC
IBR3_1 IBR3_2F IBR3_6R F: CTCAAATCAGCTCATGCAATTG60
R: AGCTCCCAATCCAACACAGC
IBR3_2 IBR3_13F IBR3_16R F: CAATGACTGAACCGCAAGTTG60
R: GACCCAACGAGTCTCATGCAG
Transducin_1 Transducin_1F Transducin_1R F: GATGTGGTTGGTGAGTCTGG55
R: CACTTCATTGAACCTCAG
Transducin_2 Transducin_2F Transducin_2R F: GGAACAAAAGCAGGGACATTAG55
R: CATCAGAAGAGATGTCCATAC
gapC_short gapC_5F gapC_7R F: GAATCTACTGGTGTCTTCAC55
R: TTCTGGTTTATATTCATGCTCG
gapC_long gapC_5F gapC_9R F: GAATCTACTGGTGTCTTCAC55
R: ATGGTCCATCAACAGTYTTCTG

F = forward; R = reverse; T a = annealing temperature.

Primers designed for low‐copy markers identified in Isoëtes transcriptomes F = forward; R = reverse; T a = annealing temperature. Plants were collected from the field, and leaf tissue was desiccated with silica gel. Voucher specimens have been stored at the Old Dominion University herbarium (ODU) and/or the U.S. National Herbarium (US). DNA was extracted from approximately 200 mg of dried tissue with the QIAGEN DNeasy Plant Mini Kit (QIAGEN Inc., Valencia, California, USA) or AutogenPrep 965 (Autogen Inc., Holliston, Mississippi, USA) using standard protocols. Sixteen diploid taxa of Isoëtes and one species of Lycopodium L. (one individual per taxon) were selected from available DNAs to represent various levels of divergence (Appendix 1). Markers were amplified by PCR on an ABI 2720 thermocycler (Thermo Fisher Scientific Inc., Waltham, Massachusetts, USA), with a reaction mixture of 12.5 μL of 2× GoTaq PCR master mix (Promega Corporation, Madison, Wisconsin, USA), 0.5 μL of 0.1 mg/mL bovine serum albumin, 1.0 μL each of 10 μM forward and reverse primer, 7.5 μL of sterile distilled water, and 2.5 μL of DNA template (10–60 ng). PCR reactions were carried out with an initial melting period at 94°C (5 min), followed by 35 cycles of 94°C (30 s), annealing at 55–60°C (30 s), and extension at 72°C (1 min), with a final extension at 72°C (7 min). Amplification success was confirmed by electrophoresis using a 1.5% sodium boric acid–based agarose gel. Markers were selected for Sanger sequencing based on their producing a single band across all samples and for a maximum size of ~1000 bp. PCR products were treated with ExoSAP‐IT PCR cleanup enzyme mix (Affymetrix Inc., Santa Clara, California, USA) before cycle sequencing with BigDye Terminator v3.1 (Thermo Fisher Scientific Inc.). The labeled sequencing fragments were read on an ABI 3130xl Genetic Analyzer (Thermo Fisher Scientific Inc.), and the resulting chromatograms were edited and analyzed using Geneious (Kearse et al., 2012). Initial screening of primers showed that all amplify in at least some of the eastern North American taxa. Gel electrophoresis revealed that IBR3_1 and Transducin_2 are too long (~2000 bp) and Transducin_1 has both short and long copies in some individuals (~500 bp and ~1000 bp), making these poor candidates for a Sanger sequencing approach without needing molecular cloning or gel extraction. Although gapC_short readily amplified, it is contained within gapC_long, making sequencing of the shorter fragment redundant. pgiC, IBR3_2 (hereafter IBR3), and gapC_long (hereafter gapC) were selected for PCR and sequencing of the full taxa list (Appendices 2, 3).

pgiC

This primer pair is rooted in exons 14 and 16, and amplifies across introns 14, 15, and exon 15 of this locus (Rothfels et al., 2013). The region amplified easily across all taxa of Isoëtes and Lycopodium clavatum L., and generated consistently high‐quality sequence data. All sequences aligned well, with a total alignment length of 466 bp and pairwise identity of 83%. Excluding L. clavatum, alignment length decreases to 357 bp and pairwise identity increases to 89%. Sequence length between these species of Isoëtes ranges from 310 to 347 bp, with a mean of 324 bp (Table 2). This is approximately half the length of the same region in ferns tested by Rothfels et al. (2013).
Table 2

Alignment statistics for all sequences with quality scores >85%

Marker Isoëtes Lycopodium + Isoëtes
Amplicon length range, bp (Mean)Alignment length, bpPairwise % identityNo. of identical sites (%)No. of PIS (%)Amplicon length range, bp (Mean)Alignment length, bpPairwise % identityNo. of identical sites (%)No. of PIS (%)
pgiC310–347 (324)35789240 (67)80 (22)310–458 (331)46683192 (41)82 (18)
IBR3587–682 (659)70087415 (59)111 (16)
gapC443–543 (507)56185304 (54)95 (17)

PIS = parsimony informative sites.

Alignment statistics for all sequences with quality scores >85% PIS = parsimony informative sites.

gapC

The gapC gene encodes cytosolic glyceraldehyde‐3‐phosphate and is part of the GAPDH gene family (Strand et al., 1997; Wall, 2002; Szövényi et al., 2006). Primers designed by Szövényi et al. (2006) are rooted in exons 5 and 9 and amplify all exons and introns in between. However, given concern that the resulting marker in Isoëtes may be too long for Sanger sequencing, the primers designed for this study were rooted in exons 5 and 8, amplifying introns 5, 6, 7, and exons 6 and 7. This marker showed the least ability to routinely generate high‐quality sequence data. Although not detected in any of the transcriptomes available, it is possible this results from off‐target amplification of other members of the GADPH gene family (i.e., gapCp or an unnamed gapC/gapCp relative) (Schuettpelz et al., 2008; Rothfels et al., 2013). The Isoëtes‐only alignment is 561 bp and has a pairwise identity of 85% (Table 2).

IBR3

Unlike pgiC and gapC, this gene does not have an extensive history of use as a phylogenetic marker. The IBR3 gene is thought to encode an indole‐3‐butyric acid–specific peroxisomal enzyme related to acyl‐CoA dehydrogenases (Zolman et al., 2007). Rothfels et al. (2013) showed it to be single‐copy throughout selected fern lineages, and this also appears to be the case in Isoëtes. Primers for the IBR3 marker amplify most species of Isoëtes easily, with the exception of two members of the Mediterranean clade (I. histrix Bory & Durieu and I. nuttallii A. Braun ex Engelm.). Alignment of Isoëtes sequences is 700 bp long with 87% pairwise identity (Table 2).

CONCLUSIONS

Transcriptome mining is shown to be a useful tool for identification of putative low‐copy markers for primer design. Despite having access to transcriptomes of just three species of Isoëtes in the North American clade, primers could be designed for regions that show phylogenetic signal across widely divergent clades in the genus, and potentially across all Lycopodiophyta. Although techniques such as target enrichment allow for generation of data sets orders of magnitude larger (Mandel et al., 2014), design of primers for Sanger sequencing is still more time‐ and cost‐efficient in taxonomic groups for which just a few markers may be needed to infer well‐resolved phylogenies.
Taxona Phylogenetic cladeb Collection localityVoucher (Herbarium)c GenBank accession no.
pgiCIBR3gapC
Isoëtes butleri Engelm.Clade ETexas, USA Schafran 47 (ODU) KY243331 KY270816 KY270832
I. echinospora DurieuClade ENew York, USA Schafran NY‐4 (ODU) KY243333 KY270818 KY270835
I. engelmannii A. BraunClade ETennessee, USA Schafran 46 (ODU) KY243334 KY270819
I. flaccida Shuttlew. var. chapmanii Engelm.Clade E( (= I. flaccida)Florida, USA Bolin JB_FL_01 (ODU) KY243332 KY270817 KY270833
I. flaccida var. flaccida Clade EFlorida, USA Schafran FL‐01 (ODU) KY243335 KY270820 KY270836
I. histrix Bory & DurieuClade ESicily, Italy A. Troia s.n.d KY243347
I. lithophila N. Pfeiff.Clade ETexas, USA Schafran 61 (ODU) KY243336 KY270822 KY270838
I. longissima BoryClade B )(= I. velata)Sicily, Italy A. Troia s.n.d KY243348 KY270823 KY270839
I. melanopoda J. Gay & Durieu subsp. melanopoda Clade EMississippi, USA Taylor 6796 (US) KY243338 KY270825 KY270841
I. melanopoda subsp. silvatica D. F. Brunt. & D. M. Britton Clade E (= I. melanopoda s.l.) North Carolina, USA Schafran NC‐05 (ODU) KY243342 KY270828 KY270845
I. melanospora Engelm.Clade EGeorgia, USA Schafran 12 (ODU) KY243339 KY270826 KY270842
I. nuttallii A. Braun ex Engelm.Clade BCalifornia, USA Taylor 6734 (US) KY243351
I. piedmontana (N. Pfeiff.) C. F. ReedGeorgia, USA Schafran 18 (ODU) KY243341 KY270827 KY270844
I. storkii T. C. PalmerClade ECosta Rica Taylor 6760 (US) KY243352 KY270829 KY270846
I. tegetiformans RuryGeorgia, USA Schafran 19 (ODU) KY243343 KY270830 KY270847
I. valida (Engelm.) CluteClade EPennsylvania, USA Schafran 37 (ODU) KY243344 KY270831
Lycopodium clavatum L.New York, USA Schafran s.n.e MG434746

a One individual was sampled per taxon.

b Per Larsén and Rydin (2016).

c Herbaria are abbreviated according to Index Herbariorum (http://sweetgum.nybg.org/science/ih/).

d Tissue samples provided by A. Troia (Università degli Studi di Palermo, Palermo, Italy); not deposited in a recognized herbarium.

e Voucher deposited in P. Schafran's personal collection.

TaxonAmplificationSequencing
pgiCIBR3gapCpgiCIBR3gapC
Isoëtes butleri ++++++
I. echinospora ++++++
I. engelmannii +++++
I. flaccida var. chapmanii ++++++
I. flaccida var. flaccida ++++++
I. histrix ++NANA
I. lithophila ++++++
I. longissima ++++++
I. melanopoda subsp. melanopoda ++++++
I. melanopoda subsp. silvatica ++++++
I. melanospora ++++++
I. nuttallii ++NANA
I. piedmontana ++++++
I. storkii ++++++
I. tegetiformans ++++++
I. valida +++++
Lycopodium clavatum +++NA

+ = successful amplification or sequence quality >85%; — = no amplification or sequence quality <85%; NA = sequencing not attempted.

I. butleri I. echinospora I. engelmannii I. flaccida var. chapmanii I. flaccida var. flaccida I. histrix I. lithophila I. longissima I. melanopoda subsp. melanopoda I. melanopoda subsp. silvatica I. melanospora I. nuttallii I. piedmontana I. storkii I. tegetiformans I. valida
I. echinospora 10/34/37
I. engelmannii 4/30/—8/26/—
I. flaccida var. chapmanii 6/56/4410/57/282/52/—
I. flaccida var. flaccida 6/56/4310/58/282/53/—0/3/10
I. histrix 73/—/—78/—/—72/—/—70/—/—70/—/—
I. lithophila 4/41/516/36/402/39/—4/68/454/69/4479/—/—
I. longissima 74/217/20678/225/19473/218/—73/236/18873/235/19149/—/—74/225/198
I. melanopoda subsp. melanopoda 8/33/5710/13/476/28/—6/59/536/60/5172/—/—4/38/1574/221/208
I. melanopoda subsp. silvatica 8/35/3511/36/264/28/—6/61/316/62/2974/—/—6/40/3774/225/19810/38/44
I. melanospora 9/26/4712/23/325/16/—7/47/377/48/3675/—/—7/34/3075/218/20111/25/361/27/37
I. nuttallii 67/—/—69/—/—68/—/—70/—/—70/—/—54/—/—81/—/—49/—/—72/—/—69/—/—70/—/—
I. piedmontana 8/26/4011/23/144/16/—6/47/306/48/2774/—/—6/34/4274/218/19110/25/440/27/251/2/3669/—/—
I. storkii 13/29/4715/24/3411/25/—11/50/1411/50/1677/—/—4/30/3775/220/1929/26/4515/29/3416/21/3873/—/—15/21/36
I. tegetiformans 7/31/5410/28/406/21/—8/53/408/54/3672/—/—5/40/5474/221/19710/30/6110/28/3811/19/4767/—/—10/19/3814/24/42
I. valida 6/52/—14/51/—8/48/—10/14/—10/15/—75/—/—8/62/—75/239/—12/53/—12/56/—13/44/—67/—/—12/44/—17/44/—11/48/—
Lycopodium clavatum 227/—/—229/—/—226/—/—225/—/—225/—/—225/—/—248/—/—227/—/—223/—/—228/—/—229/—/—236/—/—228/—/—226/—/—229/—/—228/—/—
  10 in total

1.  Use of the nuclear gene glyceraldehyde 3-phosphate dehydrogenase for phylogeny reconstruction of recently diverged lineages in Mitthyridium (Musci: Calymperaceae).

Authors:  Dennis P Wall
Journal:  Mol Phylogenet Evol       Date:  2002-10       Impact factor: 4.286

2.  IBR3, a novel peroxisomal acyl-CoA dehydrogenase-like protein required for indole-3-butyric acid response.

Authors:  Bethany K Zolman; Michelle Nyberg; Bonnie Bartel
Journal:  Plant Mol Biol       Date:  2007-02-03       Impact factor: 4.076

3.  Nuclear DNA-based markers for plant evolutionary biology.

Authors:  A E Strand; J Leebens-Mack; B G Milligan
Journal:  Mol Ecol       Date:  1997-02       Impact factor: 6.185

4.  Revealing unknown or extinct lineages within Isoetes (Isoetaceae) using DNA sequencesfrom hybrids.

Authors:  Sara B Hoot; Nancy S Napier; W Carl Taylor
Journal:  Am J Bot       Date:  2004-06       Impact factor: 3.844

5.  BLAST+: architecture and applications.

Authors:  Christiam Camacho; George Coulouris; Vahram Avagyan; Ning Ma; Jason Papadopoulos; Kevin Bealer; Thomas L Madden
Journal:  BMC Bioinformatics       Date:  2009-12-15       Impact factor: 3.169

6.  Transcriptome-mining for single-copy nuclear markers in ferns.

Authors:  Carl J Rothfels; Anders Larsson; Fay-Wei Li; Erin M Sigel; Layne Huiet; Dylan O Burge; Markus Ruhsam; Sean W Graham; Dennis W Stevenson; Gane Ka-Shu Wong; Petra Korall; Kathleen M Pryer
Journal:  PLoS One       Date:  2013-10-08       Impact factor: 3.240

7.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

Review 8.  Data access for the 1,000 Plants (1KP) project.

Authors:  Naim Matasci; Ling-Hong Hung; Zhixiang Yan; Eric J Carpenter; Norman J Wickett; Siavash Mirarab; Nam Nguyen; Tandy Warnow; Saravanaraj Ayyampalayam; Michael Barker; J Gordon Burleigh; Matthew A Gitzendanner; Eric Wafula; Joshua P Der; Claude W dePamphilis; Béatrice Roure; Hervé Philippe; Brad R Ruhfel; Nicholas W Miles; Sean W Graham; Sarah Mathews; Barbara Surek; Michael Melkonian; Douglas E Soltis; Pamela S Soltis; Carl Rothfels; Lisa Pokorny; Jonathan A Shaw; Lisa DeGironimo; Dennis W Stevenson; Juan Carlos Villarreal; Tao Chen; Toni M Kutchan; Megan Rolf; Regina S Baucom; Michael K Deyholos; Ram Samudrala; Zhijian Tian; Xiaolei Wu; Xiao Sun; Yong Zhang; Jun Wang; Jim Leebens-Mack; Gane Ka-Shu Wong
Journal:  Gigascience       Date:  2014-10-27       Impact factor: 6.524

9.  A target enrichment method for gathering phylogenetic information from hundreds of loci: An example from the Compositae.

Authors:  Jennifer R Mandel; Rebecca B Dikow; Vicki A Funk; Rishi R Masalia; S Evan Staton; Alex Kozik; Richard W Michelmore; Loren H Rieseberg; John M Burke
Journal:  Appl Plant Sci       Date:  2014-02-06       Impact factor: 1.936

10.  GenBank.

Authors:  Karen Clark; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; Eric W Sayers
Journal:  Nucleic Acids Res       Date:  2015-11-20       Impact factor: 16.971

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.