Literature DB >> 25202552

Primers for low-copy nuclear genes in the Hawaiian endemic Clermontia (Campanulaceae) and cross-amplification in Lobelioideae.

Yohan Pillon1, Jennifer Johansen1, Tomoko Sakishima1, Srikar Chamala2, W Brad Barbazuk2, Elizabeth A Stacy1.   

Abstract

PREMISE OF THE STUDY: Primers were developed to amplify 12 intron-less, low-copy nuclear genes in the Hawaiian genus Clermontia (Campanulaceae), a suspected tetraploid. • METHODS AND
RESULTS: Data from a pooled 454 titanium run of the partial transcriptomes of seven Clermontia species were used to identify the loci of interest. Most loci were amplified and sequenced directly with success in a representative selection of lobeliads even though several of these loci turned out to be duplicated. Levels of variation were comparable to those observed in commonly used plastid and ribosomal markers. •
CONCLUSIONS: We found evidence of a genome duplication that likely predates the diversification of the Hawaiian lobeliads. Some genes nevertheless appear to be single-copy and should be useful for phylogenetic studies of Clermontia or the entire Lobelioideae subfamily.

Entities:  

Keywords:  Clermontia; Hawai‘i; Lobelioideae; low-copy nuclear genes; next-generation sequencing; polyploidy

Year:  2013        PMID: 25202552      PMCID: PMC4105025          DOI: 10.3732/apps.1200450

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Although phylogenetic studies of plants rely heavily on plastid and nuclear ribosomal loci, the limitations of these loci are well known. Plastid loci are uniparentally inherited and susceptible to chloroplast capture (Rieseberg and Soltis, 1991). Furthermore, their low variation is particularly problematic for investigations of recent radiations such as those in the Hawaiian Islands, and indeed these markers have provided only limited resolution there. Nuclear ribosomal loci are subject to complex concerted evolution (Alvarez and Wendel, 2003), which can be incomplete so that multiple ancient alleles are maintained within a single individual, or rapid such that traces of hybridization are quickly erased. As a result, both plastid and ribosomal markers provide only incomplete information when hybridization is common. The genus Clermontia Gaudich. (Campanulaceae, subfamily Lobelioideae) comprises 22 species endemic to Hawai‘i, 13 of which are on the IUCN Red List of Threatened Species. Species identification in the field is often difficult, particularly in the absence of flowers, and apparent hybrids can be common. Clermontia and other Hawaiian lobeliads (six genera total) form a monophyletic clade that represents the largest plant radiation in Hawai‘i (Givnish et al., 2009). The members of this clade, as well as the members of clade 4 of Antonelli (2008), in which the Hawaiian clade is nested, are suspected paleotetraploids (Lammers, 1988). To find genetic markers that will be useful for phylogenetics and DNA barcoding within Clermontia, we used 454 data of partial cDNA libraries to design primers for single-exon nuclear loci in Clermontia and then tested their cross-amplification in Campanulaceae.

METHODS AND RESULTS

We obtained a pooled, partial transcriptome library from leaf and floral buds (fixed in the field in RNAlater [QIAGEN, Gaithersburg, Maryland, USA] and stored at −80°C) of seven taxa: Clermontia arborescens (H. Mann) Hillebr., C. clermontioides (Gaudich.) A. Heller, C. fauriei H. Lév., C. kakeana Meyen, C. kohalae Rock, C. parviflora Gaudich. ex A. Gray, and C. peleana Rock. RNA isolation, cDNA synthesis, and 454 sequencing were done at the University of Arizona Genetics Core Laboratory. The 454 run provided 1.4 million reads with an average length of 395 bp. 454 adapters, ribosomal RNA, and low-quality and low-complexity sequences were removed/trimmed using SeqClean (http://compbio.dfci.harvard.edu/tgi/software/), and each taxon was assembled separately by the TGI Clustering tools (TGICL; Pertea et al., 2003), using default settings. We conducted BLAST searches of the 400 most highly expressed genes in Arabidopsis (C. Fizames, personal communication) against our data in CLC DNA Workbench (CLC bio, Aarhus, Denmark) to identify a set of genes with high coverage within each of all or most of the species. We selected loci (generally only a small portion of a gene) that comprised a single, long exon (200 bp) with matches in multiple species, and designed primers with FastPCR (PrimerDigital Ltd., Helsinki, Finland; http://www.primerdigital.com/fastpcr.html) for their amplification using default settings. The presence of introns was tested by comparison with genomic and cDNA sequences in the Arabidopsis Information Resource database (www.arabidopsis.org). Avoiding introns allowed the direct sequencing of accessions even in the case of gene duplications; introns often contained indels, which often resulted in alleles of different lengths in heterozygotes or among copies of duplicated genes. Twelve exon regions were identified (Table 1, Appendix 1) and were tested on seven accessions: C. fauriei (the earliest diverging species within the genus), C. arborescens, C. kakeana, Cyanea asplenifolia Hillebr. (Cyanea is a Hawaiian endemic genus and putative sister group of Clermontia; Givnish et al., 2009), Hippobroma longiflora (L.) G. Don (belonging to a different major clade of Lobelioideae and a likely tetraploid; Antonelli, 2008), Lobelia erinus L. (one of the earliest diverging Lobelioideae; Antonelli, 2008), and Campanula persicifolia L. (Campanuloideae). Leaf material was collected in the field and dried in silica gel, and genomic DNA was extracted using the Nucleospin Plant II Kit (Macherey-Nagel, Düren, Germany). The nuclear regions were amplified using the following mix: 12.3 μL of H2O, 4 μL of GoTaq 5× Buffer (Promega Corporation, Madison, Wisconsin, USA), 2 μL of MgCl2 25 mM, 0.4 μL of dNTP 1.25 μM, 0.2 μL of each primer 10 μM, 0.1 μL of GoTaq Flexi DNA polymerase 5 U/μL (Promega Corporation), and 0.8 μL of DNA template. The following amplification program was used: 2 min at 94°C; 38 cycles of 1 min at 94°C, 1 min at 63°C, and 1 min at 72°C; and a final extension of 5 min at 72°C. PCR products were sequenced directly at the Core Genetics Laboratory at the University of Hawai‘i Hilo. The identity of each amplified gene was validated through BLAST or tBLASTx (Clerm4, Clerm10) searches in GenBank. In every case, the 10 best matches (identities >80%) were either the same gene from a different species or a gene that was not yet annotated.
Table 1.

Identity of the 12 intron-less, low-copy nuclear genes identified in this study, with primer sequences, results from cross-amplification tests, and inference of putative gene duplication.

GenePutative Arabidopsis homologPutative productForward primer sequence (5′–3′)Reverse primer sequence (5′–3′)Length (bp)Failed sequencesDuplication in Hawai‘i
Clerm1At1g14320RPL10 (ribosomal protein)GACCTGCTAGGTGTTACCGTATCTTCTGACGACCAGGGA484yes
Clerm2At1g61520Lhca*3 (chlorophyll-binding protein)TTGGATTCGACCCACTTGGACTTAAGGCTGGTCAGCAC594Campanulano
Clerm3At2g05070Lhcb2.2 (chlorophyll-binding protein)GGTGACTACGGATGGGACACCATCAGCCAGTCCAAGTGGGT382yes
Clerm4At3g26520TIP2 (tonoplast intrinsic protein)GATGACATTCGGGCTGGTACAGTAGACCCAGTGGTTGG210Campanulano
Clerm5At4g18100RPL32e (ribosomal protein)ATTGATTCCCGGGTGAGAAGTCACCACAACATCAAGCT248Campanula, LobeliaClermontia
Clerm6At3g13920EIF4-1 (translation initiation factor)TTTGAGAAGCCCTCTGCAATTGCCTACGCAACATGTCAA351yes
Clerm7At4g14880ATCYS3-A (acetylserine(thiol)lyase)TCTTATCGAACCAACAAGTGGTCGGATTGGCAGGATTCTC257no
Clerm8At5g05170CESA3 (cellulose synthase)AGAGCCATCGCAATTGGCTGGCGAAATACCACTCTGGAGC264no
Clerm9At3g23810SAHH2 (S-adenosyl-l-homocysteine hydrolase)CATGTCCTTAGCCGACTTCGACCATTAGCCTGCATTTGG608no
Clerm10At1g79550PGK (phosphoglycerate kinase)GTCAAGATGGCAAATGATTGCCCTTCAGTGGAAGCATGAGC212yes
Clerm11At1g66580SAG24 (senescence-associated gene/ribosomal protein L10e)GACCTGCTAGGTGTTACCGTCCCTCATACCAGTCTGGAGC339yes
Clerm12At3g56940Magnesium protoporphyrin IX monomethyl ester cyclaseTCTACACGACGGATTTCGAGGTGAGGCGTCTACCAAGCTC264yes

Missing sequences are due to failed amplification (Clerm5), weak amplification (Clerm4), or amplification of a different gene (Clerm2).

Identity of the 12 intron-less, low-copy nuclear genes identified in this study, with primer sequences, results from cross-amplification tests, and inference of putative gene duplication. Missing sequences are due to failed amplification (Clerm5), weak amplification (Clerm4), or amplification of a different gene (Clerm2). All 12 nuclear regions were successfully amplified and sequenced in Clermontia, Cyanea, and Hippobroma; a single gene was not amplified in Lobelia erinus, and three could not be sequenced in Campanula (Table 1). A high number of ambiguous bases were found consistently in the forward and reverse sequences of some accessions, suggesting the presence of multiple gene copies. In five genes (Clerm1, Clerm6, Clerm10, Clerm11, Clerm12), ambiguous sites were identical across the three Clermontia species and Cyanea but absent in the other genera (example in Fig. 1). To confirm the hypothesis of gene duplication, we separated alleles computationally from the direct sequences using PHASE (default settings) within the software DnaSP (Librado and Rozas, 2009), and built a neighbor-joining tree of the alleles in SeaView (Gouy et al., 2010) with default settings. In each of these five cases, we recovered two clades, each comprising a single allele from each of the four Hawaiian lobeliad species examined (example in Appendix S1). This pattern strongly suggests a genome duplication event that predates the divergence of Clermontia and Cyanea. Clerm5 was duplicated in Clermontia but apparently not in Cyanea. This is the only gene for which direct sequences turned out to be difficult to read due to the divergence of the two gene copies, which may be due to the presence of an intron not present in Arabidopsis. Five genes (Clerm2, Clerm4, Clerm7, Clerm8, and Clerm9) behaved as single-copy genes in Clermontia. No recombination was detected in those genes using genetic algorithms for recombination detection (GARD; Kosakovsky Pond et al., 2006; http://www.datamonkey.org/). The percentage of variable sites within each of these genes was comparable to those of the plastid loci rbcL, matK, and psbA-trnH and the nuclear ribosomal ITS and ETS (Table 2). Genotyping of a broader taxonomic sample of Clermontia revealed a much greater number of variants at these newly described nuclear genes and a different pattern of evolution compared to plastid genes (Pillon et al., 2013).
Fig. 1.

Example of electropherograms for Clerm12 comparing Clermontia fauriei, Cyanea asplenifolia, and Hippobroma longiflora. Only the forward sequence is shown; asterisks indicate ambiguous sites. The matching positions of multiple ambiguous sites in Clermontia and Cyanea implicate an ancestral gene duplication prior to the divergence of these two genera rather than heterozygosity. Note: black = G; green = A; red = T; blue = C.

Table 2.

Comparison of variation of the five apparently nonduplicated nuclear genes with three plastid and two nuclear ribosomal loci.

CharacteristicClerm2Clerm4Clerm7Clerm8Clerm9rbcLmatKpsbA-trnHITSETS
Length (bp)594210257264608599889460–462874–877594–600
No. of variable sites125455101691712
Percent variable sites2.02.41.61.90.81.71.81.91.92.0
1st position (nonsynonymous)2 (2)1 (1)01 (1)03 (1)9 (8)NANANA
2nd position (nonsynonymous)2 (2)001 (1)003 (3)NANANA
3rd position (nonsynonymous)8 (2)4 (0)4 (0)3 (0)5 (0)7 (3)4 (1)NANANA
Indels0000000212

Note: NA = not applicable.

For each gene, variation was measured on a sample that included Clermontia fauriei, C. kakeana, C. arborescens, and Cyanea asplenifolia.

Example of electropherograms for Clerm12 comparing Clermontia fauriei, Cyanea asplenifolia, and Hippobroma longiflora. Only the forward sequence is shown; asterisks indicate ambiguous sites. The matching positions of multiple ambiguous sites in Clermontia and Cyanea implicate an ancestral gene duplication prior to the divergence of these two genera rather than heterozygosity. Note: black = G; green = A; red = T; blue = C. Comparison of variation of the five apparently nonduplicated nuclear genes with three plastid and two nuclear ribosomal loci. Note: NA = not applicable. For each gene, variation was measured on a sample that included Clermontia fauriei, C. kakeana, C. arborescens, and Cyanea asplenifolia.

CONCLUSIONS

The selection of intron-less regions proved successful for the amplification and direct sequencing of several nuclear loci and the detection of duplicated genes. The large number of gene duplications shared between Clermontia and Cyanea strongly supports the hypothesis of whole-genome duplication that predates the diversification of the lobeliads in Hawai‘i. Whole genome duplication has similarly been demonstrated in Hawaiian silverswords (Barrier et al., 1999). Despite the genome duplication, we nevertheless identified a number of apparently single-copy genes, whether due to the loss of one copy in each case or the selectivity of our primers for one copy. Geographic and taxonomic patterns of variations of two of these markers within Clermontia are examined in a study of their potential use as DNA barcodes (Pillon et al., 2013). Click here for additional data file.
  9 in total

1.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets.

Authors:  Geo Pertea; Xiaoqiu Huang; Feng Liang; Valentin Antonescu; Razvan Sultana; Svetlana Karamycheva; Yuandan Lee; Joseph White; Foo Cheung; Babak Parvizi; Jennifer Tsai; John Quackenbush
Journal:  Bioinformatics       Date:  2003-03-22       Impact factor: 6.937

Review 2.  Ribosomal ITS sequences and plant phylogenetic inference.

Authors:  I Alvarez; J F Wendel
Journal:  Mol Phylogenet Evol       Date:  2003-12       Impact factor: 4.286

3.  GARD: a genetic algorithm for recombination detection.

Authors:  Sergei L Kosakovsky Pond; David Posada; Michael B Gravenor; Christopher H Woelk; Simon D W Frost
Journal:  Bioinformatics       Date:  2006-11-16       Impact factor: 6.937

4.  DnaSP v5: a software for comprehensive analysis of DNA polymorphism data.

Authors:  P Librado; J Rozas
Journal:  Bioinformatics       Date:  2009-04-03       Impact factor: 6.937

5.  Origin, adaptive radiation and diversification of the Hawaiian lobeliads (Asterales: Campanulaceae).

Authors:  Thomas J Givnish; Kendra C Millam; Austin R Mast; Thomas B Paterson; Terra J Theim; Andrew L Hipp; Jillian M Henss; James F Smith; Kenneth R Wood; Kenneth J Sytsma
Journal:  Proc Biol Sci       Date:  2009-02-07       Impact factor: 5.349

6.  SeaView version 4: A multiplatform graphical user interface for sequence alignment and phylogenetic tree building.

Authors:  Manolo Gouy; Stéphane Guindon; Olivier Gascuel
Journal:  Mol Biol Evol       Date:  2009-10-23       Impact factor: 16.240

7.  Interspecific hybrid ancestry of a plant adaptive radiation: allopolyploidy of the Hawaiian silversword alliance (Asteraceae) inferred from floral homeotic gene duplications.

Authors:  M Barrier; B G Baldwin; R H Robichaux; M D Purugganan
Journal:  Mol Biol Evol       Date:  1999-08       Impact factor: 16.240

8.  Higher level phylogeny and evolutionary trends in Campanulaceae subfam. Lobelioideae: molecular signal overshadows morphology.

Authors:  Alexandre Antonelli
Journal:  Mol Phylogenet Evol       Date:  2007-06-30       Impact factor: 4.286

9.  Potential use of low-copy nuclear genes in DNA barcoding: a comparison with plastid genes in two Hawaiian plant radiations.

Authors:  Yohan Pillon; Jennifer Johansen; Tomoko Sakishima; Srikar Chamala; W Brad Barbazuk; Eric H Roalson; Donald K Price; Elizabeth A Stacy
Journal:  BMC Evol Biol       Date:  2013-02-09       Impact factor: 3.260

  9 in total
  2 in total

1.  MarkerMiner 1.0: A new application for phylogenetic marker development using angiosperm transcriptomes.

Authors:  Srikar Chamala; Nicolás García; Grant T Godden; Vivek Krishnakumar; Ingrid E Jordon-Thaden; Riet De Smet; W Brad Barbazuk; Douglas E Soltis; Pamela S Soltis
Journal:  Appl Plant Sci       Date:  2015-04-06       Impact factor: 1.936

2.  A genome-scale mining strategy for recovering novel rapidly-evolving nuclear single-copy genes for addressing shallow-scale phylogenetics in Hydrangea.

Authors:  Carolina Granados Mendoza; Julia Naumann; Marie-Stéphanie Samain; Paul Goetghebeur; Yannick De Smet; Stefan Wanke
Journal:  BMC Evol Biol       Date:  2015-07-04       Impact factor: 3.260

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.