Literature DB >> 26528252

Single cell PCR amplification of diatoms using fresh and preserved samples.

Paul B Hamilton1, Keely E Lefebvre2, Roger D Bull1.   

Abstract

Single cell Chelex® DNA extraction and nested PCR amplification were used to examine partial gene sequences from natural diatom populations for taxonomic and phylogenetic studies at and above the level of species. DNA was extracted from cells that were either fresh collected or stored in RNAlater. Extractions from Lugol's fixation were also attempted with limited success. Three partial gene sequences (rbcL, 18S, and psbA) were recovered using existing and new primers with a nested or double nested PCR approach with amplification and success rates between 70 and 96%. An rbcL consensus tree grouped morphologically similar specimens and was consistent across the two primary sample treatments: fresh and RNAlater. This tool will greatly enhance the number of microscopic diatom taxa (and potentially other microbes) available for barcoding and phylogenetic studies. The near-term increase in sequence data for diatoms generated via routine single cell extractions and PCR will act as a multiproxy validation of longer-term next generation genomics.

Entities:  

Keywords:  RNAlater; barcoding; diatoms; fixation; phylogenetics; single cells

Year:  2015        PMID: 26528252      PMCID: PMC4604310          DOI: 10.3389/fmicb.2015.01084

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

DNA barcoding has become common practice in animal and plant taxonomy (Hebert et al., 2003) with cytochrome c oxidase 1 (CO1), a mitochondrial gene, serving as the main animal barcoding gene (Hebert et al., 2004). In plants the chloroplast genes ribulose 1,5-biphosphate (rbcL) and megakaryocyte-associated tyrosine kinase (MatK) serve as two of the preferred barcode genes for taxonomic identifications (CBOL Plant Work Group, 2009). This is in contrast with the situation faced in diatom barcoding where several regions are presently identified as prominent taxonomic markers (e.g., Yoon et al., 2002; Evans et al., 2007). In some studies the conservative rbcL, CO1, and ribosomal complex gene 18S are considered to be good taxonomic characters for species determinations (Mann et al., 2010; Hamsher et al., 2011; Zimmermann et al., 2011). Ribosomal complex genes ITS, 18S, and 28S were also used both individually and in multi-gene studies to evaluate cryptic taxonomic variations at the genus and species levels (e.g., Amato et al., 2007; Vanormelingen et al., 2008; Pouličková et al., 2010; Kaczmarska et al., 2014). The smaller (~500 bp) chloroplast psbA gene was used in evolutionary studies but is not variable enough to be informative for species-level taxonomic studies (Souffreau et al., 2011, pers. obs.). In contrast, chloroplast gene psbC (>1000 bp) is more widely used across all the major orders (Theriot et al., 2010). With little consensus as to which marker best delimits diatom groups, the ability to amplify several genes including new genes from a single cell is essential for diatom taxonomy using DNA barcodes. In microbial studies, genetic sequencing has been successful across all the primary algal families Bacillariophyceae, Cyanophyceae, Chlorophyceae, Chrysophyceae, Cryptophyceae, Desmidiaceae, Euglenophyceae, Haptophyceae, Pyrrhophyceae, Raphidophyceae, Synurophyceae (e.g., Daugbjerg and Andersen, 1997; McCourt et al., 2000; Tomitani et al., 2006; Edvardsen et al., 2011; Bennett et al., 2014). However, compared to DNA research in Plantae and Animalae, there are much fewer sequences available for diatom taxa, leaving large taxonomic holes in DNA databases. As well, microbial genetic studies in algae are limited by the ability to collect enough material, and population genetic studies are all but absent. Culturing algae and other microbes to accumulate sufficient DNA has been the time-limiting step in microbial genetics. Cultures supply extra material for morphological identification and validation; however cultures are prone to alterations in structural morphology (Trainor et al., 1971; Estes and Dute, 1994). Single cell extraction and PCR protocols have been advanced in microbe research on live and fixed materials, although no single approach meets all aspects of routine enhanced multi-gene taxonomic research (e.g., Sherbakova et al., 2000; Ruiz Sebastián and O'Ryan, 2001; Lang and Kaczmarska, 2011). To date single cell extractions have been successfully completed with the Chrysophyceae (Auinger et al., 2008), Pyrrhophyceae (Richlen and Barber, 2005; Henrichs et al., 2007), and Bacillariophyceae (Lang and Kaczmarska, 2011). There are only a few published examples using non-cultured single cell amplifications (e.g., Godhe et al., 2002; Auinger et al., 2008; Lang and Kaczmarska, 2011). These amplifications are successful if there is sufficient initial DNA within the cell and the primer set is effective at maximizing amplification efficiency. In most cases, amplification of DNA for sequencing requires a nested amplification protocol. This nested approach is effective but has the potential to generate amplification errors (Ruck et al., 2014). In order to effectively utilize this nested approach, clear protocols for error checking and sequence validation much be established. Single cell genetic determinations are presently a novel and potentially efficient way to generate a large reference library of taxonomically informative data from single algal cells in complex environmental systems. This reference library will also contain an extensive database for population genetics and genetic biogeography studies (e.g., Alverson et al., 2007). There are a number of reagents which can be used in single cell DNA amplifications with algae (e.g., Bertozzini et al., 2005; Auinger et al., 2008). Chelex® resin is an effective DNA extraction tool with applications in molecular biology ranging from multicellular vertebrate tissues to single microbial cells (Hahn et al., 2000; Richlen and Barber, 2005). This extraction method was used in genetic investigations in forensic science (Legrand et al., 2002), population studies (Richlen and Barber, 2005), and evolution (Theriot et al., 2010). In mammalian research Chelex extractions have been used on fresh, frozen, alcohol preserved, and to a limited degree on formalin-fixed tissues (Legrand et al., 2002). The simplicity of this technique coupled with relatively cheap cost allows for quick PCR assays (e.g., Bowers et al., 2000; Reyes-Escogido et al., 2010) which has the potential for more detailed taxonomic barcoding initiatives and finer population genetic studies. The recovery of gene sequences from structurally fixed and preserved material has had some success across all the biological groups, but has not developed into a routine protocol used for sequencing (Connell, 2002; Godhe et al., 2002; Henrichs et al., 2007). Institutions with collections of fixed biological materials, like museums, are extremely interested in the recovery of genes and genomes from their historic collections. The temporal record hidden in fixed biological collections—with regard to speciation and population genetics—is waiting to be mined. In microbial genetics, gene sequences have been recovered from ethanol, Lugol's solution, buffered formalin and RNAlater fixations (Ambion, 1999; Connell, 2002; Godhe et al., 2002; Auinger et al., 2008; this study). The general recipe for success is removal or dilution of the traditional fixation solution followed by standard sample processing, DNA extraction, and gene sequencing. Buffered formalin preserved materials can be treated with cold methanol to minimize the impact of the fixation, while sodium thiosulfate is effective in capturing iodine from solutions and biological materials (Godhe et al., 2002; Auinger et al., 2008). In the Protista, ethanol preparation and fixation prior to sequencing is less common, although has been successful with some limitations (e.g., Godhe et al., 2002; Henrichs et al., 2007; Lang and Kaczmarska, 2011; Ivanova et al., submitted). RNAlater is a more recent stabilizer suitable for the storage of material prior to RNA and DNA sequencing with a high rate of recovery (Ambion, 1999). RNAlater has the advantage of being a good fixative and further prevents the degradation of RNA and DNA. At 4°C viable fixation can be maintained for a month and at −20°C samples can be maintained indefinitely (Ambion, 1999). The objective of this study was to evaluate a nested amplification protocol for multiple genes in diatoms from single cells under live and RNAlater preserved conditions. This will establish a standard multiproxy routine for reproducible barcoding with morphological analysis of natural populations. In this study new primer pairs for nested amplification of rbcL, 18S and psbA were designed and different DNA polymerases and cycling protocols were compared. Finally, examples are presented for how this protocol can be used to establish a more comprehensive reference library of taxonomic and physiological genetic data.

Materials and methods

Collection of samples

All samples were benthic or planktonic and collected from freshwaters with a wide range in pH (5.1–7.8) and eutrophication states (oligotrophic–mesotrophic) (Tables 1, 2). The samples were kept cool in transport and at room temperature in the lab in natural light prior to single cell isolations.
Table 1

List of 35 taxa isolated and sequenced for genes rbcL, psbA, and 18S, with Canadian Museum of Nature Accession numbers, GenBank Id numbers, sample media, source locality with date of collection, and collector.

TaxonAccession numberGenBank ID for rbcLGenBank ID for 18SGenBank ID for psbAIdentifierSample mediumLocality and date of collectionCollector
Stauroneis cf. gracilisCANA 93655KM999045KM998975KM999010A6R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Stauroneis cf. gracilisCANA 93655KM999046KM998976KM999011A5R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Stauroneis cf. ancepsCANA 93655KM999047KM998977KM999012B3R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Pinnularia sp.CANA 93252KM999048KM998978KM999013B5R10RNALaterAlymer, small pond (Quebec, Canada) 45.443007 N 75.810437 W, 30-Jan-2014P.B. Hamilton
Pinnularia sp.CANA 93252KM999049KM998979KM999014A2R10WaterAlymer, small pond (Quebec, Canada) 45.443007 N 75.810437 W, 30-Jan-2014P.B. Hamilton
Pinnularia sp.CANA 93251KM999050KM998980KM999015A1R10WaterAlymer, small pond (Ontario, Canada) 45.442997 N 75.810437 W, 30-Jan-2014P.B. Hamilton
Pinnularia sp.CANA 93250KM999051KM998981KM999016A3R10WaterAlymer, small pond (Quebec, Canada) 54.443033 N 75.810378 W, 30-Jan-2014P.B. Hamilton
Pinnularia sp.CANA 93249KM999052KM998982KM999017A4R10WaterAlymer, small pond (Quebec, Canada) 45.443022 N 75.810421 W, 30-Jan-2014P.B. Hamilton
Pinnularia sp.CANA 93251KM999053KM998983KM999018B1R10RNALaterAlymer, small pond (Quebec, Canada) 45.442997 N 75.810437 W, 30-Jan-2014P.B. Hamilton
Pinnularia sp.CANA 93641KM999054KM998984KM999019B1R7RNALaterDitch, Hwy 56, Adirondack Park (New York, USA) 44.30692 N 74.71650 W, 26-Oct-2013P.B. Hamilton
Pinnularia sp.CANA 93250KM999055KM998985KM999020A8R10RNALaterAlymer, small pond (Quebec, Canada) 54.443033 N 75.810378 W, 30-Jan-2014P.B. Hamilton
Pinnularia sp.CANA 93660KM999056KM998986KM999021E3R8WaterAlymer, small pond (Quebec, Canada) 45.443033 N 75.810383 W, 15-Nov-2013K.E. Lefebvre
Navicula sp.CANA 93656KM999057KM998987KM999022B8R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 28-Sep-2013F.D. Caron
Navicula sp.CANA 93655KM999058KM998988KM999023B7R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Navicula sp.CANA 93655KM999059KM998989KM999024B6R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Navicula sp.CANA 93655KM999060KM998990KM999025B5R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Navicula sp.CANA 93655KM999061KM998991KM999026B4R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Navicula sp.CANA 93655KM999062KM998992KM999027B1R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Navicula sp.CANA 93656KM999063KM998993KM999028C6R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 28-Sep-2013F.D. Caron
Navicula sp.CANA 93656KM999064KM998994KM999029C3R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 28-Sep-2013F.D. Caron
Sellephora sp.CANA 93647KM999065KM998995KM999030F4R6WaterMilledgeville, south of airport (Georgia, USA) 33.1472 N 83.2505 W, 22-Oct-2013P.B. Hamilton
Craticula cf. cuspidataCANA 93656KM999066KM998996KM999031C4R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 28-Sep-2013F.D. Caron
Craticula cf. cuspidataCANA 93655KM999067KM998997KM999032A3R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Craticula cf. cuspidataCANA 93655KM999068KM998998KM999033A2R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Craticula cf. cuspidataCANA 93655KM999069KM998999KM999034A1R6WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 07-Sep-2013F.D. Caron
Craticula cuspidataCANA 93647KM999070KM999000KM999035C6R7RNALaterMilledgeville, south of airport (Georgia, USA) 33.1472 N 83.2505 W, 22-Oct-2013P.B. Hamilton
Craticula cf. cuspidataCANA 93656KM999071KM999001KM999036WaterSmall lake near Lac a Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 28-Sep-2013F.D. Caron
Gyrosigma accuminatumCANA 93251KM999072KM999002KM999037RNALaterAlymer, small pond (Quebec, Canada) 45.442997 N 75.810437 W, 30-Jan-2014P.B. Hamilton
Gyrosigma accuminatumCANA 93251KM999073KM999003KM999038WaterAlymer, small pond (Quebec, Canada) 45.442997 N 75.810437 W, 30-Jan-2014P.B. Hamilton
Gyrosigma accuminatumCANA 93251KM999074KM999004KM999039WaterAlymer, small pond (Quebec, Canada) 45.442997 N 75.810437 W, 30-Jan-2014P.B. Hamilton
Gyrosigma accuminatumCANA 93249KM999075KM999005KM999040RNALaterAlymer, small pond (Quebec, Canada) 45.443022 N 75.810421 W, 30-Jan-2014P.B. Hamilton
Gyrosigma accuminatumCANA 93663KM999076KM999006KM999041RNALaterAlymer, small pond (Quebec, Canada) 45.443050 N 75.810367 W, 15-Nov-2013K.E. Lefebvre
Gyrosigma accuminatumCANA 93663KM999077KM999007KM999042RNALaterAlymer, small pond (Quebec, Canada) 45.443050 N 75.810367 W, 15-Nov-2013K.E. Lefebvre
Gyrosigma accuminatumCANA 93663KM999078KM999008KM999043RNALaterAlymer, small pond (Quebec, Canada) 45.443050 N 75.810367 W, 15-Nov-2013K.E. Lefebvre
Cymatopleura soleaCANA 93251KM999079KM999009KM999044RNALaterAlymer, small pond (Quebec, Canada) 45.442997 N 75.810437 W, 30-Jan-2014P.B. Hamilton
Table 2

List of 38 taxa isolated and sequenced for the gene rbcL, with Canadian Museum of Nature Accession numbers, GenBank Id numbers, sample media, source locality with date of collection, and collector.

TaxonAccession numberGenBank IDIdentifierSample mediumLocality and date of collectionCollector
Melosira variansCANA 100020KM999080I7R14WaterBlack Rapids, Rideau River (Ontario, Canada) 45.321990 N 75.699427 W, 19-Jun-2014P.B. Hamilton
Melosira variansCANA 100020KM999081I4R14WaterBlack Rapids, Rideau River (Ontario, Canada) 45.321990 N 75.699427 W, 19-Jun-2014P.B. Hamilton
Synedra sp.CANA 93622KM999082C7R5WaterAlymer, small pond (Quebec, Canada) 45.442838 N 75.810296 W, 04-Oct-2013P.B. Hamilton
SurirellaCANA 100068KM999083A7R14WaterPetawawa Forestry Station (Ontario, Canada) 45.992966 N 77.399194 W, 18-May-2014P.B. Hamilton
SurirellaCANA 93281KM999084B6R13WaterCreek, Adirondack Park (New York, USA) 44 00.715 N 74 49.155 W, 11-May-2014P.B. Hamilton
Cymatopleura soleaCANA 93636KM999085G3R5WaterAlymer, small pond (Quebec, Canada) 45.442970 N 75.810277 W, 10-Oct-2013P.B. Hamilton
Cymatopleura soleaCANA 93622KM999086C2R5WaterAlymer, small pond (Quebec, Canada) 45.442838 N 75.810296 W, 04-Oct-2013P.B. Hamilton
Stauroneis cf. gracilisCANA 93639KM999087B8R5WaterDitch, Hwy 36 (Ontario, Canada) 44.803911 W 76.509686 W, 28-Sep-2013P.B. Hamilton
Sellephora cf. laevissimaCANA 100068KM999088B2R14WaterPetawawa Forestry Station (Ontario, Canada) 45.992966 N 77.399194 W, 18-May-2014P.B. Hamilton
Neidium sp.CANA 100068KM999089G3R12WaterPetawawa Forestry Station (Ontario, Canada) 45.992966 N 77.399194 W, 18-May-2014P.B. Hamilton
Neidium tumescensCANA 100072KM999090C7R12WaterWetland, Adirondack Park (New York, USA) 43.90870 N 74.43944 W, 11-May-2014P.B. Hamilton
Neidium tumescensCANA 100072KM999091A7R12WaterWetland, Adirondack Park (New York, USA) 43.90870 N 74.43944 W, 11-May-2014P.B. Hamilton
Neidium sp.CANA 93372KM999092D1R12WaterPond, Hwy 56 (New York, USA) 44.39536 N 74.75690 W, 11-May-2014P.B. Hamilton
Cymbella sp.CANA 100068KM999093H1R12WaterPetawawa Forestry Station (Ontario, Canada) 45.992966 N 77.399194 W, 18-May-2014P.B. Hamilton
Cymbopleura subcuspidataCANA 100068KM999094H4R12WaterPetawawa Forestry Station (Ontario, Canada) 45.992966 N 77.399194 W, 18-May-2014P.B. Hamilton
Cymbopleura subcuspidataCANA 93374KM999095D7R13WaterWetland, Adirondack Park (New York, USA) 44.31925 N 74.72392 W, 10-May-2014P.B. Hamilton
Cymbopleura subcuspidataCANA 93374KM999096D6R13WaterWetland, Adirondack Park (New York, USA) 44.31925 N 74.72392 W, 10-May-2014P.B. Hamilton
Gomphonema cf. parvulumCANA 93654KM999097A6R8WaterAlymer, small pond (Quebec, Canada) 45.44288 N 75.81045 W, 05-Nov-2013K.E. Lefebvre
Encyonema sp.CANA 93281KM999098B8R13WaterCreek, Adirondack Park (New York, USA) 44.00715 N 74.49155 W, 11-May-2014P.B. Hamilton
Frustulia bahlsiiCANA 93284KM999099C6R13WaterBig Moose Lake, Adirondack Park (New York, USA) 43.81723 N 74.88400 W, 10-May-2014P.B. Hamilton
Frustulia bahlsiiCANA 93279KM999100A5R13WaterWetland, Adirondack Park (New York, USA) 44.44806 N 74.77439 W, 10-May-2014P.B. Hamilton
Frustulia bahlsiiCANA 93284KM999101C5R13WaterBig Moose Lake, Adirondack Park (New York, USA) 43.81723 N 74.88400 W, 10-May-2014P.B. Hamilton
Frustulia cf. saxonicaCANA 93281KM999102B5R13WaterCreek, Adirondack Park (New York, USA) 44.00715 W 74.49155 W, 11-May-2014P.B. Hamilton
Gyrosigma accuminatumCANA 93253KM999103E5R10IodineAlymer, small pond (Quebec, Canada) 45.443031 N 75.810467 W, 21-Mar-2014P.B. Hamilton
Gyrosigma accuminatumCANA 93663KM999104G3R9RNALaterAlymer, small pond (Quebec, Canada) 45.26583 N 75.48622 W, 15-Nov-2013K.E. Lefebvre
Navicula sp.CANA 93656KM999105C5R6WaterSmall lake near Lac A Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 28-Sep-2013F.D. Caron
Navicula sp.CANA 93656KM999106D1R6WaterSmall lake near Lac A Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 28-Sep-2013F.D. Caron
Navicula sp.CANA 93656KM999107C1R6WaterSmall lake near Lac A Serpent (Quebec, Canada) 45.627633 N 75.601467 W, 28-Sep-2013F.D. Caron
Navicula sp.CANA 93645KM999108E5R6WaterAlymer, small pond (Quebec, Canada) 45.44305 N 75.81043 W, 08-Oct-2013P.B. Hamilton
Navicula sp.CANA 93645KM999109E3R6WaterAlymer, small pond (Quebec, Canada) 45.44305 N 75.81043 W, 08-Oct-2013P.B. Hamilton
Navicula sp.CANA 93644KM999110D3R6WaterAlymer, small pond (Quebec, Canada) 45.44305 N 75.81043 W, 08-Oct-2013P.B. Hamilton
Navicula sp.CANA 93644KM999111D5R6WaterAlymer, small pond (Quebec, Canada) 45.44305 N 75.81043 W, 08-Oct-2013P.B. Hamilton
Navicula cf. cryptocephalaCANA 93655KM999112A8R6WaterSmall lake near Lac A Serpent (Quebec, Canada) 45 37.658 N 75 36.088 W, 07-Sep-2013P.B. Hamilton
Nitzschia cf. sigmoideaCANA 100068KM999113G2R12WaterPetawawa Forestry Station (Ontario, Canada) 45.992966 N 77.399194 W, 18-May-2014P.B. Hamilton
Nitzschia linearisCANA 93374KM999114D8R13WaterAdirondack Park (New York, USA) 44.31925 N 74.72392 W, 10-May-2014P.B. Hamilton
Eunotia sp.CANA 93373KM999115D2R13WaterBig Moose Lake, Adirondack Park (New York, USA) 43.81782 N 74.88404 W, 10-May-2014P.B. Hamilton
Aulacoseria granulataCANA 100020KM999116I6R14WaterBlack Rapids, Rideau River (Ontario, Canada) 45.321990 N 75.699427 W, 19-Jun-2014P.B. Hamilton
Aulacoseria granulataCANA 100020KM999117I3R14WaterBlack Rapids, Rideau River (Ontario, Canada) 45.321990 N 75.699427 W, 19-Jun-2014P.B. Hamilton
List of 35 taxa isolated and sequenced for genes rbcL, psbA, and 18S, with Canadian Museum of Nature Accession numbers, GenBank Id numbers, sample media, source locality with date of collection, and collector. List of 38 taxa isolated and sequenced for the gene rbcL, with Canadian Museum of Nature Accession numbers, GenBank Id numbers, sample media, source locality with date of collection, and collector.

RNAlater preservation

A 0.2 mL volume of fresh benthic sample was aliquoted into a 1.5 mL tube containing 1 mL of RNAlater tissue storage buffer (http://www.protocol-online.org/prot/Protocols/RNAlater-3999.html). The sample was hand shaken to mix, kept at room temperature for 24 h, and then stored in the dark at 4°C between 5 and 21 days before single cell isolation.

Lugol's fixation

Following the protocol from Henrichs et al. (2007), 0.2 mL of living benthic sample was aliquoted into a 1.5 mL tube and then fixed with Lugol's iodine solution (10 g I2, 20 g of KI, 200 mL ddH2O). In this study non-acidified Lugol's fixation (no glacial acetic acid) was used while Henrichs et al. (2007) used acidified Lugol's fixation. The merits of using non-acidified vs. acidified Lugol's fixatuion, was discussed in Throndsen (1978). Samples fixed with non-acidified Lugol's iodine solution varied in storage duration from 12 days to 20 years. Just prior to isolation, 20 μL of 1 M sodium thiosulfate was added to the samples and hand mixed until the Lugol's iodine solution was dissipated (became colorless).

Single diatom cell isolation

One to two milliliters of each sample (living, RNAlater preserved or Lugol's preserved) was placed on a large microscope slide. The sample was then diluted with ~1 mL of sterile, nuclease free water (Bioshop Canada Inc.) and examined under an inverted microscope (Leica) or a compound microscope (Nikon, with long working distance objectives) at 10x magnification. Individual cells were isolated through suction using 20–40 μl drawn-out disposable pipets, either with a Narishige micromanipulator or simple manual suction. This isolation procedure was modified from Throndsen (1978) by removing the use of Formvar film. The isolated cell with associated contaminants was transferred to a new water droplet of DNA nuclease free water. This isolation and transfer was repeated 2–5 times to remove any contaminants and/or preservative residue. Individual cells were then isolated for the final time and transferred to a 0.2 mL PCR tube containing 200 μL of 10% (w/v) Chelex® 100 solution (Richlen and Barber, 2005). The samples were stored from 1 to 51 days at 4°C in the dark until DNA extraction (Tables 1, 2; Supplement 1 in Appendix).

DNA extraction, PCR and sequencing protocol

For DNA extraction, Chelex-stored samples were incubated for 20 min at 95°C. They were then vortexed for 15 s and centrifuged for 15 s at 14,000 rcf. We also tested extraction without the incubation step on some samples, and had similar success with both protocols. For PCR study, the following primers were used (Table 3).
Table 3

Oligonucleotide primer sequences used in the nested amplifications.

NestedForward primers: name, (sequence)Reverse primers: name, (sequence)Author
arbcL66+ (5′-TTTAAGGAGAAATAAATGTCTCAATCTG-3′)rbcLdp7- (5′-AAASHDCCTTGTGTWAGTVTC-3′)Alverson et al., 2007; Daugbjerg and Andersen, 1997
arbcL40+ (5′-GGACTCGAATVAAAAGTGAACG-3′)rbcL1444-(5′-GCGAAATCAGCTGTATCTGTWG-3′)Ruck and Theriot, 2011
brbcL181F (5′-ACCGTGCTAAAGCTT-3′)rbcL1174R (5′-ACCRATTGTACCACC-3′)This study
a18SP2F (5′-CTGGTTGATTCTGCCAGT-3′)18SP4R (5′-TGATCCTTCYGCAGGTTCAC-3′)van Hannen et al., 1999; Guillou et al., 1999
b18S200F (5′-YGGSRWGAYRTGGTGADTCA-3′)18S1444R (5′-GVRTRCATCAGTGTAGCGCG-3′)This study
apsbAF (5′-ATGACTGCTACTTTAGAAAGACG-3′)psbAR (5′-GCTAAATCTARWGGGAAGTTGTG-3′)Yoon et al., 2002
bpsbA45Fdg (5′-WTTYATCGWGCTCCWCCAG-3)psbA648R (5′-RTGWGCTGCAACGATRTTRT-3′)This study

Nested, a, refers to the first primer amplifications; b, the primer used in the second amplification.

Oligonucleotide primer sequences used in the nested amplifications. Nested, a, refers to the first primer amplifications; b, the primer used in the second amplification. The first amplifications were performed in a 25 μL volume with a final concentration of 1 × PCR buffer (Bioshop Canada Inc.), 2 mmol L−1 MgCl2, 0.3 mmol L−1 dNTP, 0.4 μmol L−1 of each primer, 1 unit of Taq DNA polymerase (BioShop), and 5.0 μl of Chelex DNA extract supernatant. The following cycling conditions were used: 94°C for 210 s; followed by 36 cycles of 94°C for 50 s, 52°C for 50 s, and 72°C for 80 s; and then a final elongation step at 72°C for 15 min. For the second amplification (and the third amplification step when performed), all steps and concentrations were the same as above except that 1 μl of the product from the previous amplification was used as template. The success of the PCR was assessed by visualizing the products on a 1.5% agarose gel. Successful PCR products were purified using the enzymes Exonuclease I and shrimp alkaline phosphatase (USB Corporation). Big Dye version 3.1 (Life Technologies Corporation) was used for sequencing reactions using 0.6 μL of Big Dye in a 10 μL reaction. Sequencing reaction products were purified via ethanol-EDTA-sodium acetate precipitation. Nucleotide sequences were generated using automated cycle-sequencing on an Applied Biosystems 3130xl automated sequencer. To validate the use of Bioshop Taq DNA polymerase, seven of the samples were re-amplified for all three genes using Phusion® High-Fidelity DNA Polymerase (New England Biolabs). The optimized annealing temperatures used in the first and second Phusion amplifications were as follows: rbcL 57.7°C and 60.3°C; 18S 52.2°C and 60.3°C; psbA 52.2°C and 63°C. Further, to ensure that the error rate was not affecting our sequences, seven samples were re-amplified for rbcL using Phusion with 20, 25, and 30 cycles in the second amplification. The final PCR products were then sequenced and compared for base pair differences with the sequences obtained with Bioshop Taq using the standard 34 cycle second amplification step. Seventy-two rbcL sequences were studied from a wide range of diatom taxa. Within the pennate raphe bearing diatoms, 35 diatom cells were sequenced for all three genes to ensure that the method would allow for multi-gene sequencing. Sequences were assembled and edited in Geneious version 6.1.5 and consensus sequences were aligned using the MAFFT alignment tool. Consensus sequences were compared to the GenBank database using the Basic Local Alignment Search Tool (BLAST) to verify and ensure that no contaminants were sequenced. Initial Maximum Parsimony (MP) tree topologies of each gene were assessed in PAUP v.4.0 (Swofford, 2003), and phylogenetic model testing (using likelihood scores and AIC calculations) of each region was analyzed in JModel Test v.2.1.4 (Guindon and Gascuel, 2003; Darriba et al., 2012) to ensure that the data could be concatenated for analysis. Datasets had the preferred General Time Reversible model (GTR+I+G) (Tavaré, 1986) except the 18S dataset which had the Transitional Model (TIM3+I+G) (Posada, 2003). However, the initial topology tree of the 18S matched both the rbcL and psbA initial topology trees, and, using a chi-squared distribution, the delta values from TIM3+I+G and GTR+I+G (delta = 0.0000, K = 78; delta = 3.9436, K = 80, respectively) were not shown to be statistically different (P < 0.15). Therefore, the model GTR+I+G was used for Bayesian analysis (BI) and Maximum Likelihood (ML) for both the concatenated data set (rbcL, 18S, psbA) and the single gene dataset (rbcL). The BI was carried out with MrBayes v.3.1.2 (Huelsenbeck and Ronquist, 2001; Ronquist and Huelsenbeck, 2003), with a Monte Carlo Markov Chain (MCMC) run for 1 million generations for the concatenated gene data set and 5 million generations for the rbcL dataset with the default settings. Runs were sampled every 1000th generation. The first 250,000 and 1,250,000 were discarded as burn-in for the concatenated gene dataset and rbcL dataset, respectively. The convergence and stationarity of the BI results were analyzed in Tracer v1.6 (Rambaut et al., 2013) and topology convergences were analyzed in AWTY (Wilgenbusch et al., 2004). ML Bootstrap analysis (Felsenstein, 1985) was done in Garli v.2.01 (Zwickl, 2006), using the GTR+I+G model, with 1000 bootstrap replicates for both the concatenated dataset and rbcL data set. Fragilaria bidens (GenBank Acquisition AB430716.1) was used as the outgroup for our concatenated dataset as it was a close sister species to the taxa used in the concatenated analysis. Bolidomonas pacifica (GenBank Acquisition HQ912421.1) and Cyclotella meneghiniana (GenBank Acquisition KF959651.1) were used as outgroup and sister taxa, respectively, for the rbcL dataset as they were the closest relatives to the dataset taxa available on GenBank.

Results

Partial sequences for rbcL (1202–1305 bp), 18S (811–1144 bp), and psbA (537–578 bp) were determined for 35 single cell freshwater diatom isolates. An additional 37 partial rbcL sequences were determined for a variety of diatom genera including Melosira C.Ag., Aulacoseira Thwaites, Synedra (Ulnaria) Ehrenberg, Eunotia Ehrenberg, Navicula Bory, Neidium Pfitzer, Placoneis C.Mereschkowsky, Frustulia C.Ag., Gyrosigma Hassall, Stauroneis Ehrenberg, Craticula Grunow, Sellaphora C.Mereschkowsky, Pinnularia Ehrenberg, Cymatopleura W.Sm., Encyonema Kütz., Gomphonema Ehrenberg, Nitzschia Hassall, Hantzschia Grunow, and Surirella Turpin. Of the diatoms sequenced, 60 were from fresh living samples, 12 were from RNAlater fixed samples, and one from a Lugol's fixed sample (Tables 1, 2, Supplement 1 in Appendix). NCBI Blast searches using the new sequences resulted in matches consistent with the genus-level morphological identifications of our specimens. Method validation of the number of cycles (20, 25, 30, and 34) and type of DNA polymerase using five different taxa showed only one instance of base pair substitutions, though there we no differences in the overall sequence alignments. The recovered sequence lengths for rbcL and 18S were both within the average range for the diatom sequences found on GenBank (Table 4). The length of psbA sequence recovered was slightly below the gene length found for diatom sequences on Genbank (Table 4). The amplification success was 70%, 90%, and 96% for rbcL, 18S, and psbA, respectively (Table 5). The recovery success of 18S and psbA was higher than rbcL because only samples that amplified successfully for rbcL were processed for these two regions. We had very low amplification success with the Lugol's fixed samples (Table 5). In addition, within the Lugol's fixed samples 13 contained fungi 18S nuclear DNA. RNAlater amplification success was consistent across sample storage periods ranging from 5 to 21 days (Table 5).
Table 4

Range of sequence base pair lengths for the three genes studied from our samples and those reported from GenBank.

Our samplesGenBank samplesMean
rbcL1202–1298364–14751181 (±386)
18S996–11453–1915818 (±556)
psbA537–578660–1082790 (±110)

Mean ± SD represents base pair lengths from GenBank sequences.

Table 5

Amplification success rates for live, RNA.

GeneLiveRNAlaterLugol's
5–7 days storage8–14 days storage15–21 days storage22–30 days storage
Successful (total attempts)Successful (total attempts)Successful (total attempts)Successful (total attempts)Successful (total attempts)Successful (total attempts)
rbcL348 (446)9 (15)1 (3)4 (13)4 (7)4 (53)
18S183 (201)7 (8)1 (1)3 (4)1 (3)13* (19)
psbA65 (66)7 (7)1 (1)4 (4)NA1 (1)

All of the successful samples were found to contain residual fungi DNA.

Range of sequence base pair lengths for the three genes studied from our samples and those reported from GenBank. Mean ± SD represents base pair lengths from GenBank sequences. Amplification success rates for live, RNA. All of the successful samples were found to contain residual fungi DNA.

Multiple gene analysis

Individual topologies of the three genes (rbcL, 18S, psbA) showed no differences, neither did the ML (-LnL = 10058.4412) nor BI analyses, thus only the BI tree was shown with both the BI posterior probabilities (PP) and the ML bootstrap values (BS) (Figure 1). Our dataset showed significant separation at the family level for the following: Pinnulariaceae (PP = 100, BS = 100), Sellaphoraceae (100, 74), Stauroneidaceae (100, 85), Pleurosigmataceae (100, 100), Naviculaceae (100, 100). RNAlater preserved and fresh samples of the same taxa were found within the correct clades. Examples of this can be seen in the genera Craticula, Gyrosigma, and Pinnularia (Figure 1). In the genus Gyrosigma, a fresh sample and an RNAlater sample were significantly similar (100, 87), and came out on the same terminal branch (Figure 1, stars). The small branching of individual taxa within this genera were due to ≤5 bp differences. Although difficult to determine, low number of base pair differences could be either base pair substitution error or intrageneric variation (0.001%) between the concatenated sequences (>3000 bp).
Figure 1

Tree showing relationships of rbcL, 18S, and psbA using the best fit model, GTR+I+G. Statistical support is shown with numbers at nodes: Bayesian posterior probabilities (Maximum Likelihood bootstrap values). The stars indicates two Gyrosigma cells, one from a fresh sample and the other from an RNAlater preserved samples which had significantly similar sequences. Taxa are indicated to be from either RNAlater or fresh samples.

Tree showing relationships of rbcL, 18S, and psbA using the best fit model, GTR+I+G. Statistical support is shown with numbers at nodes: Bayesian posterior probabilities (Maximum Likelihood bootstrap values). The stars indicates two Gyrosigma cells, one from a fresh sample and the other from an RNAlater preserved samples which had significantly similar sequences. Taxa are indicated to be from either RNAlater or fresh samples. Slight branching between individuals was also present in the genera of Craticula (≤ 3 bp differences, 0.001%) and Pinnularia (≤ 14 bp differences, 0.004%), showing low levels of intrageneric variation (Figure 1). Individual cells were principally collected from benthic sediments, leading to the larger representation of Naviculaceae taxa.

Single gene analysis

For the single gene dataset using only rbcL, neither the ML (-LnL = 9690.8210) nor the BI trees showed any differences, thus only the BI tree was shown with both the BI PP and the ML BS values (Figure 2). All isolates from the same genus showed strongly supported monophyletic taxa. In particular, the genera Craticula, Pinnularia, Neidium, Frustulia, Cymatopleura, Surirella, Gyrosigma, Melosira, and Aulocoseira all had very high support values (PP = 100; BS = 100), while the genera Stauroneis (100, 65) and Navicula (100, 85) had supported monophyletic taxa groups. Cells of the same taxon, collected from the same location (±10 m area) were also more closely aligned in the tree compared to similar cell isolates of the same taxon from other locales (Figure 2, red arrows). Specimens from the same genus which were isolated from either fresh, RNAlater preserved or in one case Lugol's preserved samples were always in the same monophyletic group. The Gyrosigma specimen which was isolated from iodine fixation was placed with all other Gyrosigma isolates (PP = 100, BS = 100). As well, for both Gyrosigma and Pinnularia, fresh and RNAlater preserved isolates were on the same terminal branch (Figure 2, black arrows).
Figure 2

Phylogenetic relationships of Bacillariophyta rooted with . Numbers at the nodes indicate statistical support if both methods resulted in >50%: Bayesian Inference posterior probabilities (Maximum Likelihood bootstrap values). Taxa are indicated to be from either fresh, RNALater or iodine samples. Color blocks indicate the different orders of Bacillariophyta (A, Melosirales; B, Aulacoseirales;C, Fragilariales; D, Thalassiosirales; E, Eunotiales; F, Bacillariales; G, Naviculales; H, Cymbellales; I, Surirellales). The black arrows show two instances in which sequences from preserved cells had very similar sequences from fresh cell samples. The red arrows show three instances where taxa collected from the same location had identical sequences. Bolidomonas pacifica (GenBank Acquisition HQ912421.1) and Cyclotella meneghiniana (GenBank Acquisition KF959651.1) were used as an outgroup and sister group respectively, and their rbcL sequences were obtained from GenBank.

Phylogenetic relationships of Bacillariophyta rooted with . Numbers at the nodes indicate statistical support if both methods resulted in >50%: Bayesian Inference posterior probabilities (Maximum Likelihood bootstrap values). Taxa are indicated to be from either fresh, RNALater or iodine samples. Color blocks indicate the different orders of Bacillariophyta (A, Melosirales; B, Aulacoseirales;C, Fragilariales; D, Thalassiosirales; E, Eunotiales; F, Bacillariales; G, Naviculales; H, Cymbellales; I, Surirellales). The black arrows show two instances in which sequences from preserved cells had very similar sequences from fresh cell samples. The red arrows show three instances where taxa collected from the same location had identical sequences. Bolidomonas pacifica (GenBank Acquisition HQ912421.1) and Cyclotella meneghiniana (GenBank Acquisition KF959651.1) were used as an outgroup and sister group respectively, and their rbcL sequences were obtained from GenBank.

Specimen and sample fixation

Diatom specimens or populations for morphological study in association with DNA were cleaned and mounted for light microscopy (LM) and scanning electron microscopy (SEM) validation. A population of Gyrosigma acuminatum (Kütz.) Rabenh. for example (n = 30) illustrated a natural size diminution series (Length, 106.5–163.5; width, 18.5–27.5; stria density, 17–19 μm; areola density, 17–18 in 10 μm, Figures 3A–E). Additional specimens fixed in RNAlater and frozen (−17°C) for 2 days maintained frustule integrity with cytoplasm and chloroplast structure (Figure 3F). Chloroplasts were intact however alterations in the structure were observed; in some specimens there was slight shrinkage apically and transapically, while others had poorly defined chloroplast walls. Cyanophyceae (e.g., Phormidium sp., Oscillatoria cf. princeps Vaucher ex Gomont) and Chlorophyceae [e.g., Oedigonium sp., Pediastrum boryanum (Turp.) Menegh.] cells also maintained cell structure. The Chlorophyceae had intact but sometimes slightly altered chloroplasts (pers. obs.)
Figure 3

. (A) Live specimen from NHC pond (sequenced for rbcL), low magnification showing valve outline, and cytoplasmic structure. (B–D) LM micrographs showing the size reduction series from one sample location. (E) SEM illustration the internal view of the valve. (F) LM image of an RNAlater fixed specimen after freezing at −20°C and thawing. Scale bar = 10 μm.

. (A) Live specimen from NHC pond (sequenced for rbcL), low magnification showing valve outline, and cytoplasmic structure. (B–D) LM micrographs showing the size reduction series from one sample location. (E) SEM illustration the internal view of the valve. (F) LM image of an RNAlater fixed specimen after freezing at −20°C and thawing. Scale bar = 10 μm.

Discussion

The development of novel DNA extraction protocols has accelerated the exploitation of microbial genetic studies in health (e.g., Richlen and Barber, 2005), environment (e.g., Neilan, 1995; Kermarrec et al., 2013), and even diatom taxonomic research (e.g., Evans et al., 2007; Pouličková et al., 2010). Nested DNA amplifications have the potential to open the genetic vault of taxonomic information from single microbial cells. The simple methodology of single cell Chelex® DNA extraction followed by nested PCR has great implications for expanding the genetic reference library of information in algal research. This study uses diatoms as test organisms; preliminary PCR success using dinoflagellates (pers. obs.) is also recorded. The 70–96% amplification success rates using live and (31–100%) using RNAlater fixed samples for single cell PCRs is similar to recovery rates using cultures (Lang and Kaczmarska, 2011). DNA polymerases of different quality and price points (BioShop Taq® and Phusion®) produced (93%) the same sequence results. Comparisons of sequences using NCBI BLAST also supported the morphological taxa identifications of the specimens. In this study, the systematic associations of Gyrosigma to Navicula, Craticula to Stauroneis, and Pinnularia to Sellaphora (Figures 1, 2) were in agreement with other studies (Evans et al., 2007; Theriot et al., 2010). This success rate increases the utility of conserved genes known to be good for species level taxonomic discriminations (Hamsher et al., 2011). In this study, one existing nested primer set for rbcL was used while two new primer pairs were developed for the nested amplification of 18S and psbA in diatoms. The recovered sequence lengths for rbcL and 18S were within the average range of the lengths of diatom sequences found on GenBank while psbA sequence lengths were slightly below the average gene length found for diatom sequences (Table 4). One single cell DNA extraction provided enough amplification template for multiple DNA amplifications making the approach compatible for robust multi-gene analyses. In this study, 10 amplifications from a single cell extract were successfully performed. Based upon the replicated amplifications done, a conservative estimate suggests there could be enough template for 30–35 PCR amplifications per extraction. There is even potential that nested amplifications can be used to generate large-scale genetic datasets using next generation sequencing protocols optimized for low concentration DNA templates (e.g., initial amplification using the Multiple Displacement Amplification reaction, (Lasken, 2007); but see (Ning et al., 2014) for a review of current challenges). Some research suggests that the Chelex extraction method, which also conserves proteins, may not be the most suitable for Multiple Displacement Amplification reactions. However, one study found that keeping proteins was not an issue for whole gene amplification (Lepere et al., 2011). One concern related to using this technique is the potential for contamination during isolation of a single cell. Often diatom cells, like taxa in the genera Asterionella Hassall, Tabellaria Ehrenberg and Surirella, have epiphytes which may contaminate amplifications. In addition, single cells often have organic extracellular polymeric substances (EPS) which capture bacteria, fungi, and loose organics with remnant DNA (Das et al., 2013). Problems related to the amplification of DNA from non-target sources were observed in this study. In the Lugol's fixed samples, 18S nuclear DNA recovery from fungi was amplified in 13 samples. In this instance, the contamination problem was identified by BLASTing the sequencing product. Contamination from non-algae sources can be easily identified using this protocol and removed from study. Since contamination was only observed in selected Lugol's samples, we can conclude that with good isolations, fungi can be effectively removed from sample concerns using fresh and freshly fixed material. Contamination from more genetically associated algal sources maybe more problematic given the potential for cross-contamination during amplifications (Zhang et al., 1992; Ruck et al., 2014) and a limited genetic reference library available for comparison. However, in this study there was no evidence of contamination. Time consuming recovery of DNA from micro-organism cultures has limited the success of barcoding in the Protista. Ruck and Theriot (2011) developed a single cell diatom field isolation and rbcL DNA extraction procedure using Chelex. This approach is effective, although limited by time requirements for isolating cells in the field and no ability to reinvestigate the samples collected. Collecting live and fixed samples in the field for subsequent isolation back in the lab gives greater success in DNA recovery. Using this single cell extraction protocol alone, or with a limited number of replicated daughter cells (short-term cultures), will greatly increase the database of DNA sequences for diatoms and microbes. In cell sequencing, there are inherent problems with the destruction of the voucher specimen (Figure A1). Although reported that diatom valves can be recovered after DNA extraction in ethanol (Lang and Kaczmarska, 2011), the glass beads in the Chelex solution destroy diatom valves after centrifugation (pers. obs.). To limit the possibility of erroneous specimen identifications, many photomicrographs were taken of each single cell prior to DNA extraction. These were linked to morphological vouchers (cleaned diatom valves) collected from the same respective population. For example, diatom specimens of G. acuminatum were matched using DNA results and recoverable specimens from the natural population (Figure 3, Figure A1). To further reduce identification errors, replication of DNA sequencing results with comparison to more morphological specimens from a population can improve the validation of species identifications in association with both genetic and morphological variability. In the case of G. acuminatum, these gene sequences can be directly related to a morphological study of the species type (Sterrenberg, 1995). However, this approach can be highly problematic when taxa within a community have overlapping morphological characteristics. In these cases, additional detailed multi-gene studies of populations could enhance the resolution and identification of cryptic species (e.g., Pouličková et al., 2010).
Figure A1

Isolation photos of sequenced taxa aligned to their placement in Figure . Bolidomonas pacifica (GenBank Acquisition HQ912421.1) and Cyclotella meneghiniana (GenBank Acquisition KF959651.1) are not pictured as their sequences were used as an out group and sister group from GenBank respectively. The isolated taxa are aligned as in Figure 2 and shown with names and identifier. 1, Melosira varians I7R14; 2, Melosira varians I4R14; 3, Aulacoseira granulata I6R14; 4, Aulacoseira granulata I3R14; 5, Synedra sp. C7R5; 6, Eunotia sp. D2R13; 7, Nitschia cf. sigmoidea G2R12; 8, Nitschia linearis var. tenuis D8R13; 9, Gyrosigma acuminatum E5R10; 10, Gyrosigma acuminatum G3R9; 11, Gyrosigma acuminatum G6R9; 12, Gyrosigma acuminatum G5R9; 13, Gyrosigma acuminatum F8R9; 14, Gyrosigma acuminatum A5R10; 15, Gyrosigma acuminatum B3R10; 16, Gyrosigma acuminatum A6R10; 17, Gyrosigma acuminatum A7R10; 18, Navicula sp. C5R6; 19, Navicula sp. B8R6; 20, Navicula sp. B7R6; 21, Navicula sp. B5R6; 22, Navicula sp. B4R6; 23, Navicula sp. B1R6; 24, Navicula sp. B6R6; 25, Navicula cf. cryptocephala A8R6; 26, Navicula sp. C6R6; 27, Navicula sp. D1R6; 28, Navicula sp. C1R6; 29, Navicula sp. E5R6; 30, Navicula sp. E3R6; 31, Navicula sp. D3R6; 32, Navicula sp. C3R6; 33, Navicula sp. D5R6; 34, Sellaphora sp. F6R4; 35, Sellaphora cf. laevissima B2R14; 36, Gomphonema cf. parvulum A6R8; 37, Encyonema sp. B8R13; 38, Cymbella sp. H1R12; 39, Cymbopleura subcuspidata H4R12; 40, Cymbopleura subcuspidata D7R13; 41, Cymbopleura subcuspidata D6R13; 42, Pinnularia sp. E3R8; 43, Pinnularia sp. A8R10; 44, Pinnularia sp. A2R10; 45, Pinnularia sp. B1R10; 46, Pinnularia sp. A1R10; 47, Pinnularia sp. A3R10; 48, Pinnularia sp. A4R10; 49, Pinnularia sp. B5R10; 50, Pinnularia sp. B1R7; 51, Surirella sp. A7R14; 52, Surirella angusta B6R13; 53, Cymatopleura solea B2R10; 54, Cymatopleura solea G3R5; 55, Cymatopleura solea C2R5; 56, Neidium sp. G3R12; 57, Neidium tumescens C7R12; 58, Neidium tumescens A7R12; 59, Neidium sp. D1R12; 60, Frustulia bahlsii C6R13; 61, Frustulia bahlsii A5R13; 62, Frustulia bahlsii C5R13; 63, Frustulia saxonica B5R13; 64, Stauroneis cf. gracilis A6R6; 65, Stauroneis cf. gracilis B8R5; 66, Stauroneis cf. gracilis A5R6; 67, Stauroneis cf. anceps B3R6; 68, Craticula cf. cuspidata C4R6; 69, Craticula cf. cuspidata C2R6; 70, Craticula cf. cuspidata A3R6; 71, Craticula cf. cuspidata A2R6; 72, Craticula cf. cuspidata A1R6; 73, Craticula cuspidata C6R7.

Traditional fixation of microbial samples for morphological, ecological and physiological study has a long history (Throndsen, 1978; Simmons, 2014). Recent studies have demonstrated that gene sequences can be recovered from a variety of traditionally fixed samples (Connell, 2002; Godhe et al., 2002; Henrichs et al., 2007; Lang and Kaczmarska, 2011). Ethanol fixed samples are not commonly found in microbial museum collections because they are subject to evaporation and have negative extraction effects on cell pigments. In the current study, recovery of gene product from Lugol's fixed samples was poor 7.5 and 68% for rbcL and 18S genes respectively. With these product recoveries sequence success was reduced further with many isolates have low quality sequence results which were rejected. Only one psbC amplification and sequence determination was attempted and successful. No success was observed from formalin-fixed samples, although we did not immediately transfer formalin-fixed material into methanol storage as described by Godhe et al. (2002). Although Bertozzini et al. (2005), and Auinger et al. (2008) have successfully recovered DNA from dinoflagellates and chrysophyte fixed with Lugol's solution, we need more detailed methodological studies to improve percent success in the routine recover of DNA from diatoms in Lugol's and formalin fixation. However, Godhe et al. (2002) suggest that Lugol's solution and varying ethanol fixations have other shortcomings. In this study we also noted the presence of fungi in museum collections fixed with Lugol's, a potential problem with historically fixed collections. At present RNAlater represents an excellent genetic fixation protocol for sample collection, short term storage and long-term archiving of microbial collections. Cell wall structure in Chlorophyceae (Oedigonium sp., Pediastrum boryanum), Cyanophyceae (Phormidium sp., Oscillatoria cf. princeps) and diatoms were maintained in RNAlater fixed samples under cold and frozen conditions. Chloroplast integrity was even maintained for G. acuminatum during freezing at −17°C (Figure 3F). The specifications for RNAlater indicate that treated tissues can be stored at 25°C for 1 week, at 4°C for 1 month, or at −20°C indefinitely (Ambion, 1999). In this study, both fresh and RNAlater fixed samples had predictable extraction success (Table 5). This supports the adoption of RNAlater as a long-term diatom storage media. The recovery of DNA from archived museum and research collections is currently poor but quickly advancing, especially with vertebrate collections (e.g., Payne and Sorenson, 2002). However, museums and large collections should prioritize the implementation of storage and fixation techniques that maintain the molecular integrity of the samples. RNAlater preserved algae, including diatoms, subjected to freeze-thaw cycles showed some internal cell cytoplasmic alterations; however the chloroplasts and associated pyrenoids remained intact. RNAlater represents a good alternative for specimen, tissue and single cell preservation. DNA barcoding can help with species delimitation and refining the concept of cryptic species. For example in this rudimentary study with a small population of G. acuminatum (Figures 1, 3), gene sequences for rbcL showed 1–5 base pairs differences between the four specimen clades, collected from three different sites within our primary pond (NHC-1). C. cf. cuspidata showed no variability (no base pair differences) in specimens from another sample site. In contrast, up to 117 bp differences were observed within the Navicula clade from three different locations within a lake and up to 88 bp differences were noted in the Pinnularia clade from four different lakes and pond locations. These results suggest that by expanding the use of barcodes to many individuals within a diatom population, inter- and intraspecific questions can be routinely addressed. Historical problems in extracting, amplifying, and sequencing DNA from single-cells have limited the development of genetics as a tool in the study of global microbial diversity, biogeography, and physiology. In diatoms, DNA sequencing from single cells is a logical step forward in population, taxonomy and environmental genetic studies. More conventional morphometric studies routinely use sample populations to determine size diminution series and variability of morphological expression (e.g., Lange-Bertalot et al., 2011; Levkov et al., 2013). With detailed genetic studies of single cells, links to match morphological populations will be informative in understanding variations in genotypic and phenotypic expression. At the species level, single cell multi-gene sequences along with associated morphometrics can act as multi-proxy validation datasets for species identifications. Future developments with single cell sequencing may even advance next generation genomic research.

Author contributions

PH funded the project, developed in collaboration with RB the experimental design, identified the development with fixed samples, initiated the expansion to multiple gene and extractions, completed cell isolations and wrote the drafts of the manuscript. KL conducted cell isolations and the majority of the sequencing, edited and corrected the manuscript. This author developed new ideas on recovery of viable DNA, produced the figures and tables, wrote most of the methods and results. RB conducted the initial work in developing the single cell isolation protocol for the Canadian Museum of Nature Laboratory of Molecular Biodiversity, developed new primers for the nested amplification, experimental ideas for the development of the manuscript and contributed extensively to the writing and final editing of the manuscript.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  34 in total

1.  Biological identifications through DNA barcodes.

Authors:  Paul D N Hebert; Alina Cywinska; Shelley L Ball; Jeremy R deWaard
Journal:  Proc Biol Sci       Date:  2003-02-07       Impact factor: 5.349

2.  Purification of bacterial genomic DNA in less than 20 min using chelex-100 microwave: examples from strains of lactic acid bacteria isolated from soil samples.

Authors:  Lourdes Reyes-Escogido; Mario Balam-Chi; Ingrid Rodríguez-Buenfil; Jesús Valdés; Luis Kameyama; Francisco Martínez-Pérez
Journal:  Antonie Van Leeuwenhoek       Date:  2010-06-17       Impact factor: 2.271

3.  The evolutionary diversification of cyanobacteria: molecular-phylogenetic and paleontological perspectives.

Authors:  Akiko Tomitani; Andrew H Knoll; Colleen M Cavanaugh; Terufumi Ohno
Journal:  Proc Natl Acad Sci U S A       Date:  2006-03-28       Impact factor: 11.205

4.  Reproductive isolation among sympatric cryptic species in marine diatoms.

Authors:  Alberto Amato; Wiebe H C F Kooistra; Jung Hee Levialdi Ghiron; David G Mann; Thomas Pröschold; Marina Montresor
Journal:  Protist       Date:  2006-12-04

5.  AWTY (are we there yet?): a system for graphical exploration of MCMC convergence in Bayesian phylogenetics.

Authors:  Johan A A Nylander; James C Wilgenbusch; Dan L Warren; David L Swofford
Journal:  Bioinformatics       Date:  2007-08-30       Impact factor: 6.937

6.  Cryptic diversity in a cosmopolitan diatom known as Asterionellopsis glacialis (Fragilariaceae): Implications for ecology, biogeography, and taxonomy.

Authors:  Irena Kaczmarska; Laura Mather; Ian A Luddington; France Muise; James M Ehrman
Journal:  Am J Bot       Date:  2014-02-08       Impact factor: 3.844

7.  Barcoding diatoms: exploring alternatives to COI-5P.

Authors:  Sarah E Hamsher; Katharine M Evans; David G Mann; Aloisie Poulíčková; Gary W Saunders
Journal:  Protist       Date:  2011-01-15

8.  The single, ancient origin of chromist plastids.

Authors:  Hwan Su Yoon; Jeremiah D Hackett; Gabriele Pinto; Debashish Bhattacharya
Journal:  Proc Natl Acad Sci U S A       Date:  2002-11-15       Impact factor: 11.205

9.  DNA genotyping of unbuffered formalin fixed paraffin embedded tissues.

Authors:  Beatrice Legrand; Philippe de Mazancourt; Michel Durigon; Véronique Khalifat; Karine Crainic
Journal:  Forensic Sci Int       Date:  2002-02-18       Impact factor: 2.395

10.  Ten species in one: DNA barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes fulgerator.

Authors:  Paul D N Hebert; Erin H Penton; John M Burns; Daniel H Janzen; Winnie Hallwachs
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-01       Impact factor: 11.205

View more
  4 in total

1.  Microbiomes of bloom-forming Phaeocystis algae are stable and consistently recruited, with both symbiotic and opportunistic modes.

Authors:  Margaret Mars Brisbin; Satoshi Mitarai; Mak A Saito; Harriet Alexander
Journal:  ISME J       Date:  2022-06-28       Impact factor: 11.217

2.  The structure and diversity of freshwater diatom assemblages from Franz Josef Land Archipelago: a northern outpost for freshwater diatoms.

Authors:  Sergi Pla-Rabés; Paul B Hamilton; Enric Ballesteros; Maria Gavrilo; Alan M Friedlander; Enric Sala
Journal:  PeerJ       Date:  2016-02-18       Impact factor: 2.984

Review 3.  Paleolimnology and resurrection ecology: The future of reconstructing the past.

Authors:  David R L Burge; Mark B Edlund; Dagmar Frisch
Journal:  Evol Appl       Date:  2017-12-14       Impact factor: 5.183

4.  High planktonic diversity in mountain lakes contains similar contributions of autotrophic, heterotrophic and parasitic eukaryotic life forms.

Authors:  Rüdiger Ortiz-Álvarez; Xavier Triadó-Margarit; Lluís Camarero; Emilio O Casamayor; Jordi Catalan
Journal:  Sci Rep       Date:  2018-03-13       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.