| Literature DB >> 26823635 |
J B Stielow1, C A Lévesque2, K A Seifert2, W Meyer3, L Iriny3, D Smits1, R Renfurm1, G J M Verkley1, M Groenewald1, D Chaduli4, A Lomascolo5, S Welti6, L Lesage-Meessen4, A Favel5, A M S Al-Hatmi7, U Damm8, N Yilmaz9, J Houbraken1, L Lombard1, W Quaedvlieg1, M Binder1, L A I Vaas10, D Vu1, A Yurkov11, D Begerow12, O Roehl12, M Guerreiro13, A Fonseca13, K Samerpitak14, A D van Diepeningen1, S Dolatabadi15, L F Moreno16, S Casaregola17, S Mallet17, N Jacques17, L Roscini18, E Egidi19, C Bizet20, D Garcia-Hermoso20, M P Martín21, S Deng22, J Z Groenewald1, T Boekhout23, Z W de Beer24, I Barnes25, T A Duong25, M J Wingfield24, G S de Hoog26, P W Crous27, C T Lewis2, S Hambleton2, T A A Moussa28, H S Al-Zahrani29, O A Almaghrabi29, G Louis-Seize2, R Assabgui2, W McCormick2, G Omer1, K Dukik1, G Cardinali18, U Eberhardt30, M de Vries1, V Robert1.
Abstract
The aim of this study was to assess potential candidate gene regions and corresponding universal primer pairs as secondary DNA barcodes for the fungal kingdom, additional to ITS rDNA as primary barcode. Amplification efficiencies of 14 (partially) universal primer pairs targeting eight genetic markers were tested across > 1 500 species (1 931 strains or specimens) and the outcomes of almost twenty thousand (19 577) polymerase chain reactions were evaluated. We tested several well-known primer pairs that amplify: i) sections of the nuclear ribosomal RNA gene large subunit (D1-D2 domains of 26/28S); ii) the complete internal transcribed spacer region (ITS1/2); iii) partial β -tubulin II (TUB2); iv) γ-actin (ACT); v) translation elongation factor 1-α (TEF1α); and vi) the second largest subunit of RNA-polymerase II (partial RPB2, section 5-6). Their PCR efficiencies were compared with novel candidate primers corresponding to: i) the fungal-specific translation elongation factor 3 (TEF3); ii) a small ribosomal protein necessary for t-RNA docking; iii) the 60S L10 (L1) RP; iv) DNA topoisomerase I (TOPI); v) phosphoglycerate kinase (PGK); vi) hypothetical protein LNS2; and vii) alternative sections of TEF1α. Results showed that several gene sections are accessible to universal primers (or primers universal for phyla) yielding a single PCR-product. Barcode gap and multi-dimensional scaling analyses revealed that some of the tested candidate markers have universal properties providing adequate infra- and inter-specific variation that make them attractive barcodes for species identification. Among these gene sections, a novel high fidelity primer pair for TEF1α, already widely used as a phylogenetic marker in mycology, has potential as a supplementary DNA barcode with superior resolution to ITS. Both TOPI and PGK show promise for the Ascomycota, while TOPI and LNS2 are attractive for the Pucciniomycotina, for which universal primers for ribosomal subunits often fail.Entities:
Keywords: DNA barcoding; ITS supplement; molecular taxonomy; phylogeny; species identification; universal primers
Year: 2015 PMID: 26823635 PMCID: PMC4713107 DOI: 10.3767/003158515X689135
Source DB: PubMed Journal: Persoonia ISSN: 0031-5850 Impact factor: 11.051
Overview on datasets and selected taxa.
| Dataset | Family | Main taxa | Quantity | Source, Collaborator |
|---|---|---|---|---|
| 96 | CBS, Houbraken | |||
| 96 | CBS, Yilmaz | |||
| 30 | Inst. Pasteur, Hermoso | |||
| 22 | Westmead Hospital, Meyer & Irinyi | |||
| Onygenales | 180 | CBS, Dukik, Moreno, De Hoog, Stielow | ||
| Ascomycetous yeast I | 207 | Westmead Hospital, Meyer & Irinyi | ||
| Ascomycetous yeast II | 96 | CBS, Groenewald, Boekhout | ||
| Ascomycetous yeast III | 24 | CIRM-Levures, INRA, Casaregola, Mallet & Jacques | ||
| Ascomycetous yeast IV | 83 | Microbiology Perugia, Cardinali & Roscini | ||
| 50 | Senkenberg Museum Goerlitz, Damm | |||
| 96 | CBS, Lombard | |||
| 96 | CBS, Al-Hatmi, Van Diepeningen | |||
| 50 | CBS, Samerpitak | |||
| 21 | FABI, Wingfield, De Beer, Barnes & Duong | |||
| 96 | CBS, Verkley & Stielow | |||
| 45 | CBS, Quaedvlieg | |||
| 90 | CBS, Egidi, Binder & Quaedvlieg | |||
| 20 | INRA, Chaduli, Lomascolo, Welti & Lesage-Meessen | |||
| 47 | Museum für Naturkunde Stuttgart, Eberhardt | |||
| Gasteromycetes | 32 | Royal Botanic Garden Madrid, Martin | ||
| Basidiomycetous yeast I | 98 | Westmead Hospital, Meyer & Irinyi | ||
| Basidiomycetous yeast II | 174 | DSMZ & Ruhr University Bochum, Yrukov & Begerow | ||
| Multi-Phyla | 182 | Agriculture and Agri-Food Canada (AAFC), Ottawa | ||
Benchmarking primers used to assess performance and versatility of newly designed primers.
| Locus | Primer | Oligo nucleotides (5’-3’) | Reference |
|---|---|---|---|
| ITS | ITS5 | GGAAGTAAAAGTCGTAACAAGG | |
| ITS4 | TCCTCCGCTTATTGATATGC | ||
| ITS1 | TCC GTA GGT GAA CCT GCG G | ||
| ITS1-F | CTT GGT CAT TTA GAG GAA GTA A | ||
| LSU | LROR | ACCCGCTGAACTTAAGC | |
| LR5 | TCCTGAGGGAAACTTCG | ||
| Btub2Fd | GTBCACCTYCARACCGGYCARTG | ||
| Btub4Rd | CCRGAYTGRCCRAARACRAAGTTGTC | ||
| Bsens | ATCACWCACTCICTIGGTGGTGG | ||
| Brev | CATGAAGAARTGIAGACGIGGG | ||
| ACT512f | ATGTGCAAGGCCGGTTTCG | ||
| ACT783r | TACGAGTCCTTCTGGCCCAT | ||
| CA5R | GTGAACAATGGATGGACCAGATTCGTCG | ||
| CA14 | AACTGGGATGACATGGAGAAGATCTGGC | ||
| fRPB2-5f | GAY GAY MGW GAT CAY TTY GG | ||
| fRPB2-7cF | ATG GGY AAR CAA GCY ATG GG | ||
| fRPB2-7cR | CCC ATR GCT TGY TTR CCC AT | ||
| EF1-983F | GCY CCY GGH CAY CGT GAY TTY AT | ||
| EF1-1567R | ACH GTR CCR ATA CCA CCR ATC TT |
Initial seed primers from ‘nucleic acid based’ computational predictions (Robert et al. 2011).
| Alignment group | Species in alignment (Al) | Primers (forward-reverse) | Functional annotation |
|---|---|---|---|
| 1 | 51 | acaagcgtttct-catcaagttcca | Hypothetical protein |
| 2 | 41 | acatggagaaga-catcaaggagaa | Gamma actin |
| 3 | 44 | accttcttgatg-atgttcttgatg | Translation elongation factor 1α |
| 3b | 46 | caagaacatgat-catcaagaaggt | Translation elongation factor 1α |
| 4 | 38 | agtacttgtagg-cttggccttgta | 60S ribosomal protein L15b |
| 5 | 51 | ggaacttgatgg-agaaacgcttgt | 60S ribosomal protein L15b |
| 6 | 45 | ggtatcaccatc-caacaagatgga | Translation elongation factor 1α |
| 7 | 45 | tccatcttgttg-gatggtgatacc | Translation elongation factor 1α |
| 8 | 36 | gtccatcttgtt-tacttgaaggaa | Translation elongation factor 1α |
| 9 | 56 | gttcttggagtc-gtccatcttgtt | Translation elongation factor 1α |
| 10 | 56 | aacaagatggac-ctccaagaacga | Translation elongation factor 1α |
| 11 | 41 | aacaagatggac-tcaccactgaag | Translation elongation factor 1α |
| 11b | 42 | tggtatctccaa-aacgtcaagaac | Translation elongation factor 1α |
| 12 | 58 | caacaagatgga-ctccaagaacga | Translation elongation factor 1α |
| 13 | 42 | cacttcttcatg-atggacgagatg | 60S ribosomal protein L15b |
| 14 | 37 | catatgcttgtc-gactcgtcatct | Putative CD box sno-RNA protein,putative protein S6 kinase |
| 15 | 43 | cttcagtggtga-ccatcttgttga | Translation elongation factor 1α |
| 16 | 51 | gaacttgatggt-agaaacgcttgt | Hypothetical protein |
| 17 | 36 | gttccttcaagt-caacaagatgga | Hypothetical protein |
| 18 | 36 | tccatcttgttg-acttgaaggaac | Translation elongation factor 1α |
| 19 | 48 | tcttgacgttga-gtccatcttgtt | Translation elongation factor 1α |
| 20 | 48 | ttcttgacgttg-gtccatcttgtt | Translation elongation factor 1α |
| 21 | 39 | tcgttcttggag-tcttgatgaagt | Translation elongation factor 1α |
| 21b | 53 | ttcttggagtca-tccatcttgttg | Translation elongation factor 1α |
| 22 | 45 | ttcttgacgttg-catgttcttgat | Translation elongation factor 1α |
| 23 | 37 | ttccttcaagta-caacaagatgga | Translation elongation factor 1α |
| 24 | 45 | ttcatcaagaac-tcaacgtcaaga | Translation elongation factor 1α |
| 25 | 41 | ttcagtggtgac-atcatgttcttg | Translation elongation factor 1α |
| 26 | 39 | tccttgatttcg-cttcatgacctt | Translation elongation factor 1α |
| 27 | 37 | tcaagaaggtcg-ctccaagaacga | Translation elongation factor 1α |
| 29 | 40 | caacaagatgga-aagttcatcaag | Translation elongation factor 1α |
| 30 | 47 | gttcttgacgtt-gtccatcttgtt | Translation elongation factor 1α |
| 31 | 48 | aacaagatggac-caacgtcaagaa | Translation elongation factor 1α |
| 32 | 39 | gacttgatgaac-ccatcttgttga | Translation elongation factor 1α |
| 35 | 40 | catcaagaaggt-gactccaagaac | Translation elongation factor 1α |
| 36 | 40 | atcaagaacatg-tcaccactgaag | Translation elongation factor 1α |
| 37 | 51 | caagcgtttctc-ccatcaagttcc | Translation elongation factor 1α |
| 38 | 39 | atctccaaggat-ctccaagaacga | Translation elongation factor 1α |
| 39 | 52 | atcaagaacatg-gactccaagaac | Translation elongation factor 1α |
| 40 | 45 | atcaagaacatg-tcaacgtcaaga | Translation elongation factor 1α |
| 41 | 40 | acttgatgaact-tccatcttgttg | Translation elongation factor 1α |
| 42 | 37 | acttcatcaaga-tcaacgtcaaga | Translation elongation factor 1α |
| 43 | 40 | acaagatggaca-gactccaagaac | Translation elongation factor 1α |
| 44 | 38 | aacaagatggac-aagttcatcaag | Translation elongation factor 1α |
| 45 | 39 | aaggtcatgaag-cgaaatcaagga | Translation elongation factor 2 |
| 46 | 42 | catctcgtccat-catgaagaagtg | Beta tubulin 2 |
| 48 | 39 | tccttgatttcg-ccatcttggaga | Translation elongation factor 3 |
| 53 | 35 | ggtatcaccatc-ctccaagaacga | Translation elongation factor 1α |
| 54 | 40 | ttcatcaagaac-tcaccactgaag | Translation elongation factor 1α |
Seed primer sequences highlighted bold, were those qualifying for the last and final cross-laboratory trial.
Initial test cultures used for primary laboratory trial I for primers inferred from ‘nucleic acid based’ computational predictions (Robert et al. 2011).
| CBS number | Taxon |
|---|---|
| CBS 513.88 | |
| CBS 818.72 | |
| CBS 1954 | |
| CBS 115846 | |
| CBS 123668 | |
| CBS 445.79 | |
| CBS 277.49 | |
| FGSC 9596 | |
| CBS 708.71 | |
| FGSC 10004 | |
| CBS 405.96 | |
| CBS 142.95 | |
| CBS 109036 | |
| CBS 127170 | |
| CBS 115943 |
Initial test cultures used for secondary laboratory trial II for primers inferred from ‘nucleic acid based’ computational predictions (Robert et al. 2011).
| CBS number | Taxon |
|---|---|
| CBS 674.68 | |
| CBS 118699 | |
| CBS 131.61 | |
| CBS 126972 | |
| FGSC 1144 | |
| FGSC 1156 | |
| CBS 136.29 | |
| CBS 114389 | |
| CBS 8758 | |
| CBS 113850 | |
| CBS 113843 | |
| CBS 126970 | |
| CBS 8710 | |
| FGSC 9075 | |
| FGSC 9935 | |
| CBS 123670 | |
| FGSC 1089 | |
| CBS 287.54 | |
| CBS 2605 | |
| CBS 113480 | |
| CBS 180.27 | |
| CBS 223.38 | |
| CBS 101191 | |
| CBS 372.73 | |
| CBS 127171 | |
| FGSC 9002 | |
| CBS 6054 | |
| CBS 126969 | |
| CBS 120258 | |
| CBS 117146 | |
| CBS 128304 | |
| CBS 658.66 | |
| CBS 126971 | |
| CBS 124811 | |
| CBS 7116 | |
| CBS 484 | |
| CBS 208.27 | |
| CBS 375.48 | |
| CBS 693.94 | |
| CBS 392.92 | |
| CBS 383.78 | |
| CBS 127.97 | |
| CBS 668.78 | |
| CBS 127172 | |
| CBS 127169 | |
| CBS 599 | |
| CBS 732 |
Super primers and best candidate primers inferred from ‘nucleic acid based’ computational predictions (Robert et al. 2011).
| Locus | Original primer name | Final primer name | Sequence (5’-3’) |
|---|---|---|---|
| Al33_54_73_F1_forward | EF1-1018F | GAYTTCATCAAGAACATGAT | |
| Al33_879_859_R2_reverse | EF1-1620R | GACGTTGAADCCRACRTTGTC | |
| Al_34_EF1_300_F1_forward | EF1-1002F | TTCATCAAGAACATGAT | |
| AL34_EF1_1050_R_Tail_reverse | EF1-1688R | GCTATCATCACAATGGACGTTCTTGGAG | |
| Al33_129_148_F2_forward | Al33_alternative_f | GARTTYGARGCYGGTATCTC | |
| Al28_EF1_400_f | EF1_alterantive_3f | TTYGARGCYGGTATCTC | |
| Al28_EF1_900_R | EF1_alternative_3r | GAVACRTTCTTGACGTTGAA | |
| AlGr52_412-433_f1 | 60S-908R | CTTVAVYTGGAACTTGATGGT | |
| Algr52_1102_1084_R1 | 60S-506F | GHGACAAGCGTTTCTCNGG | |
| Al50+51_EF3_2900_f | EF3-3185F | TCYGGWGGHTGGAAGATGAAG | |
| Al50+51_EF3_3300_R | EF3-3538R | YTTGGTCTTGACACCNTC | |
| Al47_EF3_1650_forward | EF3-3188F | GGHGGHTGGAAGATGAAG | |
| A47_EF3_2451_R1_reverse | EF3-3984R | TCRTAVSWGTTCTTGAACTT | |
| Al49_ EF3_44_63_F1_forward | EF3-3186F | CYGGHGGHTGGAAGATGAAG | |
| Al49_ EF3_846_829_r1_reverse | EF3-3984R2 | TCRTAVSWGTTCTTGAAC |
Super primers inferred from ‘protein-based’ computational predictions (Lewis et al. 2011).
| Locus | Original primer name | Final primer name | Sequence (5’ to 3’) |
|---|---|---|---|
| PF00162.1120.M13-8-F | PGK_480-F | ||
| PF00162.2081.M13-8-R | PGK_480-R | ||
| PF00162.1433.M13-8-F | PGK_483-F | ||
| PF00162.1793.M13-8-R | PGK_483-R | ||
| PF00162A.675.1-F | PGK_511-F | GTYGSTGCYYTGCCMACCATCAA | |
| PF00162A.1915.1-R | PGK_511-R | ATCTTGTCRGMRACCTTRGCACC | |
| PF00162.1127.1-F | PGK_533-F | GTYGAYTTCAAYGTYCC | |
| PF00162.2081.1-R | PGK_533-R | ACACCDGGDGGRCCGTTCCA | |
| PF02919.2708.M13.1-F | TOP1_501-F | ||
| PF02919.3469.M13.8-R | TOP1_501-R | ||
| PF08235.1463.8-F | LNS2_468-F | GGCCATGTGCTGAACATGATCGGHCGWGAYTGGAC | |
| PF08235.1821.8-R | LNS2_468-R | CGGTTGCCRAAKCCRGCATAGAAKGG |
Bases in bold: M13 primers with an ACGAT spacer for the forward primers.
Fig. 1Structural gene map and positional primer locations for fungal rDNA array.
Fig. 2Structural gene map and positional primer locations for fungal β-tubulin 2.
Fig. 3Structural gene map and positional primer locations for γ-actin.
Fig. 4Structural gene map and positional primer locations for fungal RNA polymerase subunit II.
Fig. 5Structural gene map and positional primer locations for fungal translation elongation factor-1α.
Fig. 6Structural gene map and positional primer locations for fungal translation elongation factor 3.
Fig. 7Structural gene map and positional primer locations for fungal 60S ribosomal protein L 10 (L1).
Fig. 8Structural gene map and positional primer locations for fungal TOP1 primarily based on Penicillium chrysogenum genome.
Fig. 9Structural gene map and positional primer locations for fungal PGK primarily based on Penicillium chrysogenum genome.
Fig. 10Structural gene map and positional primer locations for fungal LNS2 primarily based on Penicillium chrysogenum genome.
Fig. 11Mosaic plot, indicating proportions of success rates between datasets (a, b: complete genome approach; c, d: Pfam approach) and within tested primer pairs.
Fig. 12Barcode gap analysis for ITS, PGK, TOP1, LNS2, TUB2 and TEF1.
Fig. 13Multi-dimensional scaling of rescaled cophenetic correlation coefficients comparing distance matrices.
Rescaled cophenetic correlation coefficients from Mantel test comparing distance matrices.
| Gene/Dataset | Gene | ||||||
|---|---|---|---|---|---|---|---|
| 60S | Actin | All | ITS | LSU | |||
|
| |||||||
| 60S | 1 | 0.869922 | 0.945842 | 0.665818 | 0.741716 | 0.878008 | 0.828932 |
| Actin | 0.869922 | 1 | 0.86677 | 0.718523 | 0.709779 | 0.833556 | 0.7986 |
| All | 0.945842 | 0.86677 | 1 | 0.703723 | 0.801272 | 0.947347 | 0.879415 |
| 0.665818 | 0.718523 | 0.703723 | 1 | 0.58968 | 0.656179 | 0.654457 | |
| 0.741716 | 0.709779 | 0.801272 | 0.58968 | 1 | 0.739432 | 0.751655 | |
| ITS | 0.878008 | 0.833556 | 0.947347 | 0.656179 | 0.739432 | 1 | 0.822984 |
| LSU | 0.828932 | 0.7986 | 0.879415 | 0.654457 | 0.751655 | 0.822984 | 1 |
| 60S | All | ITS | LSU | ||||
|
| |||||||
| 60S | 1 | 0.833529 | 0.748277 | 0.564056 | 0.717349 | 0.818429 | |
| All | 0.833529 | 1 | 0.770656 | 0.773935 | 0.766443 | 0.737515 | |
| 0.748277 | 0.770656 | 1 | 0.502797 | 0.616714 | 0.687664 | ||
| 0.564056 | 0.773935 | 0.502797 | 1 | 0.523543 | 0.532899 | ||
| ITS | 0.717349 | 0.766443 | 0.616714 | 0.523543 | 1 | 0.645249 | |
| LSU | 0.818429 | 0.737515 | 0.687664 | 0.532899 | 0.645249 | 1 | |
| 60S | All | ITS | LSU | ||||
|
| |||||||
| 60S | 1 | 0.678678 | 0.621512 | 0.68173 | 0.88979 | 0.748801 | |
| All | 0.678678 | 1 | 0.77138 | 0.754324 | 0.698097 | 0.694224 | |
| 0.621512 | 0.77138 | 1 | 0.572193 | 0.68485 | 0.635149 | ||
| 0.68173 | 0.754324 | 0.572193 | 1 | 0.724653 | 0.670825 | ||
| ITS | 0.88979 | 0.698097 | 0.68485 | 0.724653 | 1 | 0.778558 | |
| LSU | 0.748801 | 0.694224 | 0.635149 | 0.670825 | 0.778558 | 1 | |