| Literature DB >> 24728005 |
Daniel P Tonge1, Catherine H Pashley2, Timothy W Gant3.
Abstract
Next generation sequencing technology has revolutionised microbiology by allowing concurrent analysis of whole microbial communities. Here we developed and verified similar methods for the analysis of fungal communities using a proton release sequencing platform with the ability to sequence reads of up to 400 bp in length at significant depth. This read length permits the sequencing of amplicons from commonly used fungal identification regions and thereby taxonomic classification. Using the 400 bp sequencing capability, we have sequenced amplicons from the ITS1, ITS2 and LSU fungal regions to a depth of approximately 700,000 raw reads per sample. Representative operational taxonomic units (OTUs) were chosen by the USEARCH algorithm, and identified taxonomically through nucleotide blast (BLASTn). Combination of this sequencing technology with the bioinformatics pipeline allowed species recognition in two controlled fungal spore populations containing members of known identity and concentration. Each species included within the two controlled populations was found to correspond to a representative OTU, and these OTUs were found to be highly accurate representations of true biological sequences. However, the absolute number of reads attributed to each OTU differed among species. The majority of species were represented by an OTU derived from all three genomic regions although in some cases, species were only represented in two of the regions due to the absence of conserved primer binding sites or due to sequence composition. It is apparent from our data that proton release sequencing technologies can deliver a qualitative assessment of the fungal members comprising a sample. The fact that some fungi cannot be amplified by specific "conserved" primer pairs confirms our recommendation that a multi-region approach be taken for other amplicon-based metagenomic studies.Entities:
Mesh:
Year: 2014 PMID: 24728005 PMCID: PMC3984086 DOI: 10.1371/journal.pone.0093849
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Details of the two mock fungal populations.
| Species (8GM) | Strain | Spore size (µm) | Spore complexity | Concentration |
| Alternaria alternata | EGS 35–193 | 25–60 ×3–3.5 | Multi-cellular (up to 15 cells) | 1×105/ml |
| Aspergillus fumigatus | NRRL 163 | 2.5–3.5 | Single celled | 1×105/ml |
| Botrytis cinerea | CABI 160282 | 7–14×6–9 | Single celled | 1×105/ml |
| Cladosporium herbarum | NCPF 2564 | 5.5–23×4–6 | One or two celled | 1×105/ml |
| Epicoccum nigrum | CABI 127257 | 15–25 | Multi-cellular (up to 15 cells) | 1×105/ml |
| Fusarium moniliforme | NCPF 2865 | 4–7×25–50 | 1 to 3 cells | 1×105/ml |
| Leptosphaeria coniothyrium | CABI 52734 | 3–5×15–29 | Multi-cellular (up to 20 cells) | 1×105/ml |
| Penicillium chrysogenum | NCPF 2715 | 1.8–3.5 | Single celled | 1×105/ml |
Concentration refers to the number of spores used for DNA extraction. ATCC - American Type Culture Collection, CABI - CAB International fungal culture collection, EGS - E.G. Simmons Culture Collection, FRR - Culture collection of CSIRO Sydney Australia, NCPF - National Collection of Pathogenic Fungi, NCYC - National Collection of Yeast Cultures, NRRL - Northern Regional Research Laboratory, UAMH - University of Alberta Microfungus Collection and Herbarium
Fusion oligonucleotide primers containing target specific sequences (White, Bruns et al., 1990, Issakainen, Jalava et al., 1999; emboldened), sequencing adapters (plain text) and the Ion Torrent key sequence (underlined).
| Region | Forward Primer | Reverse Primer | Tm (°C) |
|
|
|
| 66 |
|
|
|
| 60 |
|
|
|
| 68 |
Next generation sequencing read statistics.
| Sample ID | Genomic Region | Raw Reads | Full Length Reads | Length (bp) | Dereplicated Unique Reads | Denoised Unique Reads | Total Reads Available |
|
| ITS1 | 783,648 | 631,395 | 186.7 | 039,691 | 028,439 | 575,752 |
|
| ITS2 | 844,884 | 512,545 | 302.7 | 273,496 | 205,294 | 223,048 |
|
| LSU | 600,301 | 284,547 | 345.4 | 147,167 | 099,932 | 138,137 |
|
| ITS1 | 870,048 | 525,793 | 207.6 | 071,650 | 035,045 | 192,623 |
|
| ITS2 | 865,582 | 436,590 | 316.3 | 246,924 | 161,896 | 454,395 |
|
| LSU | 518,906 | 252,447 | 343.5 | 193,944 | 154,669 | 059,641 |
Raw Reads - the total number of reads generated by each sequencing run; Full Length Reads - the total number of reads representing a full length amplicon; Dereplicated Reads - the number of unique read clusters once all identical reads were collapsed; Denoised Reads - the number of unique read clusters once reads within 1% sequence identity were combined; Total Reads Available - the total number of reads (the sum of all clusters) available for further analysis.
Characteristics of OTUs formed by the USEARCH algorithm at an identity of 0.97. Only OTUs with a minimum of 25 reads were retained.
| Sample | 8GM | 8GM | 8GM | 20SM | 20SM | 20SM |
|
| ITS1 | ITS2 | LSU | ITS1 | ITS2 | LSU |
|
| 25 | 25 | 25 | 25 | 25 | 25 |
|
| 28 | 24 | 16 | 56 | 44 | 27 |
|
| 10 | 7 | 8 | 33 | 16 | 14 |
|
| 13 | 8 | 3 | 19 | 21 | 5 |
|
| 3 | 6 | 1 | 2 | 3 | 5 |
|
| 0 | 0 | 3 | 2 | 3 | 1 |
|
| 2 | 3 | 1 | 0 | 1 | 2 |
Taxonomic levels of discrimination afforded by each genomic region.
| Sample (8GM) | 8GM | 8GM | 8GM |
| Region | ITS1 | ITS2 | LSU |
| Detected/Expected | 7/8 | 8/8 | 8/8 |
| Alternaria alternata | G | O | G |
| Aspergillus fumigatus | S | S | S |
| Penicillium chrysogenum | G | G | G |
| Fusarium moniliforme | N | O | O |
| Leptosphaeria coniothyrium | S | S | O |
| Botrytis cinerea | G | G | G |
| Cladosporium herbarum | G | S | G |
| Epicoccum nigrum | S | O | O |
Species level - S, Section within genus level – Se, Genus level - G, Above Genus level - O, Not detected -N.
Figure 1Read Distributions - the number of sequencing reads assigned to each species, expressed as a percentage of the total number of sequencing reads.