| Literature DB >> 27570615 |
Hyun S Gweon1, Anna Oliver1, Joanne Taylor2, Tim Booth1, Melanie Gibbs1, Daniel S Read1, Robert I Griffiths1, Karsten Schonrogge1.
Abstract
Studying fungal biodiversity using data generated from Illumina MiSeq sequencing platforms poses a number of bioinformatic challenges with the analysis typically involving a large number of tools for each analytical step from quality filtering to generating identified operational taxonomic unit (OTU) abundance tables.Here, we introduce PIPITS, an open-source stand-alone suite of software for automated processing of Illumina MiSeq sequences for fungal community analysis. PIPITS exploits a number of state of the art applications to process paired-end reads from quality filtering to producing OTU abundance tables.We provide detailed descriptions of the pipeline and show its utility in the analysis of 9 396 092 sequences generated on the MiSeq platform from Illumina MiSeq. PIPITS is the first automated bioinformatics pipeline dedicated for fungal ITS sequences which incorporates ITSx to extract subregions of ITS and exploits the latest RDP Classifier to classify sequences against the curated UNITE fungal data set.Entities:
Keywords: DNA metabarcoding; bioinformatics; fungi; internal transcribed spacer; pipeline
Year: 2015 PMID: 27570615 PMCID: PMC4981123 DOI: 10.1111/2041-210X.12399
Source DB: PubMed Journal: Methods Ecol Evol Impact factor: 7.781
A comparison of the key differences between PIPITS and other automated bioinformatics pipeline dedicated for fungal ITS sequences
| Pipeline | Open‐source | Stand‐alone | Extract subregion | RDP classifier | UNITE DB |
|---|---|---|---|---|---|
| PIPITS | Yes | Yes | Yes | Yes | Yes |
| SCATA | Yes | Web‐based | – | – | Customisable |
| CLOTU | Yes | Web‐based | – | – | Customisable |
| PlutoF | Yes | Web‐based | – | – | Yes |
| ITScan | Yes | Web‐based | – | – | Customisable |
| CloVR‐ITS | Yes | Yes (VM) | – | – | – |
| FHiTINGS | Yes | Yes | – | – | Customisable |
| QIIME | Yes | Yes | – | Yes | Yes |
| UPARSE | – | Yes | – | – | Yes |
Figure 1Overview/workflow of PIPITS for Illumina ITS sequences.
Figure 2The number of sequences and time taken at each processing step. SYS1: Bio‐Linux 7, 16‐core Intel(R) Xeon(R) CPU @ 2·27GHz, 105 GB RAM; SYS2: Ubuntu 14·04, a standard desktop computer, 2·93 GHz quad‐core Intel Core i3, 8GB RAM.
Figure 3Proportion of most abundant operational taxonomic units representing 90% of the samples. In total, 91·2% of the reads (blue) were assigned to phylum Ascomycota, 7·0% (green) to phylum Basidiomycota and 0·04% to phyla belonging to Glomeromycota, Zygomycota and Chytridiomycota, while < 2% of the reads (orange) were unassignable to any of the known phylum in the data base. The three most dominant OTUs were [SH198382.06FU] (24·2%), [SH235673.06FU] (16·4%) and [SH210380.06FU] (14·11%).
Figure 4Length distribution of ITS2 sequences after extraction by PIPITS_FUNITS. Range: 101–461 bp, mean: 168, standard deviation: 28·7.
Figure 5FASTQ quality scores across forward and reverse reads in one of the samples. The blue bar represents the inter‐quartile range (25–75%), the grey bar represents the 10% and 90%, and the black line represents the mean quality.