| Literature DB >> 25551450 |
Anirban Dutta1, Disha Tandon1, M H Mohammed1, Tungadri Bose1, Sharmila S Mande1.
Abstract
MOTIVATION: Paired-end sequencing protocols, offered by next generation sequencing (NGS) platforms like Illumia, generate a pair of reads for every DNA fragment in a sample. Although this protocol has been utilized for several metagenomics studies, most taxonomic binning approaches classify each of the reads (forming a pair), independently. The present work explores some simple but effective strategies of utilizing pairing-information of Illumina short reads for improving the accuracy of taxonomic binning of metagenomic datasets. The strategies proposed can be used in conjunction with all genres of existing binning methods.Entities:
Mesh:
Year: 2014 PMID: 25551450 PMCID: PMC4281075 DOI: 10.1371/journal.pone.0114814
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic representation of different ‘Binpairs’ strategies for performing ‘taxonomic re-assignment’ of ‘paired-end’ reads.
(A) Outcome when the two taxons assigned belong to two different lineages, (B) Outcome when the two taxons assigned belong to the same lineage.
Figure 2Binning outcomes for the (A) ‘short clones’ and (B) ‘long clones’ mate-paired datasets.
The outcomes were the result of the application of the four ‘Binpairs’ strategies (SI, SII, SIII, SIV) in combination with different existing binning methods (MEGAN, DiScRIBinATE, SOrt-ITEMS, TWARIT and INDUS). The results obtained when reads from these datasets were binned as individual fragments (i.e. without utilizing pairing information) by the respective binning methods, are also depicted for comparison.
Comparison of taxonomic binning efficiency obtained with ‘Roche’ reads and ‘Illumina’ paired-end reads.
| Type of reads | Binning strategy | Taxonomic assignments (%) | ||||
| Genus andbelow | Family andOrder | Class andPhylum | Wrongassignments | Unassignedreads | ||
| Roche reads | MEGAN | 24.4 | 7.9 | 10.4 | 28.5 | 28.8 |
| Illumina paired-end reads | MEGAN+SIII | 17.8 | 6.2 | 6.0 | 10.5 | 59.5 |
| Illumina paired-end reads | MEGAN (paired) | 18.0 | 4.9 | 7.3 | 18.5 | 51.3 |
| Roche reads | DiScRIBinATE | 20.9 | 15.8 | 17.9 | 9.5 | 36.0 |
| Illumina paired-end reads | DiScRIBinATE+SI | 22.5 | 8.9 | 11.7 | 9.0 | 47.9 |
| Roche reads | SOrt-ITEMS | 21.6 | 15.1 | 15.4 | 25.2 | 22.8 |
| Illumina paired-end reads | SOrt-ITEMS+SI | 20.6 | 6.2 | 9.5 | 22.5 | 41.1 |
| Roche reads | INDUS | 16.4 | 2.3 | 14.3 | 13.0 | 54.1 |
| Illumina paired-end reads | INDUS+SIII | 17.1 | 3.8 | 3.0 | 4.9 | 71.3 |
| Roche reads | SPHINX | 18.4 | 4.5 | 13.8 | 16.7 | 46.6 |
| Illumina paired-end reads | SPHINX+SIII | 16.6 | 3.3 | 3.0 | 2.8 | 74.3 |
| Roche reads | TWARIT | 22.0 | 4.2 | 7.8 | 11.4 | 54.7 |
| Illumina paired-end reads | TWARIT+SIII | 15.6 | 3.6 | 2.9 | 7.3 | 70.6 |
Roche reads have been binned using different existing binning algorithms. Illumina paired-end reads have been binned using a combination of ‘Binpairs’ strategies and existing algorithms. Results of representative ‘Binpairs’ strategies, which provided the best results, are illustrated in this table.