| Literature DB >> 28818036 |
Mona Riemenschneider1, Alexander Herbst2, Ari Rasch2, Sergei Gorlatch2, Dominik Heider3,4,5.
Abstract
BACKGROUND: Multi-label classification has recently gained great attention in diverse fields of research, e.g., in biomedical application such as protein function prediction or drug resistance testing in HIV. In this context, the concept of Classifier Chains has been shown to improve prediction accuracy, especially when applied as Ensemble Classifier Chains. However, these techniques lack computational efficiency when applied on large amounts of data, e.g., derived from next-generation sequencing experiments. By adapting algorithms for the use of graphics processing units, computational efficiency can be greatly improved due to parallelization of computations.Entities:
Keywords: Classifier chains; High performance computing; Multi label classification
Mesh:
Year: 2017 PMID: 28818036 PMCID: PMC5561639 DOI: 10.1186/s12859-017-1783-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1General concept of classifier chanining. In general, classifier C knows the labels L 0,...,L of classifiers C 0,...,C in training process and in classification process the results of classifiers C 0,...,C . Here, the concept of classifier chaining is depicted for three class labels
Comparison between our GPU implementation and the non-parallelized Mulan framework for the classification of instances based on different data sets with different counts of instances and labels
| #Instances | Mulan | GPU | Speed-up | |
|---|---|---|---|---|
| NNRTI | 715 | 1563.7 | 109 | 14x |
| PI | 662 | 1998.6 | 128.2 | 15x |
| Emotions | 593 | 1577.3 | 157.7 | 10x |
| Scene | 2407 | 8920.3 | 300.9 | 29x |
| Yeast | 2417 | 270736.2 | 379.2 | 71x |
The runtimes are shown in milliseconds
Instances classified per second with increasing number of bootstrapped instances exemplarily shown for the PI dataset
| #Instances | Mulan | GPU |
|---|---|---|
| 1000 | 357 | 2,516 |
| 10,000 | 342 | 11,510 |
| 100,000 | 352 | 25,851 |
| 1,000,000 | 362 | 26,266 |