| Literature DB >> 27487242 |
Yuling Tian1, Hongxian Zhang1.
Abstract
For the purposes of information retrieval, users must find highly relevant documents from within a system (and often a quite large one comprised of many individual documents) based on input query. Ranking the documents according to their relevance within the system to meet user needs is a challenging endeavor, and a hot research topic-there already exist several rank-learning methods based on machine learning techniques which can generate ranking functions automatically. This paper proposes a parallel B cell algorithm, RankBCA, for rank learning which utilizes a clonal selection mechanism based on biological immunity. The novel algorithm is compared with traditional rank-learning algorithms through experimentation and shown to outperform the others in respect to accuracy, learning time, and convergence rate; taken together, the experimental results show that the proposed algorithm indeed effectively and rapidly identifies optimal ranking functions.Entities:
Mesh:
Year: 2016 PMID: 27487242 PMCID: PMC4972358 DOI: 10.1371/journal.pone.0157994
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Parallel model of B cell algorithm.
Correspondence between immune components and rank-learning components.
| Immune Components | Rank-learning Components |
|---|---|
| Antigen | List of documents and labels ( |
| Antibody | Ranking function |
| Training repository | Training set |
| Validation repository | Validation set |
| Affinity | IR evaluation measure |
Fig 2Antibody tree example.
Fig 3Preordered antibody encoding.
Fig 4Contiguous region mutation.
Algorithm configuration and parameters.
| Parameter | Meaning | Value |
|---|---|---|
| Height of antibody tree | 7 | |
| Iteration number | 60 | |
| Clone factor | 0.5 | |
| Size of population | 64 | |
| Experiment number | 10 |
MAP comparison on benchmarks.
| Algorithms | MAP(OHSUMED) | MAP(MQ2007) |
|---|---|---|
| RankSVM | 0.4334 | 0.4644 |
| RankBoost | 0.4411 | 0.4662 |
| ListNet | 0.4457 | 0.4652 |
| AdaRank-MAP | 0.4487 | 0.4577 |
| RankBCA | 0.4601 | 0.4710 |
Fig 5P@n on OHSUMED.
Fig 6NDCG@n on OHSUMED.
Fig 7P@n on MQ2007.
Fig 8NDCG@n on MQ2007.
Fig 9Learning curve of RankBCA.
Speed-up ratio comparison on OHSUMED dataset.
| Processor number | Time/s | Speed-up ratio |
|---|---|---|
| 1 | 5565 | 1 |
| 2 | 2914 | 1.91 |
| 4 | 1661 | 3.35 |
| 8 | 1212 | 4.59 |