| Literature DB >> 24564220 |
Emilie Pasche, Julien Gobeill, Olivier Kreim, Fatma Oezdemir-Zaech, Therese Vachon, Christian Lovis, Patrick Ruch.
Abstract
BACKGROUND: The large increase in the size of patent collections has led to the need of efficient search strategies. But the development of advanced text-mining applications dedicated to patents of the biomedical field remains rare, in particular to address the needs of the pharmaceutical & biotech industry, which intensively uses patent libraries for competitive intelligence and drug development.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24564220 PMCID: PMC4015144 DOI: 10.1186/1471-2105-15-S1-S15
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Example of a topic of the benchmark for the traditional .
Figure 2Example of a topic of the benchmark for the variant of the .
Tuning of the search engine regarding the prior art search task.
| Strategy | Tuning | P0 | MAP |
|---|---|---|---|
| Impact of the description field | Description | 2.20% | 1.34% |
| Impact of the metadata field | |||
| Metadata on TAC | 3.63% | 1.92% | |
| Metadata on TACD | 2.87% | 1.52% | |
| Impact of the weighting model | PL2 | 2.87% | 1.52% |
| Impact of the co-citation network | Without re-ranking | 5.36% | 3.47% |
| Impact of the use of IPC codes | |||
| IPC codes | 5.88% | 3.66% | |
The best tuning for each strategy is displayed in bold. P0 indicates the top-precision and MAP represents the mean average precision.
Tuning of the search engine regarding the traditional technical survey task.
| Strategy | Tuning | P0 | MAP |
|---|---|---|---|
| Impact of the description field | Description | 15.87% | 1.49% |
| Impact of the metadata field | |||
| Metadata on TAC | 30.30% | 1.60% | |
| Metadata on TACD | 19.51% | 1.48% | |
| Impact of the weighting model | PL2 | 19.51% | 1.48% |
| Impact of the co-citation network | Without re-ranking | 20.05% | 1.59% |
| Impact of the use of IPC codes | No IPC codes | 21.24% | 1.60% |
Results in bold are the most effective and are therefore selected to perform further experiments. P0 indicates the top-precision and MAP represents the mean average precision.
Tuning of the search engine regarding the variant of the technical survey task.
| Strategy | Tuning | P0 |
|---|---|---|
| Impact of the description field | Description | 23.63% |
| Impact of the metadata field | No metadata | 34.78% |
| Metadata on TACD | 33.59% | |
| Impact of the weighting model | PL2 | 33.59% |
| Impact of the co-citation network | Without re-ranking | 40.86% |
| Impact of the use of IPC codes | No IPC codes | 40.87% |
The tuning displayed in bold is selected for the further strategies. P0 indicates the top-precision.
Figure 3Welcome page of the Novartis search application. Example of an ad hoc search for the topic mentioned in figure 1.
Figure 4Example of metadata. Example of normalized metadata automatically assigned to a patent.