| Literature DB >> 26834982 |
Fabrizio Celli1, Johannes Keizer1, Yves Jaques1, Stasinos Konstantopoulos2, Dušan Vudragović3.
Abstract
The social media revolution is having a dramatic effect on the world of scientific publication. Scientists now publish their research interests, theories and outcomes across numerous channels, including personal blogs and other thematic web spaces where ideas, activities and partial results are discussed. Accordingly, information systems that facilitate access to scientific literature must learn to cope with this valuable and varied data, evolving to make this research easily discoverable and available to end users. In this paper we describe the incremental process of discovering web resources in the domain of agricultural science and technology. Making use of Linked Open Data methodologies, we interlink a wide array of custom-crawled resources with the AGRIS bibliographic database in order to enrich the user experience of the AGRIS website. We also discuss the SemaGrow Stack, a query federation and data integration infrastructure used to estimate the semantic distance between crawled web resources and AGRIS.Entities:
Keywords: AGRIS; Linked Data; Recommender Systems; SemaGrow; Text Categorization; Web Crawling
Year: 2015 PMID: 26834982 PMCID: PMC4722689 DOI: 10.12688/f1000research.6848.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. The process of crawling and indexing the web.
Figure 2. Execution of the web crawler using depth=3.
Figure 3. AgroTagger workflow.
Figure 4. RDF schema of AgroTagger output annotations.
Figure 5. Usage of the SemaGrow Stack as intermediate layer for the generation of the Recommender Database, which contains meaningful combinations between the AGRIS bibliographic database and the Crawler Database.
Example of naïve Similarity Score.
|
| 10 | 10 | 10 | 6 | 6 | 3 | 3 |
|
| 10 | 10 | 10 | 10 | 10 | 10 | 10 |
|
| 10 | 8 | 2 | 6 | 2 | 3 | 1 |
|
| 1.0 | 0.8 | 0.2 | 1.0 | 0.33 |
|
|
Revised example of Similarity Score.
|
| 10 | 10 | 10 | 6 | 6 | 3 | 3 |
|
| 10 | 10 | 10 | 10 | 10 | 10 | 10 |
|
| 10 | 8 | 2 | 6 | 2 | 3 | 1 |
|
| 1.0 | 0.8 | 0.2 | 1.0 | 0.33 | 1.0 | 0.33 |
|
| 1.0 | 0.8 |
| 1.0 |
|
|
|
Figure 6. An AGRIS mashup page displaying an AGRIS bibliographic record with related information from external data sources.
Figure 7. A widget displaying related resources from the web.
Figure 8. Cumulative distribution (percentage) of AGRIS records over the number of relevant recommendations.