| Literature DB >> 30630546 |
C Lokhorst1, R M de Mol1, C Kamphuis1.
Abstract
Insight into current scientific applications of Big Data in the precision dairy farming area may help us to understand the inflated expectations around Big Data. The objective of this invited review paper is to give that scientific background and determine whether Big Data has overcome the peak of inflated expectations. A conceptual model was created, and a literature search in Scopus resulted in 1442 scientific peer reviewed papers. After thorough screening on relevance and classification by the authors, 142 papers remained for further analysis. The area of precision dairy farming (with classes in the primary chain (dairy farm, feed, breed, health, food, retail, consumer) and levels for object of interest (animal, farm, network)), the Big Data-V area (with categories on Volume, Velocity, Variety and other V's) and the data analytics area (with categories in analysis methods (supervised learning, unsupervised learning, semi-supervised classification, reinforcement learning) and data characteristics (time-series, streaming, sequence, graph, spatial, multimedia)) were analysed. The animal sublevel, with 83% of the papers, exceeds the farm sublevel and network sublevel. Within the animal sublevel, topics within the dairy farm level prevailed with 58% over the health level (33%). Within the Big Data category, the Volume category was most favoured with 59% of the papers, followed by 37% of papers that included the Variety category. None of the papers included the Velocity category. Supervised learning, representing 87% of the papers, exceeds unsupervised learning (12%). Within supervised learning, 64% of the papers dealt with classification issues and exceeds the regression methods (36%). Time-series were used in 61% of the papers and were mostly dealing with animal-based farm data. Multimedia data appeared in a greater number of recent papers. Based on these results, it can be concluded that Big Data is a relevant topic of research within the precision dairy farming area, but that the full potential of Big Data in this precision dairy farming area is not utilised yet. However, the present authors expect the full potential of Big Data, within the precision dairy farming area, will be reached when multiple Big Data characteristics (Volume, Variety and other V's) and sources (animal, groups, farms and chain parts) are used simultaneously, adding value to operational and strategic decision.Entities:
Keywords: dairy chain; data analytics; data characteristics; data mining; expectations
Mesh:
Year: 2019 PMID: 30630546 PMCID: PMC6581964 DOI: 10.1017/S1751731118003439
Source DB: PubMed Journal: Animal ISSN: 1751-7311 Impact factor: 3.240
Figure 1Number of papers published relating to Big Data in the precision dairy farming area, per year of publication.
Distribution of papers published between 1994 and June 2017 within the precision dairy farming area
| Sublevel | ||||
|---|---|---|---|---|
| Main level | Animal | Farm | Network | Total |
| Dairy farm | 76 | 10 | 3 | 89 |
| Feed | 2 | 1 | 1 | 4 |
| Breed | 5 | 0 | 1 | 6 |
| Health | 43 | 6 | 1 | 50 |
| Food | 1 | 0 | 2 | 3 |
| Retail | 0 | 0 | 1 | 1 |
| Consumer | 0 | 0 | 0 | 0 |
| Total | 127 | 17 | 9 | 153 |
The number of papers are classified per main level and sublevel.
The totals exceed the number of unique papers used in the review (n=142) since some papers cover more than one main level or sublevel.
Distribution of papers published between 1994 and June 2017 (n=142) in the four different categories (and, where applicable, levels within category) of Big Data analytics
| Number of papers | |||
|---|---|---|---|
| Categories | Level | Level | Category |
| Supervised learning | 134 | ||
| Regression | 48 | ||
| Classification | 86 | ||
| Unsupervised learning | 19 | ||
| Clustering | 7 | ||
| Dimensionality reduction | 12 | ||
| Semi-supervised classification | 1 | ||
| Reinforcement learning | 0 | ||
| Total | 154 | ||
The total exceeds the number of unique papers used in the review (n=142) since some papers cover more than one main category of level of Big Data analytics.
Distribution of papers between 1994 and June 2017 (n=142) based on the Big Data aspect
| Aspects | Number of papers |
|---|---|
| Time-series | 70 |
| Streaming | 1 |
| Sequence | 10 |
| Graph | 6 |
| Spatial | 6 |
| Multimedia | 21 |
| Total | 114 |
The total is less than the number of unique papers used in the review (n=142) since some papers did not cover a Big Data aspect.