| Literature DB >> 35336456 |
Abstract
Acquiring useful data from agricultural areas has always been somewhat of a challenge, as these are often expansive, remote, and vulnerable to weather events. Despite these challenges, as technologies evolve and prices drop, a surge of new data are being collected. Although a wealth of data are being collected at different scales (i.e., proximal, aerial, satellite, ancillary data), this has been geographically unequal, causing certain areas to be virtually devoid of useful data to help face their specific challenges. However, even in areas with available resources and good infrastructure, data and knowledge gaps are still prevalent, because agricultural environments are mostly uncontrolled and there are vast numbers of factors that need to be taken into account and properly measured for a full characterization of a given area. As a result, data from a single sensor type are frequently unable to provide unambiguous answers, even with very effective algorithms, and even if the problem at hand is well defined and limited in scope. Fusing the information contained in different sensors and in data from different types is one possible solution that has been explored for some decades. The idea behind data fusion involves exploring complementarities and synergies of different kinds of data in order to extract more reliable and useful information about the areas being analyzed. While some success has been achieved, there are still many challenges that prevent a more widespread adoption of this type of approach. This is particularly true for the highly complex environments found in agricultural areas. In this article, we provide a comprehensive overview on the data fusion applied to agricultural problems; we present the main successes, highlight the main challenges that remain, and suggest possible directions for future research.Entities:
Keywords: artificial intelligence; data fusion; precision agriculture; sensors; variability
Mesh:
Year: 2022 PMID: 35336456 PMCID: PMC8952279 DOI: 10.3390/s22062285
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Categorization of data fusion approaches adopted in this work.
Figure 2Number of references selected in this review.
List of acronyms.
| Acronym | Meaning | Acronym | Meaning |
|---|---|---|---|
| AMSR-E | Advanced Microwave Scanning Radiometer | MLP | Multilayer Perceptron |
| on the Earth Observing System | MLR | Multiple Linear Regression | |
| ANN | Artificial Neural Network | MOA | Model Output Averaging |
| ASTER | Advanced Spaceborne Thermal Emission and | MODIS | Moderate-Resolution Imaging Spectroradiometer |
| Reflection | MSDF-ET | Multi-Sensor Data Fusion Model for Actual | |
| BK | Block Kriging | Evapotranspiration Estimation | |
| BPNN | Backpropagation Neural Network | MSPI | Maximum Sum of Probabilities Intersections |
| CACAO | Consistent Adjustment of the Climatology | NB | Naïve Bayes |
| to Actual Observations | NDSI | Normalized Difference Spectral Index | |
| CHRIS | Compact High Resolution Imaging Spectrometer | NDVI | Normalized Difference Vegetation Index |
| CNN | Convolutional Neural Network | NIR | Near-infrared Spectroscopy |
| CP-ANN | Counter-Propagation Artificial Neural Networks | NMDI | Normalized Multiband Drought Index |
| CV | Computer Vision | OLI | Operational Land Imager |
| DEM | Digital Elevation Model | PCA | Principal Component Analysis |
| DNN | Deep Neural Network | PDI | Perpendicular Drought Index |
| DRF | Distributed Random Forest | PLSR | Partial Least Square Regression |
| ECa | Apparent Soil Electrical Conductivity | RF | Random Forest |
| EDXRF | Energy dispersive X-Ray Fluorescence | RFR | Random Forest Regression |
| EKF | Extended Kalman Filter | RGB | Red–Green–Blue |
| ELM | Extreme Learning Machine | RGB-D | Red–Green–Blue-Depth |
| EMI | Electromagnetic Induction | RK | Regression Kriging |
| ESTARFM | Enhanced Spatial and Temporal Adaptive | RTK | Real Time Kinematic |
| Reflective Fusion Model | SADFAET | Spatiotemporal Adaptive Data Fusion | |
| ET | Evapotranspiration | Algorithm for Evapotranspiration Mapping | |
| FARMA | Fusion Approach for Remotely-Sensed Mapping | SAR | Synthetic Aperture Radar |
| of Agriculture | SF | Sensor Fusion | |
| GBM | Gradient Boosting Machine | SfM | Structure from Motion |
| GKSFM | Gaussian Kernel-Based Spatiotemporal Fusion Model | SKN | Supervised Kohonen Networks |
| GLM | Generalized Linear Model | SMLR | Stepwise Multiple Linear Regression |
| GNSS | Global navigation satellite system | SPA | Successive Projections Algorithm |
| HUTS | High-resolution Urban Thermal Sharpener | SPOT | Satellite Pour l’Observation de la Terre |
| INS | Inertial Navigation System | SRTM | Shuttle Radar Topographic Mission |
| IoT | Internet of Things | STARFM | Spatial and Temporal Adaptive Reflective Fusion |
| ISTDFA | Improved Spatial and Temporal Data Fusion | Model | |
| Approach | SVR | Support Vector Regression | |
| kNN | k-Nearest Neighbors | TLS | Terrestrial Laser Scanning |
| LAI | Leaf Area Index | TRMM | Tropical Rainfall Measuring Mission |
| LPT | Laplacian Pyramid Transform | TVDI | Temperature Vegetation Dryness Index |
| LR | Linear Regression | UAV | Unmanned Aerial Vehicle |
| LSTM-NN | Long Short-Term Memory Neural Network | XGBoost | Extreme Gradient Boosting |
Categories adopted for the data fusion techniques and the data being fused.
| No. | Classes of Data Fusion Technique | No. | Classes of Data Being Fused |
|---|---|---|---|
| 1 | Regression methods | 1 | RGB images |
| 2 | STARFM-like statistical methods | 2 | Multispectral images |
| 3 | Geostatistical tools | 3 | Hyperspectral images |
| 4 | PCA and derivatives | 4 | Thermal images |
| 5 | Kalman filter | 5 | Laser scanning |
| 6 | Machine learning | 6 | SAR images |
| 7 | Deep learning | 7 | Spectroscopy |
| 8 | Decision rules | 8 | Fluorescence images |
| 9 | Majority rules | 9 | Soil measurements |
| 10 | Model output averaging | 10 | Environmental/weather measurements |
| 11 | Others | 11 | Inertial measurements |
| 12 | Position measurements | ||
| 13 | Topographic records and elevation models | ||
| 14 | Historical data | ||
| 15 | Others |
References considered in this study–proximal scale. L, M, and H mean low-, mid-, and high-level data fusion, respectively. The numbers in the fourth column are those adopted in Table 2 for each “fused data” class.
| Reference | Application | Fusion Technique | Fused Data | Mean Accuracy |
|---|---|---|---|---|
| [ | Estimation of soil indices | SF (L), MOA (H) | 7 | 0.80–0.90 |
| [ | Sustainable greenhouse management | Decision rules (L) | 10 | N/A |
| [ | Human—robot interaction | LSTM-NN (L) | 11 | 0.71–0.97 |
| [ | Delineation of homogeneous zones in viticulture | GAN (L), geostatistical tools (L) | 2, 9 | N/A |
| [ | Delineation of homogeneous zones | Kriging and other geostatistical tools (L) | 2, 9 | N/A |
| [ | Estimation of crop phenological states | Particle filter scheme (L) | 2, 6, 10 | 0.93–0.96 |
| [ | Fruit detection | LPT (L) and fuzzy logic (L) | 1, 4 | 0.80–0.95 |
| [ | In-field estimation of soil properties | RK (L), PLSR (L) | 3, 9 | >0.5 |
| [ | Delineation of homogeneous management zones | Kriging (L), Gaussian anamorphosis (L) | 9, 15 | 0.66 |
| [ | Delineation of homogeneous management zones | Kriging (L), Gaussian anamorphosis (L) | 9, 15 | N/A |
| [ | Delineation of homogeneous management zones | Kriging (L),Gaussian anamorphosis (L) | 9, 15 | N/A |
| [ | Crop nutritional status determination | PCA (L) | 7, 8 | 0.7–0.9 |
| [ | Detection of olive quick decline syndrome | CNN (M) | 1 | 0.986 |
| [ | Monitoring Agricultural Terraces | Coregistering and information extraction (L/M) | 5 | N/A |
| [ | Prediction of canopy water content of rice | BPNN (M), RF (M), PLSR (M) | 2 | 0.98–1.00 |
| [ | Localization of a wheeled mobile robot | Dempster–Shafer (L) and Kalman filter (L) | 11, 12 | 0.97 |
| [ | Immature green citrus fruit detection | Color-thermal probability algorithm (H) | 1, 4 | 0.90–0.95 |
| [ | Delineation of management zones | K-means clustering (L) | 2, 9, 14 | N/A |
| [ | Segmentation for targeted application of products | Discrete wavelets transform (M) | 1 | 0.92 |
| [ | System for agricultural vehicle positioning | Kalman filter (L) | 11, 12 | N/A |
| [ | System for agricultural vehicle positioning | Kalman filter (L) | 11, 12 | N/A |
| [ | Yield gap attribution in maize | Empirical equations (L) | 15 | 0.37–0.74 |
| [ | Soil environmental quality assessment | Analytic hierarchy process, weighted average (L) | 15 | N/A |
| [ | Predict soil properties | PLSR (L) | 7, 9, 13 | 0.80–0.96 |
| [ | System for agricultural vehicle positioning | Discrete Kalman filter (L) | 11, 13 | N/A |
| [ | Estimating soil macronutrients | PLSR (L) | 7, 9 | 0.70–0.95 |
| [ | Citrus fruit detection and localization | Daubechies wavelet transform (L) | 1, 2 | 0.91 |
| [ | Estimation of agricultural equipment roll angle | Kalman filtering (L) | 11 | N/A |
| [ | Predicting toxic elements in the soil | PLSR, PCA, and SPA (L/M) | 7, 8 | 0.93–0.98 |
| [ | Review: image fusion technology in agriculture | N/A | N/A | N/A |
| [ | Heterogeneous sensor data fusion | Deep multimodal encoder (L) | 10 | N/A |
| [ | Agricultural vulnerability assessments | Binary relevance (L), RF (L), and XGBoost (L) | 10,14 | 0.67–0.98 |
| [ | Prediction of multiple soil properties | SMLR (L), PLSR (L), PCA/SMLR combination (L) | 7, 9 | 0.60–0.95 |
| [ | Prediction of environment variables | Sparse model (L), LR (L), SVM (L), ELM (L) | 10 | 0.96 |
| [ | Estimation of biomass in grasslands | Simple quadratic combination (L) | 2, 15 | 0.66–0.88 |
| [ | Plant disease detection | Kohonen self-organizing maps (M) | 3, 8 | 0.95 |
| [ | Water stress detection | Least squares support vectors machine (M) | 3, 8 | 0.99 |
| [ | Delineation of water holding capacity zones | ANN (L), MLR (L) | 7, 9 | 0.94–0.97 |
| [ | Potential of site-specific seeding (potato) | PLSR (L) | 2, 9 | 0.64–0.90 |
| [ | 3D characterization of fruit trees | Pixel level mapping between the images (L) | 4, 5 | N/A |
| [ | Measurements of sprayer boom movements | Summations of normalized measurements (L) | 11 | N/A |
| [ | Review: IoT and data fusion for crop disease | N/A | N/A | N/A |
| [ | Prediction of wheat yield and protein | Canonical powered partial least-squares (L) | 7, 10 | 0.76–0.94 |
| [ | Wheat yield prediction | CP-ANN (L), XY-fused networks (L), SKN (L) | 2, 7 | 0.82 |
| [ | Topsoil clay mapping | PLSR (L) and kNN (L) | 7, 9, 13 | 0.94–0.96 |
| [ | Fruit detection | CNN (L); scoring system (H) | 1, 2 | 0.84 |
| [ | 3D reconstruction for agriculture phenotyping | Linear interpolation (L) | 1, 10 | N/A |
| [ | Delineation of site-specific management zones | CoKriging (L) | 2 | 0.55–0.77 |
| [ | Orchard mapping and mobile robot localization | Laser data projection onto the RGB images (L) | 1, 5 | 0.97 |
| [ | Modelling crop disease severity | 2 ANN architectures (L) | 10, 15 | 0.90–0.98 |
| [ | Tropical soil fertility analysis | SVM (L), PLS (L), least squares modeling (L) | 2, 8 | 0.30–0.95 |
| [ | Internet of things applied to agriculture | Hydra system (L/M/H) | 9, 10, 15 | 0.93–0.99 |
| [ | Review: data fusion in agricultural systems | N/A | N/A | N/A |
| [ | Soil health assessment | PLSR (L) | 7, 9 | 0.78 |
| [ | Prediction of Soil Texture | SMLR (L), PLSR (L) and PCA (L) | 7, 8 | 0.61–0.88 |
| [ | Rapid determination of soil class | Outer product analysis (L) | 7 | 0.65 |
| [ | Navigation of autonomous vehicle | MSPI algorithm with Bayesian estimator (L) | 11, 12 | N/A |
| [ | Detection of cotton plants | Discriminant analysis (M) | 2, 7 | 0.97 |
| [ | Map-based variable-rate manure application | K-means clustering (L) | 2, 9 | 0.60–0.93 |
| [ | Navigation of autonomous vehicles | Kalman filter (L) | 11, 12 | N/A |
| [ | Robust tomato recognition for robotic harvesting | Wavelet transform (L) | 1 | 0.93 |
| [ | Navigation of autonomous vehicle | Self-adaptive PCA, dynamic time warping (L) | 1, 11 | N/A |
| [ | Recognition of wheat spikes | Gram–Schmidt fusion algorithm (L) | 1, 2 | 0.60–0.79 |
Also explores satellite data. Also explores aerial data.
References considered in this study–aerial scale. L, M, and H mean low-, mid-, and high-level data fusion, respectively. The numbers in the fourth column are those adopted in Table 2 for each “fused data” class.
| Reference | Application | Fusion Technique | Fused Data | Mean Accuracy |
|---|---|---|---|---|
| [ | Root zone soil moisture estimation | NN (M), DRF (M), GBM (M), GLM (M) | 2,11 | 0.90–0.95 |
| [ | Gramineae weed detection in rice fields | Haar wavelet transformation (L) | 1, 2 | 0.70–0.85 |
| [ | Monitoring agricultural terraces | Coregistering and information extraction (L) | 5 | N/A |
| [ | Spectral–temporal response surfaces | Bayesian data imputation (L) | 2, 3 | 0.77–0.83 |
| [ | Phenotyping of soybean | PLSR (L), SVR (L), ELR (L) | 1, 2, 4 | 0.83–0.90 |
| [ | Soybean yield prediction | PLSR (M), RF (M), SVR (M), 2 types of DNN (M) | 1, 2, 4 | 0.72 |
| [ | Crop monitoring | PLSR (M), RF (M), SVR (M), ELR (M) | 1, 2 | 0.60–0.93 |
| [ | Evapotranspiration estimation | MSDF-ET (L) | 1, 2, 4 | 0.68–0.77 |
| [ | Review: IoT and data fusion for crop disease | N/A | N/A | N/A |
| [ | Arid and semi-arid land vegetation monitoring | Decision tree (L/M) | 3, 5 | 0.84–0.89 |
| [ | Biomass and leaf nitrogen content in sugarcane | PCA and linear regression (L) | 2, 5 | 0.57 |
| [ | Review: data fusion in agricultural systems | N/A | N/A | N/A |
| [ | Navigation system for UAV | EKF (L) | 11, 12 | 0.98 |
| [ | Detection of cotton plants | Discriminant analysis (M) | 2 | 0.97 |
| [ | Vineyard monitoring | PLSR (M), SVR (M), RFR (M), ELR (M) | 2 | 0.98 |
Also explores proximal data. Also explores satellite data.
References considered in this study–orbital scale. L, M, and H mean low-, mid-, and high-level data fusion, respectively. The numbers in the fourth column are those adopted in Table 2 for each “fused data” class.
| Reference | Application | Fusion Technique | Fused Data | Mean Accuracy |
|---|---|---|---|---|
| [ | Soil moisture mapping | ESTARFM (L) | 2 | 0.70–0.84 |
| [ | Crop type mapping | 2D and 3D U-Net (L), SegNet (L), RF (L) | 2, 6 | 0.91–0.99 |
| [ | Estimation of surface soil moisture | ESTARFM (L) | 2 | 0.55–0.92 |
| [ | Delineation of homogeneous zones | Kriging and other geostatistical tools | 2, 9 | N/A |
| [ | Estimation of crop phenological states | Particle filter scheme (L/M) | 2, 6, 10 | 0.93–0.96 |
| [ | Evapotranspiration mapping at field scales | STARFM (L) | 2 | 0.92–0.95 |
| [ | In-field estimation of soil properties | RK (L), PLSR (L) | 3, 9 | >0.5 |
| [ | Estimation of wheat grain nitrogen uptake | BK (L) | 2, 3 | N/A |
| [ | Surface soil moisture monitoring | Linear regression analysis and Kriging (L/M) | 2, 15 | 0.51–0.84 |
| [ | Crop discrimination and classification | Voting system (H) | 2, 6 | 0.96 |
| [ | Review on multimodality and data fusion in RS | N/A | N/A | N/A |
| [ | Crop Mapping | Pixelwise matching (H) | 2, 6 | 0.94 |
| [ | Review on fusion between MODIS and Landsat | N/A | N/A | N/A |
| [ | Mapping crop progress | STARFM (L) | 2 | 0.54–0.86 |
| [ | Generation of spectral–-temporal response | Bayesian data imputation (L) | 2, 3 | 0.77–0.83 |
| [ | Delineation of management zones | K-means clustering (L) | 2, 9, 14 | N/A |
| [ | Mapping irrigated areas | Decision tree (L) | 2 | 0.67–0.93 |
| [ | Evapotranspiration mapping | Empirical exploration of band relationships (L) | 2, 4 | 0.20–0.97 |
| [ | Yield gap attribution in maize | Empirical equations (L) | 15 | 0.37–0.74 |
| [ | Change detection and biomass estimation in rice | Graph-based data fusion (L) | 2 | 0.17–0.90 |
| [ | Leaf area index estimation | STARFM (L) | 2 | 0.69–0.76 |
| [ | Evapotranspiration estimates | STARFM (M) | 2 | N/A |
| [ | Classification of agriculture drought | Optimal weighting of individual indices (M) | 2 | 0.80–0.92 |
| [ | Mapping daily evapotranspiration | STARFM (L) | 2 | N/A |
| [ | Mapping of cropping cycles | STARFM (L) | 2 | 0.88–0.91 |
| [ | Evapotranspiration partitioning at field scales | STARFM (L) | 2 | N/A |
| [ | Review: image fusion technology in agriculture | N/A | N/A | N/A |
| [ | Crop monitoring | PLSR (M), RF (M), SVR (M), ELR (M) | 1, 2, 4 | 0.60–0.93 |
| [ | Mapping of smallholder crop farming | XGBoost (L/M and H), RF (H), SVM (H), ANN (H), NB (H) | 2, 6 | 0.96–0.98 |
| [ | Estimation of biomass in grasslands | Simple quadratic combination (L/M) | 2, 15 | 0.66–0.88 |
| [ | Evapotranspiration estimation | MSDF-ET (L) | 1, 2, 4 | 0.68–0.77 |
| [ | Semantic segmentation of land types | Majority rule (H) | 2 | 0.99 |
| [ | Eucalyptus trees identification | Fuzzy information fusion (L) | 2 | 0.98 |
| [ | Review: IoT and data fusion for crop disease | N/A | N/A | N/A |
| [ | Wheat yield prediction | CP-ANN (M), XY-fused networks (M), SKN (M) | 2, 7 | 0.82 |
| [ | Drought monitoring | RF (M) | 2, 15 | 0.29–0.77 |
| [ | Crop type classification and mapping | RF (L) | 2, 6, 13 | 0.37–0.94 |
| [ | Time series data fusion | Environmental data acquisition module | 10 | N/A |
| [ | Evapotranspiration prediction in vineyard | STARFM (L) | 2 | 0.77–0.81 |
| [ | Daily NDVI product at a 30-m spatial resolution | GKSFM (M) | 2 | 0.88 |
| [ | Crop classification | Committee of MLPs (L) | 2, 6 | 0.65–0.99 |
| [ | Multisource classification of remotely sensed data | Bayesian formulation (L) | 2, 6 | 0.74 |
| [ | Fractional vegetation cover estimation | Data fusion and vegetation growth models (L) | 2 | 0.83–0.95 |
| [ | Land cover monitoring | FARMA (L) | 2, 6 | N/A |
| [ | Crop ensemble classification | mosaicking (L), classifier majority voting (H) | 2 | 0.82–0.85 |
| [ | Review: data fusion in agricultural systems | N/A | N/A | N/A |
| [ | In-season mapping of crop type | Classification tree (M) | 2 | 0.93–0.99 |
| [ | Building frequent landsat-like imagery | STARFM (L) | 2 | 0.63–0.99 |
| [ | Evapotranspiration mapping | SADFAET (M) | 2 | N/A |
| [ | Temporal land use mapping | Dynamic decision tree (M) | 2 | 0.86–0.96 |
| [ | High-resolution leaf area index estimation | STDFA (L) | 2 | 0.98 |
| [ | Monitoring cotton root rot | ISTDFA (M) | 2 | 0.79–0.97 |
| [ | Monitoring crop water content | Modified STARFM (L) | 2 | 0.44–0.85 |
| [ | Soil moisture content estimation | Vector concatenation, followed by ANN (M) | 2, 6 | 0.39–0.93 |
| [ | Impact of tile drainage on evapotranspiration | STARFM (L) | 2 | 0.23–0.91 |
| [ | Estimation of leaf area index | CACAO method (L) | 2 | 0.88 |
| [ | Mapping winter wheat in urban region | SVM (M), RF (M) | 2, 6 | 0.98 |
| [ | Leaf area index estimation | ESTARFM (L), linear regression model (M) | 2 | 0.37–0.95 |
| [ | Vineyard monitoring | PLSR (M), SVR (M), RFR (M), ELR (M) | 2 | 0.98 |
Also explores proximal data. Also explores aerial data.