| Literature DB >> 33879953 |
Ali Hamdi1, Khaled Shaban2, Abdelkarim Erradi2, Amr Mohamed2, Shakila Khan Rumi1, Flora D Salim1.
Abstract
Spatiotemporal data mining (STDM) discovers useful patterns from the dynamic interplay between space and time. Several available surveys capture STDM advances and report a wealth of important progress in this field. However, STDM challenges and problems are not thoroughly discussed and presented in articles of their own. We attempt to fill this gap by providing a comprehensive literature survey on state-of-the-art advances in STDM. We describe the challenging issues and their causes and open gaps of multiple STDM directions and aspects. Specifically, we investigate the challenging issues in regards to spatiotemporal relationships, interdisciplinarity, discretisation, and data characteristics. Moreover, we discuss the limitations in the literature and open research problems related to spatiotemporal data representations, modelling and visualisation, and comprehensiveness of approaches. We explain issues related to STDM tasks of classification, clustering, hotspot detection, association and pattern mining, outlier detection, visualisation, visual analytics, and computer vision tasks. We also highlight STDM issues related to multiple applications including crime and public safety, traffic and transportation, earth and environment monitoring, epidemiology, social media, and Internet of Things.Entities:
Keywords: Challenges Issues; Data Mining; Research Problems; Spatial; Spatiotemporal
Year: 2021 PMID: 33879953 PMCID: PMC8049397 DOI: 10.1007/s10462-021-09994-y
Source DB: PubMed Journal: Artif Intell Rev ISSN: 0269-2821 Impact factor: 9.588
Fig. 1Spatiotemporal data types. a spatiotemporal events of different types at different locations and timestamps. b spatiotemporal trajectories between locations ( and ) at time ( and ). (c and d) spatiotemporal point reference data at different locations at timestamps ( and ). (e and f) spatiotemporal raster data of regular grid at time ( and )
Fig. 2A taxonomy of the proposed STDM challenges structure. The survey is designed to cover the STDM related challenges from three different perspectives. We propose to investigate the general challenges that affect the STDM in terms of relationships, data, natures and limitations of research. Then, we discuss the STDM tasks and applications focusing on their related challenges
Fig. 6Cause-and-effect diagram of STDM general challenges. The figure shows a taxonomy of STDM challenging issues
Fig. 3A word-cloud visualisation of the most frequent used search keywords
Fig. 4Related work distributions for journal articles and conference proceedings
Fig. 5Related work distributions for years from 2011 to 2020
Fig. 7Examples of topological relationships between two areas
Fig. 8Different sources of data needed for crime analysis
Fig. 9Spatial scaling between a or b and c and zoning between a and b. The figure shows the impact of having different scales and zones on the analysis results
Fig. 10Vagueness due to data similarities stem from different criteria. Trajectory 2 and 3 have similar spatial attributes. However, they are semantically different
Fig. 11Dynamic changing of the spatiotemporal distribution of moving objects
Fig. 12An example of explaining a model prediction of flue based on different symptoms, from LIME Ribeiro et al. (2016)
Fig. 13Three different types of outliers Ji et al. (2019)
Fig. 14Drone-based object tracking (Hamdi et al. 2020b)
Fig. 15An example of semantic amodal visual object segmentation Zhu et al. (2017). The first row shows the original scene and its segments human-annotation. The second row visualises the depth and visible edges. Finally, the third one shows the semantic annotation of the invisible regions
Fig. 16The impact of different correlations among regions on RLRH demand forecast. For example, R7 is adjacent to R8, similar to R4 and R2, connected with R3, and distant or irrelevant to R6
Fig. 17Survial analysis under uncertainty (Sokota et al. 2019). The survival curve, red line, calculates the probability as a temporal function. The point-wise and simultaneous intervals covers the uncertainties
Fig. 18Mobile-collected data can be used to monitor different patterns of user’s walking, voice, tapping, and memory (Schwab and Karlen 2019)
Summary of STDM main challenges and their causes
| Issue | Challenges | Causes | Tasks | Applications |
|---|---|---|---|---|
| Spatio-temporal Relation-ships | Complexity | Discrete representation of continuous spatiotemporal data | S-Rao et al. ( | PS-Zhang et al. ( |
| Co-located objects influence each other | ||||
| Implicit-ness | Implicit relationships between spatiotemporal objects | CA-Senaratne et al. ( | PS-Wu et al. ( | |
| Non-independ-ent and Non-identical distrib-ution | Auto-correlation due to dependency relationships in space and time | S-Shekhar et al. ( | PS-Quick et al. ( | |
| Non-identical distribution across space and time | ||||
| Inter-disciplin-ary and Combined Data Mining | Various interrelated domains | Heterogeneous data requiring multiple STDM techniques | CA-Carrasco-Escobar et al. ( | PS-Song et al. ( |
| Environ-mental factors | ||||
| Opportunity | ||||
| Region Discretization | Scale effect | Scale dependency | CA-Damm et al. ( | PS-Quick et al. ( |
| Zoning effect | Zone dependency | |||
| Data Characteristics | Specificity | Spatiotemporal data tend to be unique to a particular space-time region. | CA-Rashidi et al. ( | PS-Malik et al. ( |
| Learned model is specific to a particular spatiotemporal region. | ||||
| Vagueness | Similarities rooted from different criteria | S-Shekhar et al. ( | PS-Albertetti ( | |
| Dynamic-ity | Continuous change through space and time. | CA-Huang et al. ( | PS-Kotevska et al. ( | |
| Social | Correlation with the socio-economic characteristics | CA-Steiger et al. ( | PS-Bogomolov et al. ( | |
| Network-ed | Influence between objects and trajectories. | |||
| Exponential number of relationships. | ||||
| Heterog-eneous and Non-stationary | Wide variation of data distributions over space and time. | S-Miller and Han ( | PS-Kadar et al. ( | |
| Different learning models for varying spatiotemporal regions. | ||||
| Limited Access and Privacy | Privacy issues | S-Giannotti and Pedreschi ( | PS-Ratcliffe ( | |
| Poor Quality | Uncertainties, Partial knowledge, Conjectures | CA-Albertetti ( | PS-Malik et al. ( | |
| Big data | Volume, variety and velocity | CA-Du et al. ( | PS-Zhang et al. ( | |
| Open Issues | Data Representations | Limited representations of spatiotemporal data |
Santos et al. ( | |
| Advanced Modelling | Depend on high-density locations while ignoring the temporally related attributes. |
Roth et al. ( | ||
| Visualisation | Developing techniques for spatial visualisation, while less consideration is given to spatiotemporal |
Ye et al. ( | ||
| Comprehen-sive Approaches | Focusing on certain problems and do not introduce comprehensive spatiotemporal solutions |
Ndehedehe et al. ( | ||
| Fairness, Accountability, Transparency, and Ethics (FATE) | Amplifying genders, denying people services, and racial biases . |
Dudík et al. ( |