| Literature DB >> 34901843 |
Ruwan Wickramarachchi1, Cory Henson2, Amit Sheth1.
Abstract
Scene understanding is a key technical challenge within the autonomous driving domain. It requires a deep semantic understanding of the entities and relations found within complex physical and social environments that is both accurate and complete. In practice, this can be accomplished by representing entities in a scene and their relations as a knowledge graph (KG). This scene knowledge graph may then be utilized for the task of entity prediction, leading to improved scene understanding. In this paper, we will define and formalize this problem as Knowledge-based Entity Prediction (KEP). KEP aims to improve scene understanding by predicting potentially unrecognized entities by leveraging heterogeneous, high-level semantic knowledge of driving scenes. An innovative neuro-symbolic solution for KEP is presented, based on knowledge-infused learning, which 1) introduces a dataset agnostic ontology to describe driving scenes, 2) uses an expressive, holistic representation of scenes with knowledge graphs, and 3) proposes an effective, non-standard mapping of the KEP problem to the problem of link prediction (LP) using knowledge-graph embeddings (KGE). Using real, complex and high-quality data from urban driving scenes, we demonstrate its effectiveness by showing that the missing entities may be predicted with high precision (0.87 Hits@1) while significantly outperforming the non-semantic/rule-based baselines.Entities:
Keywords: autonomous driving; entity prediction; knowledge graph embeddings; knowledge-infused learning; neuro-symbolic computing; scene understanding
Year: 2021 PMID: 34901843 PMCID: PMC8656233 DOI: 10.3389/fdata.2021.759110
Source DB: PubMed Journal: Front Big Data ISSN: 2624-909X
FIGURE 1Example scene KG with potentially missing entity.
FIGURE 2Knowledge-infused KEP as a post-processing step for computer vision entity prediction techniques, which takes a set of labels (L) as input and outputs a new set of labels (L’).
List of notations used.
|
| Knowledge graph |
|
| Set of all relations in |
|
| Set of all nodes in |
|
| Set of all class nodes in |
|
| Set of all instance nodes in |
|
| Set of triples; |
|
| Set of Scene instance nodes |
|
| Set of Entity class nodes |
| (i.e. Objects and Events) |
FIGURE 3(A) Basic structure of a scene, (B) Two types of scene: sequence scene and frame scene.
Relations associated with a Scene, including their domain and range.
| Relation | Domain | Range |
|---|---|---|
|
| SequenceScene | xsd:dateTime |
|
| SequenceScene | xsd:dateTime |
|
| Scene | SpatialRegion |
|
| SequenceScene | FrameScene |
|
| Event | Object |
|
| FrameScene | xsd:dateTime |
|
| Scene | Entity |
|
| Object | Event |
|
| FrameScene | SequenceScene |
|
| Scene | Scene |
|
| Scene | Scene |
Basic KG statistics of Pandaset (DSKG-P) and NuScenes, sampled at 4s (DSKG-N).
| Pandaset | NuScenes (sampled) | |
|---|---|---|
| # Triples | 3,301,929 | 819,084 |
| # Entity Classes | 38 | 31 |
| # Relations | 19 | 14 |
| # Sequence scene inst. | 103 | 850 |
| # Frame scene inst. | 8,240 | 4,498 |
| # Entity inst. | 53,248 | 277,287 |
| Triples per entity ratio | 62.01 | 2.95 |
| Avg. cardinality of entity class | 15.7 | 9.07 |
| Avg. cardinality of entity inst. | 369.6 | 57.78 |
FIGURE 4Co-occurrence of entity types within scenes in DSKG-N. Each cell value represents the frequency of Frames in which two entities co-occur, normalized row-wise by the total frequency of Frames in which the row entity occurs.
FIGURE 5Four main phases involved in Scene Entity Prediction (KEP) pipeline.
Details of selected KGE algorithms: class of algorithm, triple scoring functions and their space and time complexities. Notation used: , , n = # of training triples, , τ = |Ω| = # of convolution filters.
| Algorithm | Class | Scoring function | Space (S) and time (T) complexity |
|---|---|---|---|
| TransE | Geometric |
|
|
| HolE | Matrix factorization |
|
|
| ConvKB | Deep learning |
|
|
KEP results of association rule mining (ARM) baseline (A), DSKG generated using Pandaset (DSKG-P ) and NuScenes (DSKG-N ) on three algorithms, each experiment averaged with standard deviation across five runs (B,C), followed by the results of the additional investigations: different KG structures (D,E) and integration of external knowledge (F). Evaluation metrics: MRR = Mean Reciprocal Rank, H@K= Hits@K, Accu. = KEP Accuracy, Micro/Macro F1 = Micro/Macro-averaged-F1-score.
| Ranking metrics | KEP performance metrics | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| MRR | H@1 | H@3 | H@10 | Accu. (%) | Micro F1 | Macro F1 | |||
| (A) | ARM | — | — | — | — | — | 27.19 | 0.16 | 0.06 |
| (B) | DSKG-P
| TransE | 0.32 ± 0.03 | 0.16 ± 0.05 | 0.35 ± 0.04 | 0.71 ± 0.03 | 22.98 ± 4.33 | 0.26 ± 0.04 | 0.20 ± 0.02 |
| HolE |
|
|
|
|
|
|
| ||
| ConvKB | 0.29 ± 0.01 | 0.11 ± 0.02 | 0.31 ± 0.02 | 0.86 ± 0.02 | 17.83 ± 1.99 | 0.22 ± 0.02 | 0.17 ± 0.02 | ||
| (C) | DSKG-N
| TransE | 0.42 ± 0.03 | 0.22 ± 0.03 | 0.51 ± 0.03 | 0.91 ± 0.01 | 28.08 ± 2.45 | 0.32 ± 0.03 | 0.20 ± 0.01 |
| HolE | 0.23 ± 0.01 | 0.11 ± 0.01 | 0.22 ± 0.01 | 0.51 ± 0.03 | 13.80 ± 0.84 | 0.16 ± 0.01 | 0.11 ± 0.01 | ||
| ConvKB |
|
|
|
|
|
|
| ||
| (D) | DSKG
| TransE |
|
|
|
|
|
|
|
| HolE | 0.29 | 0.11 | 0.28 | 0.87 | 16.55 | 0.19 | 0.20 | ||
| ConvKB | 0.23 | 0.07 | 0.21 | 0.68 | 12.30 | 0.16 | 0.14 | ||
| (E) | DSKG
| TransE | 0.26 | 0.10 | 0.28 | 0.62 | 17.77 | 0.21 | 0.18 |
| HolE |
|
| 0.32 | 0.81 |
|
|
| ||
| ConvKB | 0.30 | 0.10 |
|
| 19.21 | 0.24 | 0.20 | ||
| (F) | DSKGSE | TransE | 0.30 | 0.18 | 0.32 | 0.50 | 24.53 | 0.27 | 0.17 |
| HolE |
|
|
|
| 74.52 |
|
| ||
| ConvKB | 0.29 | 0.13 | 0.32 | 0.71 | 21.01 | 0.26 | 0.22 | ||
“Bold” values in (B, C) indicate the peak performance for each metric in DSKG-R, while “underlined” values in (D,E, and F) indicate the same for each additional investigation.
FIGURE 6Different KG structures: (A) DSKG : KG with all instances + reified paths, (B) DSKG : KG with only reified paths, (C): DSKG : KG with reified and prototype paths.
FIGURE 7Hits@1 variation of KGE algorithms over different KG structures.
|
|