| Literature DB >> 34398659 |
Birgit Vogel-Heuser1, Felix Ocker1, Iris Weiß1, Robert Mieth1, Frederik Mann1.
Abstract
Modern production systems can benefit greatly from integrated and up-to-date digital representations. Their applications range from consistency checks during the design phase to smart manufacturing to maintenance support. Such digital twins not only require data, information and knowledge as inputs but can also be considered integrated models themselves. This paper provides an overview of data, information and knowledge typically available throughout the lifecycle of production systems and the variety of applications driven by data analysis, expert knowledge and knowledge-based systems. On this basis, we describe the potential for combining data analysis and knowledge-based systems in the context of production systems and describe two feasibility studies that demonstrate how knowledge-based systems can be created using data analysis. This article is part of the theme issue 'Towards symbiotic autonomous systems'.Entities:
Keywords: data analysis; digital twins; knowledge-based systems; production systems
Year: 2021 PMID: 34398659 PMCID: PMC8366910 DOI: 10.1098/rsta.2020.0368
Source DB: PubMed Journal: Philos Trans A Math Phys Eng Sci ISSN: 1364-503X Impact factor: 4.226
Selection of information artifacts throughout the lifecycle of a production system adapted from Biffl et al. [8].
| layer | design | operation and maintenance |
|---|---|---|
| production network | requirements, constraints | volume planning, Key Performance Indicators (KPIs), technical rules, order data, quality data |
| production line (segment) | two-dimensional-layouts | resource allocation, KPIs, logistics data, quality data, maintenance reports, test data |
| work unit, work station | layout plans, behavior models, three-dimensional-geometry-models, electrical construction, fluid plans, control programs, safety concepts | resource KPIs, resource states and alarm logs, sensor data, order data, production process control data, maintenance report, test data |
| component | behavior models, CAD construction, control programs, safety concepts, part lists | component state and alarm logs, production process control data, sensor data, test data |
| construction element | part lists, mechanical/electrical specifications, CAD-based construction | component health information |
Figure 1Model and data creation throughout the lifecycle.
Selected challenges of ML in manufacturing.
| challenge | example |
|---|---|
| information incompleteness [ | Missing units for the gathered sensor and actuator data: even though the technical documents of the equipment or sensor include such information, data analysts and the user of data-driven models often suffer from missing meta data. A complete representation of an aPS requires the inclusion of all information (data, expert knowledge, technical documents, etc.). |
| inconsistency [ | Changing sampling rates: adjustments of the process, introducing new tasks on the programmable logic controllers (PLCs) or other changes may influence the sampling rate of the data. It is important to detect such inconsistency to prevent wrong interpretation of data-driven results. |
| causality vs. correlation [ | Correlation ≠ causality: the correlation of two sensors represent statistical probability. A physical or technical dependency can not be proven without considering expert knowledge. |
| human influence [ | Trust, transparency, explainability: to ensure trust in ML and to cause the human to realize the recommendations, transparency must be given and the results have to be explainable. However, so-called black-box models miss these aspects nowadays. |
| high dimensionality [ | Irrelevant and redundant information: aPSs are equipped with a high number of sensors. However, only a subset of this data is relevant for a specific use case. The irrelevant and redundant information impact the performance of ML and need to be identified. |
| algorithm selection [ | Overwhelming number of different algorithms: selecting the best algorithms for the use case and the data characteristics is crucial to receive valid results. However, this is a challenging task due to the high number of different algorithms. A knowledge base is required, supporting data analyst and process expert in selecting appropriate algorithms. |
| interpretation of results [ | Wrong interpretation may lead to wrong decisions: condition monitoring aims to observe changes in the machine behavior in order to take appropriate maintenance action. However, the interpretation if a change of behaviour is critical often remains for the operator. |
| heterogeneous data sources [ | Link between data sources is missing: sensor and product quality data is processed in different systems. The link between those two sources is required to train a product quality model. However, the production of mass products such as small gear wheels does not allow the link between the data sources since the gear wheels are not traced individually. |
Selected challenges of KBS in manufacturing. pcc
| challenge | example |
|---|---|
| ontology creation | While the manual creation of the ontologies’ TBoxs is realistic, the population of ontologies, i.e. the creation of the instance level still poses a major challenge for engineers. Ideally, existing sources of information should be reused, but oftentimes the information stems from heterogeneous sources and is not easily interpretable, e.g. when only available as natural language text. |
| ontology evolution | Ontologies not only need to be created, but they also have to be updated continuously. Compared to ontology creation, information not only has to be formalized, but the existing ontology also needs to be checked for correctness continuously. |
| scalability | Even though many benefits arise from integrating information, there are also scalability limitations. There may be huge numbers of nodes and edges, if an ontology aims to capture an entire production system. Nodes would be required for everything, from products to employees to machines and their multitude of variables. |
| knowledge representation | Even though ontologies are well suited to represent the way humans think, namely in nodes connected via edges, they are not suitable for all types of data. For instance, there are limitations to how arithmetic operations can be performed using SWT and Binary Large Objects are also problematic. |
| understandability | While being computer-interpretable, it may be hard for humans to grasp the information captured by ontologies and insights gained from this information. For instance, even graduate mechanical engineering students have difficulties understanding the explanations provided by reasoners integrated into the ontology editor Protégé. |
| ontology combination and reuse | While combination of ontologies is in principle supported through the Open World Assumption, ontology matching is still a major challenge in practice. This is, among other reasons, due to the heterogeneity of ontologies in terms of design choices, domain of interest, structure and terminology. |
Figure 2Overview of knowledge-intensive applications throughout the lifecycle of a production system.
Available and required information for various applications throughout the lifecycle of production systems.
| application | required information | availability |
|---|---|---|
| inconsistency management | domain-specific models, product specifications, description of resource capabilities, mappings of the models or rules for inconsistency detection | models from engineering, expert knowledge, engineering documents for product specification, DTs for capabilities, reference ontologies and ML approaches (ideally) or expert knowledge (more realistic) for matching |
| test prioritization | historic test data, metrics to measure test results | documentation from prior projects, standards, gut feeling of testers |
| redlining | ECAD models and documents with on-site changes | models from engineering, changes made by operators (usually handwritten) |
| quality prediction | sensor data, quality data, algorithms | matching difficult due to time shift, quality can only be assessed via destructive testing |
| alarm analysis | alarm data, algorithms to be used for analysis | alarm data available from the field level, algorithm choice requires expert knowledge |
| smart factories | product and resource status, product features to be realized, resource capabilities, objective functions | product specification, specifications or DTs of the resources, expert knowledge and possibly customer preferences for defining objective functions |
| retrofit of control valves | failure mechanisms, physical dependencies | expert knowledge distributed across several employees or even companies along the supply chain |
Selected challenges and potential solutions by leveraging data analytics and KBS in an integrated way (DA = data analysis, KBS = knowledge-based systems).
| challenge | origin | potential approach |
|---|---|---|
| incomplete information | DA | KBS may be applied to infer missing information, e.g. missing units based on the associated sensor class. |
| inconsistency | DA | Inconsistencies in the data set may be identified and possibly resolved using SWT during preprocessing. |
| missing causality | DA and KBS | KBS may be used to formalize causalities experts are aware of, which can then be used to assess analysis results. In parallel, DA should be applied to identify potential causalities, which may be formalized in KBS if confirmed by experts. |
| interpretation of results | DA and KBS | If formalized KBs are available, inference mechanisms may be applied to infer unambiguous results. However, there are still limitations to the understandability of explanations provided by reasoners. |
| high dimensionality | DA | If experts can specify relevant features, their knowledge may be incorporated in a KB similar to causalities. These KBs can then be leveraged in a preprocessing step. |
| algorithm selection | DA | In order to support engineers in selecting appropriate algorithms, support systems relying on KBS seem promising. |
| heterogeneous data sources | DA and KBS | While DA may help to identify similarities in heterogeneous data sources, alignment with a common ontology for different data sources would enable semantic integration. |
| ontology creation | KBS | DA may be applied to populate the ontologies’ ABoxs. For instance, text mining allows engineers to automatically extract information from textual engineering documents. |
| scalability | KBS | DA could be applied to identify the views typically used by engineers from their interaction with KBS. |
| ontology combination | KBS | ML techniques may be applied to support ontology matching approaches. For instance, Natural Language Processing may be applied for terminological analyses as a part of the matching process. |
Excerpt of file extensions used by selected mechanical CAD tools for technical drawings.
| tool | supplier | file extension |
|---|---|---|
| AutoCAD 2021 | Autodesk | .dwg |
| Inventor 2021 | Autodesk | .idw |
| CATIA V5 | Dassault Systèmes | .CATdrawing |
| Solid Works | Dassault Systèmes | .slddrw |
| Creo 7.0 | PTC | .drw |
Figure 3Performance for crawler, ontology creation and query execution.
Entities and relations of ROMAIN identified in a German car maintenance forum (idiomatic translations of German terms are added in parentheses).
| ROMAIN | OCC | example |
|---|---|---|
| Maintainable item, Nonconformity, is BearerOf | 33 | motor (motor), probleme (problems), haben (have) |
| Maintainable item, State of Degradation, participatesIn | 27 | steuerkette (camshaft timing belt), km (km), haben (have) |
| Asset, Maintainable item, hasPart | 23 | motor (motor), steuerkette (camshaft timing belt), haben (have) |
| Maintainable item, Process of Degradation, participatesIn | 18 | wagen (car), km (km), fahren (drive) |
| Maintainable item, Function, hasFunction | 14 | motor (motor), gas (acceleration), geben (increase) |
Categories for entities and relations, which should be added to ROMAIN (idiomatic translations of German terms are added in parentheses).
| category | OCC | example triple |
|---|---|---|
| cost coverage | 13 | kosten (costs), BMW (BMW), übernehmen (bear) |
| operation mode | 12 | kmh (kilometres per hour), autobahn (highway), fahren (drive) |
| maintenance provider | 8 | ölwechsel (oilchange), bmw (bmw), machen (perform) |
| certificate | 3 | TÜV (vehicle inspection certificate by the Technical Inspection Agency), wagen (vehicle), bekommen (receive) |