| Literature DB >> 33800574 |
Arsalan Shahid1, Thien-An Ngoc Nguyen1, M-Tahar Kechadi1.
Abstract
Obesity is a major public health problem worldwide, and the prevalence of childhood obesity is of particular concern. Effective interventions for preventing and treating childhood obesity aim to change behaviour and exposure at the individual, community, and societal levels. However, monitoring and evaluating such changes is very challenging. The EU Horizon 2020 project "Big Data against Childhood Obesity (BigO)" aims at gathering large-scale data from a large number of children using different sensor technologies to create comprehensive obesity prevalence models for data-driven predictions about specific policies on a community. It further provides real-time monitoring of the population responses, supported by meaningful real-time data analysis and visualisations. Since BigO involves monitoring and storing of personal data related to the behaviours of a potentially vulnerable population, the data representation, security, and access control are crucial. In this paper, we briefly present the BigO system architecture and focus on the necessary components of the system that deals with data access control, storage, anonymisation, and the corresponding interfaces with the rest of the system. We propose a three-layered data warehouse architecture: The back-end layer consists of a database management system for data collection, de-identification, and anonymisation of the original datasets. The role-based permissions and secured views are implemented in the access control layer. Lastly, the controller layer regulates the data access protocols for any data access and data analysis. We further present the data representation methods and the storage models considering the privacy and security mechanisms. The data privacy and security plans are devised based on the types of collected personal, the types of users, data storage, data transmission, and data analysis. We discuss in detail the challenges of privacy protection in this large distributed data-driven application and implement novel privacy-aware data analysis protocols to ensure that the proposed models guarantee the privacy and security of datasets. Finally, we present the BigO system architecture and its implementation that integrates privacy-aware protocols.Entities:
Keywords: big data representation; big data security; healthcare data; privacy-aware models
Year: 2021 PMID: 33800574 PMCID: PMC8037603 DOI: 10.3390/s21072353
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Types of information in Electronic Health Records (I: identifiable, Q-I: quasi-identifiable, S: sensitive).
| Attribute Types | Description | Examples | Privacy |
|---|---|---|---|
| Identifiers | Person identification | name, email, address | phone number |
| Demographics | Person classification to a specific group of the population | race, age, gender, area, postal code, education, occupation, marital status | Q-I |
| Personal Biometrics | Medical information related to physical health | X-Ray, MRI, ultrasound, blood pressure, cholesterol, heart rate, allergies, ICU incidents, tests reports |
|
| Clinical information | Medical history | diagnoses, dosages, treatment services, medication, encounters, problems, therapies | Q-I, S |
| Mental information | Related to psychological, psychiatric, and psychosocial issues | sleep problems, psychology, excessive dieting, psychological sexual disorders |
|
| Life-style and activity information | Relevant to physical activities, life-style | physical activities, exercise regime, nutrition, energy consumption through exercises |
|
| Insurance and financial matters | Related to billing, reimbursements, insurance | DRG, financial class, primary and specialist providers | Q-I, S |
Figure 1A taxonomy of personal data-privacy techniques. Adopted from [22].
Figure 2Overview of the BigO system. (1) Citizen scientists. (2) Children are monitored at clinics, through smartwatch and mobile apps. (3) Anonymised and encrypted data transmission. (4) External data sources (maps, POIs, area statistics). (5) BigO cloud data aggregation and processing. (6) Data analytics and visualisation libraries and tools. (7) Secure distributed database storage. (8) Policy advisor service. (9) Policy planner service. (10) School and Clinical advisor services. (11) A clinician using web tools to monitor and guide children. (12) Web tool for policymaker decision support. (13) Policymakers identify childhood obesity conditions. (14, 15, 16) Policies are applied to hospitals, schools, and the community or regional level. (17) Applied policies affect Citizen Scientists, closing the loop and initiating another round of data collection and analysis.
Figure 3BigO information flow.
Figure 4Hierarchical classification of BigO’s required raw data sources. The smartphone is differentiated visually from its siblings, as it is a hybrid raw data source (due to the food advertisement photos).
Data collection modes that deployed in the BigO system. All modes include the use of a smartphone. Abbreviations: Smartphone (SP), Wristband (WB), and Mandometer (MM).
| Collected Data Type | Light | Standard | Enhanced |
|---|---|---|---|
| ine Accelerometry | SP | SP/WB | SP/WB |
| GPS | SP | WB (with GPS) | WB |
| Meal self-reporting | SP | SP | SP |
| Food Barcode scanning | SP | SP | SP |
| Food pictures | SP | SP | SP |
| Meal eating behaviour | - | Limited MM use | Extended MM use |
Figure 5BigO Data Warehouse Architecture.
Figure 6Component diagram of the BigO system.
Figure 7Privacy-aware data analysis protocol.
Figure 8Privacy-aware BigO architecture.