| Literature DB >> 29379849 |
Jaromír Salamon1, Roman Mouček1.
Abstract
Sentiment extraction and analysis using spoken utterances or written corpora as well as collection and analysis of human heart rate data using sensors are commonly used techniques and methods. On the other hand, these have been not combined yet. The collected data can be used e.g. to investigate the mutual dependence of human physical and emotional activity. The paper describes the procedure of parallel acquisition of heart rate sensor data and tweets expressing sentiment and difficulties related to this procedure. The obtained datasets are described in detail and further discussed to provide as much information as possible for subsequent analyses and conclusions. Analyses and conclusions are not included in this paper. The presented experiment and provided datasets serve as the first basis for further studies where all four presented data sources can be used independently, combined in a reasonable way or used all together. For instance, when the data is used all together, performing studies comparing human sensor data, acquired noninvasively from the surface of the human body and considered as more objective, and human written data expressing the sentiment, which is at least partly cognitively interpreted and thus considered as more subjective, could be beneficial.Entities:
Keywords: Common timeline; Heart rate; Sentiment analysis; Soft and hard data relation; Wearable technology
Year: 2017 PMID: 29379849 PMCID: PMC5779535 DOI: 10.1016/j.dib.2017.10.037
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Summary of heart rate data.
| 411,799 | 69,941 | |
| 6147 | 961 | |
| 9469 | 1440 | |
| 8074 | 1371 | |
| 2016-05-10 00:00:05 CET | 2016-08-15 00:00:00 CET | |
| 2016-06-29 23:58:55 CET | 2016-10-04 23:59:00 CET | |
| Every 5 s | 32 times per second | |
| Every 5–15 s | Every 1 min, | |
| i.e. 6–7 records per minute | i.e. 1 record per minute |
Only the samples where HR value is available (not equal to N/A), the total number of samples is 73,440.
Information was obtained from a company representative response in the Fitbit community forum [14]
Information was obtained from a company representative response in the Basis Peak forum [3] and [4] and from a prestige IT magazine [5].
Summary of gaps in data collections.
| 54 | 85 | 20 | 24 | |
| 7 | 10 | 0 | 0 | |
| 6 | 6 | 0 | 0 | |
| 6 | 12 | 4 | 9 | |
| 73 | 113 | 24 | 33 | |
Fig. 1Gaps in the data per day and hour.
Summary of the corpus data per tweet.
| 1029 | 1017 | |
| 780 | 718 | |
| 249 | 299 | |
| 20.56 | 20.32 | |
| 3.13:1 | 2.40:1 | |
| 35/50 (70%) | 35/50 (70%) | |
| 2016-05-10 07:36:17 CET | 2016-08-15 10:01:24 CET | |
| 2016-06-29 01:10:15 CET | 2016-10-04 00:03:35 CET |
Fig. 2Distribution of tweets per hour for both experiments.
Fig. 3Deviations between real tweeting time and expected tweeting time.
The time zone matrix for a particular data source system and specific data processing phases.
| UTC | CET | UTC | |
| CET | CET | UTC | |
| CET | CET | CET | |
| CET | CET | CET |
Fig. 4Fitbit Charge HR wearable from Fitbit company. Picture sourced from Fitbit press release kit: https://investor.fitbit.com/press/press-kit/charge-hr/default.aspx.
Fig. 5Basis Peak wearable from Basis company. Picture sourced from YouTube library (Peak is produced by Basis Company anymore): https://i.ytimg.com/vi/zhBYMR8t4_Y/maxresdefault.jpg.
Fig. 6HR data aggregation between two sentiment values.
Fig. 7HR data aggregation over sentiment values.
| Subject area | Health Informatics, Health Science |
| More specific subject area | Using wearables and social media to collect hard and soft data for further usage in health science |
| Type of data | Text files (4 CSV files) |
| How data was acquired | Pilot experiment – 2×50 days, one participant, Twitter – sentiment data, Fitbit Charge HR, Basis Peak – heart rate data |
| Data format | Raw, Preprocessed |
| Experimental factors | Heart rate collected 24×7 together with min. 20 tweets recorded per day during 2× 50 days' experiments |
| Experimental features | The paper describes the procedure of parallel acquisition of heart rate sensory data and tweets expressing sentiment and difficulties related to this procedure. |
| Data source location | Zurich, Switzerland |
| Data accessibility | Data is provided with this article |