| Literature DB >> 33495359 |
Ymir Vigfusson1,2, Thorgeir A Karlsson2, Derek Onken3, Congzheng Song4, Atli F Einarsson2, Nishant Kishore5, Rebecca M Mitchell3, Ellen Brooks-Pollock6, Gudrun Sigmundsdottir7,8.
Abstract
Epidemic preparedness depends on our ability to predict the trajectory of an epidemic and the human behavior that drives spread in the event of an outbreak. Changes to behavior during an outbreak limit the reliability of syndromic surveillance using large-scale data sources, such as online social media or search behavior, which could otherwise supplement healthcare-based outbreak-prediction methods. Here, we measure behavior change reflected in mobile-phone call-detail records (CDRs), a source of passively collected real-time behavioral information, using an anonymously linked dataset of cell-phone users and their date of influenza-like illness diagnosis during the 2009 H1N1v pandemic. We demonstrate that mobile-phone use during illness differs measurably from routine behavior: Diagnosed individuals exhibit less movement than normal (1.1 to 1.4 fewer unique tower locations; [Formula: see text]), on average, in the 2 to 4 d around diagnosis and place fewer calls (2.3 to 3.3 fewer calls; [Formula: see text]) while spending longer on the phone (41- to 66-s average increase; [Formula: see text]) than usual on the day following diagnosis. The results suggest that anonymously linked CDRs and health data may be sufficiently granular to augment epidemic surveillance efforts and that infectious disease-modeling efforts lacking explicit behavior-change mechanisms need to be revisited.Entities:
Keywords: call detail records; disease; influenza; outbreak; surveillance
Mesh:
Year: 2021 PMID: 33495359 PMCID: PMC8017972 DOI: 10.1073/pnas.2005241118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Combining health records with call-data records. (Left) Cell towers act as a proxy for location, which, when coupled with the timestamp, allow movement inference. Different colors show inferred movements of a typical cell-phone user at different time periods over a period of 3 d. (Right) The epidemic curve for the 2009 H1N1v outbreak in Iceland, showing a single pronounced peak. The green dotted line shows the number of laboratory samples taken, the red line shows the number of those testing positive for H1N1v, and the black line shows the estimate of suspected H1N1v cases per week from the recorded ILI incidence (19). The expected H1N1v positive cases (blue dotted line) are extrapolated from the suspected ILI cases and the percentage of samples found positive each week.
Feature characteristics from the 29-d period around each individual’s DoD (additional characteristics are in )
| Control | Diagnosed | ||
| Feature | Mean | Mean | Anomalous days |
| − | |||
| Number of new locations visited | 0.5 | 0.43 | 1, 3 |
| Unique contact count | |||
| Incoming | 2.25 | 2.02 | |
| Outgoing | 2.50 | 2.28 | 1, 2, 3, 6 |
| Both | 4.04 | 3.67 | 2 |
| New contacts | |||
| Incoming | 0.61 | 0.50 | |
| Outgoing | 0.65 | 0.55 | 1, 3, 6 |
| Both | 1.19 | 1 | 0, 1, 3 |
| Call duration, total, s | |||
| Incoming | 190 | 480.51 | 0, 1, 2 |
| Outgoing | 162 | 435.09 | 0 |
| Both | 479.5 | 915.60 | |
| Calls count | |||
| Incoming | 3.10 | 2.84 | |
| Both | 6.66 | 6.22 | 1 |
| Texts count | |||
| Incoming | 2.71 | 2.87 | −10 |
| Outgoing | 1.74 | 1.93 | 1, 2 |
| Both | 4.46 | 4.79 | −10, 11 |
| Calls and texts count | |||
| Incoming | 5.78 | 5.71 | |
| Outgoing | 5.34 | 5.30 | 1, 2 |
| Both | 11.12 | 11.01 | 2 |
| Mean call duration | |||
| Incoming | 133.35 | 140.74 | −1, 0, 1, 2, 4, 11 |
| Outgoing | 107.96 | 106.74 | |
| − | |||
| Top 3 contacts by duration | |||
| ( | |||
| Incoming | 0.68 | 0.69 | |
| Outgoing | 0.70 | 0.69 | |
| Both | 1.37 | 1.38 | 1 |
| Remaining contacts by duration | |||
| Incoming | 1.57 | 1.33 | |
| Outgoing | 1.80 | 1.58 | 1, 2, 4, 6 |
| Both | 3.37 | 2.92 | 1, 4, 6 |
| Top 3 contacts by frequency | |||
| Incoming | 0.58 | 0.56 | |
| Outgoing | 0.63 | 0.61 | 1, 2 |
| Both | 1.21 | 1.17 | 1 |
| Remaining contacts by frequency | |||
| Incoming | 1.67 | 1.46 | 0 |
| Outgoing | 1.88 | 1.67 | −9, 2, 6 |
| Both | 3.54 | 3.13 | |
Fig. 2.Changes in average phone-use behavior associated with diagnosis. (Left) Users visit fewer locations on days around diagnosis. (Center) They make and receive longer phone calls on days near diagnosis. (Right) They initiate fewer calls on the days after diagnosis, with the exception of the day of diagnosis itself. Graphs display the mean deviation from “normal” routine behavior () for each group on the relative day of illness determined by DoD (day 0). CIs (2.5 to 97.5%) are calculated using bootstrapping ().
Fig. 3.Privacy-preserving data-sharing protocol. Privacy-preserving architecture for syndromic surveillance using CDR data for future experimental design. An independent third-party broker is provided with real-time deidentified CDR data, extracts features, and runs the prediction models to generate an epidemic curve (Left; O1). The broker could also be provided labeled anonymous health information to join with the CDR data to calibrate or retrain the classifiers (Right; O2). The design accommodates mutual distrust, ensuring that health officials cannot monitor behavior or track mobility of individuals, that MNOs are not provided with any health information of customers, and that the broker only operates on deidentified data.