Thang Hoang1, Pietro Coletti1, Alessia Melegaro2, Jacco Wallinga3,4, Carlos G Grijalva5, John W Edmunds6, Philippe Beutels7, Niel Hens1,7. 1. From the Interuniversity Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Agoralaan Gebouw D, Diepenbeek, Belgium. 2. Carlo F. Dondena Centre for Research on Social Dynamics and Public Policy, Bocconi University, Milano, Italy. 3. Centre for Infectious Disease Control, National Institute for Public Health, Bilthoven, The Netherlands. 4. Department of Biomedical Data Sciences, Leiden University, Leiden, The Netherlands. 5. Department of Health Policy, Vanderbilt University School of Medicine, Nashville, TN. 6. Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, United Kingdom. 7. Centre for Health Economics Research and Modelling Infectious Diseases, Vaccine and Infectious Disease Institute, University of Antwerp, Antwerp, Belgium.
Abstract
BACKGROUND: Researchers increasingly use social contact data to inform models for infectious disease spread with the aim of guiding effective policies about disease prevention and control. In this article, we undertake a systematic review of the study design, statistical analyses, and outcomes of the many social contact surveys that have been published. METHODS: We systematically searched PubMed and Web of Science for articles regarding social contact surveys. We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines as closely as possible. RESULTS: In total, we identified 64 social contact surveys, with more than 80% of the surveys conducted in high-income countries. Study settings included general population (58%), schools or universities (37%), and health care/conference/research institutes (5%). The largest number of studies did not focus on a specific age group (38%), whereas others focused on adults (32%) or children (19%). Retrospective (45%) and prospective (41%) designs were used most often with 6% using both for comparison purposes. The definition of a contact varied among surveys, e.g., a nonphysical contact may require conversation, close proximity, or both. We identified age, time schedule (e.g., weekday/weekend), and household size as relevant determinants of contact patterns across a large number of studies. CONCLUSIONS: We found that the overall features of the contact patterns were remarkably robust across several countries, and irrespective of the study details. By considering the most common approach in each aspect of design (e.g., sampling schemes, data collection, definition of contact), we could identify recommendations for future contact data surveys that may be used to facilitate comparison between studies.
BACKGROUND: Researchers increasingly use social contact data to inform models for infectious disease spread with the aim of guiding effective policies about disease prevention and control. In this article, we undertake a systematic review of the study design, statistical analyses, and outcomes of the many social contact surveys that have been published. METHODS: We systematically searched PubMed and Web of Science for articles regarding social contact surveys. We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines as closely as possible. RESULTS: In total, we identified 64 social contact surveys, with more than 80% of the surveys conducted in high-income countries. Study settings included general population (58%), schools or universities (37%), and health care/conference/research institutes (5%). The largest number of studies did not focus on a specific age group (38%), whereas others focused on adults (32%) or children (19%). Retrospective (45%) and prospective (41%) designs were used most often with 6% using both for comparison purposes. The definition of a contact varied among surveys, e.g., a nonphysical contact may require conversation, close proximity, or both. We identified age, time schedule (e.g., weekday/weekend), and household size as relevant determinants of contact patterns across a large number of studies. CONCLUSIONS: We found that the overall features of the contact patterns were remarkably robust across several countries, and irrespective of the study details. By considering the most common approach in each aspect of design (e.g., sampling schemes, data collection, definition of contact), we could identify recommendations for future contact data surveys that may be used to facilitate comparison between studies.
Despite the great progress in infectious disease control and prevention that were initiated during the last century, infectious pathogens continue to pose a threat to humanity, as illustrated by severe acute respiratory syndrome, influenza, antimicrobial resistant bacteria, Ebola, and resurgent measles, potentially disrupting everyday life, burdening public health, and occasionally dominating media headlines well into the 21st century.Many infectious diseases can spread rapidly between people within and between age groups, households, schools, workplaces, cities, regions, and countries through a diversity of social contacts.[1]. Understanding and quantifying social mixing patterns is therefore of critical importance to establishing appropriate simulation models of the spread of infectious diseases. Such mathematical transmission models have become indispensable to guide health policy. Which interventions should be offered to which people in which circumstances? How would such interventions affect transmission chains and the disease burden throughout the population? What would be the population effectiveness and cost-effectiveness of such interventions? Well-informed answers to these questions require mathematical models. The validity of such models depends heavily on the appropriateness of their structure and their parameters, including what they assume about how people interact.Indeed, a transmission model’s integrated mixing patterns (i.e., who mixes with whom?) have a strong influence on the transmission parameters (i.e., who infects whom?). The latter are the most influential drivers for the outputs of such models. Whereas 20th-century models made strong assumptions about mixing patterns, it has become increasingly common to use empirical data on social interactions as a direct model input over the last decade.[2,3] For sexually transmitted infections, data from surveys on sexual behavior were available for use as an input for models. On the other hand, for infectious diseases that are transmitted by direct contact, minimal data on relevant social contacts was available. Edmunds et al[4] conducted a first study aimed to collect precisely this information using a convenience sample. This study was followed by a study that reported on relevant social contacts in a representative sample of the population that covered all ages in a city.[5] The landmark study that reported on relevant social contacts in representative samples for eight different European countries using contact diaries was the POLYMOD study.[6] Numerous other studies have been reported since. Several of these studies report on social mixing patterns as obtained through direct observation, contact diaries, or electronic proximity sensors. The strengths and weaknesses of these methods have been discussed.[7] Nevertheless, to our knowledge, a comprehensive review of the study designs for contact diaries and of major determinants of mixing patterns is lacking for this rapidly growing field of research, a gap which we aim to fill here.In the current article, we systematically retrieve and review the literature on social contact surveys. First, we provide an overview of the literature to help identify a standard. Second, we present the different approaches for data collection and identify strengths and limitations. Third, we report on the main determinants of contact. We use these findings to guide future studies.
METHODS
We conducted a systematic review in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.[8]
Search Strategy
We queried PubMed and ISI Web of Science, without time and language restriction up to 31 January 2018 using the following search string:(([survey*] OR [questionnaire*] OR [diary] OR [diaries]) AND ([social contact*] OR [mixing behavio*] OR [mixing pattern*] OR [contact pattern*] OR [contact network*] OR [contact survey*] OR [contact data])EndNote X7 was used to eliminate duplicates and manage the search results.[9]
Inclusion Criteria
We considered studies eligible if they fulfilled all of the following criteria: (1) primary focus on face-to-face contacts of humans, implying the physical presence of at least two persons during contact; (2) contacts relevant for the transmission of close-contact infections; (3) contacts recorded using a diary-type system on paper or in electronic format; (4) full-text version available.
Exclusion Criteria
We excluded studies that involved at least one of the following: (1) primary focus on human–animal or animal–animal contacts; (2) recording contacts exclusively relevant for sexually transmitted, food-, vector-, or water-borne diseases; (3) using exclusively proximity sensor devices or observational methods to collect contact data; (4) including contacts without physical presence (e.g., phone, internet/social media contacts) or without the possibility to distinguish them; (5) recording the frequency or regularity but not the number of contacts over a given time period; (6) meeting abstracts, books, theses, or unpublished journal articles.An overview of the selection process is presented in Figure 1. Title, abstract, and full-text were screened initially by the first author and double-checked by the second author.
FIGURE 1.
The PRISMA flowchart of the search process.
The PRISMA flowchart of the search process.
Data Extraction and Analysis
Three authors (T.V.H, P.C., N.H.) designed a data input form (see eTable 1; http://links.lww.com/EDE/B552). T.V.H. extracted relevant information from selected articles and inputted it in the form. P.C. performed verification to ensure data consistency and accuracy. We structured the data synthesis according to: (1) information on surveys and relevant articles: year and countries in which surveys were performed, authors and year of first publication, and relevant publications that used the same dataset; (2) information on survey’s methodology: study setting, study subjects, final sample size, response rates, sampling scheme, data collection tools, collection modes, study design (prospective, retrospective or both), and reporting period; (3) information recorded on participants and contactees; (4) characteristics of contacts reported: types, definition, location, duration, and frequency of contacts; (5) analysis results: average and median number of contacts, SD, quantiles/range, and relevance of determinants for number of contacts. Data that could not be found in individual articles were given a value of “not available” for the corresponding variable.This systematic review aggregates information of articles in the literature, so ethical review by an independent review board was unnecessary.
RESULTS
The Screening Process
Our search retrieved 1445 nonduplicate articles, with 73 suitable articles included in the review. Figure 1 shows the study selection process.
Country Settings
The 73 remaining articles covered 64 social contact surveys conducted in 24 countries spread over five continents: 12 European,[4-6,10-33] five Asian,[31,34-39] four African,[40-43] two American,[44-49] and one Oceanian[50-52] country. More details on number of social contact surveys in each of these countries are shown in the global map. Only 14 studies were conducted at the whole-country level,[6,13,14,26,28,32,37,38] whereas remaining studies focused on a region,[12,34,35,39,41,43,47] a city or town,[5,36,40,42,51] or a specific setting (school or university, health care facility, etc.), and were therefore not representative of the entire country. Figure 2 demonstrates that 40 out of 64 of the surveys were conducted in Europe followed by Asia with 10 surveys. In contrast, only a few surveys were conducted in other regions. In this representation, we count several countries separately, even if they were included as part of a single project.[6,11,31,41]
FIGURE 2.
Distribution of number of representative surveys based on continents and time periods. Countrywide surveys cover the whole country in which they were conducted.
Distribution of number of representative surveys based on continents and time periods. Countrywide surveys cover the whole country in which they were conducted.The number of surveys greatly increased over time from only four surveys before the year 2000 up to 37 surveys after 2009, indicating that social contact surveys are increasingly conducted. In addition, no survey was conducted outside Europe before 2005. One survey did not indicate the year it was conducted. For this study, we used the publication year minus two as a proxy.[4]
Study Settings and Subjects
More than half of the social contact surveys were conducted in the community or general population (58%). Of these, there were only four household-based surveys that asked every member of each participating household to complete the survey.[14,35,39,44] The majority of surveys conducted in the general population aimed at people of all ages (65%).[6,12,14,26-28,32-34,36-40,42-44,47] In contrast, two surveys excluded infants younger than 1 year of age,[5] and one excluded children less than 2 years.[35] Four surveys focused exclusively on adults,[29,41,51] two investigated contact patterns of infants (under 11 weeks[25] and under 1 year old[52]), and one was aimed at patients with pandemic influenza AH1N1pdm09 (swine flu).[20] More specific settings of schools or universities constituted 38% of the surveys, of which 11 surveys were conducted at schools (primary schools,[16] secondary schools,[23] high schools,[48,49,53-56] or a combination of those[20,21,46]) and 13 surveys were performed in universities.[4,10,17,19,24,31,33] In addition to school or university settings, we also identified one contact survey on nurses in a health-care setting,[15] one survey at a conference,[18] and one survey in a research institute.[30]
Sample Size and Response Rate
Among social contact surveys conducted in the general population, the smallest survey only consisted of 54 participants in Switzerland,[29] and the largest survey consisted of 5388 participants in the United Kingdom.[26] The largest survey in a school/university setting contained 803 participants in Germany[17] (see eFigure 2; http://links.lww.com/EDE/B552). The response rate was reported in 36 out of 64 surveys and ranged from 4% in population-based surveys[26] up to 100% in a school-based survey.[17] Of these 34 surveys, only three considered the response rate beforehand to estimate the sample size.[15,38,47] Instead of considering the response rate, some surveys established criteria to replace those who refused or were not reached after several attempts.[39-41] Twenty surveys determined sampling weights based on demographic characteristics of the populations to reduce the effects of sample bias.[6,11-13,25,26,32,36-41,51]
Sampling Methods
Approximately half (44%) of the surveys employed convenience sampling, in which subjects were selected based on their convenient accessibility to researchers.[57] This sampling technique was also used for the sake of comparing data collection tools,[18,49,50,56,58] data collection methods,[59] or study designs.[10,17,50]Seven surveys used random sampling.[5,26,28,34,35,51] Among these studies, only two surveys were considered representative of the entire country,[26,28] and the remaining surveys were representative of a region[34,35] or a city or town.[5,51] Three surveys employed multi-stage sampling,[15,38,39] and 10 surveys stratified sampling[12,14,25,37,40-42,47] that are easier to implement with respect to random sampling and can still remain representative. In addition, 10 surveys relied on quota sampling, which aimed to represent certain characteristics of a population (e.g., age, sex, geography, etc). Of these surveys, nine were conducted at the whole-country level,[6,13,32] and one survey focused on one specific region.[36] In addition, one survey used mixed samplings, in which a convenience sample of students was obtained in two schools, and a random sample of the general population was obtained in one province.[33] Five surveys used an online respondent-driven method, which can be considered as a snowball or chain sampling technique.[31,45,60] Only one survey did not state information on sampling techniques.[34] Finally, three surveys conducted at the general population level used a convenience sample,[29,44,50] therefore not relying on a sampling frame. More details on the distribution of sampling schemes based on time and regions are presented in eFigure 3; http://links.lww.com/EDE/B552.
Study Design
By prospective design, we mean that respondents are informed in advance of the day(s) that they are requested to record their contacts.[6,17,32] In a retrospective design, respondents recall their contacts over a past time period without prior warning or instruction that they would be requested to do so. Of 64 surveys, 29 (45%) used a retrospective design and 26 (41%) used a prospective design. Only four surveys 6%) used both designs for the purpose of comparison.[10,17,50] For five surveys (8%), it was not completely clear whether the study was prospective or retrospective.[20,24,30,48] eFigure 4; http://links.lww.com/EDE/B552 displays the trend of using study designs in social contact surveys over time, revealing that the retrospective design was more favored by researchers, except in the period 2005–2009 in which eight prospective surveys were implemented.[6,32]
Definition of Contact
Forty (63%) surveys distinguished physical and nonphysical contacts. Physical contacts were consistently defined as involving any sort of skin-to-skin touching (e.g., handshake, hug, kiss, etc). The definition of nonphysical contacts differed somewhat among surveys. Specifically, the majority of surveys using two types of contacts defined a nonphysical contact as a two-way conversation of at least three words at a distance that does not require raising one’s voice.[6,10,36,39,43,50,52,54,59] In some other surveys, the definition involved close proximity (e.g., verbal communication made within 2 m) without specification of a minimum number of words to be exchanged.[13,26,30,38,41,61] Of note, that since the POLYMOD contact studies were executed,[6] its contact definition was applied in several subsequent surveys.[33,36,39,43,44,52,55] Fifteen surveys used only one type of contact, either involving a face-to-face conversation[4,5,16,23,33,45] or being in close proximity,[31,48,56] both regardless of any skin-to-skin touching[37,49] or only involving direct skin-to-skin touching.[28,40] Only one survey attempted to record casual contacts occurring in an indoor location without the requirement for a conversation or any type of touching.[42] Eight remaining surveys added kissing or intimate sexual contact as different types of contacts[17,19,29,46] or asked respondents to record contacts made in small/large groups or occasional contacts within 2 m in local transportation or crowded places separately.[17]
Reporting Time Period
Greater than half the surveys asked respondents to report contacts they made during a single day, whereas only six surveys used a reporting time period of greater than 3 days. The longest time period identified is 3 days in a prospective survey[19,50] and ten weeks in a retrospective survey.[45] Seven surveys recorded both weekdays and weekend days on the same respondents.[6,10,17,19,52-54] Finally, Eames et al[20] quantified the changes in social contact patterns experienced by individuals experiencing an episode of Influenza A(H1N1) on two randomly assigned days: one day while being ill and one day when recovered.
Characteristics of Participants and Contactees
Most surveys collected a range of demographic background characteristics of study participants (e.g., age, sex, education, household size). Some surveys also asked participants to record any influenza-like-illness symptoms they experienced on the day of surveying[31,47,53,60] or whether their day was in any way special (due to holiday, sickness, etc.).[6,12]Among the characteristics of contactees, age and sex are considered to be the most important determinants of the mixing patterns given that they can help explain age and sex differences in the epidemiology of infectious diseases.[13] Thirty-six of 64 (56%) surveys recorded both age and sex of contactees, and 16 (25%) surveys recorded only age of contactees. In contrast, seven (11%) surveys required participants to simply report the number of different contactees without recording any of their characteristics.[17,29,45,47,49] In five surveys (8%), it was not clear what contactee characteristics participants had to report.[24,26,30,35,48] Along with age and sex, several surveys also asked participants to record health status of contactees and any symptoms they experienced, e.g., coughing, sneezing, fever, etc.[45,53-55] or whether they wore a protective mask.[53,54]
Information About Contacts
Participants were asked to record information about location, duration, and frequency of each contact in 77%, 67%, and 52% of contact surveys, respectively. All these contact characteristics were jointly recorded in 27 surveys (42%). For more details on the number of surveys considering these informations, see eFigure 5; http://links.lww.com/EDE/B552.
Mean Number of Contacts and Analysis of Determinants
Of the 64 surveys, 45 explicitly reported the average number of contacts measured without any stratification (Figure 3). To compare these survey results, we categorized them into 12 groups with different extents of representativeness (for country, region, or town or city), study designs and settings. In country-wide prospective surveys, the average number of contacts ranges from a minimum of 7.95 (95% confidence interval [CI] = 7.61, 8.29) in Germany[6] to a maximum of (95% CI = , ) in the United Kingdom.[26] In country-wide retrospective surveys, these values ranges from (95% CI = , ) in Taiwan[38] to (95% CI = , ) in Japan.[37] Six surveys conducted in the general population asked participants not to report details of professional contacts in the diary but to provide an estimate of the number and age distribution if they had more than 10 contacts (surveys in Finland,[6] Germany,[6] and the Netherlands[6]) or more than 20 professional contacts (surveys in Belgium[6,12] and France[13]). The additional professional contacts are not included in calculation of means presented in Figure 3. Among school- or university-based surveys, the highest number of contacts ( [95% CI = , ]) was observed in a secondary school in the United Kingdom.[23]
FIGURE 3.
Average number of contacts measured and 95% CI. For surveys reporting mean number and SD a 95% CI for the mean was computed. Surveys are labeled according to the publication’s first author, year and to the country in which the survey was performed. Ordering is performed based on increasing sample size within the specific design strata.
Average number of contacts measured and 95% CI. For surveys reporting mean number and SD a 95% CI for the mean was computed. Surveys are labeled according to the publication’s first author, year and to the country in which the survey was performed. Ordering is performed based on increasing sample size within the specific design strata.Figure 4 presents the number of surveys that have analyzed possible determinants for the number of contacts. For every determinant, we report the number of surveys that identified a relevant connection with the number of contacts (“yes”), the number of surveys that did not identify such a connection (“no”) and the number of surveys that did not delve into the matter. Strong evidence is identified for whether the age (34 yes vs. five no) and the household size (21 yes vs. four no) of the participant affected the number of contacts. Only five surveys identified sex as a relevant indicator for the number of social contacts, in contrast to 23 surveys that did not identify a relation. Social contacts are also affected by the daily routine (29 yes vs. six no) with a larger number of contacts during weekdays compared with the weekend (eFigure 6a; http://links.lww.com/EDE/B552 with the exception of Ref. 36). Similar results hold for term time versus holidays with all of the eight surveys that analyzed the issue identifying a larger number of contacts during term time (eFigure 6b; http://links.lww.com/EDE/B552). In addition, a self-reported healthy status is associated (five yes vs. one no) with a larger number of contacts with respect to feeling ill (eFigure 6c; http://links.lww.com/EDE/B552).
FIGURE 4.
Determinants of number of social contacts. Surveys are tagged as “Yes” if a relevant connection between the number of contacts and the determinant was identified, “No” if evidence was not identified, or “NA” if the given determinant was not analyzed.
Determinants of number of social contacts. Surveys are tagged as “Yes” if a relevant connection between the number of contacts and the determinant was identified, “No” if evidence was not identified, or “NA” if the given determinant was not analyzed.The relationship between social contacts and urbanization has been analyzed in three surveys. One survey found a larger number of contacts in peri-urban areas compared with rural areas,[43] one found the opposite,[40] and one did not find any evidence.[35] Finally, two surveys analyzed contacts during and outside flu seasons. Of these surveys, one survey[47] used a model to adjust for other factors (e.g., age and sex) and one did not.[55] However, both surveys identified no relevant effect (eFigure 6d; http://links.lww.com/EDE/B552).The Table provides summaries of all 64 social contact surveys.Summaries of Social Contact Surveys
DISCUSSION AND CONCLUSIONS
Social contact surveys are increasingly used to collect empirical data on human contact behavior and provide crucial inputs for mathematical models of infectious disease transmission. The POLYMOD project[6] presented the first large scale representative population surveys conducted in eight European countries. It also shared know–how both for data collection and analysis.To date, most of these contact surveys were conducted in high- and middle-income countries, whereas low-income countries, which have a higher burden of communicable diseases, were less studied in this respect. For this reason, there is a need to continue studying contact patterns more widely and in particular in low- and middle-income settings. It is also worth noting that in low- and middle-income countries, the choice to perform a general population representative survey may be less meaningful, given the large variety of different settings (urban, rural, etc.) that are simultaneously present.Most surveys did not clearly present sample size calculations, so we do not know to what extent important parameters, e.g., population size, confidence level, and margin of error, were taken into account.[62] Sample size estimation is even more important when one wants to compare social contact surveys between or among populations. Given the lack of a clear picture regarding which demographic and anthropologic factors are relevant in shaping contact patterns, inherent factors that may drive contact patterns are difficult to account for, thus making comparison among large populations, e.g., countries, even more difficult. It should be noted, however, that this is a relevant issue only when considering the contact matrix. Given the intrinsic network nature of social contact data more in general, sample size calculation becomes more involved and relies heavily on assumptions about the structural characteristic of the network and that no standard method to do so has emerged so far.[63,64] Indeed, the extensive analysis of this review has underlined a general lack of information on response rates, and a call for a better nonresponse analysis emerges as a guideline for future studies.The prospective design is subject to less recall bias than the retrospective design. This notion can be partly explained by the fact that respondents in the former are informed in advance about which days they will be assigned for reporting their contact information. Furthermore, they are also asked to keep a diary with them and finish reporting before the surveying day is elapsed. Thus, the prospective design requires more commitment from respondents. In return, a prospective design can obtain more reported contacts compared with retrospective design.[17,50] However, large-scale studies are needed to further confirm these conclusions.The use of self-reported diaries (paper or online) is the most commonly employed method in social contact surveys, associated with a smaller response rate with respect to e.g., face-to-face interviews. On the other hand, face-to-face interviews are more demanding in terms of fieldwork and data collection. No clear relationship in the number of contacts has been found when comparing online diaries with paper diaries,[10,36] while proximity sensors are more accurate in measuring short duration contacts, with the overall interaction pattern being similar between sensor and diaries[18,56] (for more details about data collection, see eAppendix; http://links.lww.com/EDE/B552). It should also be noted that proximity sensors usually perform a complete sampling of the interaction networks, whereas diary-based survey usually implement egocentric sampling (but complete sampling is not excluded in principle[14,35,39,44]). Although egocentric sampling does not allow estimation of several important network features, it still does not bias inference results, if properly taken into account.[65,66]The definition of a potentially infectious contact is of crucial importance given that it will be used as a surrogate for exposure to disease.[15] Contact definitions in most surveys cannot capture potential risks from all forms of transmission modes,[30] such as fomite or indirect contact. Even for droplet transmission, using a face-to-face conversation definition as the basis for recording nonphysical contacts might lead to underreporting potentially infectious events given that susceptible individuals are likely able to contract a respiratory infection by just standing next to infected individuals who are, for example, sneezing or coughing. Furthermore, it seems even more challenging to record common touching frequency of shared material objects, such as doorknobs, water taps, etc., with which one person may be able to infect another indirectly. Indeed, the more details on potentially infectious events we attempt to collect, the greater the burden we impose on respondents. However, it seems reasonable in future studies to consider at least two contact definitions, one that involves physical contact and one that does not.It is tempting to ask study participants to report their contacts as long as possible to gain insights into day-to-day variation. Nevertheless, the demanding task of diary–keeping may prevent many participants from recording the information for a long time in prospective studies.[67] Béraud et al[13] demonstrated that participants reported 6% (1%–10%) fewer contacts on the second day of the survey. In addition, the more contacts they reported on the first day, the larger the proportional decrease in number of contacts on the second day. For retrospective studies, a longer reporting period implies a longer recall period, with an associated larger bias. Therefore, in retrospective studies, researchers should try not to overstretch the reporting period.This review provides information on the most relevant determinants of social contacts identified in previous studies. Asking study participants to report too many characteristics of contactees imposes a burden on participants. When designing future surveys, it is therefore important to consider which characteristics may be sufficiently relevant to include as determinants. For example, collecting age of the participants and their contacts is informative, as some studies revealed that using age-related mixing patterns helped explain observed serological and infection patterns of infectious diseases like pertussis, varicella, and parvovirus-B19.[5,68,69] In addition, collecting information about location, duration, and frequency of contacts is also very essential for exploring mixing patterns and helping form effective strategies for disease prevention and control. In the case of school-aged children, a dominant number of contacts are made in school, leading to an indication that school closure can have a substantial impact on the spread of a respiratory infection.[3,20,23,70,71] Duration and frequency of contacts are important because they affect the probability of infecting another individual and if all contacts are treated equally, this may lead to wrongfully estimating the individual transmission probability.[29] Several studies found that close contacts with a duration of at least 15 minutes involving skin-to-skin touching were most predictive of the prevaccination prevalence of varicella zoster virus.[68,72] Therefore, the age of the contactee and duration of contact emerge as the most important information and should always be recorded in social contact surveys.A comparison among all surveys based on a quantity such as the average number of contacts can be problematic. The study sample serves as the first obstacle. Given different research questions or participant availability, not all the samples studied can be considered as representative of the target population. This notion is important especially because age is a relevant determinant of social contacts, and samples in which a specific age class is over-represented regardless of study design can induce a strong bias to the number of contacts measured. For example, the study reporting the largest number of contacts (70.3[23]) was performed in a secondary school in the United Kingdom. Once these caveats are taken into account, the Table can be of value in identifying all of the surveys sharing similar features when addressing a specific research question. For example, the POLYMOD survey[6] demonstrated that the main structure of social interaction among age categories was the same among several European Union countries, although the strength of the interaction could vary between countries. On the other hand, the average number of contacts measured in sub-Saharan countries by Dodd et al[41] is considerably reduced compared with the average for high-income countries. In fact, the development level can be important in determining social interactions, e.g., due to local population density or reduced school attendance.[43,44,72] Quantifying the impact of different demographic factors on social contacts would require re-analysis of the datasets on the same basis and goes beyond the scope of this review. However, this could be performed in the future as datasets of social contact surveys will be made available from researchers in a unified format.[73]This review used PubMed and Web of Knowledge for searching publications, possibly resulting in the omission of relevant publications. Nonetheless, the literature research step allowed us to recover more articles independently of a specific database, possibly recovering the ones we lost querying only two databases. Second, to the best of our knowledge, our search query failed to return the relevant articles of Leecaster et al[58] and Kwok et al,[74] which are eligible for this review. These articles are missing the words “survey”, “questionnaire”, and “diary” in the abstract and title, and therefore were missed by our searching method. The recent publication date (2016 and 2018) also prevented these articles from appearing in the references of the relevant articles. Another article that requires a specific clarification is the work of Watson et al[75] that asked participants to record whom they shared a meal with. This article was not included in our analysis, as considering a definition of social contact that is rather different from the body of this review.Since the POLYMOD survey, there has been an increasing trend in the number of social contact surveys used to collect empirical contact data. Social contact surveys have been conducted widely in many countries, but most focused on high-income countries. These surveys used a range of different study designs with different study subjects, settings, sampling scheme to study designs, data collection tools, and data collection methods. Moreover, the definition of “contact” and its characteristics also differ, making comparison of contact patterns among surveys even more difficult. Improvements towards a unified definition of “contact” and standard practice in data collection could help increase the quality of collected data, leading to more robust and reliable conclusions about contact patterns of individuals.This review demonstrates that contact surveys typically include of the order of a thousand participants, rely on convenience sampling, and use a retrospective design with paper diaries and self-reporting of contacts over a single day. Major determinants for this number of contactees include characteristics of the respondent (age, sex, and health status), time (weekday or weekend, and term time or vacation) and their immediate environment (household size and urban vs. rural). A typical number of different contactees reported per day is in the order of 20 for countrywide studies, a quantity that proved remarkably robust despite the many different study designs.From the results of this review, we formulate the following recommendations for future surveys collecting social contact data relevant for the spread of respiratory pathogens.(1) Study object: There is the need to continue the collection of social contact data, especially in low- and middle-income countries; still, in high-income countries, social contact surveys can detail interactions in epidemiologically relevant groups.(2) Random sample: Depending on the study objectives, participants should be selected as randomly as possible, so findings can be properly extrapolated to encompass target population.(3) Sampling: The sampling procedure should be described in detail, including response rates and information about at which stage and how participants can be excluded from the final sample, together with all the demographic factors considered when identifying the sample size.(4) Method: Online and paper diaries both proved to be reliable to measure the overall contact matrix, but are associated with different burden for the participant, also depending on the age.(5) Contact definition: At least two contact types should be included, one aimed to measure more casual contact and one aimed to measure physical contact.(6) Prospective versus retrospective: Prospective design should be preferred to retrospective design, since it allows respondent to remember more contact features.(7) Minimal contact information: Age and sex of the contacted person should definitely be included, as well as the duration, the frequency (intimate nature), and the location in which the contact took place.(8) Scaling information: Information on the size of the possible pool of contactees (like household or school size) proves valuable for testing general assumptions on the scaling of human interactions with the size of the population.(9) Behavioral change: Possible indicators of behavioral changes (feeling ill) should be included as well, given their large impact on disease spread.(10) Sharing data: Finally, we want to bring to attention that several datasets referred to in this review are available in a unified format (www.socialcontactata.org[73]) that is compatible with an R package for social contact analysis (Socialmixr[76]). Complying with this standard format will improve the dissemination of future surveys’ data and increase their value for the scientific community.
ACKNOWLEDGMENTS
This work received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement 682540 TransMID and 283955 DECIDE).
Authors: Joël Mossong; Niel Hens; Mark Jit; Philippe Beutels; Kari Auranen; Rafael Mikolajczyk; Marco Massari; Stefania Salmaso; Gianpaolo Scalia Tomba; Jacco Wallinga; Janneke Heijne; Malgorzata Sadkowska-Todys; Magdalena Rosinska; W John Edmunds Journal: PLoS Med Date: 2008-03-25 Impact factor: 11.069
Authors: Amy Gimma; James D Munday; Kerry L M Wong; Pietro Coletti; Kevin van Zandvoort; Kiesha Prem; Petra Klepac; G James Rubin; Sebastian Funk; W John Edmunds; Christopher I Jarvis Journal: PLoS Med Date: 2022-03-01 Impact factor: 11.069
Authors: Mélodie Monod; Alexandra Blenkinsop; Xiaoyue Xi; Daniel Hebert; Sivan Bershan; Simon Tietze; Oliver Ratmann; Marc Baguelin; Valerie C Bradley; Yu Chen; Helen Coupland; Sarah Filippi; Jonathan Ish-Horowicz; Martin McManus; Thomas Mellan; Axel Gandy; Michael Hutchinson; H Juliette T Unwin; Sabine L van Elsland; Michaela A C Vollmer; Sebastian Weber; Harrison Zhu; Anne Bezancon; Neil M Ferguson; Swapnil Mishra; Seth Flaxman; Samir Bhatt Journal: Science Date: 2021-02-02 Impact factor: 47.728
Authors: Jackie Kleynhans; Stefano Tempia; Meredith L McMorrow; Anne von Gottberg; Neil A Martinson; Kathleen Kahn; Jocelyn Moyes; Thulisa Mkhencele; Limakatso Lebina; F Xavier Gómez-Olivé; Floidy Wafawanaka; Azwifarwi Mathunjwa; Cheryl Cohen Journal: BMC Public Health Date: 2021-06-03 Impact factor: 3.295
Authors: Matthew Quaife; Kevin van Zandvoort; Amy Gimma; Kashvi Shah; Nicky McCreesh; Kiesha Prem; Edwine Barasa; Daniel Mwanga; Beth Kangwana; Jessie Pinchoff; W John Edmunds; Christopher I Jarvis; Karen Austrian Journal: BMC Med Date: 2020-10-05 Impact factor: 8.775
Authors: Andria Mousa; Peter Winskill; Oliver J Watson; Oliver Ratmann; Mélodie Monod; Marco Ajelli; Aldiouma Diallo; Peter J Dodd; Carlos G Grijalva; Moses Chapa Kiti; Anand Krishnan; Rakesh Kumar; Supriya Kumar; Kin On Kwok; Claudio F Lanata; Olivier Le Polain de Waroux; Kathy Leung; Wiriya Mahikul; Alessia Melegaro; Carl D Morrow; Joël Mossong; Eleanor Fg Neal; David J Nokes; Wirichada Pan-Ngum; Gail E Potter; Fiona M Russell; Siddhartha Saha; Jonathan D Sugimoto; Wan In Wei; Robin R Wood; Joseph T Wu; Juanjuan Zhang; Patrick Gt Walker; Charles Whittaker Journal: medRxiv Date: 2021-06-15