| Literature DB >> 26060991 |
Michael F Schober1, Frederick G Conrad2, Christopher Antoun3, Patrick Ehlen4, Stefanie Fail1, Andrew L Hupp3, Michael Johnston5, Lucas Vickers6, H Yanna Yan3, Chan Zhang7.
Abstract
As people increasingly communicate via asynchronous non-spoken modes on mobile devices, particularly text messaging (e.g., SMS), longstanding assumptions and practices of social measurement via telephone survey interviewing are being challenged. In the study reported here, 634 people who had agreed to participate in an interview on their iPhone were randomly assigned to answer 32 questions from US social surveys via text messaging or speech, administered either by a human interviewer or by an automated interviewing system. 10 interviewers from the University of Michigan Survey Research Center administered voice and text interviews; automated systems launched parallel text and voice interviews at the same time as the human interviews were launched. The key question was how the interview mode affected the quality of the response data, in particular the precision of numerical answers (how many were not rounded), variation in answers to multiple questions with the same response scale (differentiation), and disclosure of socially undesirable information. Texting led to higher quality data-fewer rounded numerical answers, more differentiated answers to a battery of questions, and more disclosure of sensitive information-than voice interviews, both with human and automated interviewers. Text respondents also reported a strong preference for future interviews by text. The findings suggest that people interviewed on mobile devices at a time and place that is convenient for them, even when they are multitasking, can give more trustworthy and accurate answers than those in more traditional spoken interviews. The findings also suggest that answers from text interviews, when aggregated across a sample, can tell a different story about a population than answers from voice interviews, potentially altering the policy implications from a survey.Entities:
Mesh:
Year: 2015 PMID: 26060991 PMCID: PMC4465184 DOI: 10.1371/journal.pone.0128337
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Voice vs. text on smartphones.
| Property | Voice | Text |
|---|---|---|
| Synchrony | Fully synchronous | Less or asynchronous |
| Medium | Auditory | Visual |
| Language | Spoken/heard | Written/read |
| Conversational structure | Turn-by-turn, with potential for simultaneous speech | Turn-by-turn, rarely but possibly out-of-sequence |
| Persistence of turn | No | Yes |
| Persistence of entire conversation | No | Yes, threaded |
| Social presence of partner | Continuous (auditory) presence | Intermittent evidence from content of texts; no additional evidence between texts |
| Nonverbal cues of emotional state and intentions | Always present: speech always has pitch and timing | Only present if added by sender through words (e.g., "LOL"), orthography (e.g., capital letters, punctuation) or emoticons (e.g., ☺) |
| Character of multitasking | Often simultaneous, especially when hands free, unless other task involves talking | Switching often required between texting and other tasks |
| Impact of environmental conditions | Potential (auditory) interference from ambient noise | Potential (visual) interference from visual glare |
| Impact of nearby others | Others may hear speaker’s side of conversation; potential audio interference from others’ talk | Others unlikely to see conversation on screen, though possible |
| Required connection to network (cellular or wifi) | Must be continuous | Can be intermittent |
Fig 1Experimental design and procedure.
Fig 2Procedures.
Participation, response rates and break-off rates.
| Mode | Invitations | Started Interview | Completed Interview | ResponseRate | Break-off Rate | Completed Online Debriefing |
|---|---|---|---|---|---|---|
|
| 316 | 169 | 164 |
| 2.96% | 160 |
|
| 414 | 187 | 162 |
| 13.37% | 159 |
|
| 227 | 176 | 163 |
| 7.39% | 158 |
|
| 325 | 185 | 159 |
| 14.05% | 157 |
| Total | 1282 | 717 | 648 |
| 9.62% | 634 |
aThe response rate (known as AAPOR RR1 [29]) is calculated as the number of complete interviews divided by the number of invitations.
bThe break-off rate is calculated as the number of people who dropped off during the survey divided by the number of people who started.
Respondent demographics across four modes.
| Demographic variables | Voice | Text | ||||||
|---|---|---|---|---|---|---|---|---|
| Human | Automated | Human | Automated | Chi-square statistics | Degrees of freedom |
| ||
| Age | 7.833 | 12 | 0.798 | |||||
|
| 56.30% | 62.70% | 57.00% | 64.70% | ||||
|
| 15.00% | 13.90% | 17.10% | 10.30% | ||||
|
| 12.50% | 12.00% | 12.70% | 11.50% | ||||
|
| 13.10% | 10.10% | 9.50% | 9.60% | ||||
|
| 3.10% | 1.30% | 3.80% | 3.90% | ||||
| Education | 7.275 | 12 | 0.839 | |||||
|
| 7.50% | 5.70% | 8.90% | 7.60% | ||||
|
| 28.80% | 28.30% | 24.10% | 28.70% | ||||
|
| 11.90% | 8.10% | 9.50% | 13.40% | ||||
|
| 28.10% | 37.10% | 33.50% | 29.90% | ||||
|
| 23.80% | 20.10% | 24.10% | 20.40% | ||||
| Income | 10.861 | 12 | 0.541 | |||||
|
| 17.50% | 18.20% | 13.90% | 16.60% | ||||
|
| 21.90% | 25.80% | 24.70% | 29.90% | ||||
|
| 16.30% | 17.60% | 22.80% | 18.50% | ||||
|
| 31.90% | 32.10% | 30.40% | 29.30% | ||||
|
| 12.50% | 6.30% | 8.20% | 5.70% | ||||
| Gender | 2.533 | 3 | 0.469 | |||||
|
| 41.90% | 49.70% | 49.40% | 47.80% | ||||
|
| 58.10% | 50.30% | 50.60% | 52.20% | ||||
| Hispanic origin | 3.896 | 3 | 0.273 | |||||
|
| 88.10% | 93.70% | 90.50% | 93.00% | ||||
|
| 11.90% | 6.30% | 9.50% | 7.00% | ||||
| Race | 5.367 | 9 | 0.801 | |||||
|
| 76.90% | 81.80% | 76.60% | 79.00% | ||||
|
| 9.40% | 3.80% | 7.00% | 5.70% | ||||
|
| 5.60% | 6.90% | 7.00% | 5.70% | ||||
|
| 8.10% | 7.60% | 9.50% | 9.60% | ||||
| Carrier | 3.731 | 6 | 0.713 | |||||
|
| 60.00% | 62.30% | 64.60% | 66.90% | ||||
|
| 10.00% | 7.60% | 10.10% | 10.20% | ||||
|
| 30.00% | 30.20% | 25.30% | 22.90% | ||||
a Age values are missing for two respondents (n = 632 instead of 634).
Fig 3Data quality across the four modes: (A) rounding, (B) straightlining, and (C) disclosure.
Fig 4Rounding effects for each question.
Fig 5Disclosure effects for each question.
Fig 6Multitasking across the four modes.
Fig 7Interview duration and median number of turns per survey question.
These timelines display the median duration of question-answer sequences with the median number of turns after each question.