| Literature DB >> 29924873 |
Susan C Weller1, Ben Vickers1, H Russell Bernard2, Alyssa M Blackburn3, Stephen Borgatti4, Clarence C Gravlee5, Jeffrey C Johnson5.
Abstract
Sample size determination for open-ended questions or qualitative interviews relies primarily on custom and finding the point where little new information is obtained (thematic saturation). Here, we propose and test a refined definition of saturation as obtaining the most salient items in a set of qualitative interviews (where items can be material things or concepts, depending on the topic of study) rather than attempting to obtain all the items. Salient items have higher prevalence and are more culturally important. To do this, we explore saturation, salience, sample size, and domain size in 28 sets of interviews in which respondents were asked to list all the things they could think of in one of 18 topical domains. The domains-like kinds of fruits (highly bounded) and things that mothers do (unbounded)-varied greatly in size. The datasets comprise 20-99 interviews each (1,147 total interviews). When saturation was defined as the point where less than one new item per person would be expected, the median sample size for reaching saturation was 75 (range = 15-194). Thematic saturation was, as expected, related to domain size. It was also related to the amount of information contributed by each respondent but, unexpectedly, was reached more quickly when respondents contributed less information. In contrast, a greater amount of information per person increased the retrieval of salient items. Even small samples (n = 10) produced 95% of the most salient ideas with exhaustive listing, but only 53% of those items were captured with limited responses per person (three). For most domains, item salience appeared to be a more useful concept for thinking about sample size adequacy than finding the point of thematic saturation. Thus, we advance the concept of saturation in salience and emphasize probing to increase the amount of information collected per respondent to increase sample efficiency.Entities:
Mesh:
Year: 2018 PMID: 29924873 PMCID: PMC6010234 DOI: 10.1371/journal.pone.0198606
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The examples.
| DOMAIN | INTERVIEW MODE (SAMPLE SIZE) | QUESTION ASKED | THE MOST FREQUENTLY MENTIONED ITEMS (Prevalence) |
|---|---|---|---|
| 1. Fruits (Brewer et al. 2002) | oral ( | “Think of all the different kinds of fruit people eat. Tell me the names of all the kinds of fruit you can remember. Please keep trying to recall if you think there are more kinds of fruit you might be able to remember.” | #1/Apple (0.97), #47/Orange (0.97), #4/Banana (0.91) |
| 2. Birds (Brewer 1995) | written ( | “What are all the kinds of birds? Please write down the names of all the birds you can think of.” [in 10 minutes] | Eagle (0.83), Crow (0.83), Hummingbird (0.81). |
| 3. Flowers (Brewer 1995) | written ( | “What are all the kinds of flowers? Please write down the names of all the flowers you can think of.” [in 10 minutes] | Rose (1.0), Carnation (0.98), Tulip (0.80), Daisy (0.80). |
| 4. Drugs (Brewer et al. 2002) | oral ( | “Think of all the different kinds of drugs or substances people use to get high, feel good, or think and feel differently. These drugs are sometimes called recreational drugs or street drugs. Tell me the names of all the kinds of these drugs you can remember. Please keep trying to recall if you think there are more kinds of drugs you might be able to remember. | #50/Heroin (0.93), #18/Cocaine (0.93), #59/Marijuana (0.93). |
| 5. Fabrics (Brewer 1995) | ( | “What are all the kinds of fabrics? Please write down the names of all the fabrics you can think of.” [in 10 minutes] | Cotton (1.0), Polyester (0.98), Silk (0.94), Wool (0.94) |
| 6. Illnesses-US (Weller 1983) | face-to-face ( | “Tell me all the illnesses you know of.” [Nonspecific and semantic prompts] | Cancer (0.75), Measles (0.65), Mumps (0.65). |
| 7. Illnesses-G (Guatemala) (Weller 1983) | face-to-face ( | “Puede Ud. decirme todas las enfermedades que conozca o que recuerde, por favor?” [Nonspecific and semantic prompts] | Sarampion (0.75), Varicela (0.60), Gripe (0.55), Amigdalitis (0.55) |
| 8. Sodas (Weller, n.d.) | written ( | “Please write down all the names for sodas or soda pops that you can think of. You have 3 minutes.” | Coca Cola (1.0), Pepsi (0.96), and Sprite (0.93). |
| 9. Holiday1 (Johnson, n.d.) | written ( | “Write down all the holidays you can think of.” | Christmas (0.88), Memorial Day (0.83), July 4th (0.83). |
| 10. Holiday2 (Johnson, n.d.) | written ( | “Write down all the holidays you can think of.” | Christmas (1.0), Memorial Day (1.0), Thanksgiving (0.96). |
| 11. LivingRoom (Johnson, n.d.) | written ( | “List all the things you would find in an American living room.” | Couch (0.91), TV (0.88), Coffee table (0.76) |
| 12. GoodLeader (Johnson, n.d.) | written ( | “What are the characteristics of a good team leader?” [in 5 min] | Good listener (0.33), Communicator (0.28), Good example (0.22) |
| 13. GoodTLeader (Johnson, n.d.) | written ( | “What are the characteristics of a good team leader?” [in 5 min] | Listener (0.45), Decisive (0.39), Honest (0.29), Respects others (0.29) |
| 14. GoodTeam1 (Johnson, n.d.) | written ( | “What are the characteristics of a good team? | Goals (0.50), Cooperation working together (0.44), Respect (0.31) |
| 15. GoodTeam3 (Johnson, n.d.) | written ( | “What are the characteristics of a good team? | Goals (0.45), Communication (0.39), Open-communication (0.39) |
| 16. GoodTeam2 Player (Johnson, n.d.) | written ( | “What are the characteristics of a good team player?” | Cooperative (0.31), Understands role (0.28), Respectful (0.22), Listens (0.22) |
| 17. Bad words (Borgatti, n.d.) | written ( | “What are all the bad words you can think of?” | Shit (0.90), Fuck (0.85), Asshole (0.73), Bitch (0.73). |
| 18. Industries1 (Borgatti, n.d.) | written ( | “What are all the industries you can think of?” | Automobile (0.70), Construction (0.67), Banking (0.63), Entertainment (0.63). |
| 19. Industries2 (Borgatti, n.d.) | written ( | “List all of the industries you can think of.” | Automobile (0.77), Health (0.63), Banking (0.60). |
| 20. CultInd1 (Borgatti, n.d.) | written ( | “Please list all of the cultural industries you can think of.” | Museum (0.39), Television (0.34), Music (0.30). |
| 21. CultInd2 (Borgatti, n.d.) | written ( | “Please list all of the cultural industries you can think of.” | Museum (0.34), School (0.31), Dance (0.24). |
| 22. ScaryThings (Borgatti, n.d.) | written ( | “List all the scary things you can think of.” | Death (0.79), Heights (0.62), The dark (0.61). |
| 23. Moms-OL (Gravlee et al., 2013) | online ( | “We’re studying things that mothers do, so, with this in mind, please start by listing all the things that mothers do. There’s no limit and there are no right or wrong answers, so take your time.” [Semantic prompting] | Cooking (0.56), Loving (0.51), Household Cleaning (0.44). |
| 24. Moms-F2F (Gravlee et al., 2013) | face-to-face ( | “Please list the all the things that mothers do. List as many things as you can think of. There’s no limit, so take your time.” [Semantic prompting] | Cooking (0.74), Working (0.58), Household Cleaning (0.58). |
| 25. Moms-PP (Gravlee et al. 2013) | paper and pen ( | “Please list the things that mothers do. List as many things as you can think of. There’s no limit, so take your time. Write each thing one below the other.” [Semantic prompting] | Cooking (0.74), Loving (0.60), Household Cleaning (0.53). |
| 26. Ethnic-OL (Gravlee et al. 2013) | online ( | “We’re studying the names of racial and ethnic groups, so, with this in mind, please start by listing all the racial and ethnic groups you can think of. There’s no limit and there are no right or wrong answers, so take your time.” [Semantic prompting] | White (0.79), Hispanic (0.68), Indian (0.64). |
| 27. Ethnic-F2F (Gravlee et al. 2013) | face-to-face ( | “Please list as many racial and ethnic groups as you can think of. There’s no limit, so take your time. [Semantic prompting] | Chinese (0.88), Asian (0.88), Japanese (0.85). |
| 28. Ethnic-PP (Gravlee et al. 2013) | paper and pen ( | “Please list as many racial and ethnic groups as you can think of. There’s no limit, so take your time. Write each group one below the other.” [Semantic prompting] | White (0.83), Hispanic (0.68), Asian (0.66). |
Estimated point of saturation and domain size.
| Example | Mean # Responses (range) | Total Unique Items | NSAT (Y<1.0) | Domain Size at Y<1.0 (DSAT) | Est. Total Domain Size (DTOT) | Capture-Recapture Pop Size | |
|---|---|---|---|---|---|---|---|
| Fruits | 33 | 22.1 (12–43) | 62 | 15 | 48 | 53 | 73c |
| Birds | 36 | 25.9 (10–41) | 121 | 35 | 114 | 130 | 203c |
| Flowers | 41 | 16.1 (7–42) | 141 | 33 | 130 | 143 | 251c |
| Drugs | 43 | 13.6 (3–33) | 92 | 33 | 84 | 101 | 115c |
| Fabrics | 63 | 15.0 (5–28) | 143 | 75 | 155 | 210 | 753c |
| Illnesses-US | 20 | 16.1 (6–31) | 144 | 53 | 217 | 237 | 758g |
| Illnesses-G | 20 | 12.3 (4–28) | 86 | 21 | 82 | 90 | 281g |
| Sodas | 28 | 16.3 (7–27) | 108 | 42 | 127 | 147 | 210c |
| Holiday1 | 24 | 13.0 (5–29) | 62 | 17 | 48 | 57 | 100c |
| Holiday2 | 23 | 17.8 (8–39) | 90 | 87 | 209 | 263 | 200c |
| LivingRoom | 33 | 12.9 (6–24) | 107 | 41 | 115 | 137 | 210c |
| GoodLeader | 36 | 6.6 (3–15) | 151 | 78 | 221 | 261 | 1165g |
| GoodTLeader | 31 | 9.7 (3–19) | 141 | 102 | 280 | 336 | 619g |
| GoodTeam1 | 36 | 6.4 (2–13) | 136 | 96p | 239 | 297 | 1615g |
| GoodTeam3 | 31 | 9.0 (1–18) | 135 | 88 | 240 | 289 | 607g |
| GoodTeam2 Player | 36 | 5.8 (2–14) | 136 | 189p | 428 | 555 | 1403g |
| BadWords | 92 | 15.8 (3–35) | 273 | 119 | 302 | 372 | 867c |
| Industries1 | 27 | 34.5 (7–69) | 413 | 138 | 875 | 919 | 1189c |
| Industries2 | 43 | 34.9 (7–67) | 510 | 194 | 1039 | 1108 | 1089c |
| CultInd1 | 44 | 12.2 (3–30) | 299 | 50nbi | 308 | 310 | 2224g |
| CultInd2 | 29 | 9.7 (1–26) | 203 | 50 | 239 | 256 | 2478g |
| ScaryThings | 99 | 16.9 (3–36) | 453 | 177 | 577 | 662 | 1213c |
| MomsOL | 55 | 17.8 (3–53) | 389 | 104 | 477 | 516 | 881c |
| MomsF2F | 50 | 30.2 (9–75) | 560 | 161 | 896 | 951 | 1631c |
| MomsP&P | 53 | 16.6 (5–51) | 337 | 109 | 443 | 488 | 738c |
| EthnicOL | 56 | 24.6 (5–90) | 304 | 70 | 312 | 338 | 615c |
| EthnicF2F | 48 | 33.3 (12–106) | 339 | 91 | 415 | 449 | 684c |
| EthnicPP | 53 | 17.9 (3–62) | 228 | 58 | 234 | 257 | 521c |
nbi = Negative binomial-identity, p = Poisson-log ; c = Chao’s Lower bound; g = gamma
Fig 1The number of unique items provided with increasing sample size.
Comparison of number of unique items obtained with full free lists and with three or fewer responses.
| Example | Mean # Responses | Total Unique Items Obtained | Free-list Unique Items | Three-list Unique Items | Three- list NSat (Y<1.0) | |||
|---|---|---|---|---|---|---|---|---|
| Fruits | 33 | 22.1 | 62 | 51 | 59 | 15 | 15 | 9 |
| Birds | 36 | 25.9 | 121 | 85 | 92 | 21 | 28 | 15 |
| Flowers | 41 | 16.1 | 141 | 92 | 113 | 15 | 21 | 11 |
| Drugs | 43 | 13.6 | 92 | 42 | 65 | 11 | 16 | 8 |
| Fabrics | 63 | 15 | 143 | 52 | 71 | 12 | 16 | 4 |
| Illnesses-US | 20 | 16.1 | 144 | 91 | 144 | 21 | 34 | 17 |
| Illnesses-G | 20 | 12.3 | 86 | 67 | 86 | 16 | 26 | 16 |
| Sodas | 28 | 16.3 | 108 | 53 | 91 | 15 | 20 | 10 |
| Holiday1 | 24 | 13 | 62 | 54 | 57 | 14 | 19 | 9 |
| Holiday2 | 23 | 17.8 | 90 | 44 | 76 | 12 | 17 | 9 |
| Living Room | 33 | 12.9 | 107 | 48 | 81 | 10 | 23 | 14 |
| Good Leader | 36 | 6.6 | 151 | 62 | 98 | 25 | 41 | 134 |
| GoodTLeader | 31 | 9.7 | 141 | 59 | 98 | 23 | 41 | 29 |
| Good Team1 | 36 | 6.4 | 136 | 47 | 87 | 23 | 36 | 46 |
| Good Team3 | 31 | 9 | 135 | 58 | 93 | 21 | 42 | 32 |
| Good Team2 Player | 36 | 5.8 | 136 | 41 | 81 | 24 | 45 | 49 |
| BadWords | 92 | 15.8 | 273 | 68 | 113 | 14 | 21 | 10 |
| Industries1 | 27 | 34.5 | 413 | 184 | 319 | 30 | 46 | 29 |
| Industries2 | 43 | 34.9 | 510 | 166 | 281 | 29 | 44 | 37 |
| CultInd1 | 44 | 12.2 | 299 | 106 | 163 | 26 | 47 | 55 |
| CultInd2 | 29 | 9.7 | 203 | 106 | 175 | 29 | 49 | 40 |
| Scary Things | 99 | 16.9 | 453 | 102 | 153 | 18 | 34 | 47 |
| MomsOL | 55 | 17.8 | 389 | 144 | 221 | 20 | 29 | 35 |
| MomsF2F | 50 | 30.2 | 560 | 193 | 279 | 17 | 25 | 20 |
| MomsPP | 53 | 16.6 | 337 | 115 | 191 | 21 | 33 | 30 |
| EthnicOL | 56 | 24.6 | 304 | 131 | 189 | 18 | 24 | 15 |
| EthnicF2F | 48 | 33.3 | 339 | 130 | 211 | 11 | 21 | 12 |
| EthnicPP | 53 | 17.9 | 228 | 80 | 137 | 12 | 20 | 11 |
Fig 2The number of unique items provided with increasing sample size when there are three or fewer responses per person.
Capture of salient items with full free list and with three or fewer responses.
| # Free-List Salient Items (≥ 20%, n = 20) | Free-list | Three or Fewer Responses | |||
|---|---|---|---|---|---|
| Fruits | 38 | 100% | 36.80% | 36.80% | 36.80% |
| Birds | 46 | 100% | 50.00% | 47.80% | 37.00% |
| Flowers | 31 | 100% | 54.80% | 48.40% | 41.90% |
| Drugs | 21 | 100% | 61.90% | 42.90% | 42.90% |
| Fabrics | 26 | 100% | 53.8% | 53.8% | 42.3% |
| Illnesses-US | 22 | 95.50% | 81.8% | 77.3% | 54.5% |
| Illnesses-G | 21 | 100% | 52.4% | 47.6% | 47.6% |
| Sodas | 23 | 100% | 69.6% | 65.2% | 56.5% |
| Holiday1 | 20 | 100% | 65.0% | 60.0% | 50.0% |
| Holiday2 | 22 | 90.90% | 72.7% | 68.2% | 54.5% |
| LivingRoom | 19 | 100% | 73.7% | 57.9% | 47.4% |
| GoodLeader | 4 | 100% | 100% | 100% | 100% |
| GoodTLeader | 8 | 100% | 87.5% | 75.0% | 62.5% |
| GoodTeam1 | 6 | 100% | 100% | 100% | 83.3% |
| GoodTeam3 | 6 | 100% | 83.3% | 66.7% | 66.7% |
| GoodTeam2 Player | 4 | 100% | 100% | 100% | 100% |
| BadWords | 25 | 100% | 56.0% | 56.0% | 44.0% |
| Industries1 | 48 | 97.90% | 50.0% | 47.9% | 37.5% |
| Industries2 | 49 | 98.00% | 53.1% | 49.0% | 38.8% |
| CultInd1 | 23 | 95.70% | 87.0% | 73.9% | 65.2% |
| CultInd2 | 5 | 80.00% | 100% | 80.0% | 60.0% |
| ScaryThings | 11 | 100% | 81.8% | 72.7% | 63.6% |
| MomsOL | 20 | 95.00% | 80.0% | 70.0% | 65.0% |
| MomsF2F | 31 | 96.80% | 51.6% | 51.6% | 41.9% |
| MomsPP | 18 | 100% | 83.3% | 72.2% | 61.1% |
| EthnicOL | 37 | 97.30% | 51.4% | 43.2% | 37.8% |
| EthnicF2F | 52 | 100% | 30.8% | 28.8% | 19.2% |
| EthnicPP | 40 | 97.50% | 35.0% | 32.5% | 20.0% |