Viet-Thi Tran1, Raphael Porcher2, Bruno Falissard3, Philippe Ravaud4. 1. Department of General Medicine, Paris Diderot University, 16 rue Henri Huchard, Paris 75018, France; METHODS Team, Centre de Recherche en Epidémiologie et Statistiques (CRESS), INSERM U1153, 1 Place du Parvis Notre Dame, Paris 75004, France; Centre d'Épidémiologie Clinique, Hôpital Hôtel-Dieu, Assistance Publique-Hôpitaux de Paris, 1 Place du Parvis Notre Dame, Paris 75004, France. Electronic address: thi.tran-viet@htd.aphp.fr. 2. METHODS Team, Centre de Recherche en Epidémiologie et Statistiques (CRESS), INSERM U1153, 1 Place du Parvis Notre Dame, Paris 75004, France; Centre d'Épidémiologie Clinique, Hôpital Hôtel-Dieu, Assistance Publique-Hôpitaux de Paris, 1 Place du Parvis Notre Dame, Paris 75004, France; Paris Descartes University, 12 Rue de l'École de Médecine, Paris 75006, France. 3. Paris Sud University, 15 Rue Georges Clemenceau, Orsay 91400, France; Centre de Recherche en Epidémiologie et Santé des populations (CESP), INSERM U1018, 16, avenue Paul Vaillant-Couturier, Villejuif 94807, France. 4. METHODS Team, Centre de Recherche en Epidémiologie et Statistiques (CRESS), INSERM U1153, 1 Place du Parvis Notre Dame, Paris 75004, France; Centre d'Épidémiologie Clinique, Hôpital Hôtel-Dieu, Assistance Publique-Hôpitaux de Paris, 1 Place du Parvis Notre Dame, Paris 75004, France; Paris Descartes University, 12 Rue de l'École de Médecine, Paris 75006, France; Department of Epidemiology, Columbia University Mailman School of Public Health, 116th St & Broadway, New York, NY, USA.
Abstract
OBJECTIVE: To describe methods to determine sample sizes in surveys using open-ended questions and to assess how resampling methods can be used to determine data saturation in these surveys. STUDY DESIGN AND SETTING: We searched the literature for surveys with open-ended questions and assessed the methods used to determine sample size in 100 studies selected at random. Then, we used Monte Carlo simulations on data from a previous study on the burden of treatment to assess the probability of identifying new themes as a function of the number of patients recruited. RESULTS: In the literature, 85% of researchers used a convenience sample, with a median size of 167 participants (interquartile range [IQR] = 69-406). In our simulation study, the probability of identifying at least one new theme for the next included subject was 32%, 24%, and 12% after the inclusion of 30, 50, and 100 subjects, respectively. The inclusion of 150 participants at random resulted in the identification of 92% themes (IQR = 91-93%) identified in the original study. CONCLUSION: In our study, data saturation was most certainly reached for samples >150 participants. Our method may be used to determine when to continue the study to find new themes or stop because of futility.
OBJECTIVE: To describe methods to determine sample sizes in surveys using open-ended questions and to assess how resampling methods can be used to determine data saturation in these surveys. STUDY DESIGN AND SETTING: We searched the literature for surveys with open-ended questions and assessed the methods used to determine sample size in 100 studies selected at random. Then, we used Monte Carlo simulations on data from a previous study on the burden of treatment to assess the probability of identifying new themes as a function of the number of patients recruited. RESULTS: In the literature, 85% of researchers used a convenience sample, with a median size of 167 participants (interquartile range [IQR] = 69-406). In our simulation study, the probability of identifying at least one new theme for the next included subject was 32%, 24%, and 12% after the inclusion of 30, 50, and 100 subjects, respectively. The inclusion of 150 participants at random resulted in the identification of 92% themes (IQR = 91-93%) identified in the original study. CONCLUSION: In our study, data saturation was most certainly reached for samples >150 participants. Our method may be used to determine when to continue the study to find new themes or stop because of futility.
Authors: Heather King; Mackenzie Magnus; Larry V Hedges; Chris Cyr; Deborah Young-Hyman; Laura Kettel Khan; Lori A J Scott-Sheldon; Jason A Saul; Sonia Arteaga; John Cawley; Christina D Economos; Debra Haire-Joshu; Christine M Hunter; Bruce Y Lee; Shiriki K Kumanyika; Lorrene D Ritchie; Thomas N Robinson; Marlene B Schwartz Journal: Child Obes Date: 2020-09 Impact factor: 2.992
Authors: Rochelle Embling; Aimee E Pink; Michelle D Lee; Menna Price; Laura L Wilkinson Journal: BMC Public Health Date: 2020-09-24 Impact factor: 3.295