| Literature DB >> 31934872 |
Hossein Mohammadhassanzadeh1, Ingrid Sketris1, Robyn Traynor1, Susan Alexander2, Brandace Winquist3, Samuel Alan Stewart1.
Abstract
BACKGROUND: Isotretinoin, for treating cystic acne, increases the risk of miscarriage and fetal abnormalities when taken during pregnancy. The Health Canada-approved product monograph for isotretinoin includes pregnancy prevention guidelines. A recent study by the Canadian Network for Observational Drug Effect Studies (CNODES) on the occurrence of pregnancy and pregnancy outcomes during isotretinoin therapy estimated poor adherence to these guidelines. Media uptake of this study was unknown; awareness of this uptake could help improve drug safety communication.Entities:
Keywords: knowledge translation; mass media; natural language processing; pharmacoepidemiology; readability
Year: 2020 PMID: 31934872 PMCID: PMC6996767 DOI: 10.2196/13296
Source DB: PubMed Journal: JMIR Form Res ISSN: 2561-326X
Figure 1Methodology schematic for our study. CMAJ: Canadian Medical Association Journal; CNODES: Canadian Network for Observational Drug Effect Studies; TF-IDF: term frequency-inverse document frequency.
Readability formulas. C: number of characters; D: number of complex words; E: number of easy (not-complex words); P: number of polysyllables; S: number of sentences; W: number of words; Y: number of syllables; AC: average number of characters per 100 words; AS: average number of sentences per 100 words.
| Readability score | Score type | Key statistical features | Formula |
| Flesch Reading Ease (FRES) | Numeric score (0-100) | Word length and sentence length | FRES=206.83 - 1.015 x (W/S) - 84.6 x (Y/W) |
| Flesch-Kincaid Grade (FKRA) | US grade level | Word length and sentence length | FKRA=0.39 x (W/S) – 11.8 x (Y/W) – 15.59 |
| Gunning Fog Index (FOG) | US grade level | Number of complex words | FOG=0.4 x [ (W/S) + 100 x (D/W)] |
| Simple Measure of Gobbledygook Index | US grade level | Number of complex words | SMOG=1.0430 x √(P x 30/S) + 3.1291 |
| Automated Readability Index (ARI) | US grade level | Number of characters | ARI=4.71 x (C/W) + 0.5 x (W/S) – 21.43 |
| Coleman Liau Index (CLI) | US grade levela | Number of characters | CLI=0.0588 x AC + 0.296 x AS – 15.8 |
| Linsear Write Index (LWI) | US grade level | Sentence length, number of polysyllables | (1) Find a 100-word sample from your writing; (2) Calculate Val=[E+(3×D)]/S; (3) If Val >20, then LWI=Val/2; (4) If Val ≤ 20, then LWI=(Val-2) / 2; |
| Dale-Chall Readability Score (DCRS) | Numeric score (0-9.9) | Number of difficult words | DCRS=0.1579 x (D/S) + 0.0496 x (W/S) |
| Text Standard | US grade level |
| A voting system among the other metrics: the reading level that is most prevalent (the mode) among the other metrics calculated. |
aThe terms in the table are stemmed versions of the actual terms (for example, us represents various forms of the verb use, and pregnanc stands for pregnancy).Grade level may also be understood as the number of years of formal education needed to understand a given text, particularly when the level exceeds the typical range of US grades (e.g. 1-12). For example, grades 13-16 suggest undergraduate training, 17-18 graduate training, and 19+ professional qualification.[63,67]
Top 10 most frequent vocabulary terms (1-grams and 2-grams).
| 1-grams | Frequency | Ratio | 2-grams | Frequency | Ratio |
| pregnanc | 344 | 0.031 | pregnanc prevent | 74 | 0.007 |
| isotretinoin | 306 | 0.027 | birth defect | 63 | 0.006 |
| studi | 245 | 0.022 | birth control | 48 | 0.004 |
| drug | 226 | 0.020 | pregnanc test | 40 | 0.004 |
| women | 188 | 0.017 | women take | 39 | 0.003 |
| us | 165 | 0.015 | prevent program | 37 | 0.003 |
| birth | 163 | 0.014 | British Columbia | 35 | 0.003 |
| research | 135 | 0.012 | live birth | 33 | 0.003 |
| treatment | 123 | 0.011 | pregnanc rate | 33 | 0.003 |
| acn | 118 | 0.010 | isotretinoin user | 31 | 0.003 |
Figure 2Cosine similarity values (between 0 and 1) between the media articles and CNODES publications, including CMAJ article, podcast, and press release article using TF-IDF calculations. Resulting dendrogram of hierarchical agglomerative clustering. Three clusters and 2 singletons, resulting from a cutoff point of 0.5. CMAJ: Canadian Medical Association Journal; CNODES: Canadian Network for Observational Drug Effect Studies; TF-IDF: term frequency-inverse document frequency.
Figure 3Trend of similarity (cosine similarity) between the media articles and the CNODES publications: CMAJ article, podcast, and press release.
Most common terms, both overall and within each cluster.
| Clustera | Cluster 1 | Cluster 2 | Cluster 3 | Singleton 28 | Singleton 29 | ||||||
|
| |||||||||||
|
| Isotretinoin | 240 (2)b | 49 (2) | 6 (48) | 7 (2) | 4 (7) | |||||
| Accutanc | 46 (32) | 12 (20) | 17 (11) | —d | 3 (12) | ||||||
| pregnanc | 263 (1) | 57 (1) | 8 (33) | 12 (1) | 5 (3) | ||||||
| drug | 166 (3) | 28 (6) | 23 (7) | 1 (50) | 8 (1) | ||||||
| birth | 127 (7) | 22 (9) | 8 (33) | 1 (50) | 5 (3) | ||||||
|
| |||||||||||
|
| studi | 160 (4) | 31 (3) | 42 (2) | 6 (3) | 6 (2) | |||||
| Us | 142 (5) | 18 (12) | 3 (107) | — | 2 (17) | ||||||
| women | 139 (6) | 30 (4) | 14 (14) | — | 5 (3) | ||||||
| treatment | 101 (8) | 16 (14) | 3 (107) | 2 (16) | 1 (39) | ||||||
| patient | 94 (9) | 11 (24) | 9 (27) | 2 (16) | 1 (39) | ||||||
|
| |||||||||||
|
| studi | 160 (4) | 31 (3) | 42 (2) | 6 (3) | 6 (2) | |||||
| women | 139 (6) | 30 (4) | 14 (14) | — | 5 (3) | ||||||
| prevent | 72 (16) | 30 (4) | 4 (74) | 5 (4) | 2 (17) | ||||||
| canadian | 62 (22) | 27 (7) | 6 (48) | 2 (16) | 1 (39) | ||||||
| take | 55 (28) | 24 (8) | 9 (27) | — | 3 (12) | ||||||
|
| |||||||||||
|
| research | 82 (13) | 9 (27) | 43 (1) | — | 1 (39) | |||||
| studi | 160 (4) | 31 (3) | 42 (2) | 6 (3) | 6 (2) | ||||||
| health | 66 (19) | 1 (411) | 38 (3) | — | — | ||||||
| Data | 35 (44) | — | 38 (3) | — | — | ||||||
| said | 56 (27) | 7 (41) | 33 (5) | — | 4 (7) | ||||||
aThe terms in the table are stemmed versions of the actual terms (for example, us represents various forms of the verb use, and pregnanc stands for pregnancy).
bTop 5 most frequent terms of each cluster exclude the 5 clinically important terms.
cThe first number in the cells shows the frequency of occurrence of the term, and the second number in the parenthesis shows the ranking of the terms among all the termt in that cluster.
dEmpty cells (represented with a —) are the terms that do not appear in the respective cluster/singleton.
Figure 4Distribution of readability levels of articles based on text-standard measure.
Average readability level of each cluster.
| Cluster. | Flesch Reading Ease | Flesch-Kincaid Grade | Gunning Fog Index | SMOG Index | Automated Readability Index | Coleman Liau Index | Linsear Write Index | Dale-Chall Readability Score | Text Standard |
| Cluster 1 | 40.78 | 13.02 | 15.19 | 15.58 | 15.21 | 14.47 | 13.27 | 9.87 | 16th grade |
| Cluster 2 | 29.89 | 14.74 | 16.59 | 16.92 | 16.76 | 15.97 | 10.62 | 10.67 | 17th grade |
| Cluster 3 | 49.19 | 11.35 | 13.75 | 14.33 | 13.35 | 13.32 | 8.85 | 9.39 | 14th grade |
| Singleton 28 | 36.79 | 12.50 | 12.99 | 15.90 | 15.20 | 16.82 | 13.75 | 8.82 | 12th grade |
| Singleton 29 | 49.55 | 11.70 | 15.11 | 15.00 | 15.40 | 14.74 | 8.08 | 9.87 | 14th grade |