| Literature DB >> 31160946 |
Beverly L Smith-Keiling1, Hye In F Hyun1.
Abstract
In addition to human, close reading of student text with rubrics for assessment, educators use nonhuman, distant computer-assisted tools to help quantitatively measure otherwise qualitative keywords to prevent bias in grading and help read beyond the sentence for underlying cognitions. We apply the Linguistic Inquiry and Word Count (LIWC) software tool to analyze different forms of student writing used in STEM education and research to assess writing of native English speakers and non-native English Language Learners (ELLs), including international students. Available in several languages, LIWC measures four summary variables, Analytical Thinking, Clout, Authentic, and Emotional Tone, to provide outputs as raw word counts, as percentages of words used relative to the text compared with a dictionary of words in categories and sub-dictionaries, and as scores correlating these words algorithmically based on a dictionary of terms associated with underlying meanings. This tool can help measure student personal reflective writing for underlying psychosocial indicators or the cognitive and analytical process in other science writing. By selecting key variables, or creating a personal dictionary, LIWC can be used to analyze scientific writing to detect progressive development of student analytical writing from early draft to final version for different informal and formal writing styles. We share methods, examples, and the potential for using LIWC measures of cognitive processes for different measures of student writing in science courses.Entities:
Year: 2019 PMID: 31160946 PMCID: PMC6508918 DOI: 10.1128/jmbe.v20i1.1709
Source DB: PubMed Journal: J Microbiol Biol Educ ISSN: 1935-7877
Sample LIWC2015 code dictionaries.
| Category | Abbrev | Examples | Words in category |
|---|---|---|---|
| Word count | WC | ||
| Summary language variables | |||
| Analytical thinking | Analytic | ||
| Clout | Clout | ||
| Authentic | Authentic | ||
| Emotional Tone | Tone | ||
| Words/sentence | WPS | ||
| Words>6 letters | Sixltr | ||
| Linguistic dimensions | |||
| Total function words | funct | it, to, no, very | 491 |
| Total pronouns | pronoun | I, them, itself | 153 |
| 1st pers singular | I | I, me, mine | 24 |
| Articles | article | a, an, the | 3 |
| Other grammar | |||
| Common verbs | verb | eat, come, carry | 1,000 |
| Common adjectives | adj | free, happy, long | 764 |
| Comparisons | compare | greater, best, after | 317 |
| Interrogatives | interrog | how, when, what | 48 |
| Numbers | number | second, thousand | 36 |
| Quantifiers | quant | few, many, much | 77 |
| Psychological processes | |||
| Affective processes | affect | happy, cried | 1,393 |
| Anxiety | anx | worried, fearful | 116 |
| Cognitive processes | cogproc | cause, know, ought | 797 |
| Insight | insight | think, know | 259 |
| Causation | cause | because, effect | 135 |
| Discrepancy | discrep | should, would | 83 |
| Tentative | tentat | maybe, perhaps | 178 |
| Certainty | certain | always, never | 113 |
| Differentiation | differ | hasn’t, but, else | 81 |
Adapted with permission from LIWC language manual (7).
This small sample of words in each category from a larger dictionary provides example words and the total number of words in each category. As educators develop their own rubric, such as using terms for Comparisons as previously shown (8), they can include Linguistic Inquiry and Word Count (LIWC) codes useful for analysis of a variety of text styles using total raw Word count and four Summary Language Variables: Analytical Thinking, Clout, Authentic, Emotional Tone. Categories such as Linguistic dimensions include subcategory function words, pronouns, and use of “I.” This category is used in expressive and reflective writing along with the category Psychological processes, which includes subcategories Affective processes and Anxiety, but is not used with scientific writing. The frequency of pronouns and grammar is used in scientific writing to detect formality. Other Grammar includes subcategories Comparisons, Quantifiers, and other variables (not shown) such as Punctuation. These, along with the category Cognitive processes, are useful for both reflective and scientific writing.
FIGURE 1Screenshot of sample Linguistic Inquiry and Word Count (LIWC) output. LIWC analysis of student Biochemical (BQA) draft questions Q (reflective) and answers A (scientific) writing. Export to Excel shows left column filenames or de-identified student numbers from a file opened in LIWC software. Top row sample categories are selected based on the desired writing style analysis. Word count (WC), and summary variables Analytical Thinking, Clout, Authentic, and Emotional Tone are raw data or algorithmically determined and useful for many writing styles. Some variables help determine the complexity of writing: Words per sentence (WPS), Words > 6 letters (Sixltr). Categories are percentage scores of the number of words from the text relative to total word count, e.g., Pronouns, I, or other me personal pronouns are used for different purposes in different styles of writing, such as determining formality in scientific writing style. Categories are algorithmically nested under summary variables, e.g., Analytical Thinking summary variable comprises Comparison, Quantifier, Cognitive Processes, and others, according to the LIWC dictionary. Emotional Tone summary variable comprises Affect, Anxiety, Positive, Negative, and others. A variety of punctuation, e.g., Quotes, Apostrophes, Parentheses, etc., are useful for tracking scientific writing formality. We define formality of scientific writing per our grading rubric as having these features: zero to low personal pronouns, no quotes, no contractions, no apostrophes except the four expected for two 5′ and 3′ DNA primer ends in a Methods section. Scores matched hand-graded appropriate use of parentheses for defined abbreviations, citations, and chemical names but were not overused in layperson writing with increased definition of scientific terminology. “I” (0.00) was consistent with BQA scientific answers A, whereas questions Q had allowable pronouns in the reflective style. Higher use of quotes, apostrophes, and parentheses was detected with less formal writing such as use of contractions (“Conc’t sample”), which was found in other writing samples and corrected upon later rewrite of draft (BQA1D, BQA2D, BQA3D) to final versions (BQA1F, BQA2F, BQAS3F). LIWC scores were matched to hand-graded counts by two independent raters and reviewed by an external evaluator, with >95% agreement, and two additional independent in-class graders for comparison, quantitatively assessing levels of Comparisons, Quantifiers, and Cognitive reasoning and Analytical thought (8, Appendix 1 examples).
FIGURE 2Visual graphical sample for comparison. Once data are graphed, detected patterns are more easily seen between the Biochemical (BQA) Question Q part, which is reflective and poses a question with more personal pronouns “I”, and the Answer A part, which is formal scientific writing and has a higher Analytical Thinking score. Some progressive improvement can be detected from drafts (BQA1D, BQA2D, BQA3D) to their final versions (BQA1F, BQA2F, BQAS3F), with higher Analytic scores algorithmically detecting more formality in the scientifically written answer A section and with some detectable increases in total scores, e.g., BQA2 improved from 86 to 87, and BQA3 improved from 87 to 89. Clout as a measure of confidence is higher in some question sections than others, demonstrating that confidence can vary per different topics in the reflectively written question Q section. While Clout per percentage words in the dictionary as an indicator of confidence can be useful in psychosocial research studies, it is not useful in a written rubric for grading. Variables are not used for all genres, e.g., Clout psychosocial indicator is not used in the Answer portion, which is scientific and not reflective writing. These computer-generated scores were matched with hand-grading and visual inspection by the two authors. Examples are provided in Appendix 1.