| Literature DB >> 34149559 |
Huiping Zhang1, Ming Chen1, Xuelan Li1.
Abstract
This paper reports a cross-sectional study that investigates the developmental features in second language writings by Chinese beginner learners of English by using four lexical richness measures-lexical sophistication, lexical variation, lexical density, and lexical errors-from the perspective of the language exposure hypothesis. Specifically, the study compares English compositions written by Chinese students of grade 7, grade 8, and grade 9 in terms of lexical sophistication, lexical variation, lexical density, and lexical errors. The English compositions were sampled from the Writing Corpus of Chinese Beginner Learners of English, and the sample size of the three grades remained almost the same. The analysis revealed that lexical richness in the writing samples of beginner learners is comparatively low, with beginner learners transferring lexical features of the oral register to their second language writings; furthermore, all four measures yielded significant, albeit non-linear and unevenly paced, developments across grade levels. Based on the findings, several suggestions for vocabulary teaching are provided.Entities:
Keywords: Chinese beginner learners; L2 writings; developmental features; language exposure hypothesis; lexical richness
Year: 2021 PMID: 34149559 PMCID: PMC8206476 DOI: 10.3389/fpsyg.2021.665988
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
FIGURE 1Language exposure hypothesis.
FIGURE 2Multidimensional model of lexical richness.
Number and percentage of word types per word list and per grade level.
| Word lists | Grade 7 Types/% | Grade 8 Types/% | Grade 9 Types/% | |
| High-frequency words | Word list 1 | 664/48.68 | 753/50.67 | 702/40.96 |
| Word list 2 | 275/20.16 | 323/21.74 | 392/22.87 | |
| Total | 939/68.84 | 1,076/72.41 | 1,094/63.83 | |
| Low-frequency words | Word list 3 | 86/6.30 | 110/7.40 | 141/8.23 |
| Not in the lists | Correct 202 (total 339)/14.81 | Correct 221 (total 300)/14.87 | Correct 338 (total 479)/19.72 | |
| Total | Correct 288 (total 425)/21.11 | Correct 331 (total 410)/22.27 | Correct 479 (total 620)/27.95 | |
| Total number of word types | 1364 | 1486 | 1714 | |
Log-likelihood (LL) tests for lexical sophistication across grade levels.
| Grade levels | Word list 3 | Not in the lists | Total (lexical sophistication) | |||
| LL | Sig. ( | LL | Sig. ( | LL | Sig. ( | |
| 7–8 | −1.25 | 0.264 | 0.00 | 0.965 | −0.44 | 0.507 |
| 8–9 | −0.69 | 0.406 | −10.81 | 0.001 | −10.19 | 0.001 |
| 7–9 | −3.85 | 0.050 | −10.59 | 0.001 | −14.42 | 0.000 |
Type-token ratio (TTR) per grade level.
| Grade level | LL | Types | Tokens | TTR |
| 7 | Grades 7 and 8: LL = −5.04 Sig. ( | 1364 | 16,708 | 8.16% |
| 8 | 1486 | 16,733 | 8.88% | |
| Grades 8 and 9: LL = −15.35 Sig. ( | ||||
| 9 | 1714 | 16,726 | 10.25% |
Number of word types[4] per word list and per grade level.
| Grade level | Word list 1 | Word list 2 | Word list 3 | Not in the lists (correct vocabulary) |
| 7 | 6644 | 275 | 86 | 202 |
| LL | LL = −5.46 Sig. ( | LL = −3.79 Sig. ( | LL = −2.91 Sig. ( | LL = −0.83 Sig. ( |
| 8 | 753 | 323 | 110 | 221 |
| LL | LL = 1.77 Sig. ( | LL = −6.70 Sig. ( | LL = −3.85 Sig. ( | LL = −24.72 Sig. ( |
| 9 | 702 | 392 | 141 | 338 |
Lexical density per grade level.
| Grade level | LL | Lexical word tokens | Tokens[ | Lexical density |
| 7 | Grades 7 and 8: LL = −10.62 Sig. ( | 6828 | 16,506 | 41.37% |
| 8 | Grades 8 and 9: LL = −0.09 | 7205 | 16,485 | 43.71% |
| 9 | Sig. ( | 7167 | 16,314 | 43.93% |
Distribution of lexical word classes across grade levels.
| Grade level | Nouns | Verbs | Adjectives | Adverbs | ||||
| Frequency/percentage | Normalized frequency | Frequency/percentage | Normalized frequency | Frequency/percentage | Normalized frequency | Frequency/percentage | Normalized frequency | |
| 7 | 3552/52.02% | 212.59 | 1963/28.75% | 117.49 | 1160/16.98% | 69.43 | 153/2.24% | 9.16 |
| 8 | 3482/48.32% | 208.09 | 2288/31.76% | 136.74 | 1247/17.31% | 74.52 | 188/2.61% | 11.24 |
| 9 | 3389/47.29% | 202.62 | 2322/32.39% | 138.83 | 1256/17.52% | 75.09 | 200/2.79% | 11.96 |
| Total | 49.2% | 30.9% | 17.3% | 2.5% | ||||
Log-likelihood (LL) tests for lexical word classes across grade levels.
| Grade levels | Nouns | Verbs | Adjectives | Adverbs | ||||
| LL | Sig. ( | LL | Sig. ( | LL | Sig. ( | LL | Sig. ( | |
| 7–8 | 0.81 | 0.370 | −24.39 | 0.000 | −3.02 | 0.082 | −3.55 | 0.060 |
| 8–9 | 1.40 | 0.236 | −0.20 | 0.653 | −0.02 | 0.887 | −0.35 | 0.552 |
| 7–9 | 4.01 | 0.045 | −29.73 | 0.000 | −3.71 | 0.054 | −6.23 | 0.013 |
Percentage of misspellings per base list.
| Word lists | Frequency of misspellings/total frequency | Percentage |
| Base list 1 | 440/43,254 | 1.01% |
| Base list 2 | 173/3412 | 5.07% |
| Base list 3 | 94/781 | 12.04% |
Percentage of misspellings per base list and per grade level.
| Word lists | Types of misspellings | Tokens of misspellings | ||||
| Grade 7 | Grade 8 | Grade 9 | Grade 7 | Grade 8 | Grade 9 | |
| Base list 1 | 108/664 (16.3%) | 80/753 (10.6%) | 61/702 (8.7%) | 202/15,036 (1.3%) | 138/14,636 (0.9%) | 100/13,852 (0.7%) |
| LL | Grades 7 and 8 | Grades 8 and 9 | Grades 7 and 8 | Grades 8 and 9 | ||
| LL = 8.45 Sig. ( | LL = 1.41 Sig. ( | LL = 10.46 Sig. ( | LL = 4.18 Sig. ( | |||
| Base list 2 x | 46/275 (16.7%) | 44/323 (13.6%) | 47/392 (11.9%) | 61/901 (6.7%) | 53/1059 (5.0%) | 59/1452 (4.1%) |
| LL | Grades 7 and 8 | Grades 8 and 9 | Grades 7 and 8 | Grades 8 and 9 | ||
| LL = 0.95 Sig. ( | LL = 0.37 Sig. ( | LL = 2.60 Sig. ( | LL = 1.21 Sig. ( | |||
| Base list 3 | 5/86 (5.8%) | 20/110 (18.2%) | 35/141 (24.8%) | 9/171 (5.3%) | 27/275 (9.8%) | 58/335 (17.3%) |
| LL | Grades 7 and 8 | Grades 8 and 9 | Grades 7 and 8 | Grades 8 and 9 | ||
| LL = −6.32 Sig. ( | LL = −1.26 Sig. ( | LL = −2.88 Sig. ( | LL = −6.28 Sig. ( | |||
| Total frequency of misspellings LL | 159 | 144 | 143 | 272 | 218 | 217 |
| LL = 0.77 Sig. ( | LL = 0.00 Sig. ( | LL = 6.04 Sig. ( | LL = 0.00 Sig. ( | |||