Literature DB >> 23983466

Latent class analysis of reading, decoding, and writing performance using the Academic Performance Test: concurrent and discriminating validity.

Hugo Cogo-Moreira1, Carolina Alves Ferreira Carvalho, Adriana de Souza Batista Kida, Clara Regina Brandão de Avila, Giovanni Abrahão Salum, Tais Silveira Moriyama, Ary Gadelha, Luis Augusto Rohde, Luciana Monteiro de Moura, Andrea Parolin Jackowski, Jair de Jesus Mari.   

Abstract

AIM: To explore and validate the best returned latent class solution for reading and writing subtests from the Academic Performance Test (TDE). SAMPLE: A total of 1,945 children (6-14 years of age), who answered the TDE, the Development and Well-Being Assessment (DAWBA), and had an estimated intelligence quotient (IQ) higher than 70, came from public schools in São Paulo (35 schools) and Porto Alegre (22 schools) that participated in the 'High Risk Cohort Study for Childhood Psychiatric Disorders' project. They were on average 9.52 years old (standard deviation = 1.856), from the 1st to 9th grades, and 53.3% male. The mean estimated IQ was 102.70 (standard deviation = 16.44).
METHODS: Via Item Response Theory (IRT), the highest discriminating items ('a'>1.7) were selected from the TDE subtests of reading and writing. A latent class analysis was run based on these subtests. The statistically and empirically best latent class solutions were validated through concurrent (IQ and combined attention deficit hyperactivity disorder [ADHD] diagnoses) and discriminant (major depression diagnoses) measures.
RESULTS: A three-class solution was found to be the best model solution, revealing classes of children with good, not-so-good, or poor performance on TDE reading and writing tasks. The three-class solution has been shown to be correlated with estimated IQ and to ADHD diagnosis. No association was observed between the latent class and major depression.
CONCLUSION: The three-class solution showed both concurrent and discriminant validity. This work provides initial evidence of validity for an empirically derived categorical classification of reading, decoding, and writing performance using the TDE. A valid classification encourages further research investing correlates of reading and writing performance using the TDE.

Entities:  

Keywords:  Academic Performance Test; TDE; decoding; validity; writing

Year:  2013        PMID: 23983466      PMCID: PMC3748054          DOI: 10.2147/NDT.S45785

Source DB:  PubMed          Journal:  Neuropsychiatr Dis Treat        ISSN: 1176-6328            Impact factor:   2.570


Introduction

The assessment of word recognition and spelling is considered to be, in different alphabetic languages, an effective approach to estimating the performance of basic cognitive levels of processing related to decoding and encoding the writing system. The scores generated by the use of quantitative parameters of errors, both in reading and writing tests, allow inferences about the characteristics of cognitive processes linked to the principle of the alphabetic-orthographic system.1 New reliable and valid data in this area of study are crucial for improving our knowledge in this field and for helping clinicians, researchers, and policymakers; diagnoses of learning disorders and language difficulties in low- and middle-income countries depend on extremely scarce scientific data to drive an evidence-based framework. The Academic Performance Test (TDE)2 is a standardized and validated test for the Brazilian population that assesses both the performance of reading aloud and writing by dictation which are known as the subtests Reading and Writing, respectively. These subtests seek to eliminate interference as much as possible in interpretation and understanding using isolated items (single words). The TDE was created approximately 10 years ago and has been frequently used to measure the performance of basic skills learned in school. It is also one of the instruments used in the diagnosis of learning disorders. Despite its frequent use, the literature on the test so far is limited in a number of important ways. First, there are very few studies investigating the validity of the TDE among the Brazilian population, and the studies available rely on small samples and lack an assessment of concurrent mental disorders and a proper characterization of cognition. Second, recent research has pointed out some concerns related to discrimination, item difficulty, and the original criteria of the test.3 To our knowledge, no study has investigated the properties of the TDE using modern statistical techniques such as Item Response Theory (IRT). Third, as far as we are aware, there is no study investigating the TDE from a person-centered perspective (eg, Latent Class Analysis [LCA]), which allows us to investigate classes of subjects that are distinct from each other in both writing and decoding performance simultaneously. LCA is a form of cluster analysis initially introduced by Lazarsfeld and Henry in 1968.4 It is the most commonly applied latent structure model for categorical data,5 allowing the specification of statistical distributions through a model-based method, which differs from methods that apply arbitrary distance metrics to group individuals based on their similarity (for example, K-means clustering).6 In LCA, unlike in K-means clustering, a statistical model is built for the population from which the data sample was obtained.7 Latent class analysis may be used as a way to evaluate the diagnostic accuracy of a test when there is not a gold standard against which to compare it.8 The latent class refers to grouping by performance standards in all the items that measure them. These groupings should then be validated to confirm whether the instrument assesses what it proposes. Also, for a proper analysis of basic skills in the domain of the alphabetic principle, it is important to consider performance in both writing and reading simultaneously. Here, we make use of a large sample of children to investigate the suitability of test items and to explore and validate classes of subjects with problems in reading and writing simultaneously. It was hypothesized that if there is an underlying latent grouping of children, without a priori classification of the TDE’s decoding (word-level skill task) and writing (dictation) items, such categories will have concurrent and divergent validity with other gold standard measures. In other words, if the best latent class solution is a useful screening tool for assessing decoding and writing skills (via very high discrimination of the TDE’s items), latent groups of readers showing different levels of decoding and writing skills will be found. Indeed, if the children, who were classified as having different levels of reading and writing skills, show corresponding performance in direct measures of reading, this result will be taken as concurrent validity for the instrument. Conversely, discriminant validity will be granted if no associations are found between reading and writing ability and measurements that are not directly related to reading and writing skills.

Materials and methods

This report is part of a large, community school-based study, performed in multiple steps combining standardized evaluation from a psychiatric and cognitive neuroscience perspective, genetics, and neuroimaging to inform preventive strategies in developmental psychiatry. Our sample in the screening phase consisted of students in public schools with more than 10,000 students in the age range assessed from 57 public schools, located close to the research centers in Porto Alegre and São Paulo, Brazil. The methodological aspects of the project were extensively reviewed by Salum et al;9 they are briefly presented here: (1) screening; (2) psychiatric assessments; and (3) cognitive evaluation, where assessments of IQ and decoding and writing skills (via the TDE) were collected. This study was approved by the ethics committee of the University of São Paulo (IORG0004884, project Institutional Review Board registration number: 1132/08). Written consent was obtained from all the parents of the participants, and verbal assent was obtained from all the children. When appropriate, written assent also was obtained from the children.

Participants

Our sample was consisted of 1,945 children, who were evaluated by hearing and speech therapists, psychologists, and psychiatrists. They came from São Paulo (35 schools) and Porto Alegre (22 schools) which participated in screening and enrollment procedures; students were on average 9.52 years old (standard deviation [SD] = 1.856), in 1st grade to 9th grade, 53.30% male, and had an intelligence quotient (IQ) higher than 70 (the sample’s average IQ was 102.70 [SD = 16.44]). Only children with an estimated IQ higher than 70 were considered in order to avoid intellectual disabilities, which might be a potential confounder for low achievement in decoding and writing (word-level skills); the proposed cutoff for intellectual disability (<70) was based on both of the most commonly used diagnostics systems in psychiatry, the DSM-IV (Diagnostic Manual of Mental Disorders) and ICD-10 (International Classification of Diseases). Of the 1,945 children, 772 children were randomly selected from the population, and 1,173 came from the high-risk strata. Selection for the high-risk group involved a risk-prioritization procedure that focused on individuals with a family history of a disorder and/or ongoing symptoms in one of the five targeted domains (attention deficit hyperactivity disorder [ADHD], anxiety, obsessive compulsive disorder [OCD], psychosis, major depression, and learning disorders), as detected during screening.9

Academic Performance Test

The Academic Performance Test (TDE)2 is composed of three subtests: writing (isolated words in dictation); mathematics (oral problem solving and written calculations of mathematical operations); and reading (recognition of isolated words). The instrument is constructed and validated for the Brazilian population.2 In this research, only the writing and reading subtests were used. The TDE was administered by trained hearing and speech therapists, who gave the following instructions to the children: (a) for the reading subtest: “look at these words carefully and read them aloud”; and (b) for the dictation subtest: “now we are going to do a dictation. I will dictate a word. After that, I will read a sentence where this word appears, and, thus, I will read it [the word] once again. If there is some word which you do not know, try to write it in a way that you know.”

Training procedures

Training procedures consisted of two sessions with experienced professionals in the field. Full explanations about the project, instruments, procedures, and standardization for the clinical evaluations were provided, in order to avoid any assessment bias.

Intelligence

IQ was estimated using the vocabulary and block design subtests of the Weschler Intelligence Scale for Children, 3rd edition (WISC-III),10 the Tellegen and Briggs method which is a modified part-whole correlation formula offered for use in the case of non-independent test administration of the part-Wechsler subtest combination,11 and Brazilian normalizations.12 The WISC-III was administered by trained psychologists.

Potential confounder

Simplified Auditory Assessments13 were conducted by a hearing and speech pathologist. These assessments tested the elicitation of the auropalpebral reflex through instrumental sounds; sound location in five directions; sequential verbal memory for sounds with three and four syllables; and sequential nonverbal memory with three and four percussion musical instruments. The children were classified as having problems in auditory processing, or not. Auditory perception may be one source of individual variation in the phonological abilities that play a critical role in skilled reading, as well as in reading and writing disabilities.14 Also, in order to evaluate the children’s socioeconomic status (SES), the Brazilian Association of Research Companies (ABEP)15 questionnaire was used, which measures personal material wealth and is a standardized index of economic classification, based on the family’s power of consumption.

Psychiatric diagnosis

Child psychiatric diagnosis was established using the Development and Well-Being Assessment (DAWBA),16 which was administered to the biological parents of all children included in the project. The DAWBA is a structured interview administered by lay interviewers who also record verbatim responses, which were used to confirm or refute diagnoses of any reported problems. All questions are closely related to DSM-IV diagnostic criteria and focus on current problems causing significant distress or social impairment. A total of nine well-trained psychiatrists performed the rating procedures; they were trained and supervised closely by a senior child psychiatrist with extensive experience in rating the DAWBA. Cases about which raters had doubts about any specific diagnosis were scaled up and discussed between two child psychiatrists until consensus about the diagnosis was achieved.

Statistical analysis plan

First, in order to identify the highest discriminating TDE items, IRT was conducted (we began with this procedure, because the TDE has showed floor and ceiling effect for some items). Through this technique, it is possible to estimate the amount of latent trait (ability) required to correctly answer the item (here, individually, two different latent traits were considered: decoding [word-level reading] and writing [word-level writing], following IRT’s unidimensionality assumption). All 104 items are dichotomous (ie, had only a correct or wrong answer). In order to satisfy the unidimensionality’s postulate, we admitted that there is a dominant ability (latent trait) that is measurable by achievement in a conjunction of items in each of the TDE’s subtests.17 The greater the ability level, the higher the probability of a correct response, and vice versa. Indeed, the probability of success depends on three items’ parameters: discrimination (denoted by ‘a’, which describes how well an item can differentiate between examinees having abilities below the item location and those having abilities above the item location);18 difficulty (denoted by ‘b’, where typical values have the range −3≤b≤3, designating the difficulty of the item; in other words, it describes where the item functions along the ability scale); and guessing (normally used in multiple-choice tests; in other words, it is the probability of getting the item correct by guessing alone. In this research, the guessing parameter was not considered, because we are assuming that its probability is 0). Therefore, we worked using a two-parameter logistic model. Baker has pointed out that items with very high discrimination value have ‘a’>1.7;18 based on this criterion, in each latent trait, we looked for items with very high values on the discrimination parameter, which means that the item has a high power of differentiation (discrimination) between children with and without word-level reading and writing skills. Subsequently, the set of high discriminative items were analyzed via latent class analysis (LCA), a mixture model that aims to uncover unobserved heterogeneity in a population and to find substantively meaningful groups of people that are similar in their responses to measured variables.19 The idea behind the latent class was to find latent groups underlying the highest discriminative items (ie, items with ‘a’>1.7); in other words, latent groups under items that differentiate between children having abilities below the item location and those having abilities above the item location. To compare different numbers of latent classes’ solutions, different information criterion indices were used: Akaike’s Information Criterion (AIC); the Bayesian information criterion (BIC), in which small values correspond to better fit; and the sample size-adjusted BIC (ssaBIC). The Lo–Mendell–Rubin (LMR) test20,21 and bootstrapped likelihood ratio test (BLRT) were used to test the number of classes in this mixture analysis procedure; the former is obtained by running the k-class and k-1 class analyses and using the derivatives from both models to compute the P-value (a low P-value rejects the k-1 class model in favor of the k-class model). The latter is obtained by bootstrapping, following the procedure described by Asparouhov and Muthén.22 The classification quality of the model was evaluated according to the entropy criterion, in which the values range from 0 to 1, where values close to 1 indicate good classification. IRT and LCA were conducted via Mplus (v6.12; Muthén and Muthén, Los Angeles, CA, USA).23 In order to validate the best latent class solution (based on statistical and empirical evidence), concurrent and discriminant validity were assessed using the regression model from STATA (v12; StataCorp, College Station, TX, USA), considering robust SE to adjust for the cluster structure (school level). Covariates such as age, gender, intelligence quotient, auditory processing, and socioeconomic status were considered in the regression models. When intelligence quotient was used as the outcome, it was excluded from the covariates selection.

Results

IRT

Tables 1 and 2 show the discrimination (‘a’) and difficulty (‘b’) parameters of the TDE’s subtests (reading and writing, respectively) with their respective SE.
Table 1

Discrimination (‘a’) and difficulty (‘b’) parameters from the TDE’s reading subtest

Translation to EnglishWord in PortugueseEstimate ‘a’SEEstimate ‘b’SE
NowAgora9.5993.474−0.9660.303
WindowJanela9.1745.142−1.1320.299
BornNascimento6.2741.301−0.9230.303
WordPalavra5.7920.685−0.9640.294
DuckPato5.7061.903−1.3060.325
My (female form)Minha5.1861.027−0.9640.305
TapeFita5.0431.432−1.1370.304
Dear (male form)Querido4.9271.171−0.9170.306
CoinMoeda4.8890.92−1.0090.286
BrickTijolo4.7480.934−1.0770.295
BaitIsca4.2710.836−1.0190.281
ProjectProjeto3.9830.615−0.8860.296
SizeTamanho3.9650.845−0.8570.295
FieldCampo3.9220.811−0.8850.286
GlobeGlobo3.8010.756−0.7960.3
TruckCaminão3.7250.567−0.9330.289
SuccessSucesso3.6080.566−0.8480.303
ArtArte3.6020.735−0.9540.274
ShoeSapato3.5890.925−1.1640.302
NeedleAgulha3.5240.568−0.8410.296
StormTempestade3.3990.711−0.8230.282
BrushwoodMato3.3780.818−1.2290.312
ArmorArmadura3.3480.64−0.8540.278
CloverTrevo3.2830.628−0.8240.276
TrayBandeja3.2820.801−0.8490.286
FloristFloresta3.1730.655−0.7940.291
Brute, inhuman personBruto3.1120.483−0.8170.287
CandleVela3.1030.591−1.1420.303
CashewCaju2.9540.601−1.0940.303
BoneOsso2.9300.6−1.0930.29
RapidityRapidez2.9130.576−0.7820.287
Cigarette lighterIsqueiro2.8900.615−0.7760.287
WolfLobo2.8610.541−1.0800.312
HoneyMel2.8130.395−0.7860.286
GuitarGuitarra2.7910.482−0.5610.275
DiligentAplicado2.7070.468−0.7350.268
LawyerAdvogado2.6460.421−0.610.294
SackSaco2.5950.565−1.1460.301
SournessAzedo2.5850.392−0.8740.269
FearMedo2.5840.479−1.1210.294
GarageGaragem2.5700.546−0.7450.288
1. To abuse, 2. To go beyond limits or measureAbusar2.3900.462−0.6210.264
KioskQuiosque2.3850.452−0.6650.277
ShakerChocalho2.3490.474−0.6740.273
ExplanationExplicação2.2210.44−0.6040.276
SheetLençóis2.1420.368−0.5570.252
LuxuriousLuxuoso2.1190.323−0.6020.266
AeronauticsAeronáutica2.0760.316−0.5490.256
AtmosphereAtmosfera2.0740.32−0.3540.281
ClawGarra2.0550.285−0.7260.261
AtlasAtlas1.9790.319−0.430.264
Dripped (past tense of ‘to drip’)Pingado1.9180.345−0.5930.252
Lodging houseHospedaria1.9050.348−0.5180.248
Scotch tapeDurex1.8890.29−0.1870.257
ExhaustedExausto1.8770.284−0.2910.259
Curdled milkCoalhada1.8720.346−0.4920.242
Fatty part of milk, butterfatNata1.7860.362−1.0290.285
RepugnantRepugnante1.7750.269−0.0920.26
BacksideCostas1.7720.277−0.6180.242
BroughtTrouxe1.7580.275−0.3250.25
To get upAcordar1.7350.321−0.660.226
DowncastAcabrunhado1.6240.318−0.3850.234
PerseverancePerseverança1.6190.263−0.2650.225
ExceptionalExcepcional1.5370.2310.0620.227
Rescinded (past tense of ‘to rescind’)Rescindido1.4920.269−0.3930.234
To ricochetRicochetear1.4870.213−0.1230.2
HallSaguões1.4830.227−0.1160.243
MarsupialsMarsupiais1.4170.247−0.3860.229
HypocriteHipócrita1.4090.2120.0420.217
To piqueVangloriar1.3390.1830.0620.185

Note: Only values in bold were used to conduct LCA.

Abbreviations: LCA, latent class analysis; SE, standard error; TDE, Academic Performance Test.

Table 2

Discrimination (‘a’) and difficulty (‘b’) parameters from the TDE’s writing subtest

Translation to EnglishWord in PortugueseEstimate ‘a’SEEstimate ‘b’SE
To seeVer3.7201.080−0.8780.266
MoreMais2.5610.600−0.9250.313
OnlyApenas2.5200.578−0.8950.279
BurrowToca2.5200.292−1.2260.385
Hammer blowMartelada2.4150.379−0.6580.237
KindnessFavor2.0530.35−0.5760.247
Substantive of breakingQuebramento2.0260.273−0.3790.255
CollectivityColetividade1.9530.308−0.5770.267
UnknownDesconhecido1.8620.258−0.1070.289
BalanceBalance1.8440.262−0.0840.267
EffectiveEfetivo1.6570.267−0.6720.269
FortificationFortificação1.6340.2130.0490.258
LugubriousSoturno1.5840.175−0.2640.24
To crystallizeCristalizar1.5360.2010.230.276
BallBaile1.5130.219−0.5020.256
PrestigiousPrestigioso1.4680.230.2210.265
TapBica1.4550.304−1.0720.332
To digestDigerir1.4510.197−0.1160.251
DiscriminativeDiscriminativo1.4450.2190.1890.245
BoisterousRevolta1.4030.2130.1000.248
To commercializeComercializar1.3650.1590.7690.25
To bring before the courtAjuizar1.3330.1590.410.245
LazinessPreguiça1.3130.1430.8230.221
To take the lid offDestampar1.3050.1610.4190.228
IndustrializationIndustrialização1.2260.1470.8670.264
CompositionComposição1.1940.1720.4630.253
Shivering sensationCalafrio1.1680.1740.1460.254
LegitimacyLegitimidade1.1220.1510.2820.22
HelmetElmo1.1200.160.3450.225
To be comfortedConsolado1.0920.170.0790.276
ImpetuosityImpetuosidade1.0190.0951.2950.23
ManlyVaronil1.0110.1470.2080.236
SimilaritySimilaridade1.0100.1520.550.211
QuickRápida0.5240.0630.9230.208

Note: Only values in bold were used to conduct LCA.

Abbreviations: LCA, latent class analysis; SE, standard error; TDE, Academic Performance Test.

Regarding the reading subtest, 61 of 70 items showed very high discrimination (‘a’>1.7; the range of discrimination among all items was from 9.599 to 1.339); in the writing subset, 10 of 34 items showed very high discrimination (the range of discrimination was from 3.720 to 0.524). Therefore, for the next step, LCA, we considered 71 items (ie, 61+10) from the reading and writing domains.

LCA

Table 3 shows the fit indices for each model solution (with two, three, four and five-latent class solutions). The two-class solution has acceptable indices and grouped the children into two latent groups: children with poor decoding and writing skills (18.50%) and good decoding and writing skills (81.50%); however under the two-class solution the AIC, BIC, and ssaBIC are highest (among the other solutions). This sort of categorization (ie, dichotomization) might be helpful when it is important to have highly contrasting groups. However, the best statistical solution (AIC = 76,422.476, BIC = 77,560.866, ssaBIC = 76,934.954, entropy = 0.982) was achieved with the three-class solution, in which each class was labeled as follows: children with good decoding and writing skills (GDW; 63.70% of population), not-so-good decoding and writing skills (NsgDW; 22%), and poor decoding and writing skills (PDW; 14.30%), indicating that there is an intermediate class.
Table 3

LCA fit indices fit for the two-, three-, and four-class solutions

N classesFree parametersH0AICBICssaBICEntropyLMR ratioP-valueBootstrapped likelihood ratio test
2140−46537.60493355.20994157.67993712.8740.998***
3197−38014.23876422.47677560.86676934.9540.982−44414.643<0.0001<0.0001
4263−36148.71772823.43374343.21373507.6050.968−38014.2380.3323<0.0001
5329−35051.65970761.31872622.48871617.1830.947−36148.7170.1092<0.0001

Note: *No value.

Abbreviations: AIC, Akaike Information Criterion; BIC, Bayesian Information Criterion; ssaBIC, sample size adjusted; LMR, Lo–Mendell–Rubin likelihood ratio; LCA, latent class analysis; H0, null hypothesis.

As might be noted from Table 4, the majority of children (403) who were classified as having NsgDW skills in the three-classes solution were categorized as children with GDW skills in the two-class solution. The four-class solution does not have good interpretability either in terms of reading and writing skills or statistics.
Table 4

Cross-tabs of two- and three-class solutions

Two-class solutionThree-class solution
Total
GDW skilledNsgDW skilledPDW skilled
GDW skilled1,23834801,586
PDW skilled080279359
Total1,2384282791,945

Abbreviations: GDW, good decoding and writing skills; NsgDW, not-so-good decoding and writing skills; PDW, poor decoding and writing skills.

Discriminant and concurrent validation

According to Table 5, the three-class solution was not found to predict a major depression diagnostic (ie, having NsgDW skills [odds ratio {OR}: 1.373, P = 0.315] or PDW skills [OR: 1.534, P = 0.424] in relation to GDW skills), indicating a discriminant validation; in other words, our latent groups were not correlated with depression diagnosis.
Table 5

Discriminating validity of three-class solution through logistic regression

OutcomesTested predictorEstimate (odds ratio)Robust SEP-value95% confidence interval
Major depression diagnostic (binary outcome)NsgDW skills group1.4470.4720.2580.7622.744
PDW skills group1.7210.8030.2440.6894.299
Age1.2960.10580.0011.1041.521
Female1.1330.3320.6700.6372.015
SES0.6520.1650.0920.3971.072
IQ1.0090.0080.2610.9921.026
Problems on hearing1.3960.2820.0980.9392.075
Combined ADHD diagnostic (binary outcome)NsgDW skills group2.0560.6080.0151.1513.673
PDW skills group2.2580.8340.0271.8374.659
Age0.9660.7680.6650.8261.129
Female0.5360.1490.0260.3100.927
SES0.8000.1780.3190.5171.239
IQ0.9830.6080.0160.9700.996

Abbreviations: ADHD, attention deficit hyperactivity disorder; NsgDW, not-so-good decoding and writing skills; PDW, poor decoding and writing skills; SES, socioeconomic status; SE, standard error; IQ, intelligence quotient.

Regarding the concurrent validity, children with NsgDW and PDW skills were shown to have approximately twice the probability of combined ADHD in relation to children with GDW skills (OR: 2.056, P = 0.015 and OR: 2.258, P < 0.027, respectively). In the same way, multinomial logistic regressions showed in Table 6, where the estimated IQs were tested as predictors of the three-class solution’s outcomes, considering both Brazilian norms and the Tellegen and Briggs method, returned the result that NsgDW and PDW skills were predictors of estimated IQ, indicating concurrent validity.
Table 6

Concurrent validity of estimated IQ (Brazilian normalization) and Tellegen and Briggs methods through multinominal regression

OutcomesTested predictorEstimateRobust SEP-valueConfidence interval 95%
NsgDW skills groupEstimated IQ/Brazilian normalization−0.3310.042<0.001−0.414−0.248
Age−0.5540.047<0.001−0.646−0.462
Female−0.2930.0950.001−0.472−0.114
SES−0.0740.0950.434−0.2600.111
Problems on hearing−0.2330.1310.076−0.4910.024
PDW skills groupEstimated IQ/Brazilian normalization−0.5760.069<0.001−0.712−0.440
Age−1.3040.840<0.001−1.469−1.140
Female−0.5310.2050.010−0.933−0.128
SES−0.3830.1750.028−0.726−0.040
Problems on hearing−0.0330.1710.847−0.3690.303
NsgDW skills groupEstimated IQ (Tellegen and Briggs11)−0.0300.003<0.001−0.377−0.022
Age−0.5460.046<0.001−0.636−0.455
Female−0.2860.0910.002−0.464−0.455
SES−0.0800.0940.392−0.2660.104
Problems on hearing−0.2210.1230.086−0.4730.031
PDW skills groupEstimated IQ (Tellegen and Briggs11)−0.0490.006<0.001−0.062−0.371
Age−0.5760.069<0.001−1.461−1.139
Female−0.5130.2050.013−0.917−0.110
SES−0.3890.1770.028−0.737−0.041
Problems on hearing−0.0170.1700.921−0.3510.317

Note: The presented estimations for each latent class (NsgDW and PDW) were the results from each one compared to the reference value (GDW), which was omitted due the statistical comparison.

Abbreviations: ADHD, attention deficit hyperactivity disorder; GDW, good decoding and writing skills; NsgDW, not-so-good decoding and writing skills; PDW, poor decoding and writing skills; SES, socioeconomic status; SE, standard error; IQ, intelligence quotient.

Discussion

We were able to demonstrate that a three-class solution encompassing children with good, not-so-good, and poor decoding (ie, word-level reading) and writing performance presented the best fit for our data. We also showed that this three-class latent solution is valid and reliable because it is clearly associated with ADHD and IQ and it is not associated with emotional disorders such as depression. It is important to stress, regarding IRT results, due to wide age range, the interpretation of IRT’s parameters (ie, ‘a’ and ‘b’) will give values of discrimination and difficulty centered on the mean age (9.52 years old, SD = 1.85). It is expected that the items (for reading and writing subtests) become easier throughout the children’s development process and therefore the values for ‘b’ will decrease significantly; this effect is called the schooling effect, and it is expected due to the children’s increasing literacy. Discrimination concerning items also can change throughout a student’s school years; however, it is more stable regarding the schooling effect than difficulty parameter (ie, ‘b’), the higher it is (‘a’), the higher the information that ‘a’ brings to the latent trait (ie, θ); therefore, a very high ‘a’ was chosen for the cut-off parameter. The majority of the writing subtest was not used in the LCA, because they scored below the Baker’s cut-off parameter for a very high discriminative item (according to Baker, a high discriminative score is 1.35–1.69, moderate is 0.65–1.34, low is 0.35–0.64, and very low is 0.01–0.34).17 However, we cannot say that they did not contribute to measuring the underlying construct of writing ability, because all writing and reading items showed a factor loading statistically significant due to the fact that t-ratios for each item were ≥1.96 (ie, indicating that the factor loading had a P-value < 0.05. This can be obtained by dividing the ‘a’ estimate by its SE. For example, taking the last three discrimination estimates from the writing subtest, we have the following t-ratios: for ‘varonil’ [discrimination estimate/SE = 1.011/0.147 = 6.877], ‘similaridade’ [discrimination estimate/SE = 1.010/0.152 = 6.64], and ‘e rápida’ [discrimination estimate/SE = 0.524/0.063 = 8.317]). The bootstrapped likelihood ratio test did not help in deciding on the number of latent classes, because its P-values were <0.0001 for any reasonable number of latent classes. One explanation may be that the 71 items were not designed to capture latent classes; as described, they were selected based on the discrimination parameter supplied by IRT. In contrast, the three-class solution model may be a good enough approximate model and does give a reasonable interpretation of reading and writing skills. However, because the P-value derived from the Lo–Mendell–Rubin test is higher than 0.05 (P = 0.323), the three-class model was not rejected when compared to the four-class solution, corroborating the previous findings from other fit indices. The class counts based on the most likely posterior class for the four-class solution were, in terms of percentage: 58.62% (related to GDW skilled children), 22.74% and 7.65% (related to two splits of the intermediate category [ie, NsgDW skilled children], which was observed between GDW and PDW skilled children), and 10.98% related to PDW. As can be seen, the two intermediate latent groups within the four-class solution, in terms of reading and writing abilities, do not have a fair interpretation. Regarding statistics, the sample proportion (and as a consequence, the size) of each class within the four-class solution is too small, which also does not help much in the interpretability of classes. The reduction in the BIC and ssaBIC was about 3,000. However, taking into account the reduction from the two- to the three-class solution, the decline was around 16,000 for AIC, BIC, and ssaBIC; the progression in the reduction for the five-class solution is still around 2,000 (ie, from the four-class solution to the five-class solution, the difference is: 2,062 for AIC, 1,721 for BIC, and 1,890 for ssaBIC). The four-class solution does not have good interpretability either in terms of reading and writing skills or statistics due to the low proportion within one of the intermediate latent classes (7.56%). The three-class solution is preferable due to previous evidence and due to the nature of TDE’s items (basically centered on the decoding skills and, therefore, any other reading/writing domains such as comprehension). Within the four-class solution, two intermediate latent classes emerged as not having any atypical profile (such as high reading but low writing skills or vice versa); we opted, therefore, for the more parsimonious structure (GDW, NsgDW, and PDW skilled children). Indeed, regarding the number of underlying latent classes exclusively related to reading skills, a recent study has identified a three-class solution using 27 items (17 items pertaining to children’s reading aloud and ten to silent reading) from the Scale of Evaluation of Reading Competence of Students by the Teacher (EACOL). The EACOL was used to evaluate reading aloud and silent reading among a heterogenic sample of elementary school children (N = 335 children, with an average age of 9.75 years [SD = 1.2]); the classes were called good reader, not-so-good reader, and poor reader.24 No relation was observed between the three-class solution and a DAWBA diagnostic of major depression. Because the underlying constructs of the three-class solution were based on items related to decoding and writing skills, no correlation was expected to be found between latent classes and major depression. Our finding is in accordance with a study that found that reading difficulty did not directly influence the level of self-evaluation or depression.25 It is important to stress that a meta-analysis quantified the mean differences in depression measure scores and levels of clinical depression between students with and without learning disabilities, and it concluded that there is a lack of data-based studies on depression among students with learning disabilities.26 However, it must be emphasized that in this study, it was not possible to measure learning disorders/disabilities, due to the nature underlying the 71 items that formed the latent classes. Regarding the concurrent validity of IQ as a predictor of the three-class solutions’ outcomes (the children with GDW skills as the reference group in a multinomial regression model), children with NsgDW and PDW skills were predicted by both estimated IQ measures. The lower the class (ie, PDW), the lower the estimated IQ score, with GDW-skilled children constituting the reference group. In other words, IQ was shown to be a predictor of latent classes. A possible explanation is that, because estimated IQ is calculated using vocabulary and cubes subtests, some studies have pointed out that semantic knowledge is important for recognizing individual words.27,28 Moreover, the results are in accordance with a study that found that correlations between reading achievement tests and Weschler intelligence tests were substantially higher when the intelligence tests evaluated verbal abilities than when they evaluated non-verbal abilities. The magnitude of correlation between verbal IQ and word identification was around 0.62 (which is considered moderate) among older readers29 (older readers, in this paper, were understood to be in the 6th and 7th grades). It should be mentioned that this finding suggests that correlations between reading achievement and tests of intelligence may be an artifact of shared variance to which language-based abilities underlying performance on both sets of measures contribute.30 A recent publication highlighted the major concerns and controversies surrounding the Diagnostic and Statistical Manual of Mental Disorders IV for learning disorders; a concern with the IQ-achievement discrepancy criterion, for specific learning disorders, stated that the estimation of discrepancy is statistically flawed.31 Based on convergent evidence from the literature review, the above cited work group recommended that IQ-achievement discrepancy criterions used in DSM-IV should be eliminated. Corroborating the findings about the concurrent validity of the three-class solution and combined ADHD diagnostic, the authors have pointed out the frequent co-occurrence of ADHD and reading disability: 25%–40% of individuals with one disorder also meet the diagnostic criteria for the other.32–34 Considering reading disability and ADHD symptoms (inattentiveness and hyperactivity-impulsivity) as contiguous traits in population samples, other evidence about the correlation between the two might be pointed out.35–37 In the same way, a research study using a larger sample of 15-year-old children from public schools examined psychiatric morbidity and functional impairment of adolescents with and without poor reading skills during mid to late adolescence. Among children with poor reading skills, it found higher rates of concurrent attention deficit/hyperactivity, affective, and anxiety disorders, particularly social phobia and generalized anxiety disorder.38 Regarding the covariate age used on models where estimated IQs (the three-class solution was used as an outcome to test the estimated IQs’ prediction) were considered as an outcome, it is important to point out that a schooling effect was observed in which the higher the education level, the better the achievement in the accuracy of words.24 This finding strengthens the findings about the three-class solution’s concurrent validity. Also, in this same modelings’ contexts (ie, the three-class solution as an outcome), SES appeared to be a predictor of the lower classes (PDW skills) in comparison to the higher level class (GDW skills). This is consistent with previous studies where various developmental antecedents (eg, social deprivation, socioeconomic status, family size, maternal reading, a stimulating home environment, maternal depression, and child negligence) have been shown to have a small but significant relation to reading achievement.8,39 One strength to be addressed in this study is the possibility of associating IRT and LCA in order to provide, based on very highly discriminative items (‘a’>1.7), a categorical solution for a construct, which might be thought of as offering a person-centered perspective. One limitation, which is inherent to this type of analysis, is that this LCA is restricted to the given sample (ie, 1,945 subjects and their achievements and subgroups). However, it is important to stress that the three-class solution for decoding and writing skills was shown to be reliable and valid. The selection of the highly discriminative items via IRT involving differential item functioning between genders or among different school regions (ie, São Paulo/Porto Alegre) was not considered for discussion here. In other words, we did not test to see if examinees with equal ability but who are from different, unequal groups have an unequal probability of item success, because this paradigm is directly related to a dimensional view of the reading/writing construct. Therefore, it will be important to test differential item functioning in future research exploring the possibility of a dimensional construct underlying the TDE; this could be explored using the factor analysis with covariates or the multiple-group approach.

Conclusion

The three-class solution was shown to have discriminant and concurrent validity, enabling a more accurate grouping of students according to good, not-so-good, or poor performance and making this set of items a reliable indicator of deficits. The latent classes can assist researchers who require reliable and validated categories for diagnosis or identification of other behavior and performance patterns.
  18 in total

1.  Psychiatric comorbidity in children and adolescents with reading disability.

Authors:  E G Willcutt; B F Pennington
Journal:  J Child Psychol Psychiatry       Date:  2000-11       Impact factor: 8.982

2.  Insights into latent class analysis of diagnostic test performance.

Authors:  Margaret Sullivan Pepe; Holly Janes
Journal:  Biostatistics       Date:  2006-11-03       Impact factor: 5.899

Review 3.  Comorbidity between ADDH and learning disability: a review and report in a clinically referred sample.

Authors:  M Semrud-Clikeman; J Biederman; S Sprich-Buckminster; B K Lehman; S V Faraone; D Norman
Journal:  J Am Acad Child Adolesc Psychiatry       Date:  1992-05       Impact factor: 8.829

4.  Reading problems, psychiatric disorders, and functional impairment from mid- to late adolescence.

Authors:  David B Goldston; Adam Walsh; Elizabeth Mayfield Arnold; Beth Reboussin; Stephanie Sergent Daniel; Alaattin Erkanli; Dennis Nutter; Enith Hickman; Guy Palmes; Erica Snider; Frank B Wood
Journal:  J Am Acad Child Adolesc Psychiatry       Date:  2007-01       Impact factor: 8.829

5.  Old wine in new skins: grouping Wechsler subtests into new scales.

Authors:  A Tellegen; P F Briggs
Journal:  J Consult Psychol       Date:  1967-10

6.  [Reading ability of junior high school students in relation to self-evaluation and depression].

Authors:  Toshiya Yamashita; Takashi Hayashi
Journal:  No To Hattatsu       Date:  2012-01

7.  Depression among students with learning disabilities: assessing the risk.

Authors:  John W Maag; Robert Reid
Journal:  J Learn Disabil       Date:  2006 Jan-Feb

8.  Understanding comorbidity: a twin study of reading disability and attention-deficit/hyperactivity disorder.

Authors:  Erik G Willcutt; Bruce F Pennington; Richard K Olson; John C DeFries
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2007-09-05       Impact factor: 3.568

9.  Revisiting the association between reading achievement and antisocial behavior: new evidence of an environmental explanation from a twin study.

Authors:  Kali H Trzesniewski; Terrie E Moffitt; Avshalom Caspi; Alan Taylor; Barbara Maughan
Journal:  Child Dev       Date:  2006 Jan-Feb

10.  EACOL (Scale of Evaluation of Reading Competence by the Teacher): evidence of concurrent and discriminant validity.

Authors:  Hugo Cogo-Moreira; George B Ploubidis; Clara Regina Brandão de Ávila; Jair de Jesus Mari; Angela Maria Vieira Pinheiro
Journal:  Neuropsychiatr Dis Treat       Date:  2012-10-11       Impact factor: 2.570

View more
  2 in total

1.  Academic and Social Functioning Associated with Attention-Deficit/Hyperactivity Disorder: Latent Class Analyses of Trajectories from Kindergarten to Fifth Grade.

Authors:  George J DuPaul; Paul L Morgan; George Farkas; Marianne M Hillemeier; Steve Maczuga
Journal:  J Abnorm Child Psychol       Date:  2016-10

2.  Positive Attributes Buffer the Negative Associations Between Low Intelligence and High Psychopathology With Educational Outcomes.

Authors:  Mauricio Scopel Hoffmann; Ellen Leibenluft; Argyris Stringaris; Paola Paganella Laporte; Pedro Mario Pan; Ary Gadelha; Gisele Gus Manfro; Eurípedes Constantino Miguel; Luis Augusto Rohde; Giovanni Abrahão Salum
Journal:  J Am Acad Child Adolesc Psychiatry       Date:  2015-11-10       Impact factor: 8.829

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.