| Literature DB >> 25071998 |
Jyrki Launes1, Laura Hokkanen1, Marja Laasonen2, Annamari Tuulio-Henriksson3, Maarit Virta1, Jari Lipsanen1, Pentti J Tienari4, Katarina Michelsson5.
Abstract
Background. Attrition is a major cause of potential bias in longitudinal studies and clinical trials. Attrition rate above 20% raises concern of the reliability of the results. Few studies have looked at the factors behind attrition in follow-ups spanning decades. Methods. We analyzed attrition and associated factors of a 30-year follow-up cohort of subjects who were born with perinatal risks for neurodevelopmental disorders. Attrition rates were calculated at different stages of follow-up and differences between responders and non-responders were tested. To find combinations of variables influencing attrition and investigate their relative importance at birth, 5, 9, 16 and 30 years of follow-up we used the random forest classification. Results. Initial loss of potential participants was 13%. Attrition was 16% at five, 24% at nine, 35% at 16 and 46% at 30 years. The only group difference that emerged between responders and non-responders was in socioeconomic status (SES). The variables identified by random forest classification analysis were classified into Birth related, Development related and SES related. Variables from all these categories contributed to attrition, but SES related variables were less important than birth and development associated variables. Classification accuracy ranged between 0.74 and 0.96 depending on age. Discussion. Lower SES is linked to attrition in many studies. Our results point to the importance of the growth and development related factors in a longitudinal study. Parents' decisions to participate depend on the characteristics of the child. The same association was also seen when the child, now grown up, decided to participate at 30 years. In addition, birth related medical variables are associated with the attrition still at the age of 30. Our results using a data mining approach suggest that attrition in longitudinal studies is influenced by complex interactions of a multitude of variables, which are not necessarily evident using other multivariate techniques.Entities:
Keywords: Attrition; Birth cohort; Cohort studies; Follow-up; LTFU; Longitudinal studies; Perinatal risks; Prospective
Year: 2014 PMID: 25071998 PMCID: PMC4103077 DOI: 10.7717/peerj.480
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Area of residency.
The area of mothers’ place of residency. Dark (red): Helsinki, 60% of mothers (mean distance to hospital 8 km). Magenta: Espoo and Vantaa, 22% of mothers (mean distance 15 km). Pink: 18% of the mothers in the surrounding municipalities (average distance 30 km). (Adapted from the work of BishkekRocks under CC BY-SA 3.0, http://commons.wikimedia.org/wiki/File:Finnish_municipalities_2007.png.)
Figure 2Inclusion.
Inclusion of subjects in the 30-year longitudinal follow-up study of children at risk for neurodevelopmental disorders.
Figure 3Participation.
Participation of subjects in the 30-year longitudinal follow-up study of children in risk of neurodevelopmental disorders. Cases excluded due to death or severe handicap are not shown. Total number of potential participants was 994 at 5 and 9 years, and 864 at 16 and 30 years. Attrition thus was 16% at five, 24% at nine, 35% at 16 and 46% at 30 years. Numbers on the left column indicate participation at different age levels.
Figure 4Retention rate.
The retention rate of the follow-up of a cohort of 1,196 neonates with birth risks. The 202 children who died or were severely handicapped were excluded, thus the actual number of included children was 994 (dotted line). * Only the children who had participated at 5 or 9 years were invited at 16 years and 30 years.
Classification accuracy and five of the most important variables at each age level selected for classification by the random forest classification model.
Accuracy (overall fraction correct) calculated from 2∗2 contingency table (a + d/t).
| At birth | Age 5 | Age 9 | Age 16 | Age 30 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Birth related | Birth weight | 1.00 | Mother’s age | 1.00 | Gestational weeks | 0.52 | Mother’s age | 0.56 | Mother’s age | 0.87 |
| Mother’s age | 0.78 | Birth weight | 0.69 | Birth weight | 0.48 | Birth weight | 0.37 | Pregnancy complication | 0.58 | |
| Resuscitation | 0.77 | Hyperbilirubinemia | 0.58 | Mother’s age | 0.44 | Pregnancy | 0.31 | Birth weight | 0.41 | |
| Perinatal treatments | 0.75 | Pregnancy | 0.56 | Hyperbilirubinemia | 0.40 | Apgar 1 min | 0.20 | X-ray in pregnancy | 0.26 | |
| Gestational weeks | 0.75 | Gestational weeks | 0.55 | Respiratory | 0.38 | Apgar 5 min | 0.16 | Apgar 1 min | 0.22 | |
| SES related | Distance | 0.55 | Distance | 0.41 | Housing (5) | 0.85 | Childs’ security (9) | 0.44 | Parents’ view (16) | 0.79 |
| Father’s social class | 0.54 | Mother working | 0.39 | Parity (5) | 0.80 | Housing (5) | 0.33 | Childs’ security (9) | 0.66 | |
| Mother working | 0.44 | Father’s social class | 0.36 | Domestic dispute (5) | 0.53 | Special education (9) | 0.30 | Special education (9) | 0.37 | |
| Smoking in pregnancy | 0.35 | Smoking in pregnancy | 0.31 | No of occupants (5) | 0.45 | Father’s social class | 0.19 | Current activity (16) | 0.35 | |
| Marital status | 0.29 | Marital status | 0.23 | Father’s social class | 0.45 | Mother working | 0.14 | Subjective gain (9) | 0.30 | |
| Neurodevelopmental related | Dubowitz test (5) | 1.00 | NDS (5) | 1.00 | NDS (5) | 1.00 | ||||
| DAP (5) | 0.90 | DAP (9) | 0.38 | Dubowitz test (5) | 0.54 | |||||
| NDS coordination (5) | 0.72 | WISC (9) | 0.37 | ITPA (5) | 0.53 | |||||
| ITPA (5) | 0.72 | TOMI (9) | 0.33 | ITPA (9) | 0.48 | |||||
| NDS (5) | 0.71 | ITPA (9) | 0.32 | WISC VIQ (9) | 0.48 | |||||
| Classification accuracy | 0.81 | 0.74 | 0.90 | 0.76 | 0.96 | |||||
| (95% confidence limits) | 0.78–0.82 | 0.71–0.76 | 0.87–0.91 | 0.72–0.79 | 0.94–0.97 | |||||
Notes.
Numbers in parenthesis refer to the age in which the variable is measured.
Body mass index
Draw a Person test
Illinois test of psycholinguistic ability
Test of motor impairment
Neurodevelopmental screen
Wechsler intelligence test for children
Verbal intelligence quotient
Performance intelligence quotient
Figure 5Variable importance in random forest model.
Average importance of variables which predict participation/non-participation as calculated by the random forest classification model. The variables were classified into three categories which are (1) variables relating to child, delivery and pregnancy (Birth), (2) variables relating to neurodevelopment and behavior (Development), and (3) variables reflecting socioeconomic status (SES). Columns represent each age level of the longitudinal follow-up and are scaled to 100%.