| Literature DB >> 26805777 |
Abstract
The use of the term 'big data' has grown substantially over the past several decades and is now widespread. In this review, I ask what makes data 'big' and what implications the size, density, or complexity of datasets have for the science of human development. A survey of existing datasets illustrates how existing large, complex, multilevel, and multimeasure data can reveal the complexities of developmental processes. At the same time, significant technical, policy, ethics, transparency, cultural, and conceptual issues associated with the use of big data must be addressed. Most big developmental science data are currently hard to find and cumbersome to access, the field lacks a culture of data sharing, and there is no consensus about who owns or should control research data. But, these barriers are dissolving. Developmental researchers are finding new ways to collect, manage, store, share, and enable others to reuse data. This promises a future in which big data can lead to deeper insights about some of the most profound questions in behavioral science.Entities:
Mesh:
Year: 2016 PMID: 26805777 PMCID: PMC5021153 DOI: 10.1002/wcs.1379
Source DB: PubMed Journal: Wiley Interdiscip Rev Cogn Sci ISSN: 1939-5078
Illustrative Big Datasets Hosted by Governmental Entities
| Dataset | Sample | URL |
|---|---|---|
| Current Population Survey (CPS) |
| |
| Danish National Birth Cohort |
|
|
| Early Childhood Longitudinal Study (ECLS) | Birth cohort (ECLS‐B): |
|
| Japan Environment and Children's Study (JECS) |
|
|
| National Health and Nutrition Examination Survey (NHANES) | nationally representative sample of |
|
| National Longitudinal Survey of Youth (NLSY) |
|
|
| National Youth Fitness Survey (NYFS) |
|
|
| NICHD Study of Early Child Care and Youth Development (SECCYD) |
|
|
| NICHD Data and Specimen Hub (DASH) | Deidentified data from NICHD funded research studies |
|
| NIH MRI Study of Normal Brain Development |
|
|
| NIMH National Database for Autism Research (NDAR) |
|
|
| Norwegian Mother and Child Cohort Study (MoBa) |
|
|
| WHO Multicentre Growth Reference Study (MGRS) |
|
|
Illustrative Big Developmental Datasets Hosted by Nongovernmental Entities
| Dataset | Sample | URL |
|---|---|---|
| National Longitudinal Study of Adolescent Health (AddHealth) | Wave I: |
|
| Child Language Data Exchange System (CHILDES) |
|
|
| Colorado Adoption Project (CAP) |
|
|
| Databrary | >3500 h of video from |
|
| Developing Human Connectome Project | 20–44 weeks postconception |
|
| Family Life Project (FLP) |
|
|
| Genome of the Netherlands Project (GoNL) |
|
|
| Human Speechome Project | 10 hr video/day from 1 child from 0–3 years |
|
| Maternal Lifestyle Study (MLS) |
|
|
| Measures of Effective Teaching Project (MET) |
|
|
| MetroBaby |
|
|
| Panel Study of Income Dynamics (PSID) |
|
|
| Pediatric Imaging, Neurocognition, and Genetics (PING) |
|
|
| Progress in International Reading Literacy (PIRLS) |
| |
| Psychiatric Genomics Consortium (PGC) |
|
|
| Trends in International Math and Science Study (TIMSS) |
|
|
| Trends in International Math and Science Study Video (TIMSSVideo) | teachers recorded in |
|
| Twins Early Development Study (TEDS) | survey data from |
|
| Twin and Offspring Study in Sweden (TOSS) | target of |
|
Illustrative Measure‐Based Datasets
| Measure | Comments |
|---|---|
| Bayley Scales of Infant and Toddler Development | Data underlying published norms not available, |
| Computer‐based cognitive tutor data | LearnLab ( |
| Infant Behavior Questionnaire (IBQ) | Information |
| Laboratory Temperament Assessment Battery (Lab‐TAB) | Information |
| MacArthur Communicative Inventory (CDI) |
|
| Spatial learning | NSF‐funded Spatial Intelligence and Learning Center (SILC; |
| Teller Acuity Cards | Publications citing norms for 0‐ to 4‐year olds have been published |
| Wechsler Intelligence Scale for Children‐III (WISC‐III) | Designed for children ages 6–16. Data underlying norms not available. Published by Pearson ( |
| Wechsler Preschool and Primary Scale of Intelligence‐R (WPPSI‐R) | Designed for children aged 4–6.5 years. Data underling norms not available. |