| Literature DB >> 26963508 |
Abstract
The extent of research on children's speech in general and on disordered speech specifically is very limited. In this article, we describe the process of creating databases of children's speech and the possibilities for using such databases, which have been created by the LANNA research group in the Faculty of Electrical Engineering at Czech Technical University in Prague. These databases have been principally compiled for medical research but also for use in other areas, such as linguistics. Two databases were recorded: one for healthy children's speech (recorded in kindergarten and in the first level of elementary school) and the other for pathological speech of children with a Specific Language Impairment (recorded at a surgery of speech and language therapists and at the hospital). Both databases were sub-divided according to specific demands of medical research. Their utilization can be exoteric, specifically for linguistic research and pedagogical use as well as for studies of speech-signal processing.Entities:
Mesh:
Year: 2016 PMID: 26963508 PMCID: PMC4786280 DOI: 10.1371/journal.pone.0150365
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Speech database–structure and types of utterances used in our research.
| Task code | Type of part | # Patterns | Descripton | |
|---|---|---|---|---|
| Vowels | 5 | Czech | "a", "o", "u", "e", "i" | |
| English | "a", "o", "u", "e", "i" | |||
| Consonants | 10 | Czech | "m", "b", "t", "d", "r", "l", "k", "g", "h", "ch" | |
| English | "m", "b", "t", "d", "r", "l", "k", "g", "h", "ch" | |||
| Syllables | 9 | Czech | "pe", "la", "vla", "pro", "bě", "nos", "ber", "krk", "prst" | |
| English | "pe", "la", "vla", "for", "bě", "nose", "take", "neck", "finger" | |||
| Two-syllable words | 5 | Czech | "kolo", "pivo", "sokol", "papír", "trdlo" | |
| English | "wheel", "beer", "falcon", "paper", "boob" | |||
| Three-syllable words | 4 | Czech | "dědeček", "pohádka", "pokémon", "květina" | |
| English | "grandfather", "fairy tale", "Pokemon", "flower" | |||
| Four-syllable words | 3 | Czech | "motovidlo", "televize", "popelnice" | |
| English | "niddy noddy", "television", "dustbin" | |||
| Difficult words | 2 | Czech | "různobarevný", "mateřídouška" | |
| English | "varicoloured", "thyme" | |||
| Geminate words | 3 | Czech | "pohádková víla", "kouzelný měšec", "čarotvorný hrnec" | |
| English | "fairy", "magic pouch", "magic pot" | |||
| Accretion of range of words | 4 | Czech | "voda", "živá voda", "živá a mrtvá voda", "pramen s živou a mrtvou vodou" | |
| English | "water", "live water", "live and dead water", "source of live and dead water" | |||
| Sentence | 1 | Czech | "Když šla červená Karkulka k babičce, potkala zlého vlka." | |
| English | "When Little Red Riding Hood went to her grandmother, she met bad wolf." | |||
| Auditory differentiation | 10 | Czech | "pes—nes", "ten—den", "kůl—vůl", "hrát—brát", "ječí—ježí", "ble—ple", "kloč—kloč", "kvěš—kveš", "šný—šní", "vošl—vočl" | |
| English | Change in one phoneme in the word. For example: "pes—nes", … | |||
| Describe the picture | 1 | English | "Look at the laughable clown."—A spontaneous description of the girl's picture. | |
Description of All Databases.
Subgroup H-CH is for controls and subgroups SLI-CH I and SLI-CH II are for cases.
| H-CH | SLI-CH I | SLI-CH II | ||
|---|---|---|---|---|
| WITH DEFECT | ||||
| 45 | 16 | 13 | 26 | |
| 45 | 16 | 22 | 45 | |
| 25 | 17 | 33 | 46 | |
| 25 | 17 | 64 | 88 | |
| 70 | 33 | 46 | 72 | |
| 70 | 33 | 86 | 133 | |
| 4620 | 2178 | 5676 | 8819 | |
Fig 1Comparison to different vocalic triangles.
Blue is represents isolated vowels and orange represents vowels from the utterance “varicolored”. The color red represents vowels that are in the wrong place (graphs on the right side).
Fig 2Number of errors for all participants.
Blue represents cases, and orange represents controls. Samples with a higher number of errors are in a higher position.
The percentage rate of the correct classification of the methods used for distinguishing between two groups.
HM is for “hand-made” and ANN is for artificial neural networks.
| Method | Percentage success rate |
|---|---|
| 65.44% | |
| 85.14% | |
| 90.50% | |
| 92.36% |
The names and descriptions of 34 low-level descriptors used for our experiments from the openSMILE toolkit [47].
| Name | Number of coefficients | Description |
|---|---|---|
| 1 | Loudness as the normalized intensity raised to a power of 0.3. | |
| 15 | Mel-Frequency cepstral coefficients 0–14. | |
| 8 | Logarithmic power of Mel-frequency bands 0–7 (distributed over a range from 0 to 8 kHz) | |
| 8 | The 8 line spectral pair frequencies computed from 8 LPC coefficients. | |
| 1 | The envelope of the smoothed fundamental frequency contour. | |
| 1 | The voicing probability of the final fundamental frequency candidate. Unclipped means that it was not set to zero when it falls below the voicing threshold. |
The names and descriptions of 21 functionals used for our experiments from the openSMILE toolkit [47].
| Name | Description |
|---|---|
| The absolute position of the maximum value (in frames). | |
| The absolute position of the minimum value (in frames). | |
| The arithmetic mean. | |
| The slope (m) of a linear approximation of the contour. | |
| The offset (t) of a linear approximation of the contour. | |
| The linear error computed as the difference of the linear approximation and the actual contour. | |
| The quadratic error computed as the difference of the linear approximation and the actual contour. | |
| The standard deviation of the values in the contour. | |
| The skewness (3rd order moment). | |
| The kurtosis (4th order moment). | |
| The first quartile (25% percentile). | |
| The second quartile (50% percentile). | |
| The third quartile (75% percentile) | |
| The inter-quartile range: quartile2-quartile1. | |
| The inter-quartile range: quartile3-quartile2. | |
| The inter-quartile range: quartile3-quartile1. | |
| The outlier-robust minimum value of the contour, represented by the 1% percentile. | |
| The outlier-robust maximum value of the contour, represented by the 99% percentile. | |
| The outlier robust signal range 'max-min' represented by the range of the 1% and the 99% percentile. | |
| The percentage of time the signal is above (75% * range + min). | |
| The percentage of time the signal is above (90% * range + min). |
Fig 3Number of classifications in the group of cases.
Blue represents cases, and orange represents controls. Samples with a higher number of classifications are in a higher position.
Fig 4Number of classifications in the group of controls.
Blue represents cases, and orange represents controls. Samples with a higher number of classifications are in a higher position.
Fig 5Overview of the core functionality of the web tool for medical staff.
1a) Name of the participant; 1b) All participant information; 1c) This panel is for the description and information of all recordings.
Fig 6Common outputs of our analyses in the web tool.
This figure shows the results from all analyses (Formants, Tests of Utterance, Extracting Features and Artificial Neural Networks); 2a) The panel shows all graphs; 2b) This section contains all the calculated values in the tables.