Elizabeth Heller Murray1, Joanna Lewis2, Emily Zimmerman2. 1. Department of Communication Sciences and Disorders, Temple University, Philadelphia, Pennsylvania, United States of America. 2. Department of Communication Sciences and Disorders, Northeastern University, Boston, Massachusetts, United States of America.
Abstract
The variability of a child's voice onset time (VOT) decreases during development as they learn to coordinate upper vocal tract and laryngeal articulatory gestures. Yet, little is known about the relationship between VOT and other early motor tasks. The aims of this study were to evaluate the relationship between infant vocalization and another early oromotor task, non-nutritive suck (NNS). Twenty-five full-term infants (11 male, 14 female) completed this study. NNS was measured with a customized pacifier at 3 months to evaluate this early reflex. Measures of mean VOT and variability of VOT (measured via coefficient of variation) were collected from 12-month-old infants using a Language Environmental Analysis device. Variability of VOTs at 12 months was significantly related to NNS measures at 3-months. Increased VOT variability was primarily driven by increased NNS intraburst frequency and increased NNS burst duration. There were no relationships between average VOT or range of VOT and NNS measures. Findings from this pilot study indicate a relationship between NNS measures of intraburst frequency and burst duration and VOT variability. Infants with increased NNS intraburst frequency and NNS burst duration had increased VOT variability, suggesting a relationship between the development of VOT and NNS in the first year of life. Future work is needed to continue to examine the relationship between these early oromotor actions and to evaluate how this may impact later speech development.
The variability of a child's voice onset time (VOT) decreases during development as they learn to coordinate upper vocal tract and laryngeal articulatory gestures. Yet, little is known about the relationship between VOT and other early motor tasks. The aims of this study were to evaluate the relationship between infant vocalization and another early oromotor task, non-nutritive suck (NNS). Twenty-five full-term infants (11 male, 14 female) completed this study. NNS was measured with a customized pacifier at 3 months to evaluate this early reflex. Measures of mean VOT and variability of VOT (measured via coefficient of variation) were collected from 12-month-old infants using a Language Environmental Analysis device. Variability of VOTs at 12 months was significantly related to NNS measures at 3-months. Increased VOT variability was primarily driven by increased NNS intraburst frequency and increased NNS burst duration. There were no relationships between average VOT or range of VOT and NNS measures. Findings from this pilot study indicate a relationship between NNS measures of intraburst frequency and burst duration and VOT variability. Infants with increased NNS intraburst frequency and NNS burst duration had increased VOT variability, suggesting a relationship between the development of VOT and NNS in the first year of life. Future work is needed to continue to examine the relationship between these early oromotor actions and to evaluate how this may impact later speech development.
The infant suck reflex is one of the earliest motor reflexes to develop, emerging in utero around 15 weeks’ gestational age [1] and stabilizing around 34 weeks’ gestational age [2]. Infants have two types of suck: a nutritive suck used for feeding and a non-nutritive suck (NNS) characterized by the absence of nutrient delivery [1,3]. Infant non-nutritive suck is less complex than nutritive suck as it does not involve swallowing and given that it develops early, provides an early metric into the infants developing oromotor system. NNS is characterized by bursts of suck cycles, occurring at approximately 2 hertz, separated by pause periods for respiration, see Fig 1 [3]. NNS provides a window into central nervous system function, with disordered NNS patterns noted in infants who are preterm or who have neurological impairments [4-7]. Furthermore, NNS changes throughout the first year of life [8], and these changes are likely due to experience, anatomical growth, and a neurological system that are shifting from reflexive to more cortically driven [9]. In addition to providing information about current neuromotor development, evaluation of infant NNS can also provide information on future functional outcomes. Relationships have been found between infant suck and later oral feeding difficulties [10], language impairment [11-13], intelligence quotient [13], and other cognitive, developmental, and motor delays [14-18]. As infant suck is present at birth, understanding the relationship between the early infant suck measures and later developing speech, language, or cognitive skills will provide valuable information on neurodevelopment in both typical and vulnerable populations.
Fig 1
Example Non-nutritive suck (NNS) bursts: Infant NNS is arranged in bursts of sucking with pause periods for respiration.
Each burst contains cycles within it, which are depicted as black dots in this image. The frequency (Hz) within a burst is measured by the number of cycles per second. The strength of the suck cycle, or amplitude (cmH20), is measured by examining the top of each cycle where the black dot is located. This image depicts 50s of NNS data with 3 NNS bursts. Burst 1 has 27 cycles/burst (burst duration of 16 sec), Burst 2 has 15 cycles/burst (burst duration 8 seconds) and Burst 3 with 19 cycles/burst (burst duration 11 seconds). Amplitude of each cycle (black dot) can be determined looking at the y-axis.
Example Non-nutritive suck (NNS) bursts: Infant NNS is arranged in bursts of sucking with pause periods for respiration.
Each burst contains cycles within it, which are depicted as black dots in this image. The frequency (Hz) within a burst is measured by the number of cycles per second. The strength of the suck cycle, or amplitude (cmH20), is measured by examining the top of each cycle where the black dot is located. This image depicts 50s of NNS data with 3 NNS bursts. Burst 1 has 27 cycles/burst (burst duration of 16 sec), Burst 2 has 15 cycles/burst (burst duration 8 seconds) and Burst 3 with 19 cycles/burst (burst duration 11 seconds). Amplitude of each cycle (black dot) can be determined looking at the y-axis.During this time-period of infant NNS development, changes can also be seen in another motor action that uses overlapping musculature, speech production [19]. Although speech and non-speech tasks have distinct motor activation patterns [20-24], the similarity in the cyclical and rhythmic movements, musculature, and neural processes suggests that understanding the relationship between speech and infant suck can provide valuable information about development. One theoretical model that proposes this relationship is the Frame Content Theory of the Evolution of Speech Production [25]. This theory states that the frame is the continual mouth open-close rhythmic movement, seen in the jaw and tongue movement in feeding and sucking. As the infant develops and interacts with the environment, the content (e.g., vocalizations, verbal output) are superimposed on the frame [25]. Thus, this theory suggests early motor action of NNS (frame) will be related to the development of speech production (content).Babbling, one of the earliest stages of speech production in which infant produce speech-like oromotor movements, begins around six months of age [26], and typically consists of stop consonant-vowel productions (e.g., /dada/). The timing between the release of the stop consonant and start of the subsequent vowel, called voice onset time (VOT) can be measured in an acoustic signal, providing information on an infant’s oromotor control and coordination [27,28]. There are four proposed stages of VOT development, first described by Macken and Barton (1980) and later expanded by Hitchcock and Koenig (2013). During the first stage, infants’ VOTs range from 0 to 20 milliseconds (ms). This stage requires minimal coordination, as the vowel phonation begins almost immediately after the stop consonant closure is released. At this stage, articulatory accuracy of voiced bilabial /b/ and alveolar /d/ productions is more accurate than their unvoiced cognates (/p t/). The second stage involves the beginning of the development of the voicing contrast seen in adults, and infants begin producing voiceless stops with longer VOTs. During the third stage there is the continued elongation of VOTs for voiceless stops. This stage often involves an “overshoot” phase, in which VOTs for voiceless productions are significantly longer than adult productions. The fourth stage emerges around two to three years of age. During this stage, average VOTs are comparable to adults, yet, considerable variability continues to be present until around seven years of age [26,29-32].The purpose of the current study was to examine if there was a relationship between the early oromotor movements of NNS and babbling, a later developing movement that requires oromotor control. Based on the aforementioned frame/content model of speech production evolution [25], we hypothesized that aspects of infant NNS pertaining to cyclical jaw movements, such as NNS cycles/bursts, burst duration and intraburst frequency, at 3-months will be more related to advanced productions VOT productions at 12 months as they are building their content on a more mature frame. Consistent with previous work examining VOT in children [31], both average and variability metrics will be examined to elucidate information about the infant’s VOT developmental stage as well as the variability of their productions. Understanding the relationship between typical development of infant NNS and its relationship to the early motoric gesture of babbling can reveal important information about overall oromotor coordination abilities and provide a more comprehensive basis for understanding future neurodevelopmental outcomes.
Methods
Participants
Twenty-five total infants (11 male, 14 female) participated in this study. NNS measurements from eleven of these infants were reported in an earlier paper [8]. Infants were evaluated at 3 months (average age = 3.04 months, range = 2.56–3.76 months) and at 12 months (average age = 11.97 months, range = 11.53–12.33 months). Participants were all born full-term and had an average birthweight of 122.53 ounces (standard deviation (SD) = 18.34 ounces). All infants passed their neonatal auditory screening; by 12 months 57.2% of the infants had a history of ear infections. Hollingshead four-factor index (raw score of 8–66) of socioeconomic status based on marital status, employment status, educational attainment and occupational prestige was on average 57.26 (range = 30–66), and therefore in the mid-to-high SES range. All participants in this study were involved in a larger study examining the relation between early sucking, oral feeding, and vocal development across preterm and full-term infants. Infants were included in the current study if they were: (1) born full-term without congenital or chromosomal anomalies, (2) had usable suck samples at 3 and 12 months, and (3) produced a minimum of ten stop consonants within 20 minutes during their 12-month appointment. This study was approved by the institutional review board at Northeastern University. Participants were recruited by word of mouth, Facebook groups, and flyer distribution. All caregivers provided written consent for the study and were compensated for their participation.
Data collection
Data collection was completed in the infant’s home approximately one hour before a scheduled feed; NNS measurements were collected at the 3-month visit to capture this reflexive motor action, and babbling samples used for VOT analysis were collected at the 12-month visit. Measurements of the infant’s NNS were collected at 3-months with a custom-made research device that consisted of a 0-3-month Soothie pacifier (Philips, Avent) attached to a pressure transducer. The pressure transducer was attached to a data acquisition system (Power Lab, ADInstruments), allowing for real-time visualization of NNS using the LabChart software (ADInstruments). The pressure transducer in the custom-made research device was calibrated with an external pressure calibrator Meriam M1 Series Digital Manometer Calibrator; a range of pressure measurements from the NNS system were recorded simultaneously with both pressure transducers and used to produce a linear calibration curve for the NNS system. Following calibration, parents/caregivers were instructed on how to offer the infant the pacifier, which consisted of demonstration by the research assistant to cradle the infants and offer the infants the pacifier. Researchers encouraged a quiet environment for data collection; however, since the study was completed in the home this was not always possible. Ideally, infants were in a quiet-alert state; however, data collection was discontinued if the infant began to cry, appeared distressed, or rejected the pacifier. Average time infants NNS suck was recorded was 3.09 minutes.During the 12 month visit, each infant was fitted with a Language Environment Analysis (LENA), a wearable recording device that is widely used in research to analyze early speech vocalizations in young infants [e.g., 33,34]. The LENA device is a small piece of hardware (3-3/8” x 2-3/16” x 1/2”) that houses an omnidirectional microphone with a flat 20 hertz (Hz)– 20,000 Hz frequency response and records acoustic data at 16,000 Hz [35]. For each child, the LENA recorder was placed in a dedicated LENA vest; the vest keeps the microphone a consistent distance from the infant and is designed with fabric that has minimal impact on the acoustic recordings [36]. Parents were instructed to leave the vest on their infant for the remainder of the day (with the exception of bath and nap times), continue with their typical routines, and document all activities done while the infant was wearing the vest.
Measures
Non-Nutritive Suck (NNS)
Trained experimenters identified NNS burst manually using the LabChart software. We created a study settings file in the LabChart software that consisted of a NNS sample rate of 1000 samples per second with a low-pass filter with a cut off frequency of 50 Hz. NNS physiology has a stereotypical burst-pause pattern, with an intraburst frequency of 2 Hz and each burst containing 6–12 suck cycles [3]. Bursts were defined two or more suck cycles in a row, with the cycles less than one second apart and each cycle’s amplitude at least one cmH20 (see example in Fig 1). This definition of burst is consistent with previous studies examining NNS in young infants [8,37-40]. Following manual selection of all bursts, the best two minutes of NNS data were selected based on cycle number, which is a common procedure used across studies [8,37-39] in an effort to examine the infant’s most active NNS sample. NNS measures were calculated with a custom-made NNS burst macro in LabChart. Then, the average of the two minute samples was taken to determine the following NNS minute rates: (1) burst amount, the number of NNS bursts in a minute, (2) burst duration in seconds (sec), the average length of the burst (3) cycle amount, the number of cycles per minute, (4) cycles/burst, average number of cycles in each burst per minute, (5) amplitude (cmH0), average amplitude of the pressure of the cycles, measured as peak-height minus peak-trough in cmH20, and (6) Frequency (Hz), the intraburst frequency between cycles.
Voice onset time (VOT)
Algorithms in the LENA Pro software were used to identify continuous speech spoken by the infant. The most voluble hour (i.e., the hour with the most infant vocalizations) was found for each infant and the activity log was examined to verify the infant was awake during the selected hour. The most voluble 20 minutes from each infant’s most voluble hour was exported to Praat [41] for acoustic analysis. An initial rater identified all stop consonants from the acoustic waveforms and spectrograms. Default spectrogram Praat settings were used, with a view range of 0–5000 Hz, dynamic range of 70 dB, and a window length of 0.005 seconds. VOT, defined as the time between the burst of the stop consonant and the start of the subsequent vowel, was measured for each stop consonant-vowel pair identified (Fig 2). To control for differences in the amount of stop consonant-vowel productions between infants, only the first ten stop consonant-vowel productions were selected for evaluation. Each stop consonant selected was identified based on both the rater’s auditory-perception of the production and the presence/absence of features in the spectrogram (e.g., voicing bar). The majority of productions were perceived as a voiced /d/ production, with voiced phonemes identified more frequently than voiceless consonants (Table 1). Stop consonants were identified from segments of reduplicated (e.g., baba) and variegated (e.g., baga) babbling. No identifiable words were noted with stop consonants. A second rater (one of the senior authors) reviewed all VOTs and adjusted when needed. Each rater repeated analysis on 20% of the participants, for a total of 50 VOTs repeated. The average absolute difference in VOT ratings were calculated to assess intra-rater reliability (rater 1: average VOT difference = 0.97 ms, standard deviation = 2.0 ms; rater 2: average VOT difference = 0.91 ms, standard deviation = 1.3 ms).
Fig 2
LENA vest (left) and Voice onset time example (Right). Left: An infant wearing the LENAvest during recording (The parent of the infant pictured in this manuscript has given written informed consent (as outlined in PLOS consent form) to publish these case details). Right: Praat window with waveform and spectrogram from a stop consonant-vowel production. Voice onset time is measured from the burst to the start of the subsequent vowel, indicated by the highlighted portion.
Table 1
Perception of stop consonants during initial calculation of voice onset time.
Phoneme
Phoneme Count
Infant Count
Voiced
b
46
15
d
138
23
g
26
6
Total Voiced
210
25
Voiceless
p
2
2
t
32
16
k
2
2
Total Voiceless
36
16
Ambiguous
d/t
4
2
Total (Voiced and Voiceless)
250
25
LENA vest (left) and Voice onset time example (Right). Left: An infant wearing the LENAvest during recording (The parent of the infant pictured in this manuscript has given written informed consent (as outlined in PLOS consent form) to publish these case details). Right: Praat window with waveform and spectrogram from a stop consonant-vowel production. Voice onset time is measured from the burst to the start of the subsequent vowel, indicated by the highlighted portion.Three measures were used to evaluate VOT productions. Average VOT and VOT range (maximum VOT–minimum VOT) were calculated to provide information on the infant’s stage of VOT development. Longer VOTs and larger VOT ranges were interpreted as advanced development as the infants were prolonging VOT productions. To evaluate the variability of the infants’ productions, the coefficient of variation (CoV) of VOT was calculated for each infant. CoV, that is the standard deviation VOT divided by the mean VOT, providing a metric of infant variability of productions around their mean production. Therefore, using CoV allows for evaluation of infant VOT variability, while adjusting for individual differences in average VOT.
Data analyses
Descriptive statistics on NNS measures at 3 months and VOT measures at 12 months were completed. All measures were converted to rank order for the subsequent analyses to account for the lack of normality in the measures. Correlations examined the relationships among NNS measures; a Bonferroni corrected alpha level of 0.0033 (0.05/15 correlations = 0.0033) was used to correct for multiple comparisons. Three multiple linear regressions examined whether NNS measures at 3 months predicted either average VOT, range of VOT, or CoV of VOT at 12 months. All analyses were completed in JMP Pro [42].
Results
Analysis of individual measures
Average VOT at 12 months was 7.82 ms across all infants, with individual infant averages ranging from 0 ms to 33.87 ms. Examination of individual productions indicated infants produced a large range of VOT values (Fig 3), with average CoV of VOT measured at 1.22.
Fig 3
Infant voice onset times.
Voice onset time medians (blue dots) and ranges for each infant.
Infant voice onset times.
Voice onset time medians (blue dots) and ranges for each infant.Infants at 3 months produced NNS an average of 4.10 bursts (range: 1.50–9.50) per minute, with an intraburst frequency average of 2.06 Hz (range: 1.36–2.75), average burst duration of 4.93 seconds (range: .94–11.97). Infants produced an average of 10.02 cycles per burst (range: 2.25–27.17), 42.24 cycles per minute (range: 3.5–109.50), and an average cycle amplitude of 12.32 cmH20 (range: 1.19–28.03).
Relationships between NNS measures at 3 months and VOT measures at 12 months
The NNS measure of burst duration was highly correlated with NNS measures of cycle amount (r = .84, p <0.001) and cycles/burst (r = .97, p <0.001). The NNS measure of cycles amount was highly correlated with burst amount (r = .71, p <0.001) and cycles/burst (r = .87, p <0.001, Table 2). Based on the high association between cycle amount and cycles/burst with other NNS measures, only the NNS measures of burst duration, frequency, amplitude, and burst amount were included in the regression model. There was no significant effect of NNS measures of burst duration, frequency, amplitude, burst amount on either average VOT or range of VOT (all p > 0.05). A regression model including the NNS measures of burst duration, frequency, amplitude, and burst amount significantly predicted CoV of VOT (F(4,18) = 3.613, p = 0.02), with an R2 = 0.45 (Fig 4). Increased variability of VOT was driven by increased NNS burst duration (β = 0.53, p = 0.008) and increased NNS intraburst frequency (β = 0.50, p = 0.01). Measures of NNS height (β = 0.21, p = 0.32) and decreased NNS burst amount (β = -0.34, p = 0.10) did not reach significance in this model.
Table 2
Correlation matrix of non-nutritive suck (NNS) measures.
Burst Duration (sec)
Frequency (Hz)
Amplitude (CmH20)
Burst Amount
Cycles/Burst
Frequency (Hz)
-.13
--
--
--
--
Amplitude (CmH20)
.18
-.26
--
--
--
Burst Amount
.37
.001
.47
--
--
Cycles/Burst
.97*
.06
.14
.38
--
Cycle amount
.84*
.09
.34
.71*
.87*
* correlations significant at p < 0.0033.
Bolding indicates significant correlations.
Fig 4
Relationships between voice onset time variability and non-nutritive suck.
Relationship between the actual ranked coefficient of variation (CoV) of voice onset time and the CoV of voice onset time predicted by the regression model that included NNS measures of burst duration, frequency, amplitude, and burst amount. NNS measures of burst duration and intraburst frequency were significant predictors. Shaded area represents 95% confidence interval around the regression line.
Relationships between voice onset time variability and non-nutritive suck.
Relationship between the actual ranked coefficient of variation (CoV) of voice onset time and the CoV of voice onset time predicted by the regression model that included NNS measures of burst duration, frequency, amplitude, and burst amount. NNS measures of burst duration and intraburst frequency were significant predictors. Shaded area represents 95% confidence interval around the regression line.* correlations significant at p < 0.0033.Bolding indicates significant correlations.
Discussion
This study provided a novel look into the relationships between two early oromotor actions, NNS and babbling. Findings from the current study indicated a relationship between NNS at 3-months and VOT variability at 12-months. The metric of variability used in the current work was coefficient of variation (CoV), a measure that examines variability while controlling for average VOT. Previous work examining VOT productions suggests that VOT variability decreases during maturation [26,43,44]; however, this reduction in VOT variability occurs after two years of age [45]. Although the decrease in variability in older children has been associated with improved accuracy of productions [44,46,47], it is unlikely that differences in variability of babbling in 12-month-old infants is due to refining movements in order to accurately reach a target. Instead, we suggest that the increased variability evident during this period may be due to increased exploration while infants are acquiring new articulatory movements. This interpretation is based on Dynamic Systems Theory, which posits that learning a new skill is preceded by increased variability of movement [48,49]. Therefore, we speculate that increased VOT variability in the current study may be indicative of a more advanced stage of speech production acquisition.Findings from this study indicated that infants with increased VOT variability at 12 months had increased NNS at 3 months, driven by increased NNS burst duration and increased NNS intraburst frequency. Put simply, infants who produced longer bursts, which likely consisted of more NNS cycles per burst, produced at a faster rate had increased VOT variability during babbling. Both NNS and babbling involve jaw and tongue movements; the jaw and tongue movements required to produce multiple, rapid cycles within a burst mirrors the jaw and tongue movements during babbling [20,50-52]. Interpreted within the frame/content theory of evolution of speech production [25], we postulate that ability to produce longer bursts at a faster rate at 3-months may allow the infant’s frame to be primed for the development of the content of babbling at 12-months. That is, infants are building this later content (babbling) on the earlier frame (NNS). However, further work is needed to examine this relationship beyond the scope of this pilot study. The current work used the measure of VOT, which allows inference about jaw movement, yet this temporal measure does not provide significant information about tongue control. As multiple orofacial structures are involved in both infant sucking and babbling (e.g., jaw, lip, tongue), future work measuring the kinematic movement of all structures involved in both NNS and speech production is needed to further evaluate how these early motor actions are related.Although relationships were found between NNS and CoV of VOT, the current study did not find any relationship between VOT average or range of VOT productions and NNS measures. Based on the proposed stages of VOT development [30,32], an average VOT value of 7.82 ms suggests infants are in stage one of VOT development, in which the vowel phonation begins almost immediately after the stop consonant closure is released. One possible reason we did not find a relationship between NNS measures and VOT averages is that the small differences in VOT averages at stage one may not provide a meaningful metric of development. Thus, to evaluate the relationship between VOT development further, we examined whether VOT range would relate to NNS measures. Consistent with previous studies, individual infant averages ranging from 0 ms to 33.87 ms [32,45,53-56] indicating that infants were beginning to lengthen the time between stop closure offset and vowel onset [30,32]. However, VOT range did not meaningfully relate to NNS productions. A potential reason for the absence of findings may be the lack of a clear target for babbling, thus making interpretation of VOT range more complex. Previous work examining VOT in the first few years of life has mainly focused on production of words [32,45,53,55], with the few articles that discussed VOTs of babbling focusing primarily on cross-linguistic differences [54,56]. The evaluation of VOT in words allows for judgements on accuracy of productions as the intended target is known, whereas babbling does not have to have a clear target. Therefore, future work is needed to longitudinally examine the relationship between the NNS reflex and more intentional early speech production produced at later time-points and across patient populations.It should be noted that although we have discussed our results within the frame/content theory, there are other potential explanations that require consideration. First, it is possible that the measurements of NNS and VOTs are capturing development of a single skill, rather than the building of the content (babbling) on the frame (NNS). Further work is needed to determine if the skill of babbling is built on the skill of NNS, or if these are purely two types of oromotor movement measurements captured at different points in time. Second, it is important to acknowledge that in focusing on the oromotor relationships between NNS and VOT productions, the current work does not address the impact of vocal fold movement on VOTs. As VOT depends on the coordination of the articulatory and vocal fold movement, some of the findings of the current work may be related to vocal fold changes. During the first few years, the vocal folds are undergoing significant structural changes that may impact their flexibility and movement [57,58]. Thus, further work is needed to clarify the potential impact of vocal fold development on VOTs during babbling. This future work should also include an examination of vowel token as emerging work suggests that, unlike adults [59-61], the relationship between VOTs and vowels in children is not clear [62]. As these differences in vowels may be related to intrinsic fundamental frequency differences, as well as differences in vocal tract positioning (and subsequent formant measurements), understanding this relationship may provide valuable information about VOT development. Lastly, although the consonants were labeled during the initial identification of VOT instances, they were not part of a larger perceptual study and therefore we considered them as preliminary labels for the consonants. Due to the absence of a clear consonantal target and the sparsity of different consonants identified during preliminary labeling, we did not pursue additional analysis of any potential relationships between specific consonants and NNS measures. As there is evidence that VOT may vary by place [e.g., 32,62–66], future work that includes methodology to confirm place of articulation (e.g., electromagnetic articulography, video analysis) is needed to examine any potential relationship between place of articulation and NNS measures.Overall, findings from this pilot study suggests the relationship between NNS and VOT in babbling and/or early words needs further exploration. Understanding this relationship could provide valuable information on the development of the motor control system as a whole and provide a marker for children who may be at higher risk for later difficulty with speech motor control. Previous work examining VOT in older children has shown differences in VOT control. For instance, evaluation of VOTs in children at high-risk for developing autism spectrum disorder had deceased distinction in voice and voiceless productions [67]. In addition, children who were later diagnosed with childhood apraxia of speech were found to use less voiceless sounds in early productions [68]. Future work should evaluate whether these later appearing differences in VOTs are related to variability in infant babbling, and thereby related to NNS outcomes sampled soon after birth. In this way, NNS and early VOT metrics could potentially serve as early biomarkers for subsequent speech development.
Limitations
Limitations in the study included a small sample size, as only 25 total infants were included in the study. Due to this small sample size, this study did not examine potential differences in other variables such as infant sex, feeding method, or birthweight on NNS and VOT productions. Further work on a larger sample size is needed to examine whether other relevant variables impact the relationship between early oromotor actions. The current study evaluated NNS measures at 3-months and babbling productions at 12-months and thus, if the infant had a difficult day (e.g., tired or fussy), their data may not be truly representative of their NNS and VOT productions. Future work should also assess both NNS and VOT measures at multiple time points to more comprehensively measure these early motor actions. Finally, infants were recruited from the Northeast through flyers, online parent communities, and word of mouth. This recruitment strategy resulted in only middle to high SES participants; future work is needed with infants from a broader range of backgrounds to increase the generalizability of these findings. Lastly, this study was completed in the home environment. Future work is needed to examine babbling in a more controlled environment, allowing for examination of other factors that may influence babbling measurements (e.g., rate of speech, use of infant directed speech, number of people present in the room). While use of the LENA system in the home provides a window into the infant’s natural environment, future work can examine situations where the babbling productions are elicited and more closely controlled.
Conclusion
The results of this pilot study reveal that there is a relationship between VOT and NNS measures in infants. Increased variability of VOT productions at 12 months was related to NNS measures, with the relationship driven by increased NNS intraburst frequency and increased NNS burst duration at 3-months. No relationships were found between the average or range of VOT productions at 12 months and NNS measures at 3 months. Findings suggest a link between infant vocal development and oromotor movements evident in NNS productions, motivating the need for future work to continue to examine this relationship.22 Mar 2021PONE-D-21-02855Non-nutritive suck and voice onset time: Examining infant oromotor coordinationPLOS ONEDear Dr. Zimmerman,Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.Please submit your revised manuscript by April 1. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.Please include the following items when submitting your revised manuscript:A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocolsWe look forward to receiving your revised manuscript.Kind regards,Michael Döllinger, Ph.D.Academic EditorPLOS ONEJournal Requirements:When submitting your revision, we need you to address these additional requirements.1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf andhttps://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf2. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.In your revised cover letter, please address the following prompts:a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.We will update your Data Availability statement on your behalf to reflect the information you provide.Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.3. We note that Figure 2 includes an image of a [patient / participant / in the study].As per the PLOS ONE policy (http://journals.plos.org/plosone/s/submission-guidelines#loc-human-subjects-research) on papers that include identifying, or potentially identifying, information, the individual(s) or parent(s)/guardian(s) must be informed of the terms of the PLOS open-access (CC-BY) license and provide specific permission for publication of these details under the terms of this license. Please download the Consent Form for Publication in a PLOS Journal (http://journals.plos.org/plosone/s/file?id=8ce6/plos-consent-form-english.pdf). The signed consent form should not be submitted with the manuscript, but should be securely filed in the individual's case notes. Please amend the methods section and ethics statement of the manuscript to explicitly state that the patient/participant has provided consent for publication: “The individual in this manuscript has given written informed consent (as outlined in PLOS consent form) to publish these case details”.If you are unable to obtain consent from the subject of the photograph, you will need to remove the figure and any other textual identifying information or case descriptions for this individual.Additional Editor Comments (if provided):[Note: HTML markup is below. Please do not edit.]Reviewers' comments:Reviewer's Responses to QuestionsComments to the Author1. Is the manuscript technically sound, and do the data support the conclusions?The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.Reviewer #1: YesReviewer #2: Yes**********2. Has the statistical analysis been performed appropriately and rigorously?Reviewer #1: YesReviewer #2: Yes**********3. Have the authors made all data underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: NoReviewer #2: Yes**********4. Is the manuscript presented in an intelligible fashion and written in standard English?PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.Reviewer #1: YesReviewer #2: Yes**********5. Review Comments to the AuthorPlease use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)Reviewer #1: Nicely designed and completed study that is clearly described. Please consider these few minor points:1) It would help readers to depict the various measures as part of Fig 1 (or in addition to fig 1).2) On line 205, the CoV is assigned units (ms). Isn't it a ratio and unit-less?3) Please describe the consonants - are these consonant-like sounds or would they perceived as actual consonants? Do they generally sound voiced or a mix of voiced and voiceless? Are they primarily reduplicated babbled syllable trains?4) Discussion - lines 257-260: it sounds as if you're suggesting this is causal (with the sucking, they are building the frame needed for increased babbling). You might want to acknowledge that they might both be consequences of the same underlying process / skill / advantage.5) I didn't see the NNS individual data at the level I think PLOS requests. Perhaps it is in an appendix that I missed.Reviewer #2: Thankyou for giving me the opportunity to review this very well written manuscript. It was a pleasure to review.The manuscript presents the results of a very challenging study comparing the activity of non nutritive sucking in infants at 3 months and the variability of voice onset time in babbling in the same infants at age 12 months.The authors are to be congratulated for achieving difficult data collection.All aspects of the study appear to have been well conducted with good scientific rigour.I have only a few comments that I think the authors should consider.The stop consonants (if determinable) that VOT was calculated on is not reported. Given that VOT varies across phonemes, it is important that these be identified and categorised if possible and then VOT values in each phoneme group be compared to child values reported in the literature to assess if the infants variability is consistent with phoneme variability ranges. I would be interested to see if any of the NNS measures were more or less correlated with different phonemes. If this is not possible, the authors should provided more detail regarding whether a phoneme target is identifiable or not.Interpretation of the results seem appropriate regarding the movement of jaw and tongue between NNS and VOT in babbling.The authors fail, however, to discuss movement of the vocal folds, and the relationship or not of vocal fold movement to jaw and tongue movement. This is a problematic as VOT is a measure of co-ordination of movements of jaw and tongue vocal fold closure with closure and vibration of the vocal folds. They should mention how the onset of phonation may be affected by the articulatory movements common to NNS and VOT. Some comment on vocal fold activity or posture in NNS should also be mentioned for context.Thankyou again for the opportunity to review this manuscript.**********6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: NoReviewer #2: No[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.27 Mar 2021We would like to thank both the reviewers and the editor for this review of our article Non-Nutritive Suck and Voice Onset Time: Examining Infant Oromotor Coordination. We have addressed each point below and have highlighted the changes in the ‘Revised Manuscript with Track Changes’ document. We are grateful for the time and effort put into these reviews and believe that these changes make the manuscript stronger. Thank you for the opportunity to resubmit.1. A minimally anonymized data set with subject-level data for voice onset time and non-nutritive suck variables has been made public through the open science framework platform available here: https://osf.io/f2sk7/2. We have updated the methods to explicitly say “The parent of the infant pictured in this manuscript has given written informed consent (as outlined in PLOS consent form) to publish these case details.” We have added this information in our ethics statement and have saved a copy of this form securely in the individual’s case notes.3. In re-checking all of our analyses prior to uploading the final anonymized dataset, we realized we had reported numbers that were not the final analysis. These resulted in very minor changes to the variables reported in the text and figure 4. These minor corrections did not change the outcomes of the statistical tests. We have highlighted these changes in the manuscript and apologize for this oversight.Below are the responses to the individual reviewer comments.Reviewer #1:1. Nicely designed and completed study that is clearly described. Please consider these few minor pointsResponse: Thank you for your review.2. It would help readers to depict the various measures as part of Fig 1 (or in addition to fig 1).Response: We have added additional text to the figure caption to explain these measures in more detail.3. On line 205, the CoV is assigned units (ms). Isn't it a ratio and unit-less?Response: Thank you for noticing this oversight. You are correct CoV is unitless and we have updated the text.4. Please describe the consonants - are these consonant-like sounds or would they perceived as actual consonants? Do they generally sound voiced or a mix of voiced and voiceless? Are they primarily reduplicated babbled syllable trains?Response: Thank you for this excellent point. We have added additional text in the methods and discussion to address this comment. The consonants were analyzed were from single syllable productions, or instances of reduplicated or variegated babbling. No identifiable words with stop consonants were found in these recordings. We have also added a table detailing the consonants preliminarily identified during the initial VOT analysis.5. Discussion - lines 257-260: it sounds as if you're suggesting this is causal (with the sucking, they are building the frame needed for increased babbling). You might want to acknowledge that they might both be consequences of the same underlying process / skill / advantage.Response: Thank you for this note – we agree this important to acknowledge. We have updated the discussion to include an acknowledgement that these could be the same process developing.6. I didn't see the NNS individual data at the level I think PLOS requests. Perhaps it is in an appendix that I missed.Response: Thank you for pointing this out. We have added the individual data to an ‘open science framework’ data repository https://osf.io/f2sk7/Reviewer #2:1. Thank you for giving me the opportunity to review this very well written manuscript. It was a pleasure to review. The manuscript presents the results of a very challenging study comparing the activity of non nutritive sucking in infants at 3 months and the variability of voice onset time in babbling in the same infants at age 12 months. The authors are to be congratulated for achieving difficult data collection.Response: Thank you for your review.2. The stop consonants (if determinable) that VOT was calculated on is not reported. Given that VOT varies across phonemes, it is important that these be identified and categorized if possible and then VOT values in each phoneme group be compared to child values reported in the literature to assess if the infants variability is consistent with phoneme variability ranges. I would be interested to see if any of the NNS measures were more or less correlated with different phonemes. If this is not possible, the authors should provided more detail regarding whether a phoneme target is identifiable or notResponse: Thank you for this excellent point. We have also added a table detailing the consonants preliminarily identified during the initial VOT analysis and added text in the discussion. Although we agree that analyzing the relationship between NNS measures is a very interesting idea, we are hesitant to do this in the current paper due to the sample size and the subjective nature of consonant productions in babbling. We have added text to addressed this directly in the discussion and to highlight the importance of doing this in future work.3. Interpretation of the results seem appropriate regarding the movement of jaw and tongue between NNS and VOT in babbling. The authors fail, however, to discuss movement of the vocal folds, and the relationship or not of vocal fold movement to jaw and tongue movement. This is a problematic as VOT is a measure of co-ordination of movements of jaw and tongue vocal fold closure with closure and vibration of the vocal folds. They should mention how the onset of phonation may be affected by the articulatory movements common to NNS and VOT. Some comment on vocal fold activity or posture in NNS should also be mentioned for context.Response: Thank you for this comment. We agree that not speaking about vocal fold motion was an oversight and we have now added additional text in the discussion.Submitted filename: ResponseToReviews.docxClick here for additional data file.8 Apr 2021Non-nutritive suck and voice onset time: Examining infant oromotor coordinationPONE-D-21-02855R1Dear Dr. Zimmerman,We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.Kind regards,Michael Döllinger, Ph.D.Academic EditorPLOS ONEAdditional Editor Comments (optional):Dear authors,I recommend acceptance, However, please check on following rather small errors:line 224: "1.190" or "11.90"; if "1.190" is correct, please change to "1.19"Line 389: please provide pagesline458: "2012;2012" twice? correct?line472: please provide pagesreference 50: If this is a book, than there is information missing, i guess.Figure 3: units of x-axis are missin, I guess it is msFigure4: units for x-y-axes?13 Apr 2021PONE-D-21-02855R1Non-nutritive suck and voice onset time: Examining infant oromotor coordinationDear Dr. Zimmerman:I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.If we can help with anything else, please email us at plosone@plos.org.Thank you for submitting your work to PLOS ONE and supporting open access.Kind regards,PLOS ONE Editorial Office Staffon behalf ofDr. Michael DöllingerAcademic EditorPLOS ONE
Authors: Mechteld I Wolthuis-Stigter; Margreet R Luinge; Saakje P da Costa; Wim P Krijnen; Cees P van der Schans; Arend F Bos Journal: J Pediatr Date: 2014-10-11 Impact factor: 4.406