Literature DB >> 25254069

On the Links Among Face Processing, Language Processing, and Narrowing During Development.

Olivier Pascalis1, Hélène Loevenbruck2, Paul C Quinn3, Sonia Kandel1, James W Tanaka4, Kang Lee5.   

Abstract

From the beginning of life, face and language processing are crucial for establishing social communication. Studies on the development of systems for processing faces and language have yielded such similarities as perceptual narrowing across both domains. In this article, we review several functions of human communication, and then describe how the tools used to accomplish those functions are modified by perceptual narrowing. We conclude that narrowing is common to all forms of social communication. We argue that during evolution, social communication engaged different perceptual and cognitive systems-face, facial expression, gesture, vocalization, sound, and oral language-that emerged at different times. These systems are interactive and linked to some extent. In this framework, narrowing can be viewed as a way infants adapt to their native social group.

Entities:  

Keywords:  face; narrowing; speech

Year:  2014        PMID: 25254069      PMCID: PMC4164271          DOI: 10.1111/cdep.12064

Source DB:  PubMed          Journal:  Child Dev Perspect        ISSN: 1750-8592


Social life requires relationships with other group members, acknowledgment of their status, and communication between individuals. Depending on the species studied, communication occurs through vocalization, language, faces and their expressions, or some combination of these. Similarities observed across species may provide insights into the relation between different social communication tools and networks. Based on these observations, we argue here that communicative tools emerged during evolutionary time and that current systems reflect aspects of this evolution. In humans, faces and language are essential for communication, but they have been studied traditionally as separate areas with little interaction between the two domains, even when their links are acknowledged. In some frameworks, they even have been conceived of as independent cognitive modules. If faces provide an early channel of communication for newborns prior to comprehending gestural or oral language, postnatal exposure to the mother's voice–face combination is required to recognize the mother's face (Sai, 2005). In one study, moving faces were recognized only when sound was present (Coulon, Guellai, & Streri, 2011). Thus, face processing seems to be facilitated by voice processing, even at an early age. Later, in early childhood, most conversations take place face-to-face. Although auditory information alone is sufficient to understand speech, we rely systematically and unconsciously on visual information provided by a speaker's face. Seeing oro-facial gestures of the speaker accelerates recognition of core words (Fort et al., 2012) and enhances intelligibility in noisy environments (Benoît, Mohamadi, & Kandel, 1994). Therefore, most human conversations—except when we are on the phone—invoke analyzing facial configurations to locate cues relevant to decode speech. Thus, the integration of audio and facial information is crucial to speech perception. These observations point to a close link between face and language processing that, we argue, may reflect how social communication evolved and how it develops in infants and children. More specifically, functional links between gestural and oral communication in nonhuman primates as well as infants suggest that social communication is a multimodal system, involving manual and visuo-facial gestures as well as vocalization. This multimodal system is gradually tuned during development, with narrowing occurring in all the different modalities of communication.

Face Processing, Language Processing, and Development

Human adults can recognize familiar faces easily and are said to process faces expertly. Faces form a category of stimuli that are homogenous in terms of the positioning of their internal elements, and humans have developed a signature way to discriminate them based on configural (i.e., relational) information, such as the distance between the eyes or between lips and chin. Experience likely plays a critical role in acquiring face expertise (Lee, Anzures, Quinn, Pascalis, & Slater, 2011). Language is a key tool for social communication because it allows for transmitting complex information that facial expressions cannot. It is a complex cognitive skill requiring recursion and displacement (Chomsky, 1965), yet children acquire it swiftly and without instruction, whereas most adults find learning a second language challenging. Studies of language acquisition have discovered crucial milestones: Vocalizations are observable at birth, babbling emerges at around 6–8 months, children utter their first words at 10–12 months, and they begin to make word combinations and form proto-sentences at around 20–24 months (Vihman, 1996). Studies of the development of the systems that process faces and language have identified similarities between the two. Face processing develops during the first years of life from a broad nonspecific system to a human-tuned face processor (Nelson, 2001). Faces observed within the infants' visual environment shape and influence the developing face system through a process known as perceptual narrowing: a progression whereby infants maintain the ability to discriminate stimuli to which they are exposed, but lose the ability to discriminate stimuli to which they are not exposed. This course of responsiveness is similar for language development. In the first year, initial discriminatory ability reflecting a universal sensitivity to the sounds of all human languages narrows as a consequence of predominant exposure to one's native language and scarce exposure to other languages (Werker & Tees, 1999). During this time, infants become tuned to their native language and the distribution of phonetic information in the ambient language at the expense of discriminating nonnative contrasts. In other words, infants become experts at processing frequently experienced faces and native sounds. Narrowing cuts across both visual and auditory modalities, possibly reflecting the development of a common neural architecture (Scott, Pascalis, & Nelson, 2007). Narrowing could be a pan-sensory process; that is, the same phenomenon is observed in various senses during the same period and is part of the development of our multisensory representation of the world (Lewkowicz & Ghazanfar, 2009). This line of thinking raises questions such as: Is perceptual narrowing amodal? Is auditory narrowing linked to visual narrowing? One argument for the link between the development of face and language processing comes from neuroanatomy. The superior temporal sulcus (STS) is associated with face processing and auditory representation of speech components (Démonet, Thierry, & Cardebat, 2005; Haxby, Hoffman, & Gobbini, 2000). The posterior part of the STS may be considered an amodal convergence zone that plays a key role in integrating face and voice information (Belin, Bestelmeyer, Latinus, & Watson, 2011). These findings suggest similar, interacting, and common brain circuits for processing faces and speech. Descriptions of narrowing fail to consider the evolution and timing of when face and language processing emerged. What drives or motivates the development of both face and language processing is the urge to communicate. In the rest of the article, we describe several functions of human communication, then explain how perceptual narrowing modifies each of these, and conclude that narrowing is a common characteristic of all social communication.

Gestural and Oral Communication

Human language is described as unique even though some form of communication exists in other species. Understanding the emergence of language during evolution is a challenge, as fossil evidence does not provide much insight into oral language. Two means of communication are seen as potential precursors to human language—vocal calls and gestures—although it is debatable whether language originated in manual gestures or evolved exclusively in the vocal domain. The former hypothesis considers pointing as the initial means to communicate, which later developed into a gestural language. Language may have evolved from manual gestures, and then gradually incorporated vocal elements, so that language involves reciprocity in the actions of partners (Corballis, 2003). The mechanism could be supported by mirror neurons, located in Broca's area in humans (Buccino et al., 2001). This area is involved with vocalization as well as manual action and could have been used as a neural substrate for interspecific communication and then to process speech. In addition, gestures, and more specifically pointing, are associated closely with language development (Kita, 2003). Ocular pointing (or deictic gaze, at 6–9 months) and later index finger pointing (deictic gesture, at 9–11 months) are key stages in cognitive development that are correlated with stages in speech development. Finger pointing is associated with learning new word forms and their associated meanings, and when accompanied by word production (at 16–20 months), fosters the emergence of sentences. At later stages, children start using prosodic focus, that is, vocal pointing (Ménard, Lœvenbruck, & Savariaux, 2006), or constructions involving a deictic pronoun (Diessel & Tomasello, 2000). Different pointing modalities may share a common cerebral network: Ocular, digital, and prosodic pointing are associated with left parietal activation (Lœvenbruck, Dohen, & Vilain, 2009). These findings suggest a link between gesture and language. However, the referential and combinatorial properties of primate vocal communication suggest that language is also rooted in vocalization (Arnold & Zuberbühler, 2008): Chimpanzees produce and understand functionally referential calls, such as an alarm call for a snake, and monkeys can combine existing calls into higher order meaningful sequences. Furthermore, syllables may derive from cycles of rhythmic opening and closing of the jaw involved in chewing, sucking, and licking, which take on communicative significance as lip smacks, tongue smacks, and teeth chatters (MacNeilage, 1998). These observations suggest a direct evolutionary trajectory from primate vocalizations to human speech rather than a complex route requiring an intermediate stage of gestural communication. Our view is that functional links between gestural and oral communication, observed in nonhuman primates and infants, suggest that communication is a multimodal system involving manual and visuo-facial gestures as well as vocalization. Human communication may have switched to oral-dominant language for several reasons, including accessibility without seeing the other person (e.g., at night or from a distance) and accessibility while doing something else with the forelimbs (e.g., carrying or using tools; Corballis, 2003). Humans would have gradually used the oro-facial region more than the hand in communicating. Clearly, different kinds of communication existed before oral language, including vocalizations, facial expressions, and visuo-facial gestures. These findings highlight the strong phylogenetic and ontogenetic links between face and language processing.

Narrowing Across Domains That Involve Social Communication

Faces

Although 6-month-olds recognize different races of human faces as well as different monkey faces, 9- to 10-month-olds recognize reliably only faces of their own species and race (for a review, see Lee et al., 2011). Successful social communication relies on our ability to process information that allows us to identify people with whom we interact, such as identity, age, and gender. Specialization for faces of our own race improves our ability to extract such information. Regarding voice recognition, 7-month-olds detected changes in voice only when the language was in their native tongue (Johnson, Westrek, Nazzi, & Cutler, 2011), suggesting that voice recognition develops in pace with increasing competence in language processing. However, younger infants' ability has not yet been reported and we, therefore, cannot conclude that narrowing has occurred in this domain. In addition to recognizing faces, infants also learn to recognize facial expressions, which further feeds into their abilities to communicate socially (Quinn et al., 2011). Perceptual narrowing has been found for recognizing emotions in 9-month-old infants, but only for faces of their own race (Vogel, Monesson, & Scott, 2012), suggesting that perceptual narrowing affects stimuli that are important for communication with conspecifics and in-groups.

Audiovisual Speech

By the end of the first year of life, responsiveness to nonnative audiovisual inputs declines both in sound–face matching for other species and in nonnative language (Lewkowicz & Ghazanfar, 2009; Pons, Lewkowicz, Soto-Faraco, & Sebastián-Gallés, 2009). In a study that used silent video clips of a bilingual speaker telling a story in two languages, monolingual 4- and 6-month-olds discriminated visually between the two languages, whereas monolingual 8-month-olds did not (Weikum et al., 2007). The link between face and language processing is also illustrated by research in which infants watched and listened to a female speaking their native language or a nonnative language. Four-month-olds looked more at the eyes, 6-month-olds looked equally at the eyes and mouth, and by 8 months, infants shifted their attention to the mouth, regardless of the language spoken. These findings suggest that infants begin to focus on the mouth of a talker precisely when they start babbling (Lewkowicz & Hansen-Tift, 2012). In contrast, 12-month-olds no longer focused on the mouth when exposed to native speech, but continued to look more at the mouth when exposed to nonnative speech (Kubicek et al., 2013; Lewkowicz & Hansen-Tift, 2012).

Music–Rhythm

Music is important for communication and may be involved in comforting, courtship, movement coordination, and social cohesion (Brown, 2003). It requires social skills, such as vocal/gestural imitation, and involves cultural transmission. It may even be considered a form of oral communication that emerged before language (Fitch, 2006). If narrowing happens for any form of communication, it should also occur for music. Indeed, in one study, 6-month-olds were able to discriminate rhythms specific to their culture and those unfamiliar to them; however, 12-month-olds could do so only with a rhythm specific to their culture (Hannon & Trehub, 2005). Furthermore, early and active exposure to culture-specific music rhythms and tonalities may accelerate perceptual narrowing in music (Trainor, Marie, Gerry, Whiskin, & Unrau, 2012).

Auditory Speech

Narrowing of speech perception is also well documented. Infants' speech perception becomes tuned toward their native language at around 10–12 months. Young infants discriminate fine phonetic differences, such as differences in voice onset time, between consonants such as /pa/ and /ba/ (Eimas, Siqueland, Jusczyk, & Vigorito, 1971). Infants are also able to discriminate vowels (e.g., between /a/ and /i/ or /i/ and /u/; Trehub, 1973). Not only can infants younger than 6–8 months discriminate categorically native phonetic contrasts, they can also discriminate those that fall outside their native language. For example, 6- to 8-month-olds who are learning English can discriminate the nonnative dental/retroflex contrasts such as the Hindi /Ta/ versus /ta/ (Werker & Tees, 1999). However, a decline in cross-language consonant perception occurs at 10–12 months. Younger children can discriminate many phonetic differences, whereas older children lose this ability for contrasts that fall outside their native language. Therefore, phonetic discrimination starts as language general but gradually narrows, showing language-specific tuning.

Sign Language

Narrowing has also been observed in perceiving sign language (Palmer, Fais, Golinkoff, & Werker, 2012). Hearing infants are able to discriminate American Sign Language (ASL) signs at 4 months but not at 14 months, whereas infants learning ASL are still able to discriminate signs at the later age. This result suggests that narrowing happens for language regardless of the whether the support is gestural or oral.

Narrowing as a Categorization Process Serving Social Needs

Our view is that narrowing occurs for different cognitive abilities commonly involved in communication, even though not all evidence uniformly shows that narrowing occurs simultaneously across different domains (see, e.g., Hayden, Bhatt, Kangas, Zieber, & Joseph, 2012, for evidence of own-race specialization several months before language narrowing). Therefore, the underlying mechanism might not be specific to one cognitive ability, but common to all communicative tools. In terms of evolution, it emerged first for processing faces and facial expressions, and therefore, should have been part of primitive language involving rhythm and gestures before becoming part of oral language. Concomitant occurrence in multiple modalities does not explain why narrowing happens. Our take is that infants are born into a social group that has developed a culture of communication that is unique, opaque (i.e., association between an oral/gestural sign and a referent may be arbitrary), and subject to evolution. The most effective way to integrate within the group may be to adapt rapidly to the group's social habits and communication traditions. During the first 12 months, when infants mainly interact with the mother/caregiver, they have to learn rapidly the appropriate way of communicating when interacting within the social group. The mother/caregiver transmits the basic aspects of communication that are crucial to being part of the community: smiles, language characteristics, and recognition of specific faces. The child then calibrates its communication systems using learning abilities including imitation. If the child is exposed to several individuals, he or she uses convergence mechanisms to calibrate the system and ends up with finely tuned representations of the faces in the environment as well as detailed representations of the phonemes and prosodic patterns in the ambient language(s). By this account, narrowing is a categorization process that serves social needs. In the language domain, infants build a broad category including the nonnative contrasts that are lost, and retain tightly tuned categories for native contrasts. In the same way, in the face domain, infants build a large category for other-race faces including multiple other-race face categories (e.g., for infants exposed mainly to Caucasian faces, this category would include Asian and African faces), and build tightly tuned categories organized around subordinate-level identity information for same-race faces (i.e., Olivier vs. Helene vs. Paul). Therefore, narrowing can be conceived of as a system that allows the infant to become more efficient or specialized for the social stimuli at hand in the close environment.

Conclusion

In this article, we have argued that perceptual narrowing should be observed for all forms of social communication. During evolution, our social communication used different perceptual and cognitive systems—face, facial expression, gesture, vocalization, sound, and oral language—that emerged at different times. These systems are interactive in adults and their neural mechanisms are linked to some extent. Their development presents similarities as infants adjust to their native social group. We suggest that the adaptation is accomplished through a specific mechanism dedicated to social cognition, which encompasses the different modalities of communication, including manual and visuo-facial gesture processing, as well as vocalization processing abilities. However, we are uncommitted to whether such a mechanism is part of the core endowment present at birth or is a product of increasing specialization that occurs with development. Behavioral and neuroimaging studies should look at the intertwining of the development of these social abilities. Our suggestion also pertains to the field of neurological or developmental disorders: We predict that deficits in either the development of manual gesture processing, facial gesture processing, or vocalization processing should result in disorders of social communication. This prediction is supported by work on autism spectrum disorders suggesting that social communication strongly relies on the healthy development of these different abilities (Adolphs, Sears, & Piven, 2001; Baron-Cohen, 1989). Although further work is needed to understand this multimodal adaptation process, our account is that the interplay of systems that process faces and language in the development of social communication underlies the occurrences of perceptual narrowing in different domains.
  23 in total

1.  Perceptual narrowing of linguistic sign occurs in the 1st year of life.

Authors:  Stephanie Baker Palmer; Laurel Fais; Roberta Michnick Golinkoff; Janet F Werker
Journal:  Child Dev       Date:  2012-01-25

2.  Infants deploy selective attention to the mouth of a talking face when learning speech.

Authors:  David J Lewkowicz; Amy M Hansen-Tift
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-17       Impact factor: 11.205

3.  Action observation activates premotor and parietal areas in a somatotopic manner: an fMRI study.

Authors:  G Buccino; F Binkofski; G R Fink; L Fadiga; L Fogassi; V Gallese; R J Seitz; K Zilles; G Rizzolatti; H J Freund
Journal:  Eur J Neurosci       Date:  2001-01       Impact factor: 3.386

Review 4.  Renewal of the neurophysiology of language: functional neuroimaging.

Authors:  Jean-François Démonet; Guillaume Thierry; Dominique Cardebat
Journal:  Physiol Rev       Date:  2005-01       Impact factor: 37.312

5.  Meaningful call combinations in a non-human primate.

Authors:  Kate Arnold; Klaus Zuberbühler
Journal:  Curr Biol       Date:  2008-03-11       Impact factor: 10.834

6.  Infant ability to tell voices apart rests on language experience.

Authors:  Elizabeth K Johnson; Ellen Westrek; Thierry Nazzi; Anne Cutler
Journal:  Dev Sci       Date:  2011-04-25

7.  Speech perception in infants.

Authors:  P D Eimas; E R Siqueland; P Jusczyk; J Vigorito
Journal:  Science       Date:  1971-01-22       Impact factor: 47.728

Review 8.  Influences on infant speech processing: toward a new synthesis.

Authors:  J F Werker; R C Tees
Journal:  Annu Rev Psychol       Date:  1999       Impact factor: 24.137

Review 9.  Effects of phonetic context on audio-visual intelligibility of French.

Authors:  C Benoît; T Mohamadi; S Kandel
Journal:  J Speech Hear Res       Date:  1994-10

10.  Narrowing of intersensory speech perception in infancy.

Authors:  Ferran Pons; David J Lewkowicz; Salvador Soto-Faraco; Núria Sebastián-Gallés
Journal:  Proc Natl Acad Sci U S A       Date:  2009-06-17       Impact factor: 11.205

View more
  16 in total

1.  Reduced perceptual narrowing in synesthesia.

Authors:  Daphne Maurer; Julian K Ghloum; Laura C Gibson; Marcus R Watson; Lawrence M Chen; Kathleen Akins; James T Enns; Takao K Hensch; Janet F Werker
Journal:  Proc Natl Acad Sci U S A       Date:  2020-04-22       Impact factor: 11.205

2.  Development of Preferences for Differently Aged Faces of Different Races.

Authors:  Michelle Heron-Delaney; Paul C Quinn; Fabrice Damon; Kang Lee; Olivier Pascalis
Journal:  Soc Dev       Date:  2017-07-03

3.  Visual selective attention biases contribute to the other-race effect among 9-month-old infants.

Authors:  Julie Markant; Lisa M Oakes; Dima Amso
Journal:  Dev Psychobiol       Date:  2015-10-21       Impact factor: 3.038

4.  Face race processing and racial bias in early development: A perceptual-social linkage.

Authors:  Kang Lee; Paul C Quinn; Olivier Pascalis
Journal:  Curr Dir Psychol Sci       Date:  2017-06-14

5.  The role of limited salience of speech in selective attention to faces in toddlers with autism spectrum disorders.

Authors:  Frederick Shic; Quan Wang; Suzanne L Macari; Katarzyna Chawarska
Journal:  J Child Psychol Psychiatry       Date:  2019-08-30       Impact factor: 8.982

Review 6.  Linking language and categorization in infancy.

Authors:  Brock Ferguson; Sandra Waxman
Journal:  J Child Lang       Date:  2016-11-10

7.  Why are faces denser in the visual experiences of younger than older infants?

Authors:  Swapnaa Jayaraman; Caitlin M Fausey; Linda B Smith
Journal:  Dev Psychol       Date:  2017-01

8.  Listening to the calls of the wild: The role of experience in linking language and cognition in young infants.

Authors:  Danielle R Perszyk; Sandra R Waxman
Journal:  Cognition       Date:  2016-05-19

9.  Bilingualism affects audiovisual phoneme identification.

Authors:  Sabine Burfin; Olivier Pascalis; Elisa Ruiz Tada; Albert Costa; Christophe Savariaux; Sonia Kandel
Journal:  Front Psychol       Date:  2014-10-21

Review 10.  A mechanistic approach to cross-domain perceptual narrowing in the first year of life.

Authors:  Hillary Hadley; Gwyneth C Rost; Eswen Fava; Lisa S Scott
Journal:  Brain Sci       Date:  2014-12-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.