Literature DB >> 33733100

Mapping Lexical Dialect Variation in British English Using Twitter.

Jack Grieve1, Chris Montgomery2, Andrea Nini3, Akira Murakami1, Diansheng Guo4.   

Abstract

There is a growing trend in regional dialectology to analyse large corpora of social media data, but it is unclear if the results of these studies can be generalized to language as a whole. To assess the generalizability of Twitter dialect maps, this paper presents the first systematic comparison of regional lexical variation in Twitter corpora and traditional survey data. We compare the regional patterns found in 139 lexical dialect maps based on a 1.8 billion word corpus of geolocated UK Twitter data and the BBC Voices dialect survey. A spatial analysis of these 139 map pairs finds a broad alignment between these two data sources, offering evidence that both approaches to data collection allow for the same basic underlying regional patterns to be identified. We argue that these results license the use of Twitter corpora for general inquiries into regional lexical variation and change.
Copyright © 2019 Grieve, Montgomery, Nini, Murakami and Guo.

Entities:  

Keywords:  British English; Twitter; big data; dialectology; lexical variation; social media; sociolinguistics; spatial analysis

Year:  2019        PMID: 33733100      PMCID: PMC7861259          DOI: 10.3389/frai.2019.00011

Source DB:  PubMed          Journal:  Front Artif Intell        ISSN: 2624-8212


  1 in total

1.  Diffusion of lexical change in social media.

Authors:  Jacob Eisenstein; Brendan O'Connor; Noah A Smith; Eric P Xing
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

  1 in total
  5 in total

1.  Network Structured Kinetic Models of Social Interactions.

Authors:  Martin Burger
Journal:  Vietnam J Math       Date:  2021-05-18

2.  Social Networks of Lexical Innovation. Investigating the Social Dynamics of Diffusion of Neologisms on Twitter.

Authors:  Quirin Würschinger
Journal:  Front Artif Intell       Date:  2021-11-01

3.  Geolocation of multiple sociolinguistic markers in Buenos Aires.

Authors:  Olga Kellert; Nicholas H Matlis
Journal:  PLoS One       Date:  2022-09-09       Impact factor: 3.752

4.  Reduction of Survey Sites in Dialectology: A New Methodology Based on Clustering.

Authors:  Péter Jeszenszky; Carina Steiner; Adrian Leemann
Journal:  Front Artif Intell       Date:  2021-05-20

5.  Using Twitter Data for the Study of Language Change in Low-Resource Languages. A Panel Study of Relative Pronouns in Frisian.

Authors:  Jelske Dijkstra; Wilbert Heeringa; Lysbeth Jongbloed-Faber; Hans Van de Velde
Journal:  Front Artif Intell       Date:  2021-04-15
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.