| Literature DB >> 33732917 |
Meng Cai1.
Abstract
Natural language processing (NLP) has shown potential as a promising tool to exploit under-utilized urban data sources. This paper presents a systematic review of urban studies published in peer-reviewed journals and conference proceedings that adopted NLP. The review suggests that the application of NLP in studying cities is still in its infancy. Current applications fell into five areas: urban governance and management, public health, land use and functional zones, mobility, and urban design. NLP demonstrates the advantages of improving the usability of urban big data sources, expanding study scales, and reducing research costs. On the other hand, to take advantage of NLP, urban researchers face challenges of raising good research questions, overcoming data incompleteness, inaccessibility, and non-representativeness, immature NLP techniques, and computational skill requirements. This review is among the first efforts intended to provide an overview of existing applications and challenges for advancing urban research through the adoption of NLP.Entities:
Keywords: Natural language processing; Text mining; Urban big data; Urban research
Year: 2021 PMID: 33732917 PMCID: PMC7944036 DOI: 10.1016/j.heliyon.2021.e06322
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Literature search criteria.
| Database | Search term | Search field | Subject area | Source/document type | Other filter |
|---|---|---|---|---|---|
| EBSCO Urban Studies Abstracts | “natural language processing” | “Title, Abstract, or Keywords” | N/A | N/A | N/A |
| Scopus | “natural language processing” AND (city OR urban) | “Title, Abstract, or Keywords: | “Social sciences” | “Journals OR conference proceedings” | N/A |
| ProQuest | “natural language processing” AND (city OR urban) | “Anywhere except full text” | N/A | “Conference Papers & Proceedings OR Scholarly Journals” | “Peer reviewed” |
| Web of Science | “natural language processing” AND (city OR urban) | “Topic” (i.e. title, abstract, author keywords, and Keywords Plus) | N/A | “Article” | N/A |
Figure 1Amount of urban studies using NLP by year.
Summary of included literatures.
| Study | Topic | NLP application | Data | Study area |
|---|---|---|---|---|
| Detecting citizen problems and their locations | Urban governance and management | 100 tweets | Aegean Region, Turkey | |
| Gender mainstreaming in slum rehabilitation housing management in Mumbai, India | Urban governance and management | 12 interviews and 2 focus groups | Mumbai, India | |
| An application of people's sentiment from social media to smart cities | Urban governance and management | 200,000 tweets | New York City, NY, US | |
| Identifying spatiotemporal urban activities | Public health | 8,098,864 tweets | Baltimore, MD, Washington D.C., and New York City, NY, US | |
| Detecting urban prostitution activities | Urban governance and management | 3,387 policy department prostitution arrest records and 10 years' hotel reviews and price data | Phoenix, AZ, US | |
| Information needs and communication gaps between citizens and local governments online during natural disasters | Urban governance and management | 96,423 tweets | Maryland, US | |
| Understanding the perceptions of people toward their living environments based on online neighborhood reviews | Public health | 7,673 neighborhood reviews on Niche | New York City, NY, US | |
| A framework for harvesting local place names from geotagged housing advertisements | Urban governance and management | 35,852 housing advertisements from Craigslist | New York City, NY, Los Angeles, CA, Chicago, IL, Richmond, VI, Boise, ID, and Spokane, WA, US | |
| Quantifying the spatiotemporal dynamics of industrial land uses | Land use and functional zones | POIs data from Gaode Map and Google Earth images | Mega Hangzhou Bay Region, China | |
| Emotional landmarks in cities | Urban design | 61,516,961 posts from Facebook, Twitter, Instagram, Foursquare, and Yelp | Rome, Milan, and Turin, Italy; Berlin, Germany; Sao Paulo, Brazil; Montreal and Toronto, Canada; New York, NY and New Haven, CT, US; Hong Kong; Cairo, Egypt; and Istanbul, Turkey | |
| Extraction of disaster-relevant information from social media | Urban governance and management | 346,764 tweets | Joplin, MO and New York City, NY, US | |
| A crowd-sourced cognitive map to display people's cognitive perception of urban space | Urban design | 1,785,768 posts from Instagram | Bundang, Dongtan, Ilsan, and Songdo, Korea | |
| Thematic structure and spatial-temporal patterns of building renovation and adaptive reuse in cities | Urban governance and management | 2,500,000 building permits | New York City, NY, Los Angeles, CA, Chicago, IL, Austin, TX, San Francisco, CA, Seattle, WA, and Boston, MA, US | |
| A regionalization method for clustering and partitioning trajectories | Land use and functional zones | 27,000,000 trajectories of call records from mobile phones | Beijing, China | |
| Measuring traffic interactions in urban road system from massive travel routes | Mobility | Taxi GPS trajectories | Beijing, China | |
| Identifying spatial interaction patterns of vehicle movements | Mobility | Taxi trajectories collected by the Global Navigation Satellite System | Beijing, China | |
| Predicting taxi demand hotspots | Mobility | Taxi pickup/drop-off data and event data from event listing sites | New York City, NY, US | |
| The geography of taste and urban culture | Public health | 4,100,000 restaurant reviews on Yelp | Boston, MA, Charlotte, NC, Cleveland, OH, Washington D.C., Detroit, MI, Las Vegas, NV, Philadelphia, PA, Phoenix, AZ, and Pittsburgh, PA, US and Toronto, Canada | |
| Real-time observations for urban air quality and public health | Public health | 17,560 tweets and weather data from the European Centre for Medium-Range Weather Forecasts | Europe | |
| Sustainability analysis of urban mobility | Mobility | 43,251 comments from Minube | Bilbao, Valencia, and Madrid, Spain | |
| A platform to analyze social streams in smart city initiatives | Urban governance and management | 530,000 policy department emergency phone call records and 126 tweets | Natal, Brazil | |
| Characterization of citizens | Urban governance and management | 2,634,176 tweets | Bogotá, Colombia | |
| Influenza detection | Public health | Tweets spanning 5 years and influenza diagnostic records from the Infectious Disease Surveillance Center | Japan | |
| Sensing the spatial distribution of urban land use | Land use and functional zones | High spatial resolution remote-sensing images | Guangzhou, Guangdong, China | |
| Discovering urban functional zones | Land use and functional zones | GPS trajectory data generated by 12,000 taxis and public transit records of 1,500,000 trips from 300,000 card holders | Beijing, China | |
| Discovering regions of different functions in a city | Land use and functional zones | 2 POIs datasets and 2 3-month GPS trajectory datasets generated by over 12,000 taxis | Beijing, China |