| Literature DB >> 25970430 |
Samiul Hasan1, Satish V Ukkusuri2.
Abstract
Geo-location data from social media offers us information, in new ways, to understand people's attitudes and interests through their activity choices. In this paper, we explore the idea of inferring individual life-style patterns from activity-location choices revealed in social media. We present a model to understand life-style patterns using the contextual information (e. g. location categories) of user check-ins. Probabilistic topic models are developed to infer individual geo life-style patterns from two perspectives: i) to characterize the patterns of user interests to different types of places and ii) to characterize the patterns of user visits to different neighborhoods. The method is applied to a dataset of Foursquare check-ins of the users from New York City. The co-existence of several location contexts and the corresponding probabilities in a given pattern provide useful information about user interests and choices. It is found that geo life-style patterns have similar items-either nearby neighborhoods or similar location categories. The semantic and geographic proximity of the items in a pattern reflects the hidden regularity in user preferences and location choice behavior.Entities:
Mesh:
Year: 2015 PMID: 25970430 PMCID: PMC4430213 DOI: 10.1371/journal.pone.0124819
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Check-in distribution.
Dataset details.
| Original dataset | |
| Number of users | 20606 |
| Number of check-ins | 680564 |
| Study sample | |
| Number of geo-active users | 3256 |
| Number of check-ins from geo-active users | 504000 |
Fig 2Components of the proposed approach to enrich the location contexts of user check-ins and use it for modeling geo life-style patterns.
Fig 3Topic model of geo life-style pattern inference.
White circles represent random variables, shaded circles represent observed variables, rectangles represent the repetitiveness of the data, and arrows represent the dependency among the entities.
Fig 4Perplexity versus the number of life-style patterns.
Examples of location categories used and the list of the categories for which names are used.
| Few of the location categories used | Location categories for which names are used |
|---|---|
| American Restaurant | Burrito Place |
| Asian Restaurant | Burger Joint |
| Bar | Clothing Store |
| Baseball Stadium | Coffee Shop |
| Boat or Ferry | Cosmetics Shop |
| Bookstore | Department Store |
| Bridge | Drugstore or Pharmacy |
| Building | Electronics Store |
| College Quad | Fast Food Restaurant |
| Cupcake Shop | Furniture or Home Store |
| Dessert Shop | Grocery or Supermarket |
| Diner | Women’s Store |
| Donut Shop | Gift Shop |
| Event Space | Kids Store |
| Falafel Restaurant | Mall |
| Food Truck | Sandwich Place |
| French Restaurant | Toy or Game Store |
| Gastropub | Men’s Store |
| General College & University | Wings Joint |
| Harbor or Marina | |
| Hardware Store | |
| Home | |
| Hospital | |
| Hotel | |
| Ice Cream Shop | |
| Italian Restaurant | |
| Juice Bar | |
| Karaoke Bar | |
| Latin American Restaurant | |
| Lounge | |
| Mexican Restaurant | |
| Middle Eastern Restaurant | |
| Miscellaneous Shop | |
| Movie Theater |
Results of user interests pattern model.
|
|
|
|
|
|
|
| Activity Location | Prob. | Activity Location | Prob. | Activity Location | Prob. |
| Residential Building | 0.3315 | Doctor’s Office | 0.1836 | Food Truck | 0.5114 |
| Beach | 0.1642 | Post Office | 0.1655 | Indian Restaurant | 0.2006 |
| Courthouse | 0.0827 | Light Rail | 0.1122 | Shake Shack | 0.0217 |
| Park | 0.0529 | Movie Theater | 0.0718 | Asian Restaurant | 0.0154 |
| Theme Park | 0.0355 | Pathmark | 0.0356 | Temple | 0.0147 |
| Levi’s | 0.0226 | Grocery or Supermarket | 0.0341 | Restaurant | 0.0137 |
| Government Building | 0.0210 | Walmart | 0.0315 | FIKA espresso bar | 0.0130 |
| Hookah Bar | 0.0190 | Ice Cream Shop | 0.0282 | 4food | 0.0109 |
| American Restaurant | 0.0182 | Diner | 0.0271 | Brewery | 0.0102 |
| Multiplex | 0.0158 | Parking | 0.0182 | Dessert Shop | 0.0081 |
|
|
|
|
|
|
|
| Activity Location | Prob. | Activity Location | Prob. | Activity Location | Prob. |
| Pub | 0.6889 | Korean Restaurant | 0.1999 | Train Station | 0.9410 |
| Bar | 0.1501 | Ramen or Noodle House | 0.1934 | Train | 0.0031 |
| Sports Bar | 0.0432 | Vietnamese Restaurant | 0.1054 | Belmont Buy Any Drug | 0.0023 |
| Lounge | 0.0063 | Asian Restaurant | 0.0728 | Newport Centre Mall | 0.0021 |
| A&P | 0.0060 | Japanese Restaurant | 0.0579 | Cajun Restaurant | 0.0011 |
| Rogo | 0.0034 | Chinese Restaurant | 0.0470 | Historic Site | 0.0011 |
| Pool Hall | 0.0032 | Dumpling Restaurant | 0.0413 | My Way Cup | 0.0011 |
| Harry’s Burrito | 0.0032 | Malaysian Restaurant | 0.0225 | Light Rail | 0.0010 |
| Dive Bar | 0.0026 | Indian Restaurant | 0.0189 | Captain Caf | 0.0010 |
| Apple Store | 0.0021 | Dessert Shop | 0.0183 | Transit | 0.0009 |
|
|
|
|
|
|
|
| Activity Location | Prob. | Activity Location | Prob. | Activity Location | Prob. |
| Sushi Restaurant | 0.4740 | College & University | 0.2502 | Target | 0.1950 |
| Japanese Restaurant | 0.3324 | College Dorm | 0.2121 | McDonald’s | 0.1573 |
| Caf | 0.0151 | College Building | 0.0661 | Hookah Bar | 0.0921 |
| Indian Restaurant | 0.0096 | Synagogue | 0.0531 | Bes | 0.0621 |
| Chinese Restaurant | 0.0086 | Student Center | 0.0451 | Burger King | 0.0451 |
| Sunrise Mart | 0.0062 | Basketball Court | 0.0335 | Fast Food Restaurant | 0.0420 |
| Brewery | 0.0048 | Harbor or Marina | 0.0196 | Wendy’s | 0.0340 |
| Cemetery | 0.0038 | Park | 0.0186 | Video Game Store | 0.0331 |
| Seafood Restaurant | 0.0035 | Multiplex | 0.0159 | Walgreens | 0.0309 |
| Spanish Restaurant | 0.0031 | College Auditorium | 0.0135 | Toys R Us | 0.0288 |
|
|
|
|
|
|
|
| Activity Location | Prob. | Activity Location | Prob. | Activity Location | Prob. |
| Highway or Road | 0.6841 | Baseball Stadium | 0.4809 | Seafood Restaurant | 0.2439 |
| General Travel | 0.1416 | Stadium | 0.2119 | Trader Joe’s | 0.1757 |
| Bridge | 0.0378 | Football Stadium | 0.1043 | Italian Restaurant | 0.0850 |
| Waldbaums | 0.0088 | Sports Bar | 0.0294 | American Restaurant | 0.0732 |
| Stop & Shop | 0.0050 | Basketball Stadium | 0.0127 | Grey Dog | 0.0568 |
| Hot Spring | 0.0050 | Baseball Field | 0.0106 | French Restaurant | 0.0317 |
| Chinese Restaurant | 0.0047 | Five Guys Burgers | 0.0089 | Mud | 0.0309 |
| Bob’s Discount Furniture | 0.0032 | Racetrack | 0.0062 | Kaffe 1668 | 0.0237 |
| Neighborhood | 0.0024 | Plane | 0.0062 | Pizza Place | 0.0130 |
| Jersey Gardens | 0.0024 | Basketball Court | 0.0035 | Asian Restaurant | 0.0122 |
|
|
|
|
|
|
|
| Activity Location | Prob. | Activity Location | Prob. | Activity Location | Prob. |
| Airport | 0.4956 | Bus Line | 0.5598 | Chinese Restaurant | 0.4795 |
| Airport Terminal | 0.3586 | Subway | 0.0822 | Asian Restaurant | 0.2854 |
| Airport Gate | 0.0289 | Home | 0.0496 | Energy Kitchen | 0.0352 |
| Airport Tram | 0.0175 | Bus Station | 0.0364 | IKEA | 0.0107 |
| Plane | 0.0175 | Train Station | 0.0335 | Dim Sum Restaurant | 0.0103 |
| Airport Lounge | 0.0106 | Playground | 0.0174 | Zaro’s Bakery | 0.0103 |
| Government Building | 0.0033 | McDonald’s | 0.0139 | Apple Store | 0.0100 |
| New American Restaurant | 0.0016 | Train | 0.0133 | Italian Restaurant | 0.0071 |
| Candy Store | 0.0013 | PC Richard & Son | 0.0124 | Pizza Place | 0.0064 |
| Hotel | 0.0011 | Transit | 0.0089 | Cupcake Shop | 0.0043 |
Fig 5The distribution of location contexts of a user, the pattern proportion and the activity location distributions for the corresponding patterns.
(A)User 1 having few dominating activity location choice patterns. For the pattern proportion of the user, only the patterns with probabilities not less than 0.05 are shown. For the activity location distribution, only the locations with frequencies not less than five are presented; the data contains 617 check-ins of the user. (B) User 2 having a diverse activity location choice patterns. For the pattern proportion of the user, only the patterns with probabilities not less than 0.05 are shown. For the activity location distribution, only the locations with frequencies not less than two are presented; the data contains 145 check-ins of the user.
Results of user geo context pattern model.
|
|
|
|
|
| Neighborhood | Prob. | Neighborhood | Prob. |
| Prospect Heights | 0.3145 | Battery Park City-Lower Manhattan | 0.9292 |
| park-cemetery-etc-Brooklyn | 0.2866 | East Flushing | 0.0679 |
| Hudson Yards-Chelsea-Flat Iron-Union Square | 0.1194 | Clinton | 0.0005 |
| West Village | 0.1147 | West New Brighton-New Brighton-St. George | 0.0004 |
| SoHo-TriBeCa-Civic Center-Little Italy | 0.1119 | Far Rockaway-Bayswater | 0.0002 |
| East Village | 0.0169 | Midtown-Midtown South | 0.0002 |
| Fort Greene | 0.0146 | Erasmus | 0.0002 |
| Chinatown | 0.0095 | Pelham Bay-Country Club-City Island | 0.0002 |
| DUMBO-Vinegar Hill-Downtown Brooklyn-Boerum Hill | 0.0043 | East Harlem South | 0.0001 |
| Clinton | 0.0008 | Williamsburg | 0.0001 |
|
|
|
|
|
| Neighborhood | Prob. | Neighborhood | Prob. |
| Forest Hills | 0.3127 | Midtown-Midtown South | 0.3022 |
| East Harlem South | 0.1784 | Hamilton Heights | 0.1934 |
| Glendale | 0.1366 | Hudson Yards-Chelsea-Flat Iron-Union Square | 0.1893 |
| Middle Village | 0.1267 | West Village | 0.1349 |
| Kew Gardens | 0.1038 | East Village | 0.0876 |
| Maspeth | 0.0712 | Clinton | 0.0475 |
| park-cemetery-etc-Queens | 0.0247 | SoHo-TriBeCa-Civic Center-Little Italy | 0.0316 |
| Rego Park | 0.0199 | Turtle Bay-East Midtown | 0.0043 |
| Lindenwood-Howard Beach | 0.0146 | Battery Park City-Lower Manhattan | 0.0015 |
| Ridgewood | 0.0017 | Murray Hill-Kips Bay | 0.0013 |
|
|
|
|
|
| Neighborhood | Prob. | Neighborhood | Prob. |
| Washington Heights South | 0.3472 | Jamaica | 0.3723 |
| Seagate-Coney Island | 0.2169 | Manhattanville | 0.2139 |
| Hudson Yards-Chelsea-Flat Iron-Union Square | 0.1320 | Richmond Hill | 0.1017 |
| SoHo-TriBeCa-Civic Center-Little Italy | 0.1067 | Highbridge | 0.0784 |
| West Village | 0.0918 | Briarwood-Jamaica Hills | 0.0752 |
| Midtown-Midtown South | 0.0646 | St. Albans | 0.0663 |
| East Village | 0.0210 | Jamaica Estates-Holliswood | 0.0250 |
| Manhattanville | 0.0062 | Washington Heights North | 0.0198 |
| West Brighton | 0.0062 | Hollis | 0.0129 |
| park-cemetery-etc-Manhattan | 0.0011 | Cambria Heights | 0.0129 |
|
|
|
|
|
| Neighborhood | Prob. | Neighborhood | Prob. |
| DUMBO-Vinegar Hill-Downtown Brooklyn-Boerum Hill | 0.5179 | Flatbush | 0.1946 |
| Carroll Gardens-Columbia Street-Red Hook | 0.2996 | Prospect Lefferts Gardens-Wingate | 0.1832 |
| Brooklyn Heights-Cobble Hill | 0.1802 | Brownsville | 0.1727 |
| Fort Greene | 0.0002 | Canarsie | 0.1052 |
| North Side-South Side | 0.0002 | East Flatbush-Farragut | 0.0603 |
| Manhattanville | 0.0002 | Erasmus | 0.0598 |
| Hudson Yards-Chelsea-Flat Iron-Union Square | 0.0001 | Crown Heights South | 0.0553 |
| Flatbush | 0.0001 | Crown Heights North | 0.0529 |
| Murray Hill | 0.0001 | Flatlands | 0.0342 |
| Forest Hills | 0.0001 | Rugby-Remsen Village | 0.0235 |
|
|
|
|
|
| Neighborhood | Prob. | Neighborhood | Prob. |
| Midtown-Midtown South | 0.3743 | East Harlem North | 0.2153 |
| park-cemetery-etc-Queens | 0.3176 | Schuylerville-Throgs Neck-Edgewater Park | 0.1158 |
| Woodside | 0.1506 | East Tremont | 0.0965 |
| Hudson Yards-Chelsea-Flat Iron-Union Square | 0.0805 | Van Nest-Morris Park-Westchester Square | 0.0649 |
| Queens Village | 0.0599 | Soundview-Castle Hill-Clason Point-Harding Park | 0.0586 |
| Corona | 0.0048 | Westchester-Unionport | 0.0509 |
| Airport | 0.0041 | Parkchester | 0.0406 |
| park-cemetery-etc-Manhattan | 0.0037 | East Concourse-Concourse Village | 0.0404 |
| Lincoln Square | 0.0005 | Hunts Point | 0.0351 |
| East Williamsburg | 0.0004 | Mount Hope | 0.0351 |
|
|
|
|
|
| Neighborhood | Prob. | Neighborhood | Prob. |
| New Springville-Bloomfield-Travis | 0.1629 | Jackson Heights | 0.1970 |
| Stapleton-Rosebank | 0.1177 | Rego Park | 0.0967 |
| Westerleigh | 0.1077 | Jamaica Estates-Holliswood | 0.0700 |
| West New Brighton-New Brighton-St. George | 0.0965 | Woodhaven | 0.0620 |
| Todt Hill-Emerson Hill-Heartland Village-Lighthouse Hill | 0.0732 | South Jamaica | 0.0558 |
| Charleston-Richmond Valley-Tottenville | 0.0649 | Bellerose | 0.0556 |
| Old Town-Dongan Hills-South Beach | 0.0605 | Springfield Gardens South-Brookville | 0.0533 |
| Great Kills | 0.0546 | Rosedale | 0.0495 |
| New Dorp-Midland Beach | 0.0469 | East Elmhurst | 0.0492 |
| Mariner’s Harbor-Arlington-Port Ivory-Graniteville | 0.0407 | Ozone Park | 0.0488 |
Fig 6Geo context patterns in New York City.
(A)Pattern 6 (B) Pattern 8 (C)Pattern 21