| Literature DB >> 35372829 |
Chris Welty1, Lora Aroyo1, Flip Korn1, Sara M McCarthy1, Shubin Zhao1.
Abstract
Successful knowledge graphs (KGs) solved the historical knowledge acquisition bottleneck by supplanting the previous expert focus with a simple, crowd-friendly one: KG nodes represent popular people, places, organizations, etc., and the graph arcs represent common sense relations like affiliations, locations, etc. Techniques for more general, categorical, KG curation do not seem to have made the same transition: the KG research community is still largely focused on logic-based methods that belie the common-sense characteristics of successful KGs. In this paper, we propose a simple yet novel three-tier crowd approach to acquiring class-level attributes that represent broad common sense associations between categories, and can be used with the classic knowledge-base default & override technique, to address the early label sparsity problem faced by machine learning systems for problems that lack data for training. We demonstrate the effectiveness of our acquisition and reasoning approach on a pair of very real industrial-scale problems: how to augment an existing KG of places and offerings (e.g. stores and products, restaurants and dishes) with associations between them indicating the availability of the offerings at those places. Label sparsity is a general problem, and not specific to these use cases, that prevents modern AI and machine learning techniques from applying to many applications for which labeled data is not readily available. As a result, the study of how to acquire the knowledge and data needed for AI to work is as much a problem today as it was in the 1970s and 80s during the advent of expert systems. Our approach was a critical part of enabling a worldwide local search capability on Google Maps, with which users can find products and dishes that are available in most places on earth.Entities:
Keywords: class-level attributes; common sense; crowdsourcing; knowledge acquisition; knowledge graph; map
Year: 2022 PMID: 35372829 PMCID: PMC8967349 DOI: 10.3389/frai.2022.830299
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Figure 1Google Maps local shopping search results for umbrellas in NYC shows stores that sell them.
Figure 2Example subset of graph with a place instance i, a place category c, its parent category , a offering category c, its parent and the class- and instance- level offering availability relations between them.
Glossary of terms.
|
| |
| Place | An establishment (store or restaurant) on Google Maps |
| Offering | A product or dish available at a place |
| KBC | Knowledge Base Completion |
| GMB | Google my Business (source store categories) |
| GPT | Google Product Taxonomy (source product categories) |
| UGC | User Generated Content–user responses to yes/no questions |
| CS | Crowd Sense, our approach |
| WebIE | Information extraction of offering names from place web pages |
| WALS | Matrix factorization using WALS to predict 〈 |
|
| |
|
| Set of place instances |
|
| Set of place categories |
|
| Set of offering categories |
|
| Place subclass/superclass relation |
|
| Offering subclass/superclass relation |
|
| Place instance/class type relation |
|
| Class-level offering @ place availability relation |
|
| Instance-level offering @ place availability relation |
|
| Base KG of place/offering classes and place instances |
|
| |
| R | Likelihood that place instance |
|
| |
|
| Rater score for place (class or instance) x and offering class o |
| α | Number of “always” answers for class-level pair 〈 |
| ν | Number of “never answers for class-level pair 〈 |
|
| Number of “yes” answers for instance-level pair 〈 |
|
| Number of “no” answers for instance-level pair 〈 |
Figure 3Example question used to gather UGC.
Figure 4Partial view of the PRODCAT data collection template with example answers from one rater.
Example CrowdSense ratings on pairs.
|
|
|
|
|
|
|---|---|---|---|---|
| Auto parts store | Pita | 0 | 0 | 5 |
| Bakery | Longline Vests | 0 | 0 | 5 |
| Beauty supply store | Aromatherapy | 5 | 0 | 0 |
| Bicycle store | Home furnishings | 0 | 0 | 5 |
| Butcher shop | Quicklime | 0 | 0 | 5 |
| Chinaware store | Watches | 0 | 0 | 5 |
| Clothing store | Women's shirts | 5 | 0 | 0 |
| Clothing store | Petite negligee | 5 | 0 | 0 |
| Clothing store | Truck tailgate caps | 0 | 0 | 5 |
| Clothing store | Chameleon | 0 | 0 | 5 |
| Clothing store | Typewriter ribbon | 0 | 0 | 5 |
| Coffee store | Instant coffee | 4 | 0 | 1 |
| Cosmetics store | Non-dairy milk | 0 | 0 | 5 |
| Drugstore | tarragon | 0 | 0 | 5 |
| Electronics store | Canister vacuums | 5 | 0 | 0 |
| Feed store | cybex | 0 | 0 | 5 |
| Fresh food market | Work dresses | 0 | 0 | 5 |
| Fruits and vegetables | Turkey sausage | 0 | 1 | 4 |
| Furniture store | Canopy beds | 4 | 1 | 0 |
| Furniture store | Box springs | 4 | 0 | 1 |
| Grocery store | Smart light bulbs | 0 | 0 | 5 |
| Grocery store | Frozen clams | 5 | 0 | 0 |
| Grocery store | Soy nuts | 4 | 1 | 0 |
| Home goods store | Storage baskets | 4 | 1 | 0 |
Figure 5Histogram of Normalized-MAE on CrowdSense pairs for three shopping and one dining (Section 5.5) class-level crowd task designs. Bins to the left indicate the relative number of pairs with lower error, making Shopping-PRODCAT the clear leader. Dining-MATRIX performs better than shopping MATRIX.
Figure 6Distribution of CrowdSense errors (Normalized-MAE) for ratings in four countries, comparing CrowdSense predictions from raters in each country to raters outside that country. A shift of scores to the left indicates lower overall error; surprisingly, for all countries except Japan, out-of-country CrowdSense raters are more accurate than those within the country.
Figure 7Precision, Recall, and F-measure for different ways of predicting for shopping.
Figure 8Precision, Recall, and F-measure for different ways of predicting for dining.
Example gold standard pairs.
|
|
|
|
|
|
|---|---|---|---|---|
| 7-Eleven | Convenience store | US | Distilled water | FALSE |
| ALDI | Grocery store | US | Fruitcake | TRUE |
| AURORA MKT | Store | US | Men's Gloves | FALSE |
| Adams Pharmacy | Pharmacy | US | Kool aid | TRUE |
| Ag construcciones | Building materials | PY | Blinds | TRUE |
| Alanyurt Gıda | General store | TR | Razor blades | TRUE |
| Amorino | Ice cream shop | FR | Meat | FALSE |
| Barnes and Noble | Book store | US | Blankets | FALSE |
| Barstow Buick | Car dealer | US | Crown victoria | TRUE |
| Barstow Buick | Car dealer | US | Gears | TRUE |
| Bazar | bazar | BR | Mary kay | FALSE |
Figure 9CrowdSense search results in NYC for knapsacks.