| Literature DB >> 35432779 |
Bill Howe1, Jackson Maxfield Brown1, Bin Han1, Bernease Herman1, Nic Weber1, An Yan2, Sean Yang1, Yiwei Yang1.
Abstract
We consider the use of AI techniques to expand the coverage, access, and equity of urban data. We aim to enable holistic research on city dynamics, steering AI research attention away from profit-oriented, societally harmful applications (e.g., facial recognition) and toward foundational questions in mobility, participatory governance, and justice. By making available high-quality, multi-variate, cross-scale data for research, we aim to link the macrostudy of cities as complex systems with the reductionist view of cities as an assembly of independent prediction tasks. We identify four research areas in AI for cities as key enablers: interpolation and extrapolation of spatiotemporal data, using NLP techniques to model speech- and text-intensive governance activities, exploiting ontology modeling in learning tasks, and understanding the interaction of fairness and interpretability in sensitive contexts.Entities:
Year: 2022 PMID: 35432779 PMCID: PMC8994025 DOI: 10.1140/epjs/s11734-022-00475-z
Source DB: PubMed Journal: Eur Phys J Spec Top ISSN: 1951-6355 Impact factor: 2.891
Fig. 1Inpainting results of taxi trip data. From left to right, the columns are: ground truth images; the irregular masks; the masked ground truth; the final inpainting results
The effect of equal explanations on accuracy and fairness distances of the model
| Reg rate | Explanation loss | Accuracy | Equality of opportunity | Equalized odds |
|---|---|---|---|---|
| 0 | 1.68 | 70.40 | 0.19 | 0.50 |
| 0.2 | 0.05 | 70.11 | 0.22 | 0.56 |
| 0.4 | 0.03 | 70.16 | 0.22 | 0.58 |
| 0.6 | 0.02 | 70.90 | 0.21 | 0.56 |
| 0.8 | 0.01 | 70.65 | 0.23 | 0.56 |
| 1 | 0.02 | 69.70 | 0.24 | 0.60 |
Dataset statistics
| Programs | Organization | ||||
|---|---|---|---|---|---|
| Train | Val | Test | Train | Val | Test |
| 6412 | 801 | 802 | 4558 | 570 | 570 |
Fig. 2Illustration of our framework
Experimental results on the Program and Organization datasets
| Programs | Organization | |||||||
|---|---|---|---|---|---|---|---|---|
| Acc | P | R | F1 | Acc | P | R | F1 | |
| Majority | 0.075 | 0.006 | 0.075 | 0.01 | 0.056 | 0.003 | 0.056 | 0.006 |
| Random | 0.003 | 0.016 | 0.003 | 0.003 | 0.006 | 0.018 | 0.006 | 0.008 |
| TF-IDF + naive | 0.102 | 0.085 | 0.132 | 0.087 | 0.090 | 0.032 | 0.090 | 0.030 |
| Glove + naive | 0.202 | 0.143 | 0.201 | 0.158 | 0.249 | 0.141 | 0.249 | 0.171 |
| TF-IDF + Ontology | 0.145 | 0.134 | 0.167 | 0.159 | 0.143 | 0.095 | 0.123 | 0.117 |
| Glove + Ontology |
|
|
|
|
|
|
|
|
Bold indicates best scores (our method)
Acc accuracy, P precision, R recall, F1 F1 score. Because these are custom datasets that are not publicly available, we also provide results from two baseline methods, majority vote and random selection. We can observe that considering ontology improves the results significantly