| Literature DB >> 33982031 |
Abstract
A surprisingly sticky belief is that a machine learning model merely reflects existing algorithmic bias in the dataset and does not itself contribute to harm. Why, despite clear evidence to the contrary, does the myth of the impartial model still hold allure for so many within our research community? Algorithms are not impartial, and some design choices are better than others. Recognizing how model design impacts harm opens up new mitigation techniques that are less burdensome than comprehensive data collection.Entities:
Year: 2021 PMID: 33982031 PMCID: PMC8085589 DOI: 10.1016/j.patter.2021.100241
Source DB: PubMed Journal: Patterns (N Y) ISSN: 2666-3899
Figure 1Our model choices express a preference for model behavior. An example most students of machine learning will recognize is the plot between the degrees of a polynomial (a) and the degree of overfitting.
Figure 2Most natural image datasets exhibit a long-tail distribution with an unequal frequency of attributes in the training data. Notions of fairness often coincide with how underrepresented sensitive attributes are treated by the model. Our model design choices can excacerbate or curb disparate harm on the long-tail.