| Literature DB >> 35789618 |
Georgina Curto1,2, Mario Fernando Jojoa Acosta3, Flavio Comim1, Begoña Garcia-Zapirain3.
Abstract
Among the myriad of technical approaches and abstract guidelines proposed to the topic of AI bias, there has been an urgent call to translate the principle of fairness into the operational AI reality with the involvement of social sciences specialists to analyse the context of specific types of bias, since there is not a generalizable solution. This article offers an interdisciplinary contribution to the topic of AI and societal bias, in particular against the poor, providing a conceptual framework of the issue and a tailor-made model from which meaningful data are obtained using Natural Language Processing word vectors in pretrained Google Word2Vec, Twitter and Wikipedia GloVe word embeddings. The results of the study offer the first set of data that evidences the existence of bias against the poor and suggest that Google Word2vec shows a higher degree of bias when the terms are related to beliefs, whereas bias is higher in Twitter GloVe when the terms express behaviour. This article contributes to the body of work on bias, both from and AI and a social sciences perspective, by providing evidence of a transversal aggravating factor for historical types of discrimination. The evidence of bias against the poor also has important consequences in terms of human development, since it often leads to discrimination, which constitutes an obstacle for the effectiveness of poverty reduction policies.Entities:
Keywords: Artificial intelligence; Bias; Embeddings; Poverty
Year: 2022 PMID: 35789618 PMCID: PMC9243923 DOI: 10.1007/s00146-022-01494-z
Source DB: PubMed Journal: AI Soc ISSN: 0951-5666
Fig. 1Block diagram of the proposed solution
Proximities and distances between unfavourable attributes and the key terms “poor” and “rich” and the ABI in Google News Word2vec pre-trained embeddings
| Negative attributes | Proximity to “poor” (cosine) | Proximity to “rich” (cosine) | Relative value: 1 suggests attribute closer to “poor” | Relative distance to “poor” (in radians) | Relative distance to “rich” (in radians) | Aporophobia bias indicator (ABI) |
|---|---|---|---|---|---|---|
| Substandard | 0.518799 | 0.065894 | 1 | 1.025350 | 1.504854 | 0.479503 |
| Dreadful | 0.496364 | 0.108623 | 1 | 1.051390 | 1.461958 | 0.410568 |
| Mediocre | 0.525181 | 0.157387 | 1 | 1.017868 | 1.412751 | 0.394883 |
| Inferior | 0.442338 | 0.154269 | 1 | 1.112590 | 1.415908 | 0.303316 |
| Indifference | 0.295424 | 0.049471 | 1 | 1.270896 | 1.521304 | 0.250408 |
| Displeasure | 0.181486 | − 0.043921 | 1 | 1.388298 | 1.614732 | 0.226433 |
| Humiliating | 0.236273 | 0.013788 | 1 | 1.332267 | 1.557007 | 0.224740 |
| Abhorrent | 0.177211 | − 0.034837 | 1 | 1.392643 | 1.605641 | 0.212997 |
| Disgust | 0.175618 | − 0.033866 | 1 | 1.394262 | 1.604669 | 0.210406 |
| Disrespect | 0.178972 | − 0.002676 | 1 | 1.390853 | 1.573472 | 0.182618 |
| Disregard | 0.165259 | − 0.011534 | 1 | 1.404775 | 1.582331 | 0.177555 |
| Fear | 0.174980 | 0.019890 | 1 | 1.394910 | 1.550904 | 0.155994 |
| Irritation | 0.152907 | 0.011789 | 1 | 1.417287 | 1.559006 | 0.141719 |
| Hostile | 0.185884 | 0.045462 | 1 | 1.383824 | 1.525318 | 0.141493 |
| Rudeness | 0.176455 | 0.038615 | 1 | 1.393411 | 1.532171 | 0.138759 |
| Annoyance | 0.110991 | − 0.026991 | 1 | 1.459575 | 1.597791 | 0.138215 |
| Disgusting | 0.259967 | 0.133528 | 1 | 1.307807 | 1.436867 | 0.129059 |
| Hostility | 0.132259 | 0.040978 | 1 | 1.438148 | 1.529806 | 0.091657 |
| Rejection | 0.100165 | 0.037907 | 1 | 1.470462 | 1.532879 | 0.062416 |
| Contempt | 0.091754 | 0.034602 | 1 | 1.478912 | 1.536186 | 0.057273 |
| Hate | 0.166657 | 0.111664 | 1 | 1.403357 | 1.458898 | 0.055540 |
| Insult | 0.150543 | 0.107800 | 1 | 1.419678 | 1.462786 | 0.043107 |
| Aversion | 0.169729 | 0.132875 | 1 | 1.400240 | 1.437526 | 0.037285 |
| hate act | 0.143041 | 0.111930 | 1 | 1.427262 | 1.458631 | 0.031369 |
| hate speech | 0.154789 | 0.134926 | 1 | 1.415381 | 1.435456 | 0.020075 |
| Antipathy | 0.082810 | 0.075422 | 1 | 1.487891 | 1.495302 | 0.007411 |
Source: author’s creation
Fig. 2ABIs (difference in distance between how an attribute is associated to the term “poor” as compared to the term “rich”) for unfavourable attributes used by Cortina (2017) in Google News Word2vec pre-trained embeddings. Source: authors’ creation. OBS: These words have been used by Cortina (2017) and identified by Comim, Borsi and Valerio (2019)
Fig. 3ABIs for unfavourable attributes in Google News Word2vec Pre-trained embedding. Unfavourable attributes used by Cortina (2017) are shown in blue. Source: authors’ creation
Terms included in the study categorized according to Allport’s (1957) degree of action associated to prejudice
| Favourable | Unfavourable | ||
|---|---|---|---|
| Belief | superior, willpower, kind, courageous, calm, calmness, mildness, mild, innocuous, positive, dignified, delight, delightful, friend, friendship, courage, serenity, excellent, partner, pleasant, polite, brave, higher, adequate, true, happy, peace, peaceful, (28) | Belief | inferior, mediocre, negative, rude, rudeness, lower, shame, shameful, shameless, substandard, slight, carelessness, unkind, inoffensive, distaste, repugnant, rival, scared, sicken, upset, adversary, enemy, opponent (23) |
| Attitude & action: | Attitude & Action: | ||
| Communication | acknowledgement, empathy, patience, tolerate, attentiveness, respectful speech, patience, cordiality, agreement, endorsement, attestation, regard, taste, remember, interest, tolerance, contentment, politeness, (19) | Antilocution | antipathy, disregard, no acknowledgement, denounce, denunciation, belligerence, belligerent, concern, denial, disagreement, derision, disregard, forget, ignore, indifference, absence of sympathy, refusal, defense, apathy, antagonism (20) |
| Acceptance | friendliness, friend, goodwill, kind, kindness, sympathy, acceptance, companionable, conciliate, fearless, cordiality, amicability, accord, self-assurance, attraction, desire, recommend, consonance, pleasure, pleasing, confidence, friendly, amity, affability, affection, benevolence, preservation, acquiescence, appetency, liking, becoming, pleasing, solace, love, love speech, liked, acceptance, accept, acceptation, like, complimentary, gentleness, attraction, attractive, approve, approval (46) | Avoidance | disgust, fear, impatience, afraid, alarmed, annoyance, annoying, anxiety, bitterness, challenger, corrupting, defense, defend, detestation, dislike, disgusting, disgust, disapprove, disapproval, detestation, displeasure, dread, dreadful, foe, ill feeling, ill will, irritating, irritation, loathe, loathing, opposition, repel, repugnance, repulse, repulsion, repulsive, resent, resentment, resistance, revulsion, unbecoming, undignified, upsetting, worry, calmness, independence, weighty, hate, abhorrence, abhorrent, hostile, hostility, neglect, unfriendliness, animosity, contempt (56) |
| Admiration | admiration, praise, approval, appreciation, delight, cherish, adore, flattery, pride, admirable, adulation, praise, dignified, appreciation, appreciate, respect, (16) | Discrimination | degrading, rejection, affront, anger, animosity, aversion, conflict, degrading, demeaning, disrespect, enmity, hatred, intolerance, obstruction, offense, offend, offensive, scorn, slur, shamed, unsupportive, hostility, abandonment, humiliating, hate speech, insult (27) |
| Aid | Aid, help, heal, support, love act, cooperation, comfort, facilitation, ally, shelter, encourage, encouraging (12) | Physical attack | hate act, physical aggression, abuse, abusive, aggression, assault, attack, bellicose, bellicosity, intimidate, intimidating, intimidation, violence, violent, harm, physical protection (16) |
Original expressions used by Cortina (2017) appear underlined
Source: author’s creation
Proximities and distances between favourable attributes and the key terms “poor” and “rich” and the ABI in Google News Word2vec pre-trained embeddings
| Favourable attributes | Proximity to “poor” (cosine) | Proximity to “rich” (cosine) | Relative value: 1 suggests attribute closer to the poor | Relative distance to “poor” (in radians) | Relative distance to “rich” (in radians) | Aporophobia bias indicator (ABI) |
|---|---|---|---|---|---|---|
| Sympathy | 0.169531 | 0.018321 | 1 | 1.400441 | 1.552474 | 0.152032 |
| Politeness | 0.132293 | 0.068439 | 1 | 1.438114 | 1.502303 | 0.064189 |
| Pleasing | 0.227241 | 0.174897 | 1 | 1.341551 | 1.394995 | 0.053443 |
| Goodwill | 0.088890 | 0.039868 | 1 | 1.481787 | 1.530918 | 0.049129 |
| Cordiality | 0.043623 | 0.007792 | 1 | 1.527159 | 1.563004 | 0.035845 |
| Happy | 0.212202 | 0.180576 | 1 | 1.356968 | 1.389223 | 0.032255 |
| Fearless | 0.100959 | 0.069186 | 1 | 1.469664 | 1.501554 | 0.031889 |
| Pride | 0.104457 | 0.088019 | 1 | 1.466148 | 1.482663 | 0.016514 |
| Friendliness | 0.178084 | 0.175157 | 1 | 1.391756 | 1.394731 | 0.002974 |
| Courageous | 1 | 1 | 0 | 0 | 0 | 0 |
| Self-assurance | 1 | 1 | 0 | 0 | 0 | 0 |
| Carelessness | 1 | 1 | 0 | 0 | 0 | 0 |
| Defence | 1 | 1 | 0 | 0 | 0 | 0 |
| Affection | 0.100301 | 0.10674 | 0 | 1.470325 | 1.463852 | − 0.006474 |
| Liked | 0.125296 | 0.135883 | 0 | 1.445169 | 1.434491 | − 0.010678 |
| Delight | 0.033640 | 0.045317 | 0 | 1.537149 | 1.525463 | − 0.011687 |
| Desire | 0.085015 | 0.096916 | 0 | 1.485677 | 1.473728 | − 0.011949 |
| Pleasant | 0.168783 | 0.187770 | 0 | 1.401201 | 1.381905 | − 0.019297 |
| acceptation | 0.049464 | 0.099845 | 0 | 1.521311 | 1.470784 | − 0.050527 |
| appreciation | 0.005268 | 0.075830 | 0 | 1.565527 | 1.494893 | − 0.070635 |
| independence | 0.067198 | 0.141933 | 0 | 1.503546 | 1.428382 | − 0.075165 |
| Love | 0.107482 | 0.184401 | 0 | 1.463105 | 1.385334 | − 0.077772 |
| Delightful | 0.131124 | 0.215119 | 0 | 1.439293 | 1.353983 | − 0.085311 |
| Flattery | 0.054658 | 0.140086 | 0 | 1.516110 | 1.430247 | − 0.085864 |
| Friendly | 0.184168 | 0.271432 | 0 | 1.385570 | 1.295916 | − 0.089655 |
| Endorsement | − 0.049720 | 0.057279 | 0 | 1.620537 | 1.513486 | − 0.107052 |
| Taste | 0.147377 | 0.261997 | 0 | 1.422879 | 1.305705 | − 0.117175 |
| Pleasure | − 0.005007 | 0.120311 | 0 | 1.575803 | 1.450193 | − 0.125610 |
| Attractive | 0.146302 | 0.282672 | 0 | 1.423967 | 1.284217 | − 0.139750 |
Source: author’s creation
Fig. 4CABIs for unfavourable attributes in Google News Word2Vec vs Twitter GloVe, indicating the difference in the degree of bias per attribute between the two predefined embeddings. Source: authors’ creation
Fig. 5CABIs for unfavourable attributes in Google News vs Wikipedia, indicating the difference in the degree of bias per attribute between the two predefined embeddings. Source: author’s creation
Fig. 6CABIs for unfavourable attributes in Twitter vs Wikipedia, indicating the difference in the degree of bias per attribute between the two predefined embeddings. Source: author’s creation