Equal accuracy for Andrew and Abubakar—detecting and mitigating bias in name-ethnicity classification algorithms
Uncovering the world’s ethnic inequalities is hampered by a lack of ethnicity-annotated datasets. Name-ethnicity classifiers (NECs) can help, as they are able to infer people’s ethnicities from their names. However, since the latest generation of NECs rely on machine learning and artificial intellig...
Saved in:
| Published in | AI & society Vol. 39; no. 4; pp. 1605 - 1629 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
London
Springer London
01.08.2024
Springer Nature B.V |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0951-5666 1435-5655 1435-5655 |
| DOI | 10.1007/s00146-022-01619-4 |
Cover
| Summary: | Uncovering the world’s ethnic inequalities is hampered by a lack of ethnicity-annotated datasets. Name-ethnicity classifiers (NECs) can help, as they are able to infer people’s ethnicities from their names. However, since the latest generation of NECs rely on machine learning and artificial intelligence (AI), they may suffer from the same racist and sexist biases found in many AIs. Therefore, this paper offers an algorithmic fairness audit of three NECs. It finds that the UK-Census-trained
EthnicityEstimator
displays large accuracy biases with regards to ethnicity, but relatively less among gender and age groups. In contrast, the Twitter-trained
NamePrism
and the Wikipedia-trained
Ethnicolr
are more balanced among ethnicity, but less among gender and age. We relate these biases to global power structures manifested in naming conventions and NECs’ input distribution of names. To improve on the uncovered biases, we program a novel NEC,
N2E
, using fairness-aware AI techniques. We make
N2E
freely available at
www.name-to-ethnicity.com
. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
| ISSN: | 0951-5666 1435-5655 1435-5655 |
| DOI: | 10.1007/s00146-022-01619-4 |