Fairness hacking : the malicious practice of shrouding unfairness in algorithms

Meding, Kristof; Hagendorff, Thilo

Fairness hacking : the malicious practice of shrouding unfairness in algorithms

dc.contributor.author	Meding, Kristof
dc.contributor.author	Hagendorff, Thilo
dc.date.accessioned	2025-06-14T06:52:20Z
dc.date.issued	2024
dc.date.updated	2025-01-26T19:37:00Z
dc.description.abstract	Fairness in machine learning (ML) is an ever-growing field of research due to the manifold potential for harm from algorithmic discrimination. To prevent such harm, a large body of literature develops new approaches to quantify fairness. Here, we investigate how one can divert the quantification of fairness by describing a practice we call “fairness hacking” for the purpose of shrouding unfairness in algorithms. This impacts end-users who rely on learning algorithms, as well as the broader community interested in fair AI practices. We introduce two different categories of fairness hacking in reference to the established concept of p-hacking. The first category, intra-metric fairness hacking, describes the misuse of a particular metric by adding or removing sensitive attributes from the analysis. In this context, countermeasures that have been developed to prevent or reduce p-hacking can be applied to similarly prevent or reduce fairness hacking. The second category of fairness hacking is inter-metric fairness hacking. Inter-metric fairness hacking is the search for a specific fair metric with given attributes. We argue that countermeasures to prevent or reduce inter-metric fairness hacking are still in their infancy. Finally, we demonstrate both types of fairness hacking using real datasets. Our paper intends to serve as a guidance for discussions within the fair ML community to prevent or reduce the misuse of fairness metrics, and thus reduce overall harm from ML applications.	en
dc.description.sponsorship	Deutsche Forschungsgemeinschaft
dc.description.sponsorship	Ministerium für Wissenschaft, Forschung und Kunst Baden-Württembergh
dc.identifier.issn	2210-5441
dc.identifier.issn	2210-5433
dc.identifier.uri	https://elib.uni-stuttgart.de/handle/11682/16593
dc.language.iso	en
dc.relation.uri	doi:10.1007/s13347-023-00679-8
dc.rights	CC BY
dc.rights	info:eu-repo/semantics/openAccess
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject.ddc	004
dc.title	Fairness hacking : the malicious practice of shrouding unfairness in algorithms	en
dc.type	article
dc.type.version	publishedVersion
ubs.fakultaet	Fakultäts- und hochschulübergreifende Einrichtungen
ubs.fakultaet	Fakultätsübergreifend / Sonstige Einrichtung
ubs.institut	Stuttgart Research Focus „Interchange Forum for Reflecting on Intelligent Systems“ (SRF IRIS)
ubs.institut	Fakultätsübergreifend / Sonstige Einrichtung
ubs.publikation.seiten	22
ubs.publikation.source	Philosophy & technology 37 (2024), No. 4
ubs.publikation.typ	Zeitschriftenartikel

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 13347_2024_Article_679.pdf
Size:: 1.52 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 3.3 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

11 Interfakultäre Einrichtungen