Fairness hacking : the malicious practice of shrouding unfairness in algorithms

dc.contributor.authorMeding, Kristof
dc.contributor.authorHagendorff, Thilo
dc.date.accessioned2025-06-14T06:52:20Z
dc.date.issued2024
dc.date.updated2025-01-26T19:37:00Z
dc.description.abstractFairness in machine learning (ML) is an ever-growing field of research due to the manifold potential for harm from algorithmic discrimination. To prevent such harm, a large body of literature develops new approaches to quantify fairness. Here, we investigate how one can divert the quantification of fairness by describing a practice we call “fairness hacking” for the purpose of shrouding unfairness in algorithms. This impacts end-users who rely on learning algorithms, as well as the broader community interested in fair AI practices. We introduce two different categories of fairness hacking in reference to the established concept of p-hacking. The first category, intra-metric fairness hacking, describes the misuse of a particular metric by adding or removing sensitive attributes from the analysis. In this context, countermeasures that have been developed to prevent or reduce p-hacking can be applied to similarly prevent or reduce fairness hacking. The second category of fairness hacking is inter-metric fairness hacking. Inter-metric fairness hacking is the search for a specific fair metric with given attributes. We argue that countermeasures to prevent or reduce inter-metric fairness hacking are still in their infancy. Finally, we demonstrate both types of fairness hacking using real datasets. Our paper intends to serve as a guidance for discussions within the fair ML community to prevent or reduce the misuse of fairness metrics, and thus reduce overall harm from ML applications.en
dc.description.sponsorshipDeutsche Forschungsgemeinschaft
dc.description.sponsorshipMinisterium für Wissenschaft, Forschung und Kunst Baden-Württembergh
dc.identifier.issn2210-5441
dc.identifier.issn2210-5433
dc.identifier.urihttps://elib.uni-stuttgart.de/handle/11682/16593
dc.language.isoen
dc.relation.uridoi:10.1007/s13347-023-00679-8
dc.rightsCC BY
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject.ddc004
dc.titleFairness hacking : the malicious practice of shrouding unfairness in algorithmsen
dc.typearticle
dc.type.versionpublishedVersion
ubs.fakultaetFakultäts- und hochschulübergreifende Einrichtungen
ubs.fakultaetFakultätsübergreifend / Sonstige Einrichtung
ubs.institutStuttgart Research Focus „Interchange Forum for Reflecting on Intelligent Systems“ (SRF IRIS)
ubs.institutFakultätsübergreifend / Sonstige Einrichtung
ubs.publikation.seiten22
ubs.publikation.sourcePhilosophy & technology 37 (2024), No. 4
ubs.publikation.typZeitschriftenartikel

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
13347_2024_Article_679.pdf
Size:
1.52 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.3 KB
Format:
Item-specific license agreed upon to submission
Description: