Feature Responsiveness Scores: Model-Agnostic Explanations for Recourse
Machine learning models routinely automate decisions in applications like lending and hiring. In such settings, consumer protection rules require companies that deploy models to explain predictions to decision subjects. These rules are motivated, in part, by the belief that explanations can promote...
Saved in:
Main Authors | , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
29.10.2024
|
Subjects | |
Online Access | Get full text |
DOI | 10.48550/arxiv.2410.22598 |
Cover
Summary: | Machine learning models routinely automate decisions in applications like
lending and hiring. In such settings, consumer protection rules require
companies that deploy models to explain predictions to decision subjects. These
rules are motivated, in part, by the belief that explanations can promote
recourse by revealing information that individuals can use to contest or
improve their outcomes. In practice, many companies comply with these rules by
providing individuals with a list of the most important features for their
prediction, which they identify based on feature importance scores from feature
attribution methods such as SHAP or LIME. In this work, we show how these
practices can undermine consumers by highlighting features that would not lead
to an improved outcome and by explaining predictions that cannot be changed. We
propose to address these issues by highlighting features based on their
responsiveness score -- i.e., the probability that an individual can attain a
target prediction by changing a specific feature. We develop efficient methods
to compute responsiveness scores for any model and any dataset. We conduct an
extensive empirical study on the responsiveness of explanations in lending. Our
results show that standard practices in consumer finance can backfire by
presenting consumers with reasons without recourse, and demonstrate how our
approach improves consumer protection by highlighting responsive features and
identifying fixed predictions. |
---|---|
DOI: | 10.48550/arxiv.2410.22598 |