Deterministic Local Interpretable Model-Agnostic Explanations for Stable Explainability

Local Interpretable Model-Agnostic Explanations (LIME) is a popular technique used to increase the interpretability and explainability of black box Machine Learning (ML) algorithms. LIME typically creates an explanation for a single prediction by any ML model by learning a simpler interpretable mode...

Full description

Saved in:

Bibliographic Details
Published in	Machine learning and knowledge extraction Vol. 3; no. 3; pp. 525 - 541
Main Authors	Zafar, Muhammad Rehman, Khan, Naimul
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.09.2021
Subjects	Ablation Algorithms Artificial intelligence Cluster analysis Clustering Datasets Decision making Decision trees deterministic explanations explainable artificial intelligence (XAI) Feature selection interpretable machine learning Lime local explanations Machine learning model agnostic explanations Perturbation methods Regression analysis stable explanations
Online Access	Get full text
ISSN	2504-4990 2504-4990
DOI	10.3390/make3030027

Cover

More Information
Summary:	Local Interpretable Model-Agnostic Explanations (LIME) is a popular technique used to increase the interpretability and explainability of black box Machine Learning (ML) algorithms. LIME typically creates an explanation for a single prediction by any ML model by learning a simpler interpretable model (e.g., linear classifier) around the prediction through generating simulated data around the instance by random perturbation, and obtaining feature importance through applying some form of feature selection. While LIME and similar local algorithms have gained popularity due to their simplicity, the random perturbation methods result in shifts in data and instability in the generated explanations, where for the same prediction, different explanations can be generated. These are critical issues that can prevent deployment of LIME in sensitive domains. We propose a deterministic version of LIME. Instead of random perturbation, we utilize Agglomerative Hierarchical Clustering (AHC) to group the training data together and K-Nearest Neighbour (KNN) to select the relevant cluster of the new instance that is being explained. After finding the relevant cluster, a simple model (i.e., linear model or decision tree) is trained over the selected cluster to generate the explanations. Experimental results on six public (three binary and three multi-class) and six synthetic datasets show the superiority for Deterministic Local Interpretable Model-Agnostic Explanations (DLIME), where we quantitatively determine the stability and faithfulness of DLIME compared to LIME.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2504-4990 2504-4990
DOI:	10.3390/make3030027