Evaluating XAI: A comparison of rule-based and example-based explanations

Jasper van der Waa; Elisabeth Nieuwburg; Anita Cremers; Mark Neerincx

doi:10.1016/j.artint.2020.103404

Evaluating XAI: A comparison of rule-based and example-based explanations

Jasper van der Waa^*, Elisabeth Nieuwburg, Anita Cremers, Mark Neerincx

^*Corresponding author for this work

Interactive Intelligence

Research output: Contribution to journal › Article › Scientific › peer-review

118 Citations (Scopus)

119 Downloads (Pure)

Abstract

Current developments in Artificial Intelligence (AI) led to a resurgence of Explainable AI (XAI). New methods are being researched to obtain information from AI systems in order to generate explanations for their output. However, there is an overall lack of valid and reliable evaluations of the effects on users' experience of, and behavior in response to explanations. New XAI methods are often based on an intuitive notion what an effective explanation should be. Rule- and example-based contrastive explanations are two exemplary explanation styles. In this study we evaluate the effects of these two explanation styles on system understanding, persuasive power and task performance in the context of decision support in diabetes self-management. Furthermore, we provide three sets of recommendations based on our experience designing this evaluation to help improve future evaluations. Our results show that rule-based explanations have a small positive effect on system understanding, whereas both rule- and example-based explanations seem to persuade users in following the advice even when incorrect. Neither explanation improves task performance compared to no explanation. This can be explained by the fact that both explanation styles only provide details relevant for a single decision, not the underlying rational or causality. These results show the importance of user evaluations in assessing the current assumptions and intuitions on effective explanations.

Original language	English
Article number	103404
Number of pages	19
Journal	Artificial Intelligence
Volume	291
DOIs	https://doi.org/10.1016/j.artint.2020.103404
Publication status	Published - 2021

Keywords

Artificial Intelligence (AI)
Contrastive explanations
Decision support systems
Explainable Artificial Intelligence (XAI)
Machine learning
User evaluations

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1016/j.artint.2020.103404

1-s2.0-S0004370220301533-mainFinal published version, 2.3 MBLicence: CC BY

Cite this

@article{bff6a600f6d94486910b36c8a847afa3,

title = "Evaluating XAI: A comparison of rule-based and example-based explanations",

abstract = "Current developments in Artificial Intelligence (AI) led to a resurgence of Explainable AI (XAI). New methods are being researched to obtain information from AI systems in order to generate explanations for their output. However, there is an overall lack of valid and reliable evaluations of the effects on users' experience of, and behavior in response to explanations. New XAI methods are often based on an intuitive notion what an effective explanation should be. Rule- and example-based contrastive explanations are two exemplary explanation styles. In this study we evaluate the effects of these two explanation styles on system understanding, persuasive power and task performance in the context of decision support in diabetes self-management. Furthermore, we provide three sets of recommendations based on our experience designing this evaluation to help improve future evaluations. Our results show that rule-based explanations have a small positive effect on system understanding, whereas both rule- and example-based explanations seem to persuade users in following the advice even when incorrect. Neither explanation improves task performance compared to no explanation. This can be explained by the fact that both explanation styles only provide details relevant for a single decision, not the underlying rational or causality. These results show the importance of user evaluations in assessing the current assumptions and intuitions on effective explanations.",

keywords = "Artificial Intelligence (AI), Contrastive explanations, Decision support systems, Explainable Artificial Intelligence (XAI), Machine learning, User evaluations",

author = "{van der Waa}, Jasper and Elisabeth Nieuwburg and Anita Cremers and Mark Neerincx",

year = "2021",

doi = "10.1016/j.artint.2020.103404",

language = "English",

volume = "291",

journal = "Artificial Intelligence",

issn = "0004-3702",

publisher = "Elsevier",

}

TY - JOUR

T1 - Evaluating XAI

T2 - A comparison of rule-based and example-based explanations

AU - van der Waa, Jasper

AU - Nieuwburg, Elisabeth

AU - Cremers, Anita

AU - Neerincx, Mark

PY - 2021

Y1 - 2021

N2 - Current developments in Artificial Intelligence (AI) led to a resurgence of Explainable AI (XAI). New methods are being researched to obtain information from AI systems in order to generate explanations for their output. However, there is an overall lack of valid and reliable evaluations of the effects on users' experience of, and behavior in response to explanations. New XAI methods are often based on an intuitive notion what an effective explanation should be. Rule- and example-based contrastive explanations are two exemplary explanation styles. In this study we evaluate the effects of these two explanation styles on system understanding, persuasive power and task performance in the context of decision support in diabetes self-management. Furthermore, we provide three sets of recommendations based on our experience designing this evaluation to help improve future evaluations. Our results show that rule-based explanations have a small positive effect on system understanding, whereas both rule- and example-based explanations seem to persuade users in following the advice even when incorrect. Neither explanation improves task performance compared to no explanation. This can be explained by the fact that both explanation styles only provide details relevant for a single decision, not the underlying rational or causality. These results show the importance of user evaluations in assessing the current assumptions and intuitions on effective explanations.

AB - Current developments in Artificial Intelligence (AI) led to a resurgence of Explainable AI (XAI). New methods are being researched to obtain information from AI systems in order to generate explanations for their output. However, there is an overall lack of valid and reliable evaluations of the effects on users' experience of, and behavior in response to explanations. New XAI methods are often based on an intuitive notion what an effective explanation should be. Rule- and example-based contrastive explanations are two exemplary explanation styles. In this study we evaluate the effects of these two explanation styles on system understanding, persuasive power and task performance in the context of decision support in diabetes self-management. Furthermore, we provide three sets of recommendations based on our experience designing this evaluation to help improve future evaluations. Our results show that rule-based explanations have a small positive effect on system understanding, whereas both rule- and example-based explanations seem to persuade users in following the advice even when incorrect. Neither explanation improves task performance compared to no explanation. This can be explained by the fact that both explanation styles only provide details relevant for a single decision, not the underlying rational or causality. These results show the importance of user evaluations in assessing the current assumptions and intuitions on effective explanations.

KW - Artificial Intelligence (AI)

KW - Contrastive explanations

KW - Decision support systems

KW - Explainable Artificial Intelligence (XAI)

KW - Machine learning

KW - User evaluations

UR - http://www.scopus.com/inward/record.url?scp=85097186283&partnerID=8YFLogxK

U2 - 10.1016/j.artint.2020.103404

DO - 10.1016/j.artint.2020.103404

M3 - Article

AN - SCOPUS:85097186283

SN - 0004-3702

VL - 291

JO - Artificial Intelligence

JF - Artificial Intelligence

M1 - 103404

ER -

Evaluating XAI: A comparison of rule-based and example-based explanations

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this