Zorro: Valid, sparse, and stable explanations in graph neural networks

Thorben Funke; Megha Khosla; Mandeep Rathee; Avishek Anand

doi:10.1109/TKDE.2022.3201170

Zorro: Valid, sparse, and stable explanations in graph neural networks

Thorben Funke, Megha Khosla, Mandeep Rathee, Avishek Anand

Research output: Contribution to journal › Article › Scientific › peer-review

5 Citations (Scopus)

40 Downloads (Pure)

Abstract

With the ever-increasing popularity and applications of graph neural networks, several proposals have been made to explain and understand the decisions of a graph neural network. Explanations for graph neural networks differ in principle from other input settings. It is important to attribute the decision to input features and other related instances connected by the graph structure. We find that the previous explanation generation approaches that maximize the mutual information between the label distribution produced by the model and the explanation to be restrictive. Specifically, existing approaches do not enforce explanations to be valid, sparse, or robust to input perturbations. In this paper, we lay down some of the fundamental principles that an explanation method for graph neural networks should follow and introduce a metric RDT-Fidelity as a measure of the explanation's effectiveness. We propose a novel approach Zorro based on the principles from rate-distortion theory that uses a simple combinatorial procedure to optimize for RDT-Fidelity. Extensive experiments on real and synthetic datasets reveal that Zorro produces sparser, stable, and more faithful explanations than existing graph neural network explanation approaches.

Original language	English
Article number	9866587
Pages (from-to)	8687-8698
Journal	IEEE Transactions on Knowledge & Data Engineering
Volume	35
Issue number	8
DOIs	https://doi.org/10.1109/TKDE.2022.3201170
Publication status	Published - 2023

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Access to Document

10.1109/TKDE.2022.3201170

Zorro_Valid_Sparse_and_Stable_Explanations_in_Graph_Neural_NetworksFinal published version, 698 KB

Cite this

@article{996e61c7a5014b738b9706c582ee5e6f,

title = "Zorro: Valid, sparse, and stable explanations in graph neural networks",

abstract = "With the ever-increasing popularity and applications of graph neural networks, several proposals have been made to explain and understand the decisions of a graph neural network. Explanations for graph neural networks differ in principle from other input settings. It is important to attribute the decision to input features and other related instances connected by the graph structure. We find that the previous explanation generation approaches that maximize the mutual information between the label distribution produced by the model and the explanation to be restrictive. Specifically, existing approaches do not enforce explanations to be valid, sparse, or robust to input perturbations. In this paper, we lay down some of the fundamental principles that an explanation method for graph neural networks should follow and introduce a metric RDT-Fidelity as a measure of the explanation's effectiveness. We propose a novel approach Zorro based on the principles from rate-distortion theory that uses a simple combinatorial procedure to optimize for RDT-Fidelity. Extensive experiments on real and synthetic datasets reveal that Zorro produces sparser, stable, and more faithful explanations than existing graph neural network explanation approaches.",

author = "Thorben Funke and Megha Khosla and Mandeep Rathee and Avishek Anand",

note = "Green Open Access added to TU Delft Institutional Repository {\textquoteleft}You share, we take care!{\textquoteright} – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public. ",

year = "2023",

doi = "10.1109/TKDE.2022.3201170",

language = "English",

volume = "35",

pages = "8687--8698",

journal = "IEEE Transactions on Knowledge & Data Engineering",

issn = "1041-4347",

publisher = "IEEE",

number = "8",

}

TY - JOUR

T1 - Zorro

T2 - Valid, sparse, and stable explanations in graph neural networks

AU - Funke, Thorben

AU - Khosla, Megha

AU - Rathee, Mandeep

AU - Anand, Avishek

N1 - Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2023

Y1 - 2023

N2 - With the ever-increasing popularity and applications of graph neural networks, several proposals have been made to explain and understand the decisions of a graph neural network. Explanations for graph neural networks differ in principle from other input settings. It is important to attribute the decision to input features and other related instances connected by the graph structure. We find that the previous explanation generation approaches that maximize the mutual information between the label distribution produced by the model and the explanation to be restrictive. Specifically, existing approaches do not enforce explanations to be valid, sparse, or robust to input perturbations. In this paper, we lay down some of the fundamental principles that an explanation method for graph neural networks should follow and introduce a metric RDT-Fidelity as a measure of the explanation's effectiveness. We propose a novel approach Zorro based on the principles from rate-distortion theory that uses a simple combinatorial procedure to optimize for RDT-Fidelity. Extensive experiments on real and synthetic datasets reveal that Zorro produces sparser, stable, and more faithful explanations than existing graph neural network explanation approaches.

AB - With the ever-increasing popularity and applications of graph neural networks, several proposals have been made to explain and understand the decisions of a graph neural network. Explanations for graph neural networks differ in principle from other input settings. It is important to attribute the decision to input features and other related instances connected by the graph structure. We find that the previous explanation generation approaches that maximize the mutual information between the label distribution produced by the model and the explanation to be restrictive. Specifically, existing approaches do not enforce explanations to be valid, sparse, or robust to input perturbations. In this paper, we lay down some of the fundamental principles that an explanation method for graph neural networks should follow and introduce a metric RDT-Fidelity as a measure of the explanation's effectiveness. We propose a novel approach Zorro based on the principles from rate-distortion theory that uses a simple combinatorial procedure to optimize for RDT-Fidelity. Extensive experiments on real and synthetic datasets reveal that Zorro produces sparser, stable, and more faithful explanations than existing graph neural network explanation approaches.

UR - http://www.scopus.com/inward/record.url?scp=85137574213&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2022.3201170

DO - 10.1109/TKDE.2022.3201170

M3 - Article

SN - 1041-4347

VL - 35

SP - 8687

EP - 8698

JO - IEEE Transactions on Knowledge & Data Engineering

JF - IEEE Transactions on Knowledge & Data Engineering

IS - 8

M1 - 9866587

ER -

Zorro: Valid, sparse, and stable explanations in graph neural networks

Abstract

Bibliographical note

Access to Document

Other files and links

Fingerprint

Cite this