Abstract
Explainability is a key requirement for text classification in many application domains ranging from sentiment analysis to medical diagnosis or legal reviews. Existing methods often rely on "attention" mechanisms for explaining classification results by estimating the relative importance of input units. However, recent studies have shown that such mechanisms tend to mis-identify irrelevant input units in their explanation. In this work, we propose a hybrid human-AI approach that incorporates human rationales into attention-based text classification models to improve the explainability of classification results. Specifically, we ask workers to provide rationales for their annotation by selecting relevant pieces of text. We introduce MARTA, a Bayesian framework that jointly learns an attention-based model and the reliability of workers while injecting human rationales into model training. We derive a principled optimization algorithm based on variational inference with efficient updating rules for learning MARTA parameters. Extensive validation on real-world datasets shows that our framework significantly improves the state of the art both in terms of classification explainability and accuracy.
Original language | English |
---|---|
Title of host publication | Proceedings of the 35th AAAI Conference on Artificial Intelligence |
Publisher | Association for the Advancement of Artificial Intelligence (AAAI) |
Number of pages | 9 |
ISBN (Electronic) | 978-1-57735-866-4 |
Publication status | Published - 2021 |
Event | 35th AAAI Conference on Artificial Intelligence - Online Duration: 2 Feb 2021 → 9 Feb 2021 Conference number: 35 |
Publication series
Name | AAAI-21 Technical Tracks |
---|---|
Publisher | PKP Publishing Services |
Number | 7 |
Volume | 35 |
ISSN (Print) | 2159-5399 |
ISSN (Electronic) | 2374-3468 |
Conference
Conference | 35th AAAI Conference on Artificial Intelligence |
---|---|
Abbreviated title | AAAI 2021 |
Period | 2/02/21 → 9/02/21 |
Keywords
- Human-in-the-loop Machine Learning
- Learning of Cost
- Reliability
- Skill of Label
- Probabilistic Graphical Models
- Accountability
- Interpretability & Explainability