PDFA Distillation with Error Bound Guarantees

Robert Baumgartner*, Sicco Verwer

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

Abstract

Active learning algorithms to infer probabilistic finite automata (PFA) have gained interest recently, due to their ability to provide surrogate models for some types of neural networks. However, recent approaches either cannot guarantee determinism, which makes the automaton harder to understand and compute, or they rely on techniques that bound errors on individual transitions. In this work we propose a derivative of the recent L# algorithm to learn deterministic PFA (PDFA) from systems returning a distribution over a set of tokens given an input string. Along with determinism, we can give error bounds on probabilities assigned to whole strings with an easy to understand approach. We show formal correctness of our algorithm and test it on neural networks trained to model three datasets from computer- and network-systems respectively. We show that the algorithm can learn the network’s behaviour closely, and provide an example application of how the model can be used to interpret the network. We note that our approach is in theory applicable in general to learn deterministic weighted finite automata. We provide the source code of our algorithm and relevant scripts on our public repository.
Original languageEnglish
Title of host publicationImplementation and Application of Automata
Subtitle of host publication28th International Conference, CIAA 2024, Akita, Japan, September 3–6, 2024, Proceedings
EditorsSzilárd Zsolt Fazekas
Place of PublicationCham
PublisherSpringer
Pages51-65
Number of pages15
ISBN (Electronic)978-3-031-71112-1
ISBN (Print)978-3-031-71111-4
DOIs
Publication statusPublished - 2024
Event28th International Conference on Implementation and Application of Automata, CIAA 2024 - Akita, Japan
Duration: 3 Sept 20246 Sept 2024
http://www.math.akita-u.ac.jp/ciaa2024/

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer
Volume15015 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference28th International Conference on Implementation and Application of Automata, CIAA 2024
Country/TerritoryJapan
CityAkita
Period3/09/246/09/24
Internet address

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

  • Active Automata Learning
  • Explainable AI
  • PDFA distillation

Fingerprint

Dive into the research topics of 'PDFA Distillation with Error Bound Guarantees'. Together they form a unique fingerprint.

Cite this