Abstract
The article published in Knowledge and Information Systems examines the evaluation of a user-adaptive personal assistant agent designed to assist a busy knowledge worker in time management. The article examines the managerial and technical challenges of designing adequate evaluation and the tension of collecting adequate data without a fully functional, deployed system. The PTIME agent was part of the CALO project, a seminal multi-institution effort to develop a personalized cognitive assistant. The project included a significant attempt to rigorously quantify learning capability, which the article discusses for the first time, and ultimately the project led to multiple spin-outs including Siri. Retrospection on negative and positive experiences over the six years of the project underscores best practice in evaluating user-adaptive systems. Through the lessons illustrated from the case study of intelligent knowledge system evaluation, the article highlights how development and infusion of innovative technology must be supported by adequate evaluation of its efficacy.
Original language | English |
---|---|
Title of host publication | BNAIC 2017 pre-proceedings |
Subtitle of host publication | 29th Benelux Conference on Artificial Intelligence |
Editors | Bart Verheij, Marco Wiering |
Pages | 344-345 |
Number of pages | 2 |
ISBN (Electronic) | 978-94-034-0299-4 |
Publication status | Published - Nov 2017 |
Event | 29th Benelux Conference on Artificial Intelligence: 29th Benelux Conference on Artificial Intelligence - Groningen, Netherlands Duration: 8 Nov 2017 → 9 Nov 2017 Conference number: 29 http://bnaic2017.ai.rug.nl/ |
Conference
Conference | 29th Benelux Conference on Artificial Intelligence |
---|---|
Abbreviated title | BNAIC 2017 |
Country/Territory | Netherlands |
City | Groningen |
Period | 8/11/17 → 9/11/17 |
Internet address |