TY - GEN
T1 - Effective crowdsourced generation of training data for chatbots natural language understanding
AU - Bapat, Rucha
AU - Kucherbaev, Pavel
AU - Bozzon, Alessandro
N1 - Accepted Author Manuscript
PY - 2018
Y1 - 2018
N2 - Chatbots are text-based conversational agents. Natural Language Understanding (NLU) models are used to extract meaning and intention from user messages sent to chatbots. The user experience of chatbots largely depends on the performance of the NLU model, which itself largely depends on the initial dataset the model is trained with. The training data should cover the diversity of real user requests the chatbot will receive. Obtaining such data is a challenging task even for big corporations. We introduce a generic approach to generate training data with the help of crowd workers, we discuss the approach workflow and the design of crowdsourcing tasks assuring high quality. We evaluate the approach by running an experiment collecting data for 9 different intents. We use the collected training data to train a natural language understanding model. We analyse the performance of the model under different training set sizes for each intent. We provide recommendations on selecting an optimal confidence threshold for predicting intents, based on the cost model of incorrect and unknown predictions.
AB - Chatbots are text-based conversational agents. Natural Language Understanding (NLU) models are used to extract meaning and intention from user messages sent to chatbots. The user experience of chatbots largely depends on the performance of the NLU model, which itself largely depends on the initial dataset the model is trained with. The training data should cover the diversity of real user requests the chatbot will receive. Obtaining such data is a challenging task even for big corporations. We introduce a generic approach to generate training data with the help of crowd workers, we discuss the approach workflow and the design of crowdsourcing tasks assuring high quality. We evaluate the approach by running an experiment collecting data for 9 different intents. We use the collected training data to train a natural language understanding model. We analyse the performance of the model under different training set sizes for each intent. We provide recommendations on selecting an optimal confidence threshold for predicting intents, based on the cost model of incorrect and unknown predictions.
KW - Conversational agents
KW - Crowdsourcing
KW - Natural language understanding
UR - http://www.scopus.com/inward/record.url?scp=85047996275&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-91662-0_8
DO - 10.1007/978-3-319-91662-0_8
M3 - Conference contribution
SN - 978-3-319-91661-3
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 114
EP - 128
BT - Web Engineering - 18th International Conference, ICWE 2018, Proceedings
PB - Springer
T2 - 18th International Conference on Web Engineering, ICWE 2018
Y2 - 5 June 2018 through 8 June 2018
ER -