Key Insights from a Feature Discovery User Study

Andra Ionescu*, Zeger Mouw, Efthimia Aivaloglou, Asterios Katsifodimos

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

29 Downloads (Pure)

Abstract

Multiple works in data management research focus on automating the processes of data augmentation and feature discovery to save users from having to perform these tasks manually. Yet, this automation often leads to a disconnect with the users, as it fails to consider the specific needs and preferences of the actual end-users of data management systems for machine learning. To explore this issue further, we conducted 19 semi-structured, think-aloud use-case studies based on a scenario in which data specialists were tasked with augmenting a base table with additional features to train a machine learning model. In this paper, we share key insights into the practices of feature discovery on tabular data performed by real-world data specialists derived from our user study. Our research uncovered differences between the user assumptions reported in the literature and the actual practices, as well as some areas where literature and real-world practices align.

Original languageEnglish
Title of host publicationHILDA 2024 - Workshop on Human-In-the-Loop Data Analytics Co-located with SIGMOD 2024
PublisherACM
Number of pages5
ISBN (Electronic)9798400706936
DOIs
Publication statusPublished - 2024
Event2024 Workshop on Human-In-the-Loop Data Analytics, HILDA 2024, Co-located with SIGMOD 2024 - Santiago, Chile
Duration: 14 Jun 2024 → …

Publication series

NameHILDA 2024 - Workshop on Human-In-the-Loop Data Analytics Co-located with SIGMOD 2024

Conference

Conference2024 Workshop on Human-In-the-Loop Data Analytics, HILDA 2024, Co-located with SIGMOD 2024
Country/TerritoryChile
CitySantiago
Period14/06/24 → …

Fingerprint

Dive into the research topics of 'Key Insights from a Feature Discovery User Study'. Together they form a unique fingerprint.

Cite this