Faulty or Ready? Handling Failures in Deep-Learning Computer Vision Models until Deployment: A Study of Practices, Challenges, and Needs

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

56 Downloads (Pure)

Abstract

Handling failures in computer vision systems that rely on deep learning models remains a challenge. While an increasing number of methods for bug identification and correction are proposed, little is known about how practitioners actually search for failures in these models. We perform an empirical study to understand the goals and needs of practitioners, the workflows and artifacts they use, and the challenges and limitations in their process. We interview 18 practitioners by probing them with a carefully crafted failure handling scenario. We observe that there is a great diversity of failure handling workflows in which cooperations are often necessary, that practitioners overlook certain types of failures and bugs, and that they generally do not rely on potentially relevant approaches and tools originally stemming from research. These insights allow to draw a list of research opportunities, such as creating a library of best practices and more representative formalisations of practitioners' goals, developing interfaces to exploit failure handling artifacts, as well as providing specialized training.

Original languageEnglish
Title of host publicationCHI 2023 - Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Number of pages20
ISBN (Print)978-1-4503-9421-5
DOIs
Publication statusPublished - 2023
Event2023 CHI Conference on Human Factors in Computing Systems - Congress Center Hamburg (CCH), Hamburg, Germany
Duration: 23 Apr 202328 Apr 2023
https://chi2023.acm.org/

Conference

Conference2023 CHI Conference on Human Factors in Computing Systems
Abbreviated titleCHI'23
Country/TerritoryGermany
CityHamburg
Period23/04/2328/04/23
Internet address

Keywords

  • debugging
  • explainability
  • machine learning testing
  • practices

Fingerprint

Dive into the research topics of 'Faulty or Ready? Handling Failures in Deep-Learning Computer Vision Models until Deployment: A Study of Practices, Challenges, and Needs'. Together they form a unique fingerprint.

Cite this