Data governance: Organizing data for trustworthy Artificial Intelligence

Marijn Janssen*, Paul Brous, Elsa Estevez, Luis S. Barbosa, Tomasz Janowski

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

210 Citations (Scopus)
352 Downloads (Pure)

Abstract

The rise of Big, Open and Linked Data (BOLD) enables Big Data Algorithmic Systems (BDAS) which are often based on machine learning, neural networks and other forms of Artificial Intelligence (AI). As such systems are increasingly requested to make decisions that are consequential to individuals, communities and society at large, their failures cannot be tolerated, and they are subject to stringent regulatory and ethical requirements. However, they all rely on data which is not only big, open and linked but varied, dynamic and streamed at high speeds in real-time. Managing such data is challenging. To overcome such challenges and utilize opportunities for BDAS, organizations are increasingly developing advanced data governance capabilities. This paper reviews challenges and approaches to data governance for such systems, and proposes a framework for data governance for trustworthy BDAS. The framework promotes the stewardship of data, processes and algorithms, the controlled opening of data and algorithms to enable external scrutiny, trusted information sharing within and between organizations, risk-based governance, system-level controls, and data control through shared ownership and self-sovereign identities. The framework is based on 13 design principles and is proposed incrementally, for a single organization and multiple networked organizations.

Original languageEnglish
Article number101493
JournalGovernment Information Quarterly
Volume37
Issue number3
DOIs
Publication statusPublished - 2020

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

  • AI
  • Algorithmic governance
  • Artificial Intelligence
  • Big data
  • Data governance
  • Information sharing
  • Trusted frameworks

Fingerprint

Dive into the research topics of 'Data governance: Organizing data for trustworthy Artificial Intelligence'. Together they form a unique fingerprint.

Cite this