TY - JOUR
T1 - A Process Pattern Model for Tackling and Improving Big Data Quality
AU - Wahyudi, Agung
AU - Kuk, George
AU - Janssen, Marijn
PY - 2018
Y1 - 2018
N2 - Data seldom create value by themselves. They need to be linked and combined from multiple sources, which can often come with variable data quality. The task of improving data quality is a recurring challenge. In this paper, we use a case study of a large telecom company to develop a generic process pattern model for improving data quality. The process pattern model is defined as a proven series of activities, aimed at improving the data quality given a certain context, a particular objective, and a specific set of initial conditions. Four different patterns are derived to deal with the variations in data quality of datasets. Instead of having to find the way to improve the quality of big data for each situation, the process model provides data users with generic patterns, which can be used as a reference model to improve big data quality.
AB - Data seldom create value by themselves. They need to be linked and combined from multiple sources, which can often come with variable data quality. The task of improving data quality is a recurring challenge. In this paper, we use a case study of a large telecom company to develop a generic process pattern model for improving data quality. The process pattern model is defined as a proven series of activities, aimed at improving the data quality given a certain context, a particular objective, and a specific set of initial conditions. Four different patterns are derived to deal with the variations in data quality of datasets. Instead of having to find the way to improve the quality of big data for each situation, the process model provides data users with generic patterns, which can be used as a reference model to improve big data quality.
KW - Big data
KW - Data processing
KW - Data quality
KW - Information quality
KW - Process patterns
KW - Reference model telecom
UR - http://resolver.tudelft.nl/uuid:a410d5ed-8bef-41e8-8140-130ded9cb8ee
UR - http://www.scopus.com/inward/record.url?scp=85040922240&partnerID=8YFLogxK
U2 - 10.1007/s10796-017-9822-7
DO - 10.1007/s10796-017-9822-7
M3 - Article
AN - SCOPUS:85040922240
SN - 1387-3326
SP - 1
EP - 13
JO - Information Systems Frontiers: a journal of research and innovation
JF - Information Systems Frontiers: a journal of research and innovation
ER -