TY - JOUR
T1 - Solving bin-packing problems under privacy preservation
T2 - Possibilities and trade-offs
AU - Hoogervorst, Rowan
AU - Zhang, Yingqian
AU - Tillem, Gamze
AU - Erkin, Zekeriya
AU - Verwer, Sicco
PY - 2019
Y1 - 2019
N2 - We investigate the trade-off between privacy and solution quality that occurs when a k-anonymized database is used as input to the bin-packing optimization problem. To investigate the impact of the chosen anonymization method on this trade-off, we consider two recoding methods for k-anonymity: full-domain generalization and partition-based single-dimensional recoding. To deal with the uncertainty created by anonymization in the bin-packing problem, we utilize stochastic programming and robust optimization methods. Our computational results show that the trade-off is strongly dependent on both the anonymization and optimization method. On the anonymization side, we see that using single dimensional recoding leads to significantly better solution quality than using full domain generalization. On the optimization side, we see that using stochastic programming, where we use the multiset of values in an equivalence class, considerably improves the solutions. While publishing these multisets makes the database more vulnerable to a table linkage attack, we argue that it is up to the data publisher to reason if such a loss of anonymization weighs up to the increase in optimization performance.
AB - We investigate the trade-off between privacy and solution quality that occurs when a k-anonymized database is used as input to the bin-packing optimization problem. To investigate the impact of the chosen anonymization method on this trade-off, we consider two recoding methods for k-anonymity: full-domain generalization and partition-based single-dimensional recoding. To deal with the uncertainty created by anonymization in the bin-packing problem, we utilize stochastic programming and robust optimization methods. Our computational results show that the trade-off is strongly dependent on both the anonymization and optimization method. On the anonymization side, we see that using single dimensional recoding leads to significantly better solution quality than using full domain generalization. On the optimization side, we see that using stochastic programming, where we use the multiset of values in an equivalence class, considerably improves the solutions. While publishing these multisets makes the database more vulnerable to a table linkage attack, we argue that it is up to the data publisher to reason if such a loss of anonymization weighs up to the increase in optimization performance.
KW - Bin-packing
KW - Data anonymization
KW - k-anonymity
KW - Robust optimization
KW - Stochastic programming
UR - http://www.scopus.com/inward/record.url?scp=85066619466&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2019.05.011
DO - 10.1016/j.ins.2019.05.011
M3 - Article
SN - 0020-0255
VL - 500
SP - 203
EP - 216
JO - Information Sciences
JF - Information Sciences
ER -