Towards Minimal Necessary Data: The Case for Analyzing Training Data Requirements of Recommender Algorithms

Martha Larson; Alessandro Zito; Babak Loni; Paolo Cremonesi

doi:10.18122/B2VX12

Towards Minimal Necessary Data: The Case for Analyzing Training Data Requirements of Recommender Algorithms

Martha Larson, Alessandro Zito, Babak Loni, Paolo Cremonesi

Multimedia Computing

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

321 Downloads (Pure)

Abstract

This paper states the case for the principle of minimal necessary data: If two recommender algorithms achieve the same effectiveness, the better algorithm is the one that requires less user data. Applying this principle involves carrying out training data requirements analysis, which we argue should be adopted as best practice for the development and evaluation of recommender algorithms. We take
the position that responsible recommendation is recommendation that serves the people whose data it uses. To minimize the imposition on users’ privacy, it is important that a recommender system does not collect or store more user information than it absolutely needs. Further, algorithms using minimal necessary data reduce training time and address the cold start problem. To illustrate the trade-off between training data volume and accuracy, we carry out
a set of classic recommender system experiments. We conclude that
consistently applying training data requirements analysis would represent a relatively small change in researchers’ current practices, but a large step towards more responsible recommender systems.

Original language	English
Title of host publication	FATREC Workshop on Responsible Recommendation Proceedings
Pages	1-6
Number of pages	6
DOIs	https://doi.org/10.18122/B2VX12
Publication status	Published - 2017
Event	FATREC 2017: Workshop on Responsible Recommendation - Como, Italy Duration: 31 Aug 2017 → 31 Aug 2017 https://piret.gitlab.io/fatrec/

Workshop

Workshop	FATREC 2017
Country/Territory	Italy
City	Como
Period	31/08/17 → 31/08/17
Internet address	https://piret.gitlab.io/fatrec/

Access to Document

10.18122/B2VX12

35744357Final published version, 687 KBLicence: CC BY-SA

Cite this

@inproceedings{03b3462def44465bb02f011dc7a1574d,

title = "Towards Minimal Necessary Data: The Case for Analyzing Training Data Requirements of Recommender Algorithms",

abstract = "This paper states the case for the principle of minimal necessary data: If two recommender algorithms achieve the same effectiveness, the better algorithm is the one that requires less user data. Applying this principle involves carrying out training data requirements analysis, which we argue should be adopted as best practice for the development and evaluation of recommender algorithms. We takethe position that responsible recommendation is recommendation that serves the people whose data it uses. To minimize the imposition on users{\textquoteright} privacy, it is important that a recommender system does not collect or store more user information than it absolutely needs. Further, algorithms using minimal necessary data reduce training time and address the cold start problem. To illustrate the trade-off between training data volume and accuracy, we carry outa set of classic recommender system experiments. We conclude thatconsistently applying training data requirements analysis would represent a relatively small change in researchers{\textquoteright} current practices, but a large step towards more responsible recommender systems.",

author = "Martha Larson and Alessandro Zito and Babak Loni and Paolo Cremonesi",

year = "2017",

doi = "10.18122/B2VX12",

language = "English",

pages = "1--6",

booktitle = "FATREC Workshop on Responsible Recommendation Proceedings",

note = "FATREC 2017 : Workshop on Responsible Recommendation ; Conference date: 31-08-2017 Through 31-08-2017",

url = "https://piret.gitlab.io/fatrec/",

}

TY - GEN

T1 - Towards Minimal Necessary Data

T2 - FATREC 2017

AU - Larson, Martha

AU - Zito, Alessandro

AU - Loni, Babak

AU - Cremonesi, Paolo

PY - 2017

Y1 - 2017

N2 - This paper states the case for the principle of minimal necessary data: If two recommender algorithms achieve the same effectiveness, the better algorithm is the one that requires less user data. Applying this principle involves carrying out training data requirements analysis, which we argue should be adopted as best practice for the development and evaluation of recommender algorithms. We takethe position that responsible recommendation is recommendation that serves the people whose data it uses. To minimize the imposition on users’ privacy, it is important that a recommender system does not collect or store more user information than it absolutely needs. Further, algorithms using minimal necessary data reduce training time and address the cold start problem. To illustrate the trade-off between training data volume and accuracy, we carry outa set of classic recommender system experiments. We conclude thatconsistently applying training data requirements analysis would represent a relatively small change in researchers’ current practices, but a large step towards more responsible recommender systems.

AB - This paper states the case for the principle of minimal necessary data: If two recommender algorithms achieve the same effectiveness, the better algorithm is the one that requires less user data. Applying this principle involves carrying out training data requirements analysis, which we argue should be adopted as best practice for the development and evaluation of recommender algorithms. We takethe position that responsible recommendation is recommendation that serves the people whose data it uses. To minimize the imposition on users’ privacy, it is important that a recommender system does not collect or store more user information than it absolutely needs. Further, algorithms using minimal necessary data reduce training time and address the cold start problem. To illustrate the trade-off between training data volume and accuracy, we carry outa set of classic recommender system experiments. We conclude thatconsistently applying training data requirements analysis would represent a relatively small change in researchers’ current practices, but a large step towards more responsible recommender systems.

UR - http://resolver.tudelft.nl/uuid:03b3462d-ef44-465b-b02f-011dc7a1574d

U2 - 10.18122/B2VX12

DO - 10.18122/B2VX12

M3 - Conference contribution

SP - 1

EP - 6

BT - FATREC Workshop on Responsible Recommendation Proceedings

Y2 - 31 August 2017 through 31 August 2017

ER -

Towards Minimal Necessary Data: The Case for Analyzing Training Data Requirements of Recommender Algorithms

Abstract

Workshop

Access to Document

Other files and links

Fingerprint

Cite this