New Insights into Metric Optimization for Ranking-based Recommendation

Roger Zhe Li; Julián Urbano; Alan Hanjalic

doi:10.1145/3404835.3462973

New Insights into Metric Optimization for Ranking-based Recommendation

Roger Zhe Li, Julián Urbano, Alan Hanjalic

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

3 Citations (Scopus)

39 Downloads (Pure)

Abstract

Direct optimization of IR metrics has often been adopted as an approach to devise and develop ranking-based recommender systems. Most methods following this approach (e.g. TFMAP, CLiMF, Top-N-Rank) aim at optimizing the same metric being used for evaluation, under the assumption that this will lead to the best performance. A number of studies of this practice bring this assumption, however, into question. In this paper, we dig deeper into this issue in order to learn more about the effects of the choice of the metric to optimize on the performance of a ranking-based recommender system. We present an extensive experimental study conducted on different datasets in both pairwise and listwise learning-to-rank (LTR) scenarios, to compare the relative merit of four popular IR metrics, namely RR, AP, nDCG and RBP, when used for optimization and assessment of recommender systems in various combinations. For the first three, we follow the practice of loss function formulation available in literature. For the fourth one, we propose novel loss functions inspired by RBP for both the pairwise and listwise scenario. Our results confirm that the best performance is indeed not necessarily achieved when optimizing the same metric being used for evaluation. In fact, we find that RBP-inspired losses perform at least as well as other metrics in a consistent way, and offer clear benefits in several cases. Interesting to see is that RBP-inspired losses, while improving the recommendation performance for all uses, may lead to an individual performance gain that is correlated with the activity level of a user in interacting with items. The more active the users, the more they benefit. Overall, our results challenge the assumption behind the current research practice of optimizing and evaluating the same metric, and point to RBP-based optimization instead as a promising alternative when learning to rank in the recommendation context.

Original language	English
Title of host publication	SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
Publisher	Association for Computing Machinery (ACM)
Pages	932-941
Number of pages	10
ISBN (Electronic)	9781450380379
DOIs	https://doi.org/10.1145/3404835.3462973
Publication status	Published - 2021
Event	44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021 - Virtual, Online, Canada Duration: 11 Jul 2021 → 15 Jul 2021

Publication series

Name	SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

Conference

Conference	44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021
Country/Territory	Canada
City	Virtual, Online
Period	11/07/21 → 15/07/21

Keywords

evaluation metrics
learning to rank
recommender systems

Access to Document

10.1145/3404835.3462973

3404835.3462973Final published version, 1.52 MBLicence: CC BY

Cite this

Li, R. Z., Urbano, J., & Hanjalic, A. (2021). New Insights into Metric Optimization for Ranking-based Recommendation. In SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 932-941). Article 3462973 (SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval). Association for Computing Machinery (ACM). https://doi.org/10.1145/3404835.3462973

Li, Roger Zhe ; Urbano, Julián ; Hanjalic, Alan. / New Insights into Metric Optimization for Ranking-based Recommendation. SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery (ACM), 2021. pp. 932-941 (SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval).

@inproceedings{16e5a6703fd04ce3a35f54ca6fc8cc1e,

title = "New Insights into Metric Optimization for Ranking-based Recommendation",

abstract = "Direct optimization of IR metrics has often been adopted as an approach to devise and develop ranking-based recommender systems. Most methods following this approach (e.g. TFMAP, CLiMF, Top-N-Rank) aim at optimizing the same metric being used for evaluation, under the assumption that this will lead to the best performance. A number of studies of this practice bring this assumption, however, into question. In this paper, we dig deeper into this issue in order to learn more about the effects of the choice of the metric to optimize on the performance of a ranking-based recommender system. We present an extensive experimental study conducted on different datasets in both pairwise and listwise learning-to-rank (LTR) scenarios, to compare the relative merit of four popular IR metrics, namely RR, AP, nDCG and RBP, when used for optimization and assessment of recommender systems in various combinations. For the first three, we follow the practice of loss function formulation available in literature. For the fourth one, we propose novel loss functions inspired by RBP for both the pairwise and listwise scenario. Our results confirm that the best performance is indeed not necessarily achieved when optimizing the same metric being used for evaluation. In fact, we find that RBP-inspired losses perform at least as well as other metrics in a consistent way, and offer clear benefits in several cases. Interesting to see is that RBP-inspired losses, while improving the recommendation performance for all uses, may lead to an individual performance gain that is correlated with the activity level of a user in interacting with items. The more active the users, the more they benefit. Overall, our results challenge the assumption behind the current research practice of optimizing and evaluating the same metric, and point to RBP-based optimization instead as a promising alternative when learning to rank in the recommendation context.",

keywords = "evaluation metrics, learning to rank, recommender systems",

author = "Li, {Roger Zhe} and Juli{\'a}n Urbano and Alan Hanjalic",

year = "2021",

doi = "10.1145/3404835.3462973",

language = "English",

series = "SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval",

publisher = "Association for Computing Machinery (ACM)",

pages = "932--941",

booktitle = "SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval",

address = "United States",

note = "44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021 ; Conference date: 11-07-2021 Through 15-07-2021",

}

Li, RZ , Urbano, J & Hanjalic, A 2021, New Insights into Metric Optimization for Ranking-based Recommendation. in SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval., 3462973, SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery (ACM), pp. 932-941, 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021, Virtual, Online, Canada, 11/07/21. https://doi.org/10.1145/3404835.3462973

New Insights into Metric Optimization for Ranking-based Recommendation. / Li, Roger Zhe ; Urbano, Julián ; Hanjalic, Alan.
SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery (ACM), 2021. p. 932-941 3462973 (SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - New Insights into Metric Optimization for Ranking-based Recommendation

AU - Li, Roger Zhe

AU - Urbano, Julián

AU - Hanjalic, Alan

PY - 2021

Y1 - 2021

N2 - Direct optimization of IR metrics has often been adopted as an approach to devise and develop ranking-based recommender systems. Most methods following this approach (e.g. TFMAP, CLiMF, Top-N-Rank) aim at optimizing the same metric being used for evaluation, under the assumption that this will lead to the best performance. A number of studies of this practice bring this assumption, however, into question. In this paper, we dig deeper into this issue in order to learn more about the effects of the choice of the metric to optimize on the performance of a ranking-based recommender system. We present an extensive experimental study conducted on different datasets in both pairwise and listwise learning-to-rank (LTR) scenarios, to compare the relative merit of four popular IR metrics, namely RR, AP, nDCG and RBP, when used for optimization and assessment of recommender systems in various combinations. For the first three, we follow the practice of loss function formulation available in literature. For the fourth one, we propose novel loss functions inspired by RBP for both the pairwise and listwise scenario. Our results confirm that the best performance is indeed not necessarily achieved when optimizing the same metric being used for evaluation. In fact, we find that RBP-inspired losses perform at least as well as other metrics in a consistent way, and offer clear benefits in several cases. Interesting to see is that RBP-inspired losses, while improving the recommendation performance for all uses, may lead to an individual performance gain that is correlated with the activity level of a user in interacting with items. The more active the users, the more they benefit. Overall, our results challenge the assumption behind the current research practice of optimizing and evaluating the same metric, and point to RBP-based optimization instead as a promising alternative when learning to rank in the recommendation context.

AB - Direct optimization of IR metrics has often been adopted as an approach to devise and develop ranking-based recommender systems. Most methods following this approach (e.g. TFMAP, CLiMF, Top-N-Rank) aim at optimizing the same metric being used for evaluation, under the assumption that this will lead to the best performance. A number of studies of this practice bring this assumption, however, into question. In this paper, we dig deeper into this issue in order to learn more about the effects of the choice of the metric to optimize on the performance of a ranking-based recommender system. We present an extensive experimental study conducted on different datasets in both pairwise and listwise learning-to-rank (LTR) scenarios, to compare the relative merit of four popular IR metrics, namely RR, AP, nDCG and RBP, when used for optimization and assessment of recommender systems in various combinations. For the first three, we follow the practice of loss function formulation available in literature. For the fourth one, we propose novel loss functions inspired by RBP for both the pairwise and listwise scenario. Our results confirm that the best performance is indeed not necessarily achieved when optimizing the same metric being used for evaluation. In fact, we find that RBP-inspired losses perform at least as well as other metrics in a consistent way, and offer clear benefits in several cases. Interesting to see is that RBP-inspired losses, while improving the recommendation performance for all uses, may lead to an individual performance gain that is correlated with the activity level of a user in interacting with items. The more active the users, the more they benefit. Overall, our results challenge the assumption behind the current research practice of optimizing and evaluating the same metric, and point to RBP-based optimization instead as a promising alternative when learning to rank in the recommendation context.

KW - evaluation metrics

KW - learning to rank

KW - recommender systems

UR - http://www.scopus.com/inward/record.url?scp=85111656074&partnerID=8YFLogxK

U2 - 10.1145/3404835.3462973

DO - 10.1145/3404835.3462973

M3 - Conference contribution

AN - SCOPUS:85111656074

T3 - SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

SP - 932

EP - 941

BT - SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

PB - Association for Computing Machinery (ACM)

T2 - 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021

Y2 - 11 July 2021 through 15 July 2021

ER -

Li RZ , Urbano J , Hanjalic A. New Insights into Metric Optimization for Ranking-based Recommendation. In SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery (ACM). 2021. p. 932-941. 3462973. (SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval). doi: 10.1145/3404835.3462973

New Insights into Metric Optimization for Ranking-based Recommendation

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Metric Optimization and Mainstream Bias Mitigation in Recommender Systems

Cite this