Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking

Gustavo Penha; Claudia Hauff

doi:10.1007/978-3-030-45439-5_46

Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking

^*Corresponding author for this work

Web Information Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

22 Citations (Scopus)

Abstract

Neural ranking models are traditionally trained on a series of random batches, sampled uniformly from the entire training set. Curriculum learning has recently been shown to improve neural models’ effectiveness by sampling batches non-uniformly, going from easy to difficult instances during training. In the context of neural Information Retrieval (IR) curriculum learning has not been explored yet, and so it remains unclear (1) how to measure the difficulty of training instances and (2) how to transition from easy to difficult instances during training. To address both challenges and determine whether curriculum learning is beneficial for neural ranking models, we need large-scale datasets and a retrieval task that allows us to conduct a wide range of experiments. For this purpose, we resort to the task of conversation response ranking: ranking responses given the conversation history. In order to deal with challenge (1), we explore scoring functions to measure the difficulty of conversations based on different input spaces. To address challenge (2) we evaluate different pacing functions, which determine the velocity in which we go from easy to difficult instances. We find that, overall, by just intelligently sorting the training data (i.e., by performing curriculum learning) we can improve the retrieval effectiveness by up to 2% (The source code is available at https://github.com/Guzpenha/transformers_cl.).

Original language	English
Title of host publication	Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020
Subtitle of host publication	Proceedings
Editors	Joemon M. Jose, Emine Yilmaz, João Magalhães, Flávio Martins, Pablo Castells, Nicola Ferro, Mário J. Silva
Place of Publication	Cham
Publisher	Springer
Pages	699-713
Number of pages	15
ISBN (Electronic)	978-3-030-45439-5
ISBN (Print)	978-3-030-45438-8
DOIs	https://doi.org/10.1007/978-3-030-45439-5_46
Publication status	Published - 2020
Event	42nd European Conference on IR Research, ECIR 2020 - Lisbon, Portugal Duration: 14 Apr 2020 → 17 Apr 2020

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	12035
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	42nd European Conference on IR Research, ECIR 2020
Country/Territory	Portugal
City	Lisbon
Period	14/04/20 → 17/04/20

Keywords

Conversation response ranking
Curriculum learning

Access to Document

10.1007/978-3-030-45439-5_46

Cite this

Penha, G., & Hauff, C. (2020). Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking. In J. M. Jose, E. Yilmaz, J. Magalhães, F. Martins, P. Castells, N. Ferro, & M. J. Silva (Eds.), Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020: Proceedings (pp. 699-713). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12035 ). Springer. https://doi.org/10.1007/978-3-030-45439-5_46

Penha, Gustavo ; Hauff, Claudia. / Curriculum Learning Strategies for IR : An Empirical Study on Conversation Response Ranking. Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020: Proceedings. editor / Joemon M. Jose ; Emine Yilmaz ; João Magalhães ; Flávio Martins ; Pablo Castells ; Nicola Ferro ; Mário J. Silva. Cham : Springer, 2020. pp. 699-713 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{3d6216fbd31f47d4af0d911d3bccaccb,

title = "Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking",

abstract = "Neural ranking models are traditionally trained on a series of random batches, sampled uniformly from the entire training set. Curriculum learning has recently been shown to improve neural models{\textquoteright} effectiveness by sampling batches non-uniformly, going from easy to difficult instances during training. In the context of neural Information Retrieval (IR) curriculum learning has not been explored yet, and so it remains unclear (1) how to measure the difficulty of training instances and (2) how to transition from easy to difficult instances during training. To address both challenges and determine whether curriculum learning is beneficial for neural ranking models, we need large-scale datasets and a retrieval task that allows us to conduct a wide range of experiments. For this purpose, we resort to the task of conversation response ranking: ranking responses given the conversation history. In order to deal with challenge (1), we explore scoring functions to measure the difficulty of conversations based on different input spaces. To address challenge (2) we evaluate different pacing functions, which determine the velocity in which we go from easy to difficult instances. We find that, overall, by just intelligently sorting the training data (i.e., by performing curriculum learning) we can improve the retrieval effectiveness by up to 2% (The source code is available at https://github.com/Guzpenha/transformers_cl.).",

keywords = "Conversation response ranking, Curriculum learning",

author = "Gustavo Penha and Claudia Hauff",

year = "2020",

doi = "10.1007/978-3-030-45439-5_46",

language = "English",

isbn = "978-3-030-45438-8",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "699--713",

editor = "Jose, {Joemon M.} and Emine Yilmaz and Jo{\~a}o Magalh{\~a}es and Fl{\'a}vio Martins and Pablo Castells and Nicola Ferro and Silva, {M{\'a}rio J.}",

booktitle = "Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020",

note = "42nd European Conference on IR Research, ECIR 2020 ; Conference date: 14-04-2020 Through 17-04-2020",

}

Penha, G & Hauff, C 2020, Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking. in JM Jose, E Yilmaz, J Magalhães, F Martins, P Castells, N Ferro & MJ Silva (eds), Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020: Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12035 , Springer, Cham, pp. 699-713, 42nd European Conference on IR Research, ECIR 2020, Lisbon, Portugal, 14/04/20. https://doi.org/10.1007/978-3-030-45439-5_46

Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking. / Penha, Gustavo; Hauff, Claudia.
Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020: Proceedings. ed. / Joemon M. Jose; Emine Yilmaz; João Magalhães; Flávio Martins; Pablo Castells; Nicola Ferro; Mário J. Silva. Cham: Springer, 2020. p. 699-713 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12035 ).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Curriculum Learning Strategies for IR

T2 - 42nd European Conference on IR Research, ECIR 2020

AU - Penha, Gustavo

AU - Hauff, Claudia

PY - 2020

Y1 - 2020

N2 - Neural ranking models are traditionally trained on a series of random batches, sampled uniformly from the entire training set. Curriculum learning has recently been shown to improve neural models’ effectiveness by sampling batches non-uniformly, going from easy to difficult instances during training. In the context of neural Information Retrieval (IR) curriculum learning has not been explored yet, and so it remains unclear (1) how to measure the difficulty of training instances and (2) how to transition from easy to difficult instances during training. To address both challenges and determine whether curriculum learning is beneficial for neural ranking models, we need large-scale datasets and a retrieval task that allows us to conduct a wide range of experiments. For this purpose, we resort to the task of conversation response ranking: ranking responses given the conversation history. In order to deal with challenge (1), we explore scoring functions to measure the difficulty of conversations based on different input spaces. To address challenge (2) we evaluate different pacing functions, which determine the velocity in which we go from easy to difficult instances. We find that, overall, by just intelligently sorting the training data (i.e., by performing curriculum learning) we can improve the retrieval effectiveness by up to 2% (The source code is available at https://github.com/Guzpenha/transformers_cl.).

AB - Neural ranking models are traditionally trained on a series of random batches, sampled uniformly from the entire training set. Curriculum learning has recently been shown to improve neural models’ effectiveness by sampling batches non-uniformly, going from easy to difficult instances during training. In the context of neural Information Retrieval (IR) curriculum learning has not been explored yet, and so it remains unclear (1) how to measure the difficulty of training instances and (2) how to transition from easy to difficult instances during training. To address both challenges and determine whether curriculum learning is beneficial for neural ranking models, we need large-scale datasets and a retrieval task that allows us to conduct a wide range of experiments. For this purpose, we resort to the task of conversation response ranking: ranking responses given the conversation history. In order to deal with challenge (1), we explore scoring functions to measure the difficulty of conversations based on different input spaces. To address challenge (2) we evaluate different pacing functions, which determine the velocity in which we go from easy to difficult instances. We find that, overall, by just intelligently sorting the training data (i.e., by performing curriculum learning) we can improve the retrieval effectiveness by up to 2% (The source code is available at https://github.com/Guzpenha/transformers_cl.).

KW - Conversation response ranking

KW - Curriculum learning

UR - http://www.scopus.com/inward/record.url?scp=85083955385&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-45439-5_46

DO - 10.1007/978-3-030-45439-5_46

M3 - Conference contribution

AN - SCOPUS:85083955385

SN - 978-3-030-45438-8

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 699

EP - 713

BT - Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020

A2 - Jose, Joemon M.

A2 - Yilmaz, Emine

A2 - Magalhães, João

A2 - Martins, Flávio

A2 - Castells, Pablo

A2 - Ferro, Nicola

A2 - Silva, Mário J.

PB - Springer

CY - Cham

Y2 - 14 April 2020 through 17 April 2020

ER -

Penha G, Hauff C. Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking. In Jose JM, Yilmaz E, Magalhães J, Martins F, Castells P, Ferro N, Silva MJ, editors, Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020: Proceedings. Cham: Springer. 2020. p. 699-713. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-45439-5_46

Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this