Minimizers of the empirical risk and risk monotonicity

M. Loog; T.J. Viering; A. Mey

Minimizers of the empirical risk and risk monotonicity

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

67 Downloads (Pure)

Abstract

Plotting a learner’s average performance against the number of training samples results in a learning curve. Studying such curves on one or more data sets is a way to get to a better understanding of the generalization properties of this learner. The behavior of learning curves is, however, not very well understood and can display (for most researchers) quite unexpected behavior. Our work introduces the formal notion of risk monotonicity, which asks the risk to not deteriorate with increasing training set sizes in expectation over the training samples. We then present the surprising result that various standard learners, specifically those that minimize the empirical risk, can act nonmonotonically irrespective of the training sample size. We provide a theoretical underpinning for specific instantiations from classification, regression, and density estimation. Altogether, the proposed monotonicity notion opens up a whole new direction of research.

Original language	English
Title of host publication	Neural Information Processing Systems
Number of pages	11
Publication status	Published - 2019
Event	33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019 - Vancouver, Canada Duration: 8 Dec 2019 → 14 Dec 2019

Publication series

Name	Advances in Neural Information Processing Systems

Conference

Conference	33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019
Country/Territory	Canada
City	Vancouver
Period	8/12/19 → 14/12/19

Access to Document

NeurIPS-2019-minimizers-of-the-empirical-risk-and-risk-monotonicity-PaperAccepted author manuscript, 337 KB

https://proceedings.neurips.cc/paper/2019/hash/0f9cafd014db7a619ddb4276af0d692c-Abstract.html

Cite this

@inproceedings{ccd5f21cb35441d285f768f10fe20db8,

title = "Minimizers of the empirical risk and risk monotonicity",

abstract = "Plotting a learner{\textquoteright}s average performance against the number of training samples results in a learning curve. Studying such curves on one or more data sets is a way to get to a better understanding of the generalization properties of this learner. The behavior of learning curves is, however, not very well understood and can display (for most researchers) quite unexpected behavior. Our work introduces the formal notion of risk monotonicity, which asks the risk to not deteriorate with increasing training set sizes in expectation over the training samples. We then present the surprising result that various standard learners, specifically those that minimize the empirical risk, can act nonmonotonically irrespective of the training sample size. We provide a theoretical underpinning for specific instantiations from classification, regression, and density estimation. Altogether, the proposed monotonicity notion opens up a whole new direction of research. ",

author = "M. Loog and T.J. Viering and A. Mey",

year = "2019",

language = "English",

series = "Advances in Neural Information Processing Systems",

booktitle = "Neural Information Processing Systems",

note = "33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019 ; Conference date: 08-12-2019 Through 14-12-2019",

}

Loog, M , Viering, TJ & Mey, A 2019, Minimizers of the empirical risk and risk monotonicity. in Neural Information Processing Systems. Advances in Neural Information Processing Systems, 33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019, Vancouver, Canada, 8/12/19. <https://proceedings.neurips.cc/paper/2019/hash/0f9cafd014db7a619ddb4276af0d692c-Abstract.html>

TY - GEN

T1 - Minimizers of the empirical risk and risk monotonicity

AU - Loog, M.

AU - Viering, T.J.

AU - Mey, A.

PY - 2019

Y1 - 2019

N2 - Plotting a learner’s average performance against the number of training samples results in a learning curve. Studying such curves on one or more data sets is a way to get to a better understanding of the generalization properties of this learner. The behavior of learning curves is, however, not very well understood and can display (for most researchers) quite unexpected behavior. Our work introduces the formal notion of risk monotonicity, which asks the risk to not deteriorate with increasing training set sizes in expectation over the training samples. We then present the surprising result that various standard learners, specifically those that minimize the empirical risk, can act nonmonotonically irrespective of the training sample size. We provide a theoretical underpinning for specific instantiations from classification, regression, and density estimation. Altogether, the proposed monotonicity notion opens up a whole new direction of research.

AB - Plotting a learner’s average performance against the number of training samples results in a learning curve. Studying such curves on one or more data sets is a way to get to a better understanding of the generalization properties of this learner. The behavior of learning curves is, however, not very well understood and can display (for most researchers) quite unexpected behavior. Our work introduces the formal notion of risk monotonicity, which asks the risk to not deteriorate with increasing training set sizes in expectation over the training samples. We then present the surprising result that various standard learners, specifically those that minimize the empirical risk, can act nonmonotonically irrespective of the training sample size. We provide a theoretical underpinning for specific instantiations from classification, regression, and density estimation. Altogether, the proposed monotonicity notion opens up a whole new direction of research.

M3 - Conference contribution

T3 - Advances in Neural Information Processing Systems

BT - Neural Information Processing Systems

T2 - 33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019

Y2 - 8 December 2019 through 14 December 2019

ER -

Minimizers of the empirical risk and risk monotonicity

Abstract

Publication series

Conference

Access to Document

Fingerprint

Cite this