The Peaking Phenomenon in Semi-supervised Learning

Jesse Krijthe; Marco Loog

doi:10.1007/978-3-319-49055-7_27

The Peaking Phenomenon in Semi-supervised Learning

Pattern Recognition and Bioinformatics

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

3 Citations (Scopus)

Abstract

For the supervised least squares classifier, when the number of training objects is smaller than the dimensionality of the data, adding more data to the training set may first increase the error rate before decreasing it. This, possibly counterintuitive, phenomenon is known as peaking. In this work, we observe that a similar but more pronounced version of this phenomenon also occurs in the semi-supervised setting, where instead of labeled objects, unlabeled objects are added to the training set. We explain why the learning curve has a more steep incline and a more gradual decline in this setting through simulation studies and by applying an approximation of the learning curve based on the work by Raudys and Duin.

Original language	English
Title of host publication	Structural, Syntactic, and Statistical Pattern Recognition
Subtitle of host publication	Joint IAPR International Workshop, S+SSPR 2016, proceedings
Editors	A. Robles-Kelly, Marco Loog, B. Biggio, F. Escolano, R. Wilson
Place of Publication	Cham
Publisher	Springer
Pages	299-309
Number of pages	11
ISBN (Electronic)	978-3-319-49055-7
ISBN (Print)	978-3-319-49054-0
DOIs	https://doi.org/10.1007/978-3-319-49055-7_27
Publication status	Published - 2016
Event	SSPR Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR) - Mérida, Mexico Duration: 29 Nov 2016 → 2 Dec 2016

Publication series

Name	Lecture Notes in Computer Science
Volume	10029
ISSN (Print)	0302-9743

Workshop

Workshop	SSPR Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)
Country/Territory	Mexico
City	Mérida
Period	29/11/16 → 2/12/16

Keywords

Semi-supervised learning
Peaking
Least squares classfier
Pseudo-inverse

Access to Document

10.1007/978-3-319-49055-7_27

Cite this

Krijthe, J., & Loog, M. (2016). The Peaking Phenomenon in Semi-supervised Learning. In A. Robles-Kelly, M. Loog, B. Biggio, F. Escolano, & R. Wilson (Eds.), Structural, Syntactic, and Statistical Pattern Recognition: Joint IAPR International Workshop, S+SSPR 2016, proceedings (pp. 299-309). (Lecture Notes in Computer Science; Vol. 10029). Springer. https://doi.org/10.1007/978-3-319-49055-7_27

@inproceedings{eedff80eb79c400e817fde76ef50e3d9,

title = "The Peaking Phenomenon in Semi-supervised Learning",

abstract = "For the supervised least squares classifier, when the number of training objects is smaller than the dimensionality of the data, adding more data to the training set may first increase the error rate before decreasing it. This, possibly counterintuitive, phenomenon is known as peaking. In this work, we observe that a similar but more pronounced version of this phenomenon also occurs in the semi-supervised setting, where instead of labeled objects, unlabeled objects are added to the training set. We explain why the learning curve has a more steep incline and a more gradual decline in this setting through simulation studies and by applying an approximation of the learning curve based on the work by Raudys and Duin.",

keywords = "Semi-supervised learning, Peaking, Least squares classfier, Pseudo-inverse",

author = "Jesse Krijthe and Marco Loog",

year = "2016",

doi = "10.1007/978-3-319-49055-7_27",

language = "English",

isbn = "978-3-319-49054-0",

series = "Lecture Notes in Computer Science",

publisher = "Springer",

pages = "299--309",

editor = "A. Robles-Kelly and Marco Loog and B. Biggio and F. Escolano and R. Wilson",

booktitle = "Structural, Syntactic, and Statistical Pattern Recognition",

note = "SSPR Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR) ; Conference date: 29-11-2016 Through 02-12-2016",

}

Krijthe, J & Loog, M 2016, The Peaking Phenomenon in Semi-supervised Learning. in A Robles-Kelly, M Loog, B Biggio, F Escolano & R Wilson (eds), Structural, Syntactic, and Statistical Pattern Recognition: Joint IAPR International Workshop, S+SSPR 2016, proceedings. Lecture Notes in Computer Science, vol. 10029, Springer, Cham, pp. 299-309, SSPR Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), Mérida, Mexico, 29/11/16. https://doi.org/10.1007/978-3-319-49055-7_27

The Peaking Phenomenon in Semi-supervised Learning. / Krijthe, Jesse ; Loog, Marco.
Structural, Syntactic, and Statistical Pattern Recognition: Joint IAPR International Workshop, S+SSPR 2016, proceedings. ed. / A. Robles-Kelly; Marco Loog; B. Biggio; F. Escolano; R. Wilson. Cham: Springer, 2016. p. 299-309 (Lecture Notes in Computer Science; Vol. 10029).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - The Peaking Phenomenon in Semi-supervised Learning

AU - Krijthe, Jesse

AU - Loog, Marco

PY - 2016

Y1 - 2016

N2 - For the supervised least squares classifier, when the number of training objects is smaller than the dimensionality of the data, adding more data to the training set may first increase the error rate before decreasing it. This, possibly counterintuitive, phenomenon is known as peaking. In this work, we observe that a similar but more pronounced version of this phenomenon also occurs in the semi-supervised setting, where instead of labeled objects, unlabeled objects are added to the training set. We explain why the learning curve has a more steep incline and a more gradual decline in this setting through simulation studies and by applying an approximation of the learning curve based on the work by Raudys and Duin.

AB - For the supervised least squares classifier, when the number of training objects is smaller than the dimensionality of the data, adding more data to the training set may first increase the error rate before decreasing it. This, possibly counterintuitive, phenomenon is known as peaking. In this work, we observe that a similar but more pronounced version of this phenomenon also occurs in the semi-supervised setting, where instead of labeled objects, unlabeled objects are added to the training set. We explain why the learning curve has a more steep incline and a more gradual decline in this setting through simulation studies and by applying an approximation of the learning curve based on the work by Raudys and Duin.

KW - Semi-supervised learning

KW - Peaking

KW - Least squares classfier

KW - Pseudo-inverse

U2 - 10.1007/978-3-319-49055-7_27

DO - 10.1007/978-3-319-49055-7_27

M3 - Conference contribution

SN - 978-3-319-49054-0

T3 - Lecture Notes in Computer Science

SP - 299

EP - 309

BT - Structural, Syntactic, and Statistical Pattern Recognition

A2 - Robles-Kelly, A.

A2 - Loog, Marco

A2 - Biggio, B.

A2 - Escolano, F.

A2 - Wilson, R.

PB - Springer

CY - Cham

T2 - SSPR Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

Y2 - 29 November 2016 through 2 December 2016

ER -

The Peaking Phenomenon in Semi-supervised Learning

Abstract

Publication series

Workshop

Keywords

Access to Document

Fingerprint

Cite this