Dissimilarity-based ensembles for multiple instance learning

VV Cheplygina; DMJ Tax; M Loog

doi:10.1109/TNNLS.2015.2424254

Dissimilarity-based ensembles for multiple instance learning

VV Cheplygina, DMJ Tax, M Loog

Pattern Recognition and Bioinformatics

Research output: Contribution to journal › Article › Scientific › peer-review

Abstract

In multiple instance learning, objects are sets (bags) of feature vectors (instances) rather than individual feature vectors. In this paper, we address the problem of how these bags can best be represented. Two standard approaches are to use (dis)similarities between bags and prototype bags, or between bags and prototype instances. The first approach results in a relatively low-dimensional representation, determined by the number of training bags, whereas the second approach results in a relatively high-dimensional representation, determined by the total number of instances in the training set. However, an advantage of the latter representation is that the informativeness of the prototype instances can be inferred. In this paper, a third, intermediate approach is proposed, which links the two approaches and combines their strengths. Our classifier is inspired by a random subspace ensemble, and considers subspaces of the dissimilarity space, defined by subsets of instances, as prototypes. We provide insight into the structure of some popular multiple instance problems and show state-of-the-art performances on these data sets.

Original language	English
Pages (from-to)	1379-1391
Number of pages	13
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	27
Issue number	6
DOIs	https://doi.org/10.1109/TNNLS.2015.2424254
Publication status	Published - 2016

Bibliographical note

harvest

Keywords

Combining classifiers
dissimilarity representation
multiple instance learning (MIL)
random subspacemethod (RSM)

Access to Document

10.1109/TNNLS.2015.2424254

Cite this

@article{1629ec1047ad4ba782420527c26c46fe,

title = "Dissimilarity-based ensembles for multiple instance learning",

abstract = "In multiple instance learning, objects are sets (bags) of feature vectors (instances) rather than individual feature vectors. In this paper, we address the problem of how these bags can best be represented. Two standard approaches are to use (dis)similarities between bags and prototype bags, or between bags and prototype instances. The first approach results in a relatively low-dimensional representation, determined by the number of training bags, whereas the second approach results in a relatively high-dimensional representation, determined by the total number of instances in the training set. However, an advantage of the latter representation is that the informativeness of the prototype instances can be inferred. In this paper, a third, intermediate approach is proposed, which links the two approaches and combines their strengths. Our classifier is inspired by a random subspace ensemble, and considers subspaces of the dissimilarity space, defined by subsets of instances, as prototypes. We provide insight into the structure of some popular multiple instance problems and show state-of-the-art performances on these data sets.",

keywords = "Combining classifiers, dissimilarity representation, multiple instance learning (MIL), random subspacemethod (RSM) ",

author = "VV Cheplygina and DMJ Tax and M Loog",

note = "harvest",

year = "2016",

doi = "10.1109/TNNLS.2015.2424254",

language = "English",

volume = "27",

pages = "1379--1391",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "3162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "6",

}

TY - JOUR

T1 - Dissimilarity-based ensembles for multiple instance learning

AU - Cheplygina, VV

AU - Tax, DMJ

AU - Loog, M

N1 - harvest

PY - 2016

Y1 - 2016

N2 - In multiple instance learning, objects are sets (bags) of feature vectors (instances) rather than individual feature vectors. In this paper, we address the problem of how these bags can best be represented. Two standard approaches are to use (dis)similarities between bags and prototype bags, or between bags and prototype instances. The first approach results in a relatively low-dimensional representation, determined by the number of training bags, whereas the second approach results in a relatively high-dimensional representation, determined by the total number of instances in the training set. However, an advantage of the latter representation is that the informativeness of the prototype instances can be inferred. In this paper, a third, intermediate approach is proposed, which links the two approaches and combines their strengths. Our classifier is inspired by a random subspace ensemble, and considers subspaces of the dissimilarity space, defined by subsets of instances, as prototypes. We provide insight into the structure of some popular multiple instance problems and show state-of-the-art performances on these data sets.

AB - In multiple instance learning, objects are sets (bags) of feature vectors (instances) rather than individual feature vectors. In this paper, we address the problem of how these bags can best be represented. Two standard approaches are to use (dis)similarities between bags and prototype bags, or between bags and prototype instances. The first approach results in a relatively low-dimensional representation, determined by the number of training bags, whereas the second approach results in a relatively high-dimensional representation, determined by the total number of instances in the training set. However, an advantage of the latter representation is that the informativeness of the prototype instances can be inferred. In this paper, a third, intermediate approach is proposed, which links the two approaches and combines their strengths. Our classifier is inspired by a random subspace ensemble, and considers subspaces of the dissimilarity space, defined by subsets of instances, as prototypes. We provide insight into the structure of some popular multiple instance problems and show state-of-the-art performances on these data sets.

KW - Combining classifiers

KW - dissimilarity representation

KW - multiple instance learning (MIL)

KW - random subspacemethod (RSM)

U2 - 10.1109/TNNLS.2015.2424254

DO - 10.1109/TNNLS.2015.2424254

M3 - Article

SN - 3162-237X

VL - 27

SP - 1379

EP - 1391

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 6

ER -

Dissimilarity-based ensembles for multiple instance learning

Abstract

Bibliographical note

Keywords

Access to Document

Fingerprint

Cite this