The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier

CJ Veenman; MJT Reinders

doi:10.1109/TPAMI.2005.187

The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier

CJ Veenman, MJT Reinders

Research output: Contribution to journal › Article › Scientific › peer-review

125 Citations (Scopus)

Abstract

We present the Nearest Subclass Classifier (NSC), which is a classification algorithm that unifies the flexibility of the nearest neighbor classifier with the robustness of the nearest mean classifier. The algorithm is based on the Maximum Variance Cluster algorithm and, as such, it belongs to the class of prototype-based classifiers. The variance constraint parameter of the cluster algorithm serves to regularize the classifier, that is, to prevent overfitting. With a low variance constraint value, the classifier turns into the nearest neighbor classifier and, with a high variance parameter, it becomes the nearest mean classifier with the respective properties. In other words, the number of prototypes ranges from the whole training set to only one per class. In the experiments, we compared the NSC with regard to its performance and data set compression ratio to several other prototype-based methods. On several data sets, the NSC performed similarly to the k-nearest neighbor classifier, which is a well-established classifier in many domains. Also concerning storage requirements and classification speed, the NSC has favorable properties, so it gives a good compromise between classification performance and efficiency.

Original language	Undefined/Unknown
Pages (from-to)	1417-1429
Number of pages	13
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume	27
Issue number	9
DOIs	https://doi.org/10.1109/TPAMI.2005.187
Publication status	Published - 2005

Keywords

academic journal papers
ZX CWTS 1.00 <= JFIS < 3.00

Access to Document

10.1109/TPAMI.2005.187

Cite this

@article{5b320f029cc141bf8d25103710238dca,

title = "The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier",

abstract = "We present the Nearest Subclass Classifier (NSC), which is a classification algorithm that unifies the flexibility of the nearest neighbor classifier with the robustness of the nearest mean classifier. The algorithm is based on the Maximum Variance Cluster algorithm and, as such, it belongs to the class of prototype-based classifiers. The variance constraint parameter of the cluster algorithm serves to regularize the classifier, that is, to prevent overfitting. With a low variance constraint value, the classifier turns into the nearest neighbor classifier and, with a high variance parameter, it becomes the nearest mean classifier with the respective properties. In other words, the number of prototypes ranges from the whole training set to only one per class. In the experiments, we compared the NSC with regard to its performance and data set compression ratio to several other prototype-based methods. On several data sets, the NSC performed similarly to the k-nearest neighbor classifier, which is a well-established classifier in many domains. Also concerning storage requirements and classification speed, the NSC has favorable properties, so it gives a good compromise between classification performance and efficiency.",

keywords = "academic journal papers, ZX CWTS 1.00 <= JFIS < 3.00",

author = "CJ Veenman and MJT Reinders",

year = "2005",

doi = "10.1109/TPAMI.2005.187",

language = "Undefined/Unknown",

volume = "27",

pages = "1417--1429",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE",

number = "9",

}

TY - JOUR

T1 - The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier

AU - Veenman, CJ

AU - Reinders, MJT

PY - 2005

Y1 - 2005

N2 - We present the Nearest Subclass Classifier (NSC), which is a classification algorithm that unifies the flexibility of the nearest neighbor classifier with the robustness of the nearest mean classifier. The algorithm is based on the Maximum Variance Cluster algorithm and, as such, it belongs to the class of prototype-based classifiers. The variance constraint parameter of the cluster algorithm serves to regularize the classifier, that is, to prevent overfitting. With a low variance constraint value, the classifier turns into the nearest neighbor classifier and, with a high variance parameter, it becomes the nearest mean classifier with the respective properties. In other words, the number of prototypes ranges from the whole training set to only one per class. In the experiments, we compared the NSC with regard to its performance and data set compression ratio to several other prototype-based methods. On several data sets, the NSC performed similarly to the k-nearest neighbor classifier, which is a well-established classifier in many domains. Also concerning storage requirements and classification speed, the NSC has favorable properties, so it gives a good compromise between classification performance and efficiency.

AB - We present the Nearest Subclass Classifier (NSC), which is a classification algorithm that unifies the flexibility of the nearest neighbor classifier with the robustness of the nearest mean classifier. The algorithm is based on the Maximum Variance Cluster algorithm and, as such, it belongs to the class of prototype-based classifiers. The variance constraint parameter of the cluster algorithm serves to regularize the classifier, that is, to prevent overfitting. With a low variance constraint value, the classifier turns into the nearest neighbor classifier and, with a high variance parameter, it becomes the nearest mean classifier with the respective properties. In other words, the number of prototypes ranges from the whole training set to only one per class. In the experiments, we compared the NSC with regard to its performance and data set compression ratio to several other prototype-based methods. On several data sets, the NSC performed similarly to the k-nearest neighbor classifier, which is a well-established classifier in many domains. Also concerning storage requirements and classification speed, the NSC has favorable properties, so it gives a good compromise between classification performance and efficiency.

KW - academic journal papers

KW - ZX CWTS 1.00 <= JFIS < 3.00

U2 - 10.1109/TPAMI.2005.187

DO - 10.1109/TPAMI.2005.187

M3 - Article

SN - 0162-8828

VL - 27

SP - 1417

EP - 1429

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 9

ER -