An Alternative Exploitation of Isolation Forests for Outlier Detection

Antonella Mensi*, Alessio Franzoni, David M.J. Tax, Manuele Bicego

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

1 Citation (Scopus)

Abstract

Isolation Forests are one of the most successful outlier detection techniques: they isolate outliers by performing random splits in each node. It has been recently shown that a trained Random Forest-based model can also be used to define and extract informative distance measures between objects. Although their success has been shown mainly in the clustering field, we propose to extract these pairwise distances between the objects from an Isolation Forest and use them as input to a distance or density-based outlier detector. We show that the extracted distances from Isolation Forests are able to describe outliers meaningfully. We evaluate our technique on ten benchmark datasets for outlier detection: we employ three different distance measures and evaluate the obtained representation using a density-based classifier, the Local Outlier Factor. We also compare the methodology to the standard Isolation Forests scheme.

Original languageEnglish
Title of host publicationStructural, Syntactic, and Statistical Pattern Recognition - Joint IAPR International Workshops, S+SSPR 2020, Proceedings
EditorsAndrea Torsello, Luca Rossi, Marcello Pelillo, Battista Biggio, Antonio Robles-Kelly
PublisherSpringer
Pages34-44
Number of pages11
ISBN (Print)9783030739720
DOIs
Publication statusPublished - 2021
EventJoint IAPR International Workshops on Structural, Syntactic and Statistical Techniques in Pattern Recognition, S+SSPR 2020 - Padua, Italy
Duration: 21 Jan 202122 Jan 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12644 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceJoint IAPR International Workshops on Structural, Syntactic and Statistical Techniques in Pattern Recognition, S+SSPR 2020
Country/TerritoryItaly
CityPadua
Period21/01/2122/01/21

Keywords

  • Isolation forests
  • Outlier detection
  • Random forest-based similarity

Fingerprint

Dive into the research topics of 'An Alternative Exploitation of Isolation Forests for Outlier Detection'. Together they form a unique fingerprint.

Cite this