A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning

Arkadiy Dushatskiy; Tanja Alderliesten; Peter A.N. Bosman

doi:10.1145/3449639.3459306

A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning

Arkadiy Dushatskiy, Tanja Alderliesten, Peter A.N. Bosman

Algorithmics

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

4 Citations (Scopus)

47 Downloads (Pure)

Abstract

We propose a novel surrogate-assisted Evolutionary Algorithm for solving expensive combinatorial optimization problems. We integrate a surrogate model, which is used for fitness value estimation, into a state-of-the-art P3-like variant of the Gene-Pool Optimal Mixing Algorithm (GOMEA) and adapt the resulting algorithm for solving non-binary combinatorial problems. We test the proposed algorithm on an ensemble learning problem. Ensembling several models is a common Machine Learning technique to achieve better performance. We consider ensembles of several models trained on disjoint subsets of a dataset. Finding the best dataset partitioning is naturally a combinatorial non-binary optimization problem. Fitness function evaluations can be extremely expensive if complex models, such as Deep Neural Networks, are used as learners in an ensemble. Therefore, the number of fitness function evaluations is typically limited, necessitating expensive optimization techniques. In our experiments we use five classification datasets from the OpenML-CC18 benchmark and Support-vector Machines as learners in an ensemble. The proposed algorithm demonstrates better performance than alternative approaches, including Bayesian optimization algorithms. It manages to find better solutions using just several thousand fitness function evaluations for an ensemble learning problem with up to 500 variables.

Original language	English
Title of host publication	GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference
Publisher	Association for Computing Machinery (ACM)
Pages	583-591
Number of pages	9
ISBN (Electronic)	9781450383509
DOIs	https://doi.org/10.1145/3449639.3459306
Publication status	Published - 2021
Event	2021 Genetic and Evolutionary Computation Conference, GECCO 2021 - Virtual, Online, France Duration: 10 Jul 2021 → 14 Jul 2021

Publication series

Name	GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference

Conference

Conference	2021 Genetic and Evolutionary Computation Conference, GECCO 2021
Country/Territory	France
City	Virtual, Online
Period	10/07/21 → 14/07/21

Keywords

Ensemble learning
Expensive combinatorial optimization
Surrogate-assisted evolutionary algorithms

Access to Document

10.1145/3449639.3459306

3449639.3459306Final published version, 1.22 MBLicence: CC BY

Cite this

Dushatskiy, A., Alderliesten, T., & Bosman, P. A. N. (2021). A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning. In GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference (pp. 583-591). (GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference). Association for Computing Machinery (ACM). https://doi.org/10.1145/3449639.3459306

Dushatskiy, Arkadiy ; Alderliesten, Tanja ; Bosman, Peter A.N. / A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning. GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference. Association for Computing Machinery (ACM), 2021. pp. 583-591 (GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference).

@inproceedings{d0aac26dbff04fa78d43a377736f73c5,

title = "A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning",

abstract = "We propose a novel surrogate-assisted Evolutionary Algorithm for solving expensive combinatorial optimization problems. We integrate a surrogate model, which is used for fitness value estimation, into a state-of-the-art P3-like variant of the Gene-Pool Optimal Mixing Algorithm (GOMEA) and adapt the resulting algorithm for solving non-binary combinatorial problems. We test the proposed algorithm on an ensemble learning problem. Ensembling several models is a common Machine Learning technique to achieve better performance. We consider ensembles of several models trained on disjoint subsets of a dataset. Finding the best dataset partitioning is naturally a combinatorial non-binary optimization problem. Fitness function evaluations can be extremely expensive if complex models, such as Deep Neural Networks, are used as learners in an ensemble. Therefore, the number of fitness function evaluations is typically limited, necessitating expensive optimization techniques. In our experiments we use five classification datasets from the OpenML-CC18 benchmark and Support-vector Machines as learners in an ensemble. The proposed algorithm demonstrates better performance than alternative approaches, including Bayesian optimization algorithms. It manages to find better solutions using just several thousand fitness function evaluations for an ensemble learning problem with up to 500 variables. ",

keywords = "Ensemble learning, Expensive combinatorial optimization, Surrogate-assisted evolutionary algorithms",

author = "Arkadiy Dushatskiy and Tanja Alderliesten and Bosman, {Peter A.N.}",

year = "2021",

doi = "10.1145/3449639.3459306",

language = "English",

series = "GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference",

publisher = "Association for Computing Machinery (ACM)",

pages = "583--591",

booktitle = "GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference",

address = "United States",

note = "2021 Genetic and Evolutionary Computation Conference, GECCO 2021 ; Conference date: 10-07-2021 Through 14-07-2021",

}

Dushatskiy, A, Alderliesten, T & Bosman, PAN 2021, A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning. in GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference. GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference, Association for Computing Machinery (ACM), pp. 583-591, 2021 Genetic and Evolutionary Computation Conference, GECCO 2021, Virtual, Online, France, 10/07/21. https://doi.org/10.1145/3449639.3459306

A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning. / Dushatskiy, Arkadiy; Alderliesten, Tanja ; Bosman, Peter A.N.
GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference. Association for Computing Machinery (ACM), 2021. p. 583-591 (GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning

AU - Dushatskiy, Arkadiy

AU - Alderliesten, Tanja

AU - Bosman, Peter A.N.

PY - 2021

Y1 - 2021

N2 - We propose a novel surrogate-assisted Evolutionary Algorithm for solving expensive combinatorial optimization problems. We integrate a surrogate model, which is used for fitness value estimation, into a state-of-the-art P3-like variant of the Gene-Pool Optimal Mixing Algorithm (GOMEA) and adapt the resulting algorithm for solving non-binary combinatorial problems. We test the proposed algorithm on an ensemble learning problem. Ensembling several models is a common Machine Learning technique to achieve better performance. We consider ensembles of several models trained on disjoint subsets of a dataset. Finding the best dataset partitioning is naturally a combinatorial non-binary optimization problem. Fitness function evaluations can be extremely expensive if complex models, such as Deep Neural Networks, are used as learners in an ensemble. Therefore, the number of fitness function evaluations is typically limited, necessitating expensive optimization techniques. In our experiments we use five classification datasets from the OpenML-CC18 benchmark and Support-vector Machines as learners in an ensemble. The proposed algorithm demonstrates better performance than alternative approaches, including Bayesian optimization algorithms. It manages to find better solutions using just several thousand fitness function evaluations for an ensemble learning problem with up to 500 variables.

AB - We propose a novel surrogate-assisted Evolutionary Algorithm for solving expensive combinatorial optimization problems. We integrate a surrogate model, which is used for fitness value estimation, into a state-of-the-art P3-like variant of the Gene-Pool Optimal Mixing Algorithm (GOMEA) and adapt the resulting algorithm for solving non-binary combinatorial problems. We test the proposed algorithm on an ensemble learning problem. Ensembling several models is a common Machine Learning technique to achieve better performance. We consider ensembles of several models trained on disjoint subsets of a dataset. Finding the best dataset partitioning is naturally a combinatorial non-binary optimization problem. Fitness function evaluations can be extremely expensive if complex models, such as Deep Neural Networks, are used as learners in an ensemble. Therefore, the number of fitness function evaluations is typically limited, necessitating expensive optimization techniques. In our experiments we use five classification datasets from the OpenML-CC18 benchmark and Support-vector Machines as learners in an ensemble. The proposed algorithm demonstrates better performance than alternative approaches, including Bayesian optimization algorithms. It manages to find better solutions using just several thousand fitness function evaluations for an ensemble learning problem with up to 500 variables.

KW - Ensemble learning

KW - Expensive combinatorial optimization

KW - Surrogate-assisted evolutionary algorithms

UR - http://www.scopus.com/inward/record.url?scp=85110096162&partnerID=8YFLogxK

U2 - 10.1145/3449639.3459306

DO - 10.1145/3449639.3459306

M3 - Conference contribution

AN - SCOPUS:85110096162

T3 - GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference

SP - 583

EP - 591

BT - GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference

PB - Association for Computing Machinery (ACM)

T2 - 2021 Genetic and Evolutionary Computation Conference, GECCO 2021

Y2 - 10 July 2021 through 14 July 2021

ER -

Dushatskiy A, Alderliesten T , Bosman PAN. A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning. In GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference. Association for Computing Machinery (ACM). 2021. p. 583-591. (GECCO 2021 - Proceedings of the 2021 Genetic and Evolutionary Computation Conference). doi: 10.1145/3449639.3459306

A novel surrogate-assisted evolutionary algorithm applied to partition-based ensemble learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this