Skip to main navigation Skip to search Skip to main content

Exploring the Search Space of Neural Network Combinations obtained with Efficient Model Stitching

Arthur Guijt, Dirk Thierens, Tanja Alderliesten, Peter A.N. Bosman

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

26 Downloads (Pure)

Abstract

Machine learning models can be made more performant and their predictions more consistent by creating an ensemble. Each neural network in an ensemble commonly performs its own feature extraction. These features are often highly similar, leading to potentially many redundant calculations. Unifying these calculations (i.e., reusing some of them) would be desirable to reduce computational cost. However, splicing two trained networks is non-trivial because architectures and feature representations typically differ, leading to a performance breakdown. To overcome this issue, we propose to employ stitching, which introduces new layers at crossover points. Essentially, a new network consisting of the two basis networks is constructed. In this network, new links between the two basis networks are created through the introduction and training of stitches. New networks can then be created by choosing which stitching layers to (not) use, thereby selecting a subnetwork. Akin to a supernetwork, assessing the performance of a selected subnetwork is efficient, as only their evaluation on data is required. We experimentally show that our proposed approach enables finding networks that represent novel trade-offs between performance and computational cost compared to classical ensembles, with some new networks even dominating the original networks.
Original languageEnglish
Title of host publicationGECCO '24 Companion
Subtitle of host publicationProceedings of the Genetic and Evolutionary Computation Conference Companion
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery (ACM)
Pages1914-1923
Number of pages10
ISBN (Electronic)979-8-4007-0495-6
DOIs
Publication statusPublished - 2024
Event2024 Genetic and Evolutionary Computation Conference - Melbourne Convention and Exhibition Centre (MCEC), Melbourne, Australia
Duration: 14 Jul 202418 Jul 2024
https://gecco-2024.sigevo.org/HomePage

Conference

Conference2024 Genetic and Evolutionary Computation Conference
Abbreviated titleGECCO 2024
Country/TerritoryAustralia
CityMelbourne
Period14/07/2418/07/24
Internet address

Keywords

  • ensembles
  • neural architecture search
  • neuroevolution
  • stitching

Fingerprint

Dive into the research topics of 'Exploring the Search Space of Neural Network Combinations obtained with Efficient Model Stitching'. Together they form a unique fingerprint.

Cite this