A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP

Kaan  Yilmaz; Neil Yorke-Smith

doi:10.3390/ai2020010

A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP

Kaan Yilmaz, Neil Yorke-Smith

Algorithmics

Research output: Contribution to journal › Article › Scientific › peer-review

9 Citations (Scopus)

41 Downloads (Pure)

Abstract

In line with the growing trend of using machine learning to help solve combinatorial optimisation problems, one promising idea is to improve node selection within a mixed integer programming (MIP) branch-and-bound tree by using a learned policy. Previous work using imitation learning indicates the feasibility of acquiring a node selection policy, by learning an adaptive node searching order. In contrast, our imitation learning policy is focused solely on learning which of a node’s children to select. We present an offline method to learn such a policy in two settings: one that comprises a heuristic by committing to pruning of nodes; one that is exact and backtracks from a leaf to guarantee finding the optimal integer solution. The former setting corresponds to a child selector during plunging, while the latter is akin to a diving heuristic. We apply the policy within the popular open-source solver SCIP, in both heuristic and exact settings. Empirical results on five MIP datasets indicate that our node selection policy leads to solutions significantly more quickly than the state-of-the-art precedent in the literature. While we do not beat the highly-optimised SCIP state-of-practice baseline node selector in terms of solving time on exact solutions, our heuristic policies have a consistently better optimality gap than all baselines, if the accuracy of the predictive model is sufficient. Further, the results also indicate that, when a time limit is applied, our heuristic method finds better solutions than all baselines in the majority of problems tested. We explain the results by showing that the learned policies have imitated the SCIP baseline, but without the latter’s early plunge abort. Our recommendation is that, despite the clear improvements over the literature, this kind of MIP child selector is better seen in a broader approach to using learning in MIP branch-and-bound tree decisions.

Original language	English
Pages (from-to)	150-178
Number of pages	29
Journal	AI
Volume	2
Issue number	2
DOIs	https://doi.org/10.3390/ai2020010
Publication status	Published - 2021

Keywords

mixed integer programming
node selection
machine learning
approximate pruning
imitation learning
SCIP

Access to Document

10.3390/ai2020010

ai-02-00010Final published version, 673 KBLicence: CC BY

Cite this

@article{1471f318861140cfa1477ed3fa346a5d,

title = "A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP",

abstract = "In line with the growing trend of using machine learning to help solve combinatorial optimisation problems, one promising idea is to improve node selection within a mixed integer programming (MIP) branch-and-bound tree by using a learned policy. Previous work using imitation learning indicates the feasibility of acquiring a node selection policy, by learning an adaptive node searching order. In contrast, our imitation learning policy is focused solely on learning which of a node{\textquoteright}s children to select. We present an offline method to learn such a policy in two settings: one that comprises a heuristic by committing to pruning of nodes; one that is exact and backtracks from a leaf to guarantee finding the optimal integer solution. The former setting corresponds to a child selector during plunging, while the latter is akin to a diving heuristic. We apply the policy within the popular open-source solver SCIP, in both heuristic and exact settings. Empirical results on five MIP datasets indicate that our node selection policy leads to solutions significantly more quickly than the state-of-the-art precedent in the literature. While we do not beat the highly-optimised SCIP state-of-practice baseline node selector in terms of solving time on exact solutions, our heuristic policies have a consistently better optimality gap than all baselines, if the accuracy of the predictive model is sufficient. Further, the results also indicate that, when a time limit is applied, our heuristic method finds better solutions than all baselines in the majority of problems tested. We explain the results by showing that the learned policies have imitated the SCIP baseline, but without the latter{\textquoteright}s early plunge abort. Our recommendation is that, despite the clear improvements over the literature, this kind of MIP child selector is better seen in a broader approach to using learning in MIP branch-and-bound tree decisions.",

keywords = "mixed integer programming, node selection, machine learning, approximate pruning, imitation learning, SCIP",

author = "Kaan Yilmaz and Neil Yorke-Smith",

year = "2021",

doi = "10.3390/ai2020010",

language = "English",

volume = "2",

pages = "150--178",

journal = "AI",

issn = "2673-2688",

publisher = "MDPI",

number = "2",

}

TY - JOUR

T1 - A Study of Learning Search Approximation in Mixed Integer Branch and Bound

T2 - Node Selection in SCIP

AU - Yilmaz, Kaan

AU - Yorke-Smith, Neil

PY - 2021

Y1 - 2021

N2 - In line with the growing trend of using machine learning to help solve combinatorial optimisation problems, one promising idea is to improve node selection within a mixed integer programming (MIP) branch-and-bound tree by using a learned policy. Previous work using imitation learning indicates the feasibility of acquiring a node selection policy, by learning an adaptive node searching order. In contrast, our imitation learning policy is focused solely on learning which of a node’s children to select. We present an offline method to learn such a policy in two settings: one that comprises a heuristic by committing to pruning of nodes; one that is exact and backtracks from a leaf to guarantee finding the optimal integer solution. The former setting corresponds to a child selector during plunging, while the latter is akin to a diving heuristic. We apply the policy within the popular open-source solver SCIP, in both heuristic and exact settings. Empirical results on five MIP datasets indicate that our node selection policy leads to solutions significantly more quickly than the state-of-the-art precedent in the literature. While we do not beat the highly-optimised SCIP state-of-practice baseline node selector in terms of solving time on exact solutions, our heuristic policies have a consistently better optimality gap than all baselines, if the accuracy of the predictive model is sufficient. Further, the results also indicate that, when a time limit is applied, our heuristic method finds better solutions than all baselines in the majority of problems tested. We explain the results by showing that the learned policies have imitated the SCIP baseline, but without the latter’s early plunge abort. Our recommendation is that, despite the clear improvements over the literature, this kind of MIP child selector is better seen in a broader approach to using learning in MIP branch-and-bound tree decisions.

AB - In line with the growing trend of using machine learning to help solve combinatorial optimisation problems, one promising idea is to improve node selection within a mixed integer programming (MIP) branch-and-bound tree by using a learned policy. Previous work using imitation learning indicates the feasibility of acquiring a node selection policy, by learning an adaptive node searching order. In contrast, our imitation learning policy is focused solely on learning which of a node’s children to select. We present an offline method to learn such a policy in two settings: one that comprises a heuristic by committing to pruning of nodes; one that is exact and backtracks from a leaf to guarantee finding the optimal integer solution. The former setting corresponds to a child selector during plunging, while the latter is akin to a diving heuristic. We apply the policy within the popular open-source solver SCIP, in both heuristic and exact settings. Empirical results on five MIP datasets indicate that our node selection policy leads to solutions significantly more quickly than the state-of-the-art precedent in the literature. While we do not beat the highly-optimised SCIP state-of-practice baseline node selector in terms of solving time on exact solutions, our heuristic policies have a consistently better optimality gap than all baselines, if the accuracy of the predictive model is sufficient. Further, the results also indicate that, when a time limit is applied, our heuristic method finds better solutions than all baselines in the majority of problems tested. We explain the results by showing that the learned policies have imitated the SCIP baseline, but without the latter’s early plunge abort. Our recommendation is that, despite the clear improvements over the literature, this kind of MIP child selector is better seen in a broader approach to using learning in MIP branch-and-bound tree decisions.

KW - mixed integer programming

KW - node selection

KW - machine learning

KW - approximate pruning

KW - imitation learning

KW - SCIP

UR - http://www.scopus.com/inward/record.url?scp=85111409446&partnerID=8YFLogxK

U2 - 10.3390/ai2020010

DO - 10.3390/ai2020010

M3 - Article

SN - 2673-2688

VL - 2

SP - 150

EP - 178

JO - AI

JF - AI

IS - 2

ER -

A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Imitation learning model and datasets: "A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP"

Cite this

A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Datasets

Imitation learning model and datasets: "A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP"

Cite this