Resolution learning in deep convolutional networks using scale-space theory

Silvia L. Pintea; Nergis Tömen; Stanley F.  Goes; Marco Loog; Jan van Gemert

doi:10.1109/TIP.2021.3115001

Resolution learning in deep convolutional networks using scale-space theory

Silvia L. Pintea, Nergis Tömen, Stanley F. Goes, Marco Loog, Jan van Gemert

Pattern Recognition and Bioinformatics

Research output: Contribution to journal › Article › Scientific › peer-review

11 Citations (Scopus)

Abstract

Resolution in deep convolutional neural networks (CNNs) is typically bounded by the receptive field size through filter sizes, and subsampling layers or strided convolutions on feature maps. The optimal resolution may vary significantly depending on the dataset. Modern CNNs hard-code their resolution hyper-parameters in the network architecture which makes tuning such hyper-parameters cumbersome. We propose to do away with hard-coded resolution hyper-parameters and aim to learn the appropriate resolution from data. We use scale-space theory to obtain a self-similar parametrization of filters and make use of the N-Jet: a truncated Taylor series to approximate a filter by a learned combination of Gaussian derivative filters. The parameter σ of the Gaussian basis controls both the amount of detail the filter encodes and the spatial extent of the filter. Since σ is a continuous parameter, we can optimize it with respect to the loss. The proposed N-Jet layer achieves comparable performance when used in state-of-the art architectures, while learning the correct resolution in each layer automatically. We evaluate our N-Jet layer on both classification and segmentation, and we show that learning σ is especially beneficial when dealing with inputs at multiple sizes.

Original language	English
Article number	9552550
Pages (from-to)	8342 - 8353
Number of pages	12
Journal	IEEE Transactions on Image Processing
Volume	30
DOIs	https://doi.org/10.1109/TIP.2021.3115001
Publication status	Published - 2021

Keywords

Gaussian basis approximation
Scale-space theory
resolution learning in deep networks

Access to Document

10.1109/TIP.2021.3115001

Cite this

@article{b3065b516d294f8c8cccb8769e6b01d0,

title = "Resolution learning in deep convolutional networks using scale-space theory",

abstract = "Resolution in deep convolutional neural networks (CNNs) is typically bounded by the receptive field size through filter sizes, and subsampling layers or strided convolutions on feature maps. The optimal resolution may vary significantly depending on the dataset. Modern CNNs hard-code their resolution hyper-parameters in the network architecture which makes tuning such hyper-parameters cumbersome. We propose to do away with hard-coded resolution hyper-parameters and aim to learn the appropriate resolution from data. We use scale-space theory to obtain a self-similar parametrization of filters and make use of the N-Jet: a truncated Taylor series to approximate a filter by a learned combination of Gaussian derivative filters. The parameter σ of the Gaussian basis controls both the amount of detail the filter encodes and the spatial extent of the filter. Since σ is a continuous parameter, we can optimize it with respect to the loss. The proposed N-Jet layer achieves comparable performance when used in state-of-the art architectures, while learning the correct resolution in each layer automatically. We evaluate our N-Jet layer on both classification and segmentation, and we show that learning σ is especially beneficial when dealing with inputs at multiple sizes.",

keywords = "Gaussian basis approximation, Scale-space theory, resolution learning in deep networks",

author = "Pintea, {Silvia L.} and Nergis T{\"o}men and Goes, {Stanley F.} and Marco Loog and {van Gemert}, Jan",

year = "2021",

doi = "10.1109/TIP.2021.3115001",

language = "English",

volume = "30",

pages = "8342 -- 8353",

journal = "IEEE Transactions on Image Processing",

issn = "1941-0042",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

}

TY - JOUR

T1 - Resolution learning in deep convolutional networks using scale-space theory

AU - Pintea, Silvia L.

AU - Tömen, Nergis

AU - Goes, Stanley F.

AU - Loog, Marco

AU - van Gemert, Jan

PY - 2021

Y1 - 2021

N2 - Resolution in deep convolutional neural networks (CNNs) is typically bounded by the receptive field size through filter sizes, and subsampling layers or strided convolutions on feature maps. The optimal resolution may vary significantly depending on the dataset. Modern CNNs hard-code their resolution hyper-parameters in the network architecture which makes tuning such hyper-parameters cumbersome. We propose to do away with hard-coded resolution hyper-parameters and aim to learn the appropriate resolution from data. We use scale-space theory to obtain a self-similar parametrization of filters and make use of the N-Jet: a truncated Taylor series to approximate a filter by a learned combination of Gaussian derivative filters. The parameter σ of the Gaussian basis controls both the amount of detail the filter encodes and the spatial extent of the filter. Since σ is a continuous parameter, we can optimize it with respect to the loss. The proposed N-Jet layer achieves comparable performance when used in state-of-the art architectures, while learning the correct resolution in each layer automatically. We evaluate our N-Jet layer on both classification and segmentation, and we show that learning σ is especially beneficial when dealing with inputs at multiple sizes.

AB - Resolution in deep convolutional neural networks (CNNs) is typically bounded by the receptive field size through filter sizes, and subsampling layers or strided convolutions on feature maps. The optimal resolution may vary significantly depending on the dataset. Modern CNNs hard-code their resolution hyper-parameters in the network architecture which makes tuning such hyper-parameters cumbersome. We propose to do away with hard-coded resolution hyper-parameters and aim to learn the appropriate resolution from data. We use scale-space theory to obtain a self-similar parametrization of filters and make use of the N-Jet: a truncated Taylor series to approximate a filter by a learned combination of Gaussian derivative filters. The parameter σ of the Gaussian basis controls both the amount of detail the filter encodes and the spatial extent of the filter. Since σ is a continuous parameter, we can optimize it with respect to the loss. The proposed N-Jet layer achieves comparable performance when used in state-of-the art architectures, while learning the correct resolution in each layer automatically. We evaluate our N-Jet layer on both classification and segmentation, and we show that learning σ is especially beneficial when dealing with inputs at multiple sizes.

KW - Gaussian basis approximation

KW - Scale-space theory

KW - resolution learning in deep networks

UR - http://www.scopus.com/inward/record.url?scp=85116921643&partnerID=8YFLogxK

U2 - 10.1109/TIP.2021.3115001

DO - 10.1109/TIP.2021.3115001

M3 - Article

SN - 1941-0042

VL - 30

SP - 8342

EP - 8353

JO - IEEE Transactions on Image Processing

JF - IEEE Transactions on Image Processing

M1 - 9552550

ER -

Resolution learning in deep convolutional networks using scale-space theory

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this