Structured Receptive Fields in CNNs

Jörn-Henrik Jacobsen; Jan van Gemert; Zhongyou Lou; Arnold W.M. Smeulders

doi:10.1109/CVPR.2016.286

Structured Receptive Fields in CNNs

Jörn-Henrik Jacobsen, Jan van Gemert, Zhongyou Lou, Arnold W.M. Smeulders

Pattern Recognition and Bioinformatics

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

61 Citations (Scopus)

Abstract

Learning powerful feature representations with CNNs is hard when training data are limited. Pre-training is one way to overcome this, but it requires large datasets sufficiently similar to the target domain. Another option is to design priors into the model, which can range from tuned hyperparameters to fully engineered representations like Scattering Networks. We combine these ideas into structured receptive field networks, a model which has a fixed filter basis and yet retains the flexibility of CNNs. This flexibility is achieved by expressing receptive fields in CNNs as a weighted sum over a fixed basis which is similar in spirit to Scattering Networks. The key difference is that we learn arbitrary effective filter sets from the basis rather than modeling the filters. This approach explicitly connects classical multiscale image analysis with general CNNs. With structured receptive field networks, we improve considerably over unstructured CNNs for small and medium dataset scenarios as well as over Scattering for large datasets. We validate our findings on ILSVRC2012, Cifar-10, Cifar-100 and MNIST. As a realistic small dataset example, we show state-of-the-art classification results on popular 3D MRI brain-disease datasets where pre-training is difficult due to a lack of large public datasets in a similar domain.

Original language	English
Title of host publication	Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
Editors	Lisa O'Conner
Place of Publication	Los Alamitos, CA
Publisher	IEEE
Pages	2610-2619
Number of pages	10
ISBN (Electronic)	978-1-4673-8851-1
DOIs	https://doi.org/10.1109/CVPR.2016.286
Publication status	Published - 2016
Event	CVPR 2016: 29th IEEE Conference on Computer Vision and Pattern Recognition - Las Vegas, United States Duration: 26 Jun 2016 → 1 Jul 2016

Conference

Conference	CVPR 2016
Country/Territory	United States
City	Las Vegas
Period	26/06/16 → 1/07/16

Keywords

Scattering
Convolution
Kernel
Image resolution
Spatial coherence
Training data
Wavelet transforms

Access to Document

10.1109/CVPR.2016.286

Cite this

@inproceedings{21f482ea6cbe4fbbbde4b96c2b280c35,

title = "Structured Receptive Fields in CNNs",

abstract = "Learning powerful feature representations with CNNs is hard when training data are limited. Pre-training is one way to overcome this, but it requires large datasets sufficiently similar to the target domain. Another option is to design priors into the model, which can range from tuned hyperparameters to fully engineered representations like Scattering Networks. We combine these ideas into structured receptive field networks, a model which has a fixed filter basis and yet retains the flexibility of CNNs. This flexibility is achieved by expressing receptive fields in CNNs as a weighted sum over a fixed basis which is similar in spirit to Scattering Networks. The key difference is that we learn arbitrary effective filter sets from the basis rather than modeling the filters. This approach explicitly connects classical multiscale image analysis with general CNNs. With structured receptive field networks, we improve considerably over unstructured CNNs for small and medium dataset scenarios as well as over Scattering for large datasets. We validate our findings on ILSVRC2012, Cifar-10, Cifar-100 and MNIST. As a realistic small dataset example, we show state-of-the-art classification results on popular 3D MRI brain-disease datasets where pre-training is difficult due to a lack of large public datasets in a similar domain.",

keywords = "Scattering, Convolution, Kernel, Image resolution, Spatial coherence, Training data, Wavelet transforms",

author = "J{\"o}rn-Henrik Jacobsen and {van Gemert}, Jan and Zhongyou Lou and Smeulders, {Arnold W.M.}",

year = "2016",

doi = "10.1109/CVPR.2016.286",

language = "English",

pages = "2610--2619",

editor = "O'Conner, {Lisa }",

booktitle = "Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016",

publisher = "IEEE",

address = "United States",

note = "CVPR 2016 : 29th IEEE Conference on Computer Vision and Pattern Recognition ; Conference date: 26-06-2016 Through 01-07-2016",

}

Structured Receptive Fields in CNNs. / Jacobsen, Jörn-Henrik; van Gemert, Jan; Lou, Zhongyou et al.
Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. ed. / Lisa O'Conner. Los Alamitos, CA: IEEE, 2016. p. 2610-2619.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Structured Receptive Fields in CNNs

AU - Jacobsen, Jörn-Henrik

AU - van Gemert, Jan

AU - Lou, Zhongyou

AU - Smeulders, Arnold W.M.

PY - 2016

Y1 - 2016

N2 - Learning powerful feature representations with CNNs is hard when training data are limited. Pre-training is one way to overcome this, but it requires large datasets sufficiently similar to the target domain. Another option is to design priors into the model, which can range from tuned hyperparameters to fully engineered representations like Scattering Networks. We combine these ideas into structured receptive field networks, a model which has a fixed filter basis and yet retains the flexibility of CNNs. This flexibility is achieved by expressing receptive fields in CNNs as a weighted sum over a fixed basis which is similar in spirit to Scattering Networks. The key difference is that we learn arbitrary effective filter sets from the basis rather than modeling the filters. This approach explicitly connects classical multiscale image analysis with general CNNs. With structured receptive field networks, we improve considerably over unstructured CNNs for small and medium dataset scenarios as well as over Scattering for large datasets. We validate our findings on ILSVRC2012, Cifar-10, Cifar-100 and MNIST. As a realistic small dataset example, we show state-of-the-art classification results on popular 3D MRI brain-disease datasets where pre-training is difficult due to a lack of large public datasets in a similar domain.

AB - Learning powerful feature representations with CNNs is hard when training data are limited. Pre-training is one way to overcome this, but it requires large datasets sufficiently similar to the target domain. Another option is to design priors into the model, which can range from tuned hyperparameters to fully engineered representations like Scattering Networks. We combine these ideas into structured receptive field networks, a model which has a fixed filter basis and yet retains the flexibility of CNNs. This flexibility is achieved by expressing receptive fields in CNNs as a weighted sum over a fixed basis which is similar in spirit to Scattering Networks. The key difference is that we learn arbitrary effective filter sets from the basis rather than modeling the filters. This approach explicitly connects classical multiscale image analysis with general CNNs. With structured receptive field networks, we improve considerably over unstructured CNNs for small and medium dataset scenarios as well as over Scattering for large datasets. We validate our findings on ILSVRC2012, Cifar-10, Cifar-100 and MNIST. As a realistic small dataset example, we show state-of-the-art classification results on popular 3D MRI brain-disease datasets where pre-training is difficult due to a lack of large public datasets in a similar domain.

KW - Scattering

KW - Convolution

KW - Kernel

KW - Image resolution

KW - Spatial coherence

KW - Training data

KW - Wavelet transforms

U2 - 10.1109/CVPR.2016.286

DO - 10.1109/CVPR.2016.286

M3 - Conference contribution

SP - 2610

EP - 2619

BT - Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016

A2 - O'Conner, Lisa

PB - IEEE

CY - Los Alamitos, CA

T2 - CVPR 2016

Y2 - 26 June 2016 through 1 July 2016

ER -

Structured Receptive Fields in CNNs

Abstract

Conference

Keywords

Access to Document

Fingerprint

Cite this