Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers

Saikiran Bulusu; G. Joseph; M. Cenk Gursoy; Pramod K. Varshney

Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers

Saikiran Bulusu, G. Joseph, M. Cenk Gursoy, Pramod K. Varshney

Signal Processing Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

18 Downloads (Pure)

Abstract

We consider a set of data samples such that a fraction of the samples are arbitrary outliers, and the rest are the output samples of a single-layer neural network with rectified linear unit (ReLU) activation. Our goal is to estimate the parameters (weight matrix and bias vector) of the neural network, assuming the bias vector to be non-negative. We estimate the network parameters using the gradient descent algorithm combined with either the median- or trimmed mean-based filters to mitigate the effect of the arbitrary outliers. We then prove that $\tilde{O}( \frac{1}{p^2}+\frac{1}{\epsilon^2p})$ samples and $\tilde{O} ( \frac{d^2}{p^2}+ \frac{d^2}{\epsilon^2p})$ time are sufficient for our algorithm to estimate the neural network parameters within an error of $\epsilon$ when the outlier probability is $1-p$, {where $2/3< p \leq 1$} and the problem dimension is $d$ (with log factors being ignored here). Our theoretical and simulation results provide insights into the training complexity of ReLU neural networks in terms of the probability of outliers and problem dimension.

Original language	English
Title of host publication	36th Conference on Neural Information Processing Systems 2022
Editors	S. Koyejo
Number of pages	11
ISBN (Electronic)	9781713871088
Publication status	Published - 2022
Event	36th Conference on Neural Information Processing Systems - Hybrid Conference, New Orleans, United States Duration: 28 Nov 2022 → 9 Dec 2022 Conference number: 36

Conference

Conference	36th Conference on Neural Information Processing Systems
Abbreviated title	NeurIPS 2022
Country/Territory	United States
City	New Orleans
Period	28/11/22 → 9/12/22

Access to Document

NeurIPS-2022-learning-distributions-generated-by-single-layer-relu-networks-in-the-presence-of-arbitrary-outliers-Paper-ConferenceFinal published version, 384 KB

Cite this

@inproceedings{bec9f025e74f4854879a1062979a3e01,

title = "Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers",

abstract = "We consider a set of data samples such that a fraction of the samples are arbitrary outliers, and the rest are the output samples of a single-layer neural network with rectified linear unit (ReLU) activation. Our goal is to estimate the parameters (weight matrix and bias vector) of the neural network, assuming the bias vector to be non-negative. We estimate the network parameters using the gradient descent algorithm combined with either the median- or trimmed mean-based filters to mitigate the effect of the arbitrary outliers. We then prove that $\tilde{O}( \frac{1}{p^2}+\frac{1}{\epsilon^2p})$ samples and $\tilde{O} ( \frac{d^2}{p^2}+ \frac{d^2}{\epsilon^2p})$ time are sufficient for our algorithm to estimate the neural network parameters within an error of $\epsilon$ when the outlier probability is $1-p$, {where $2/3< p \leq 1$} and the problem dimension is $d$ (with log factors being ignored here). Our theoretical and simulation results provide insights into the training complexity of ReLU neural networks in terms of the probability of outliers and problem dimension. ",

author = "Saikiran Bulusu and G. Joseph and Gursoy, {M. Cenk} and Varshney, {Pramod K.}",

year = "2022",

language = "English",

editor = "Koyejo, {S. }",

booktitle = "36th Conference on Neural Information Processing Systems 2022",

note = "36th Conference on Neural Information Processing Systems, NeurIPS 2022 ; Conference date: 28-11-2022 Through 09-12-2022",

}

Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers. / Bulusu, Saikiran; Joseph, G.; Gursoy, M. Cenk et al.
36th Conference on Neural Information Processing Systems 2022. ed. / S. Koyejo. 2022.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers

AU - Bulusu, Saikiran

AU - Joseph, G.

AU - Gursoy, M. Cenk

AU - Varshney, Pramod K.

N1 - Conference code: 36

PY - 2022

Y1 - 2022

N2 - We consider a set of data samples such that a fraction of the samples are arbitrary outliers, and the rest are the output samples of a single-layer neural network with rectified linear unit (ReLU) activation. Our goal is to estimate the parameters (weight matrix and bias vector) of the neural network, assuming the bias vector to be non-negative. We estimate the network parameters using the gradient descent algorithm combined with either the median- or trimmed mean-based filters to mitigate the effect of the arbitrary outliers. We then prove that $\tilde{O}( \frac{1}{p^2}+\frac{1}{\epsilon^2p})$ samples and $\tilde{O} ( \frac{d^2}{p^2}+ \frac{d^2}{\epsilon^2p})$ time are sufficient for our algorithm to estimate the neural network parameters within an error of $\epsilon$ when the outlier probability is $1-p$, {where $2/3< p \leq 1$} and the problem dimension is $d$ (with log factors being ignored here). Our theoretical and simulation results provide insights into the training complexity of ReLU neural networks in terms of the probability of outliers and problem dimension.

AB - We consider a set of data samples such that a fraction of the samples are arbitrary outliers, and the rest are the output samples of a single-layer neural network with rectified linear unit (ReLU) activation. Our goal is to estimate the parameters (weight matrix and bias vector) of the neural network, assuming the bias vector to be non-negative. We estimate the network parameters using the gradient descent algorithm combined with either the median- or trimmed mean-based filters to mitigate the effect of the arbitrary outliers. We then prove that $\tilde{O}( \frac{1}{p^2}+\frac{1}{\epsilon^2p})$ samples and $\tilde{O} ( \frac{d^2}{p^2}+ \frac{d^2}{\epsilon^2p})$ time are sufficient for our algorithm to estimate the neural network parameters within an error of $\epsilon$ when the outlier probability is $1-p$, {where $2/3< p \leq 1$} and the problem dimension is $d$ (with log factors being ignored here). Our theoretical and simulation results provide insights into the training complexity of ReLU neural networks in terms of the probability of outliers and problem dimension.

M3 - Conference contribution

BT - 36th Conference on Neural Information Processing Systems 2022

A2 - Koyejo, S.

T2 - 36th Conference on Neural Information Processing Systems

Y2 - 28 November 2022 through 9 December 2022

ER -

Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers

Abstract

Conference

Access to Document

Fingerprint

Cite this