EAD-GAN: A Generative Adversarial Network for Disentangling Affine Transforms in Images

Letao  Liu; Xudong Jiang; Martin Saerbeck; Justin Dauwels

doi:10.1109/TNNLS.2022.3195533

EAD-GAN: A Generative Adversarial Network for Disentangling Affine Transforms in Images

Letao Liu^*, Xudong Jiang, Martin Saerbeck, Justin Dauwels

^*Corresponding author for this work

Signal Processing Systems

Research output: Contribution to journal › Article › Scientific › peer-review

2 Citations (Scopus)

4 Downloads (Pure)

Abstract

This article proposes a generative adversarial network called explicit affine disentangled generative adversarial network (EAD-GAN), which explicitly disentangles affine transform in a self-supervised manner. We propose an affine transform regularizer to force the InfoGAN to have explicit properties of affine transform. To facilitate training an affine transform encoder, we decompose the affine matrix into two separate matrices and infer the explicit transform parameters by the least-squares method. Unlike the existing approaches, representations learned by the proposed EAD-GAN have clear physical meaning, where transforms, such as rotation, horizontal and vertical zooms, skews, and translations, are explicitly learned from training data. Thus, we set different values of each transform parameter individually to generate specifically affine transformed data by the learned network. We show that the proposed EAD-GAN successfully disentangles these attributes on the MNIST, CelebA, and dSprites datasets. EAD-GAN achieves higher disentanglement scores with a large margin compared to the state-of-the-art methods on the dSprites dataset. For example, on the dSprites dataset, EAD-GAN achieves the MIG and DCI score of 0.59 and 0.96 respectively, compared to 0.37 and 0.71, respectively, for the state-of-the-art methods.

Original language	English
Pages (from-to)	3652-3662
Number of pages	11
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	35
Issue number	3
DOIs	https://doi.org/10.1109/TNNLS.2022.3195533
Publication status	Published - 2022

Keywords

Affine transform
disentanglement
generative adversarial network (GAN)

Access to Document

10.1109/TNNLS.2022.3195533

EAD-GAN_A_Generative_Adversarial_Network_for_Disentangling_Affine_Transforms_in_ImagesFinal published version, 4.42 MBLicence: CC BY

Cite this

@article{53a21bfbe8ee4be2bc684ed14837f3bc,

title = "EAD-GAN: A Generative Adversarial Network for Disentangling Affine Transforms in Images",

abstract = "This article proposes a generative adversarial network called explicit affine disentangled generative adversarial network (EAD-GAN), which explicitly disentangles affine transform in a self-supervised manner. We propose an affine transform regularizer to force the InfoGAN to have explicit properties of affine transform. To facilitate training an affine transform encoder, we decompose the affine matrix into two separate matrices and infer the explicit transform parameters by the least-squares method. Unlike the existing approaches, representations learned by the proposed EAD-GAN have clear physical meaning, where transforms, such as rotation, horizontal and vertical zooms, skews, and translations, are explicitly learned from training data. Thus, we set different values of each transform parameter individually to generate specifically affine transformed data by the learned network. We show that the proposed EAD-GAN successfully disentangles these attributes on the MNIST, CelebA, and dSprites datasets. EAD-GAN achieves higher disentanglement scores with a large margin compared to the state-of-the-art methods on the dSprites dataset. For example, on the dSprites dataset, EAD-GAN achieves the MIG and DCI score of 0.59 and 0.96 respectively, compared to 0.37 and 0.71, respectively, for the state-of-the-art methods.",

keywords = "Affine transform, disentanglement, generative adversarial network (GAN)",

author = "Letao Liu and Xudong Jiang and Martin Saerbeck and Justin Dauwels",

year = "2022",

doi = "10.1109/TNNLS.2022.3195533",

language = "English",

volume = "35",

pages = "3652--3662",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "3",

}

TY - JOUR

T1 - EAD-GAN

T2 - A Generative Adversarial Network for Disentangling Affine Transforms in Images

AU - Liu, Letao

AU - Jiang, Xudong

AU - Saerbeck, Martin

AU - Dauwels, Justin

PY - 2022

Y1 - 2022

N2 - This article proposes a generative adversarial network called explicit affine disentangled generative adversarial network (EAD-GAN), which explicitly disentangles affine transform in a self-supervised manner. We propose an affine transform regularizer to force the InfoGAN to have explicit properties of affine transform. To facilitate training an affine transform encoder, we decompose the affine matrix into two separate matrices and infer the explicit transform parameters by the least-squares method. Unlike the existing approaches, representations learned by the proposed EAD-GAN have clear physical meaning, where transforms, such as rotation, horizontal and vertical zooms, skews, and translations, are explicitly learned from training data. Thus, we set different values of each transform parameter individually to generate specifically affine transformed data by the learned network. We show that the proposed EAD-GAN successfully disentangles these attributes on the MNIST, CelebA, and dSprites datasets. EAD-GAN achieves higher disentanglement scores with a large margin compared to the state-of-the-art methods on the dSprites dataset. For example, on the dSprites dataset, EAD-GAN achieves the MIG and DCI score of 0.59 and 0.96 respectively, compared to 0.37 and 0.71, respectively, for the state-of-the-art methods.

AB - This article proposes a generative adversarial network called explicit affine disentangled generative adversarial network (EAD-GAN), which explicitly disentangles affine transform in a self-supervised manner. We propose an affine transform regularizer to force the InfoGAN to have explicit properties of affine transform. To facilitate training an affine transform encoder, we decompose the affine matrix into two separate matrices and infer the explicit transform parameters by the least-squares method. Unlike the existing approaches, representations learned by the proposed EAD-GAN have clear physical meaning, where transforms, such as rotation, horizontal and vertical zooms, skews, and translations, are explicitly learned from training data. Thus, we set different values of each transform parameter individually to generate specifically affine transformed data by the learned network. We show that the proposed EAD-GAN successfully disentangles these attributes on the MNIST, CelebA, and dSprites datasets. EAD-GAN achieves higher disentanglement scores with a large margin compared to the state-of-the-art methods on the dSprites dataset. For example, on the dSprites dataset, EAD-GAN achieves the MIG and DCI score of 0.59 and 0.96 respectively, compared to 0.37 and 0.71, respectively, for the state-of-the-art methods.

KW - Affine transform

KW - disentanglement

KW - generative adversarial network (GAN)

UR - http://www.scopus.com/inward/record.url?scp=85136140446&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2022.3195533

DO - 10.1109/TNNLS.2022.3195533

M3 - Article

AN - SCOPUS:85136140446

SN - 2162-237X

VL - 35

SP - 3652

EP - 3662

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 3

ER -

EAD-GAN: A Generative Adversarial Network for Disentangling Affine Transforms in Images

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this