Unified Binary Generative Adversarial Network for Image Retrieval and Compression

Jingkuan Song; Tao He; Lianli Gao; Xing Xu; Alan Hanjalic; Heng Tao Shen

doi:10.1007/s11263-020-01305-2

Unified Binary Generative Adversarial Network for Image Retrieval and Compression

Jingkuan Song, Tao He, Lianli Gao^*, Xing Xu, Alan Hanjalic, Heng Tao Shen

^*Corresponding author for this work

Intelligent Systems

Research output: Contribution to journal › Article › Scientific › peer-review

59 Citations (Scopus)

335 Downloads (Pure)

Abstract

Binary codes have often been deployed to facilitate large-scale retrieval tasks, but not that often for image compression. In this paper, we propose a unified framework, BGAN+, that restricts the input noise variable of generative adversarial networks to be binary and conditioned on the features of each input image, and simultaneously learns two binary representations per image: one for image retrieval and the other serving as image compression. Compared to related methods that attempt to learn a single binary code serving both purposes, we demonstrate that choosing for two codes leads to more effective representations due to less concessions needed when balancing the requirements. The added value of using a unified framework compared to two separate frameworks lies in the synergy in data representation that is beneficial for both learning processes. When devising this framework, we also address another challenge in learning binary codes, namely that of learning supervision. While the most striking successes in image retrieval using binary codes have mostly involved discriminative models requiring labels, the proposed BGAN+ framework learns the binary codes in an unsupervised fashion, yet more effectively than the state-of-the-art supervised approaches. The proposed BGAN+ framework is evaluated on three benchmark datasets for image retrieval and two datasets on image compression. The experimental results show that BGAN+ outperforms the existing retrieval methods with significant margins and achieves promising performance for image compression, especially for low bit rates.

Original language	English
Pages (from-to)	2243-2264
Number of pages	22
Journal	International Journal of Computer Vision
Volume	128
Issue number	8-9
DOIs	https://doi.org/10.1007/s11263-020-01305-2
Publication status	Published - 2020

Bibliographical note

Accepted author manuscript

Keywords

Binary codes
Generative adversarial network
Image compression
Image retrieval

Access to Document

10.1007/s11263-020-01305-2

ijcv2020-binaryAccepted author manuscript, 1.39 MB

Cite this

@article{e95be336afd4447f84554e55b1af134a,

title = "Unified Binary Generative Adversarial Network for Image Retrieval and Compression",

abstract = "Binary codes have often been deployed to facilitate large-scale retrieval tasks, but not that often for image compression. In this paper, we propose a unified framework, BGAN+, that restricts the input noise variable of generative adversarial networks to be binary and conditioned on the features of each input image, and simultaneously learns two binary representations per image: one for image retrieval and the other serving as image compression. Compared to related methods that attempt to learn a single binary code serving both purposes, we demonstrate that choosing for two codes leads to more effective representations due to less concessions needed when balancing the requirements. The added value of using a unified framework compared to two separate frameworks lies in the synergy in data representation that is beneficial for both learning processes. When devising this framework, we also address another challenge in learning binary codes, namely that of learning supervision. While the most striking successes in image retrieval using binary codes have mostly involved discriminative models requiring labels, the proposed BGAN+ framework learns the binary codes in an unsupervised fashion, yet more effectively than the state-of-the-art supervised approaches. The proposed BGAN+ framework is evaluated on three benchmark datasets for image retrieval and two datasets on image compression. The experimental results show that BGAN+ outperforms the existing retrieval methods with significant margins and achieves promising performance for image compression, especially for low bit rates.",

keywords = "Binary codes, Generative adversarial network, Image compression, Image retrieval",

author = "Jingkuan Song and Tao He and Lianli Gao and Xing Xu and Alan Hanjalic and Shen, {Heng Tao}",

note = "Accepted author manuscript",

year = "2020",

doi = "10.1007/s11263-020-01305-2",

language = "English",

volume = "128",

pages = "2243--2264",

journal = "International Journal of Computer Vision",

issn = "0920-5691",

publisher = "Springer",

number = "8-9",

}

TY - JOUR

T1 - Unified Binary Generative Adversarial Network for Image Retrieval and Compression

AU - Song, Jingkuan

AU - He, Tao

AU - Gao, Lianli

AU - Xu, Xing

AU - Hanjalic, Alan

AU - Shen, Heng Tao

N1 - Accepted author manuscript

PY - 2020

Y1 - 2020

N2 - Binary codes have often been deployed to facilitate large-scale retrieval tasks, but not that often for image compression. In this paper, we propose a unified framework, BGAN+, that restricts the input noise variable of generative adversarial networks to be binary and conditioned on the features of each input image, and simultaneously learns two binary representations per image: one for image retrieval and the other serving as image compression. Compared to related methods that attempt to learn a single binary code serving both purposes, we demonstrate that choosing for two codes leads to more effective representations due to less concessions needed when balancing the requirements. The added value of using a unified framework compared to two separate frameworks lies in the synergy in data representation that is beneficial for both learning processes. When devising this framework, we also address another challenge in learning binary codes, namely that of learning supervision. While the most striking successes in image retrieval using binary codes have mostly involved discriminative models requiring labels, the proposed BGAN+ framework learns the binary codes in an unsupervised fashion, yet more effectively than the state-of-the-art supervised approaches. The proposed BGAN+ framework is evaluated on three benchmark datasets for image retrieval and two datasets on image compression. The experimental results show that BGAN+ outperforms the existing retrieval methods with significant margins and achieves promising performance for image compression, especially for low bit rates.

AB - Binary codes have often been deployed to facilitate large-scale retrieval tasks, but not that often for image compression. In this paper, we propose a unified framework, BGAN+, that restricts the input noise variable of generative adversarial networks to be binary and conditioned on the features of each input image, and simultaneously learns two binary representations per image: one for image retrieval and the other serving as image compression. Compared to related methods that attempt to learn a single binary code serving both purposes, we demonstrate that choosing for two codes leads to more effective representations due to less concessions needed when balancing the requirements. The added value of using a unified framework compared to two separate frameworks lies in the synergy in data representation that is beneficial for both learning processes. When devising this framework, we also address another challenge in learning binary codes, namely that of learning supervision. While the most striking successes in image retrieval using binary codes have mostly involved discriminative models requiring labels, the proposed BGAN+ framework learns the binary codes in an unsupervised fashion, yet more effectively than the state-of-the-art supervised approaches. The proposed BGAN+ framework is evaluated on three benchmark datasets for image retrieval and two datasets on image compression. The experimental results show that BGAN+ outperforms the existing retrieval methods with significant margins and achieves promising performance for image compression, especially for low bit rates.

KW - Binary codes

KW - Generative adversarial network

KW - Image compression

KW - Image retrieval

UR - http://www.scopus.com/inward/record.url?scp=85079780056&partnerID=8YFLogxK

U2 - 10.1007/s11263-020-01305-2

DO - 10.1007/s11263-020-01305-2

M3 - Article

AN - SCOPUS:85079780056

SN - 0920-5691

VL - 128

SP - 2243

EP - 2264

JO - International Journal of Computer Vision

JF - International Journal of Computer Vision

IS - 8-9

ER -

Unified Binary Generative Adversarial Network for Image Retrieval and Compression

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this