Journal Articles

GEN: Generative Equivariant Networks for Diverse Image-to-Image Translation

Pourya Shamsolmoali, East China Normal University
Masoumeh Zareapoor, Shanghai Jiao Tong University
Swagatam Das, Indian Statistical Institute, Kolkata
Salvador Garcia, Universidad de Granada
Eric Granger, École de Technologie Supérieure
Jie Yang, Shanghai Jiao Tong University

Article Type

Research Article

Publication Title

IEEE Transactions on Cybernetics

Abstract

Image-to-image (I2I) translation has become a key asset for generative adversarial networks. Convolutional neural networks (CNNs), despite having a significant performance, are not able to capture the spatial relationships among different parts of an object and, thus, do not qualify as the ideal representative model for image translation tasks. As a remedy to this problem, capsule networks have been proposed to represent patterns for a visual object in such a way that preserves hierarchical spatial relationships. The training of capsules is constrained by learning all pairwise relationships between capsules of consecutive layers. This design would be prohibitively expensive both in time and memory. In this article, we present a new framework for capsule networks to provide a full description of the input components at various levels of semantics, which can successfully be applied to the generator-discriminator architectures without incurring computational overhead compared to the CNNs. To successfully apply the proposed capsules in the generative adversarial network, we put forth a novel Gromov-Wasserstein (GW) distance as a differentiable loss function that compares the dissimilarity between two distributions and then guides the learned distribution toward target properties, using optimal transport (OT) discrepancy. The proposed method - which is called generative equivariant network (GEN) - is an alternative architecture for GANs with equivariance capsule layers. The proposed model is evaluated through a comprehensive set of experiments on I2I translation and image generation tasks and compared with several state-of-the-art models. Results indicate that there is a principled connection between generative and capsule models that allows extracting discriminant and invariant information from image data.

First Page

874

Last Page

886

DOI

https://10.1109/TCYB.2022.3166761

Publication Date

2-1-2023

Recommended Citation

Shamsolmoali, Pourya; Zareapoor, Masoumeh; Das, Swagatam; Garcia, Salvador; Granger, Eric; and Yang, Jie, "GEN: Generative Equivariant Networks for Diverse Image-to-Image Translation" (2023). Journal Articles. 3871.
https://digitalcommons.isical.ac.in/journal-articles/3871

Link to Full Text

COinS

Journal Articles

GEN: Generative Equivariant Networks for Diverse Image-to-Image Translation

Article Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Browse

Search

Author Corner

Links

Journal Articles

GEN: Generative Equivariant Networks for Diverse Image-to-Image Translation

Authors

Article Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Share

Browse

Search

Author Corner

Links