Publications – Laboratory for Machine Intelligence

2023

Pernuš, Martin; Štruc, Vitomir; Dobrišek, Simon

MaskFaceGAN: High Resolution Face Editing With Masked GAN Latent Code Optimization Journal Article

In: IEEE Transactions on Image Processing, 2023, ISSN: 1941-0042.

Abstract | Links | BibTeX | Tags: CNN, computer vision, deep learning, face editing, face image processing, GAN, GAN inversion, generative models, StyleGAN

@article{MaskFaceGAN,

title = {MaskFaceGAN: High Resolution Face Editing With Masked GAN Latent Code Optimization},

author = {Martin Pernuš and Vitomir Štruc and Simon Dobrišek},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10299582

https://lmi.fe.uni-lj.si/wp-content/uploads/2023/02/MaskFaceGAN_compressed.pdf

https://arxiv.org/pdf/2103.11135.pdf},

doi = {10.1109/TIP.2023.3326675},

issn = {1941-0042},

year  = {2023},

date = {2023-10-27},

urldate = {2023-01-02},

journal = {IEEE Transactions on Image Processing},

abstract = {Face editing represents a popular research topic within the computer vision and image processing communities. While significant progress has been made recently in this area, existing solutions: ( i ) are still largely focused on low-resolution images, ( ii ) often generate editing results with visual artefacts, or ( iii ) lack fine-grained control over the editing procedure and alter multiple (entangled) attributes simultaneously, when trying to generate the desired facial semantics. In this paper, we aim to address these issues through a novel editing approach, called MaskFaceGAN that focuses on local attribute editing. The proposed approach is based on an optimization procedure that directly optimizes the latent code of a pre-trained (state-of-the-art) Generative Adversarial Network (i.e., StyleGAN2) with respect to several constraints that ensure: ( i ) preservation of relevant image content, ( ii ) generation of the targeted facial attributes, and ( iii ) spatially–selective treatment of local image regions. The constraints are enforced with the help of an (differentiable) attribute classifier and face parser that provide the necessary reference information for the optimization procedure. MaskFaceGAN is evaluated in extensive experiments on the FRGC, SiblingsDB-HQf, and XM2VTS datasets and in comparison with several state-of-the-art techniques from the literature. Our experimental results show that the proposed approach is able to edit face images with respect to several local facial attributes with unprecedented image quality and at high-resolutions (1024×1024), while exhibiting considerably less problems with attribute entanglement than competing solutions. The source code is publicly available from: https://github.com/MartinPernus/MaskFaceGAN.},

keywords = {CNN, computer vision, deep learning, face editing, face image processing, GAN, GAN inversion, generative models, StyleGAN},

pubstate = {published},

tppubtype = {article}

}

2022

Tomašević, Darian; Peer, Peter; Štruc, Vitomir

BiOcularGAN: Bimodal Synthesis and Annotation of Ocular Images Proceedings Article

In: IEEE/IAPR International Joint Conference on Biometrics (IJCB 2022) , pp. 1-10, 2022.

Abstract | Links | BibTeX | Tags: biometrics, CNN, data synthesis, deep learning, ocular, segmentation, StyleGAN, synthetic data