Publications – Laboratory for Machine Intelligence

Ivanovska, Marija; Štruc, Vitomir

Y-GAN: Learning Dual Data Representations for Anomaly Detection in Images Journal Article

In: Expert Systems with Applications (ESWA), vol. 248, no. 123410, pp. 1-7, 2024.

Abstract | Links | BibTeX | Tags: anomaly detection, CNN, deep learning, one-class learning, y-gan

@article{ESWA2024,

title = {Y-GAN: Learning Dual Data Representations for Anomaly Detection in Images},

author = {Marija Ivanovska and Vitomir Štruc},

url = {https://www.sciencedirect.com/science/article/pii/S0957417424002756

https://lmi.fe.uni-lj.si/wp-content/uploads/2024/02/YGAN_Marija.pdf},

doi = {https://doi.org/10.1016/j.eswa.2024.123410},

year  = {2024},

date = {2024-03-01},

urldate = {2024-03-01},

journal = {Expert Systems with Applications (ESWA)},

volume = {248},

number = {123410},

pages = {1-7},

abstract = {We propose a novel reconstruction-based model for anomaly detection in image data, called 'Y-GAN'. The model consists of a Y-shaped auto-encoder and represents images in two separate latent spaces. The first captures meaningful image semantics, which are key for representing (normal) training data, whereas the second encodes low-level residual image characteristics. To ensure the dual representations encode mutually exclusive information, a disentanglement procedure is designed around a latent (proxy) classifier. Additionally, a novel representation-consistency mechanism is proposed to prevent information leakage between the latent spaces. The model is trained in a one-class learning setting using only normal training data. Due to the separation of semantically-relevant and residual information, Y-GAN is able to derive informative data representations that allow for efficacious anomaly detection across a diverse set of anomaly detection tasks. The model is evaluated in comprehensive experiments with several recent anomaly detection models using four popular image datasets, i.e., MNIST, FMNIST, CIFAR10, and PlantVillage. Experimental results show that Y-GAN outperforms all tested models by a considerable margin and yields state-of-the-art results. The source code for the model is made publicly available at https://github.com/MIvanovska/Y-GAN. },

keywords = {anomaly detection, CNN, deep learning, one-class learning, y-gan},

pubstate = {published},

tppubtype = {article}

}

Close

Larue, Nicolas; Vu, Ngoc-Son; Štruc, Vitomir; Peer, Peter; Christophides, Vassilis

SeeABLE: Soft Discrepancies and Bounded Contrastive Learning for Exposing Deepfakes Proceedings Article

In: Proceedings of the International Conference on Computer Vision (ICCV), pp. 21011 - 21021, IEEE 2023.

Abstract | Links | BibTeX | Tags: CNN, deepfake detection, deepfakes, face, media forensics, one-class learning, representation learning

@inproceedings{NicolasCCV,

title = {SeeABLE: Soft Discrepancies and Bounded Contrastive Learning for Exposing Deepfakes},

author = {Nicolas Larue and Ngoc-Son Vu and Vitomir Štruc and Peter Peer and Vassilis Christophides},

url = {https://openaccess.thecvf.com/content/ICCV2023/papers/Larue_SeeABLE_Soft_Discrepancies_and_Bounded_Contrastive_Learning_for_Exposing_Deepfakes_ICCV_2023_paper.pdf

https://lmi.fe.uni-lj.si/wp-content/uploads/2024/01/SeeABLE_compressed.pdf

https://lmi.fe.uni-lj.si/wp-content/uploads/2024/01/SeeABLE_supplementary_compressed.pdf},

year  = {2023},

date = {2023-10-01},

urldate = {2023-10-01},

booktitle = {Proceedings of the International Conference on Computer Vision (ICCV)},

pages = {21011 - 21021},

organization = {IEEE},

abstract = {Modern deepfake detectors have achieved encouraging results, when training and test images are drawn from the same data collection. However, when these detectors are applied to images produced with unknown deepfake-generation techniques, considerable performance degradations are commonly observed. In this paper, we propose a novel deepfake detector, called SeeABLE, that formalizes the detection problem as a (one-class) out-of-distribution detection task and generalizes better to unseen deepfakes. Specifically, SeeABLE first generates local image perturbations (referred to as soft-discrepancies) and then pushes the perturbed faces towards predefined prototypes using a novel regression-based bounded contrastive loss. To strengthen the generalization performance of SeeABLE to unknown deepfake types, we generate a rich set of soft discrepancies and train the detector: (i) to localize, which part of the face was modified, and (ii) to identify the alteration type. To demonstrate the capabilities of SeeABLE, we perform rigorous experiments on several widely-used deepfake datasets and show that our model convincingly outperforms competing state-of-the-art detectors, while exhibiting highly encouraging generalization capabilities. The source code for SeeABLE is available from: https://github.com/anonymous-author-sub/seeable.

},

keywords = {CNN, deepfake detection, deepfakes, face, media forensics, one-class learning, representation learning},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Ivanovska, Marija; Štruc, Vitomir

A Comparative Study on Discriminative and One--Class Learning Models for Deepfake Detection Proceedings Article

In: Proceedings of ERK 2021, pp. 1–4, 2021.

Abstract | Links | BibTeX | Tags: biometrics, comparative study, computer vision, deepfake detection, deepfakes, detection, face, one-class learning