Publications

Darian Tomašević; Fadi Boutros; Naser Damer; Peter Peer; Vitomir Štruc

Generating bimodal privacy-preserving data for face recognition Journal Article

In: Engineering Applications of Artificial Intelligence, vol. 133, iss. E, pp. 1-25, 2024.

@article{Darian2024,

title = {Generating bimodal privacy-preserving data for face recognition},

author = {Darian Tomašević and Fadi Boutros and Naser Damer and Peter Peer and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2024/05/PapersDarian.pdf},

doi = {https://doi.org/10.1016/j.engappai.2024.108495},

year  = {2024},

date = {2024-05-01},

journal = {Engineering Applications of Artificial Intelligence},

volume = {133},

issue = {E},

pages = {1-25},

abstract = {The performance of state-of-the-art face recognition systems depends crucially on the availability of large-scale training datasets. However, increasing privacy concerns nowadays accompany the collection and distribution of biometric data, which has already resulted in the retraction of valuable face recognition datasets. The use of synthetic data represents a potential solution, however, the generation of privacy-preserving facial images useful for training recognition models is still an open problem. Generative methods also remain bound to the visible spectrum, despite the benefits that multispectral data can provide. To address these issues, we present a novel identity-conditioned generative framework capable of producing large-scale recognition datasets of visible and near-infrared privacy-preserving face images. The framework relies on a novel identity-conditioned dual-branch style-based generative adversarial network to enable the synthesis of aligned high-quality samples of identities determined by features of a pretrained recognition model. In addition, the framework incorporates a novel filter to prevent samples of privacy-breaching identities from reaching the generated datasets and improve both identity separability and intra-identity diversity. Extensive experiments on six publicly available datasets reveal that our framework achieves competitive synthesis capabilities while preserving the privacy of real-world subjects. The synthesized datasets also facilitate training more powerful recognition models than datasets generated by competing methods or even small-scale real-world datasets. Employing both visible and near-infrared data for training also results in higher recognition accuracy on real-world visible spectrum benchmarks. Therefore, training with multispectral data could potentially improve existing recognition systems that utilize only the visible spectrum, without the need for additional sensors.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Žiga Babnik; Peter Peer; Vitomir Štruc

eDifFIQA: Towards Efficient Face Image Quality Assessment based on Denoising Diffusion Probabilistic Models Journal Article

In: IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM), pp. 1-16, 2024, ISSN: 2637-6407.

Abstract | Links | BibTeX

@article{BabnikTBIOM2024,

title = {eDifFIQA: Towards Efficient Face Image Quality Assessment based on Denoising Diffusion Probabilistic Models},

author = {Žiga Babnik and Peter Peer and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2024/03/TBIOM___DifFIQAv2.pdf

https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10468647&tag=1},

doi = {10.1109/TBIOM.2024.3376236},

issn = {2637-6407},

year  = {2024},

date = {2024-03-07},

urldate = {2024-03-07},

journal = {IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM)},

pages = {1-16},

abstract = {State-of-the-art Face Recognition (FR) models perform well in constrained scenarios, but frequently fail in difficult real-world scenarios, when no quality guarantees can be made for face samples. For this reason, Face Image Quality Assessment (FIQA) techniques are often used by FR systems, to provide quality estimates of captured face samples. The quality estimate provided by FIQA techniques can be used by the FR system to reject samples of low-quality, in turn improving the performance of the system and reducing the number of critical false-match errors. However, despite steady improvements, ensuring a good trade-off between the performance and computational complexity of FIQA methods across diverse face samples remains challenging. In this paper, we present DifFIQA, a powerful unsupervised approach for quality assessment based on the popular denoising diffusion probabilistic models (DDPMs) and the extended (eDifFIQA) approach. The main idea of the base DifFIQA approach is to utilize the forward and backward processes of DDPMs to perturb facial images and quantify the impact of these perturbations on the corresponding image embeddings for quality prediction. Because of the iterative nature of DDPMs the base DifFIQA approach is extremely computationally expensive. Using eDifFIQA we are able to improve on both the performance and computational complexity of the base DifFIQA approach, by employing label optimized knowledge distillation. In this process, quality information inferred by DifFIQA is distilled into a quality-regression model. During the distillation process, we use an additional source of quality information hidden in the relative position of the embedding to further improve the predictive capabilities of the underlying regression model. By choosing different feature extraction backbone models as the basis for the quality-regression eDifFIQA model, we are able to control the trade-off between the predictive capabilities and computational complexity of the final model. We evaluate three eDifFIQA variants of varying sizes in comprehensive experiments on 7 diverse datasets containing static-images and a separate video-based dataset, with 4 target CNN-based FR models and 2 target Transformer-based FR models and against 10 state-of-the-art FIQA techniques, as well as against the initial DifFIQA baseline and a simple regression-based predictor DifFIQA(R), distilled from DifFIQA without any additional optimization. The results show that the proposed label optimized knowledge distillation improves on the performance and computationally complexity of the base DifFIQA approach, and is able to achieve state-of-the-art performance in several distinct experimental scenarios. Furthermore, we also show that the distilled model can be used directly for face recognition and leads to highly competitive results.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

State-of-the-art Face Recognition (FR) models perform well in constrained scenarios, but frequently fail in difficult real-world scenarios, when no quality guarantees can be made for face samples. For this reason, Face Image Quality Assessment (FIQA) techniques are often used by FR systems, to provide quality estimates of captured face samples. The quality estimate provided by FIQA techniques can be used by the FR system to reject samples of low-quality, in turn improving the performance of the system and reducing the number of critical false-match errors. However, despite steady improvements, ensuring a good trade-off between the performance and computational complexity of FIQA methods across diverse face samples remains challenging. In this paper, we present DifFIQA, a powerful unsupervised approach for quality assessment based on the popular denoising diffusion probabilistic models (DDPMs) and the extended (eDifFIQA) approach. The main idea of the base DifFIQA approach is to utilize the forward and backward processes of DDPMs to perturb facial images and quantify the impact of these perturbations on the corresponding image embeddings for quality prediction. Because of the iterative nature of DDPMs the base DifFIQA approach is extremely computationally expensive. Using eDifFIQA we are able to improve on both the performance and computational complexity of the base DifFIQA approach, by employing label optimized knowledge distillation. In this process, quality information inferred by DifFIQA is distilled into a quality-regression model. During the distillation process, we use an additional source of quality information hidden in the relative position of the embedding to further improve the predictive capabilities of the underlying regression model. By choosing different feature extraction backbone models as the basis for the quality-regression eDifFIQA model, we are able to control the trade-off between the predictive capabilities and computational complexity of the final model. We evaluate three eDifFIQA variants of varying sizes in comprehensive experiments on 7 diverse datasets containing static-images and a separate video-based dataset, with 4 target CNN-based FR models and 2 target Transformer-based FR models and against 10 state-of-the-art FIQA techniques, as well as against the initial DifFIQA baseline and a simple regression-based predictor DifFIQA(R), distilled from DifFIQA without any additional optimization. The results show that the proposed label optimized knowledge distillation improves on the performance and computationally complexity of the base DifFIQA approach, and is able to achieve state-of-the-art performance in several distinct experimental scenarios. Furthermore, we also show that the distilled model can be used directly for face recognition and leads to highly competitive results.

Close

Meiling Fang; Wufei Yang; Arjan Kuijper; Vitomir S̆truc; Naser Damer

Fairness in Face Presentation Attack Detection Journal Article

In: Pattern Recognition, vol. 147 , iss. 110002, pp. 1-14, 2024.

Abstract | Links | BibTeX

@article{PR_Fairness2024,

title = {Fairness in Face Presentation Attack Detection},

author = {Meiling Fang and Wufei Yang and Arjan Kuijper and Vitomir S̆truc and Naser Damer},

url = {https://www.sciencedirect.com/science/article/pii/S0031320323007008?dgcid=coauthor},

year  = {2024},

date = {2024-03-01},

urldate = {2024-03-01},

journal = {Pattern Recognition},

volume = {147 },

issue = {110002},

pages = {1-14},

abstract = {Face recognition (FR) algorithms have been proven to exhibit discriminatory behaviors against certain demographic and non-demographic groups, raising ethical and legal concerns regarding their deployment in real-world scenarios. Despite the growing number of fairness studies in FR, the fairness of face presentation attack detection (PAD) has been overlooked, mainly due to the lack of appropriately annotated data. To avoid and mitigate the potential negative impact of such behavior, it is essential to assess the fairness in face PAD and develop fair PAD models. To enable fairness analysis in face PAD, we present a Combined Attribute Annotated PAD Dataset (CAAD-PAD), offering seven human-annotated attribute labels. Then, we comprehensively analyze the fairness of PAD and its relation to the nature of the training data and the Operational Decision Threshold Assignment (ODTA) through a set of face PAD solutions. Additionally, we propose a novel metric, the Accuracy Balanced Fairness (ABF), that jointly represents both the PAD fairness and the absolute PAD performance. The experimental results pointed out that female and faces with occluding features (e.g. eyeglasses, beard, etc.) are relatively less protected than male and non-occlusion groups by all PAD solutions. To alleviate this observed unfairness, we propose a plug-and-play data augmentation method, FairSWAP, to disrupt the identity/semantic information and encourage models to mine the attack clues. The extensive experimental results indicate that FairSWAP leads to better-performing and fairer face PADs in 10 out of 12 investigated cases.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Marija Ivanovska; Vitomir Štruc

Y-GAN: Learning Dual Data Representations for Anomaly Detection in Images Journal Article

In: Expert Systems with Applications (ESWA), vol. 248, no. 123410, pp. 1-7, 2024.

Abstract | Links | BibTeX

Chenquan Gan; Jiahao Zheng; Qingyi Zhu; Deepak Kumar Jain; Vitomir vStruc,

A graph neural network with context filtering and feature correction for conversational emotion recognition Journal Article

In: Information Sciences, vol. 658, no. 120017, pp. 1-21, 2024.

Abstract | Links | BibTeX

@article{InformSciences2024,

title = {A graph neural network with context filtering and feature correction for conversational emotion recognition},

author = {Chenquan Gan and Jiahao Zheng and Qingyi Zhu and Deepak Kumar Jain and Vitomir {v{S}}truc, },

url = {https://www.sciencedirect.com/science/article/pii/S002002552301602X?via%3Dihub

https://lmi.fe.uni-lj.si/wp-content/uploads/2023/12/InformationSciences.pdf},

doi = {https://doi.org/10.1016/j.ins.2023.120017},

year  = {2024},

date = {2024-02-01},

journal = {Information Sciences},

volume = {658},

number = {120017},

pages = {1-21},

abstract = {Conversational emotion recognition represents an important machine-learning problem with a wide variety of deployment possibilities. The key challenge in this area is how to properly capture the key conversational aspects that facilitate reliable emotion recognition, including utterance semantics, temporal order, informative contextual cues, speaker interactions as well as other relevant factors. In this paper, we present a novel Graph Neural Network approach for conversational emotion recognition at the utterance level. Our method addresses the outlined challenges and represents conversations in the form of graph structures that naturally encode temporal order, speaker dependencies, and even long-distance context. To efficiently capture the semantic content of the conversations, we leverage the zero-shot feature-extraction capabilities of pre-trained large-scale language models and then integrate two key contributions into the graph neural network to ensure competitive recognition results. The first is a novel context filter that establishes meaningful utterance dependencies for the graph construction procedure and removes low-relevance and uninformative utterances from being used as a source of contextual information for the recognition task. The second contribution is a feature-correction procedure that adjusts the information content in the generated feature representations through a gating mechanism to improve their discriminative power and reduce emotion-prediction errors. We conduct extensive experiments on four commonly used conversational datasets, i.e., IEMOCAP, MELD, Dailydialog, and EmoryNLP, to demonstrate the capabilities of the developed graph neural network with context filtering and error-correction capabilities. The results of the experiments point to highly promising performance, especially when compared to state-of-the-art competitors from the literature.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Janez Križaj; Richard O. Plesh; Mahesh Banavar; Stephanie Schuckers; Vitomir Štruc

Deep Face Decoder: Towards understanding the embedding space of convolutional networks through visual reconstruction of deep face templates Journal Article

In: Engineering Applications of Artificial Intelligence, vol. 132, iss. 107941, pp. 1-20, 2024.

Abstract | Links | BibTeX

@article{KrizajEAAI2024,

title = {Deep Face Decoder: Towards understanding the embedding space of convolutional networks through visual reconstruction of deep face templates},

author = {Janez Križaj and Richard O. Plesh and Mahesh Banavar and Stephanie Schuckers and Vitomir Štruc},

url = {https://www.sciencedirect.com/science/article/abs/pii/S095219762400099X

https://lmi.fe.uni-lj.si/wp-content/uploads/2024/02/DFD_Overleaf.pdf},

doi = {https://doi.org/10.1016/j.engappai.2024.107941},

year  = {2024},

date = {2024-01-30},

urldate = {2024-01-30},

journal = {Engineering Applications of Artificial Intelligence},

volume = {132},

issue = {107941},

pages = {1-20},

abstract = {Advances in deep learning and convolutional neural networks (ConvNets) have driven remarkable face recognition (FR) progress recently. However, the black-box nature of modern ConvNet-based face recognition models makes it challenging to interpret their decision-making process, to understand the reasoning behind specific success and failure cases, or to predict their responses to unseen data characteristics. It is, therefore, critical to design mechanisms that explain the inner workings of contemporary FR models and offer insight into their behavior. To address this challenge, we present in this paper a novel textit{template-inversion approach} capable of reconstructing high-fidelity face images from the embeddings (templates, feature-space representations) produced by modern FR techniques. Our approach is based on a novel Deep Face Decoder (DFD) trained in a regression setting to visualize the information encoded in the embedding space with the goal of fostering explainability. We utilize the developed DFD model in comprehensive experiments on multiple unconstrained face datasets, namely Visual Geometry Group Face dataset 2 (VGGFace2), Labeled Faces in the Wild (LFW), and Celebrity Faces Attributes Dataset High Quality (CelebA-HQ). Our analysis focuses on the embedding spaces of two distinct face recognition models with backbones based on the Visual Geometry Group 16-layer model (VGG-16) and the 50-layer Residual Network (ResNet-50). The results reveal how information is encoded in the two considered models and how perturbations in image appearance due to rotations, translations, scaling, occlusion, or adversarial attacks, are propagated into the embedding space. Our study offers researchers a deeper comprehension of the underlying mechanisms of ConvNet-based FR models, ultimately promoting advancements in model design and explainability. },

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Fadi Boutros; Vitomir Štruc; Naser Damer

AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition Proceedings Article

In: Proceedings of the European Conference on Computer Vision (ECCV 2024), pp. 1-20, 2024.

Abstract | Links | BibTeX

Ivan DeAndres-Tame; Ruben Tolosana; Pietro Melzi; Ruben Vera-Rodriguez; Minchul Kim; Christian Rathgeb; Xiaoming Liu; Aythami Morales; Julian Fierrez; Javier Ortega-Garcia; Zhizhou Zhong; Yuge Huang; Yuxi Mi; Shouhong Ding; Shuigeng Zhou; Shuai He; Lingzhi Fu; Heng Cong; Rongyu Zhang; Zhihong Xiao; Evgeny Smirnov; Anton Pimenov; Aleksei Grigorev; Denis Timoshenko; Kaleb Mesfin Asfaw; Cheng Yaw Low; Hao Liu; Chuyi Wang; Qing Zuo; Zhixiang He; Hatef Otroshi Shahreza; Anjith George; Alexander Unnervik; Parsa Rahimi; Sébastien Marcel; Pedro C Neto; Marco Huber; Jan Niklas Kolf; Naser Damer; Fadi Boutros; Jaime S Cardoso; Ana F Sequeira; Andrea Atzori; Gianni Fenu; Mirko Marras; Vitomir Štruc; Jiang Yu; Zhangjie Li; Jichun Li; Weisong Zhao; Zhen Lei; Xiangyu Zhu; Xiao-Yu Zhang; Bernardo Biesseck; Pedro Vidal; Luiz Coelho; Roger Granada; David Menotti

Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data Proceedings Article

In: Proceedings of CVPR Workshops (CVPRW 2024), pp. 1-11, 2024.

Abstract | Links | BibTeX

@inproceedings{CVPR_synth2024,

title = {Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data},

author = {Ivan DeAndres-Tame and Ruben Tolosana and Pietro Melzi and Ruben Vera-Rodriguez and Minchul Kim and Christian Rathgeb and Xiaoming Liu and Aythami Morales and Julian Fierrez and Javier Ortega-Garcia and Zhizhou Zhong and Yuge Huang and Yuxi Mi and Shouhong Ding and Shuigeng Zhou and Shuai He and Lingzhi Fu and Heng Cong and Rongyu Zhang and Zhihong Xiao and Evgeny Smirnov and Anton Pimenov and Aleksei Grigorev and Denis Timoshenko and Kaleb Mesfin Asfaw and Cheng Yaw Low and Hao Liu and Chuyi Wang and Qing Zuo and Zhixiang He and Hatef Otroshi Shahreza and Anjith George and Alexander Unnervik and Parsa Rahimi and Sébastien Marcel and Pedro C Neto and Marco Huber and Jan Niklas Kolf and Naser Damer and Fadi Boutros and Jaime S Cardoso and Ana F Sequeira and Andrea Atzori and Gianni Fenu and Mirko Marras and Vitomir Štruc and Jiang Yu and Zhangjie Li and Jichun Li and Weisong Zhao and Zhen Lei and Xiangyu Zhu and Xiao-Yu Zhang and Bernardo Biesseck and Pedro Vidal and Luiz Coelho and Roger Granada and David Menotti},

url = {https://openaccess.thecvf.com/content/CVPR2024W/FRCSyn/papers/Deandres-Tame_Second_Edition_FRCSyn_Challenge_at_CVPR_2024_Face_Recognition_Challenge_CVPRW_2024_paper.pdf},

year  = {2024},

date = {2024-06-17},

urldate = {2024-06-17},

booktitle = {Proceedings of CVPR Workshops (CVPRW 2024)},

pages = {1-11},

abstract = {Synthetic data is gaining increasing relevance for training machine learning models. This is mainly motivated due to several factors such as the lack of real data and intraclass variability, time and errors produced in manual labeling, and in some cases privacy concerns, among others. This paper presents an overview of the 2nd edition of the Face Recognition Challenge in the Era of Synthetic Data (FRCSyn) organized at CVPR 2024. FRCSyn aims to investigate the use of synthetic data in face recognition to address current technological limitations, including data privacy concerns, demographic biases, generalization to novel scenarios, and performance constraints in challenging situations such as aging, pose variations, and occlusions. Unlike the 1st edition, in which synthetic data from DCFace and GANDiffFace methods was only allowed to train face recognition systems, in this 2nd edition we propose new subtasks that allow participants to explore novel face generative methods. The outcomes of the 2nd FRCSyn Challenge, along with the proposed experimental protocol and benchmarking contribute significantly to the application of synthetic data to face recognition.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Peter Rot; Philipp Terhorst; Peter Peer; Vitomir Štruc

ASPECD: Adaptable Soft-Biometric Privacy-Enhancement Using Centroid Decoding for Face Verification Proceedings Article

In: Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition (FG), pp. 1-9, 2024.

Abstract | Links | BibTeX

@inproceedings{Rot_FG2024,

title = {ASPECD: Adaptable Soft-Biometric Privacy-Enhancement Using Centroid Decoding for Face Verification},

author = {Peter Rot and Philipp Terhorst and Peter Peer and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2024/03/PeterRot_FG2024.pdf},

year  = {2024},

date = {2024-05-28},

booktitle = {Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition (FG)},

pages = {1-9},

abstract = {State-of-the-art face recognition models commonly extract information-rich biometric templates from the input images that are then used for comparison purposes and identity inference. While these templates encode identity information in a highly discriminative manner, they typically also capture other potentially sensitive facial attributes, such as age, gender or ethnicity. To address this issue, Soft-Biometric Privacy-Enhancing Techniques (SB-PETs) were proposed in the literature that aim to suppress such attribute information, and, in turn, alleviate the privacy risks associated with the extracted biometric templates. While various SB-PETs were presented so far, existing   approaches do not provide dedicated mechanisms to determine which soft-biometrics to exclude and which to retain. In this paper, we address this gap and introduce ASPECD, a modular framework designed to selectively suppress binary and categorical soft-biometrics based on users' privacy preferences. ASPECD consists of multiple sequentially connected components, each dedicated for privacy-enhancement of an individual soft-biometric attribute.  The proposed framework suppresses attribute information using a Moment-based Disentanglement process coupled with a centroid decoding procedure, ensuring that the privacy-enhanced templates are directly comparable to the templates in the original embedding space, regardless of the soft-biometric modality being suppressed. 

To validate the performance of ASPECD, we conduct experiments on a large-scale face dataset and with five state-of-the-art face recognition models, demonstrating the effectiveness of the proposed approach in suppressing single and multiple soft-biometric attributes. Our approach achieves a competitive privacy-utility trade-off compared to the state-of-the-art methods in scenarios that involve enhancing privacy w.r.t. gender and ethnicity attributes. Source code will be made publicly available.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Ajda Lampe; Julija Stopar; Deepak Kumar Jain; Shinichiro Omachi; Peter Peer; Vitomir Struc

DiCTI: Diffusion-based Clothing Designer via Text-guided Input Proceedings Article

In: Proceedings of the18th International Conference on Automatic Face and Gesture Recognition (FG 2024), pp. 1-9, 2024.

Abstract | Links | BibTeX

@inproceedings{Ajda_Dicti,

title = {DiCTI: Diffusion-based Clothing Designer via Text-guided Input},

author = {Ajda Lampe and Julija Stopar and Deepak Kumar Jain and Shinichiro Omachi and Peter Peer and Vitomir Struc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2024/06/Dicti_FG2024_compressed.pdf},

year  = {2024},

date = {2024-05-27},

booktitle = {Proceedings of the18th International Conference on Automatic Face and Gesture Recognition (FG 2024)},

pages = {1-9},

abstract = {Recent developments in deep generative models have opened up a wide range of opportunities for image synthesis, leading to significant changes in various creative fields, including the fashion industry. While numerous methods have been proposed to benefit buyers, particularly in virtual try-on applications, there has been relatively less focus on facilitating fast prototyping for designers and customers seeking to order new designs. To address this gap, we introduce DiCTI (Diffusion-based Clothing Designer via Text-guided Input), a straightforward yet highly effective approach that allows designers to quickly visualize fashion-related ideas using text inputs only. 

Given an image of a person and a description of the desired garments as input, DiCTI automatically generates multiple high-resolution, photorealistic images that capture the expressed semantics.  

By leveraging a powerful diffusion-based inpainting model conditioned on text inputs, DiCTI is able to synthesize convincing, high-quality images with varied clothing designs that viably follow the provided text descriptions, while being able to process very diverse and challenging inputs, captured in completely unconstrained settings. We evaluate DiCTI in comprehensive experiments on two different datasets (VITON-HD and Fashionpedia) and in comparison to the state-of-the-art (SoTa). The results of our experiments show that DiCTI convincingly outperforms the SoTA competitor in generating higher quality images with more elaborate garments and superior text prompt adherence, both according to standard quantitative evaluation measures and human ratings, generated as part of a user study. The source code of DiCTI will be made publicly available.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Žiga Babnik; Fadi Boutros; Naser Damer; Peter Peer; Vitomir Štruc

AI-KD: Towards Alignment Invariant Face Image Quality Assessment Using Knowledge Distillation Proceedings Article

In: Proceedings of the International Workshop on Biometrics and Forensics (IWBF), pp. 1-6, 2024.

Abstract | Links | BibTeX

Peter Rot; Janez Križaj; Peter Peer; Vitomir Štruc

Enhancing Gender Privacy with Photo-realistic Fusion of Disentangled Spatial Segments Proceedings Article

In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1-5, 2024.

BibTeX

Marko Brodarič; Peter Peer; Vitomir Štruc

Cross-Dataset Deepfake Detection: Evaluating the Generalization Capabilities of Modern DeepFake Detectors Proceedings Article

In: Proceedings of the 27th Computer Vision Winter Workshop (CVWW), pp. 1-10, 2024.

Abstract | Links | BibTeX

@inproceedings{MarkoCVWW,

title = {Cross-Dataset Deepfake Detection: Evaluating the Generalization Capabilities of Modern DeepFake Detectors},

author = {Marko Brodarič and Peter Peer and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2024/01/MarkoCVWW24_compressed.pdf},

year  = {2024},

date = {2024-01-31},

booktitle = {Proceedings of the 27th Computer Vision Winter Workshop (CVWW)},

pages = {1-10},

abstract = {Due to the recent advances in generative deep learning, numerous techniques have been proposed in the literature that allow for the creation of so-called deepfakes, i.e., forged facial images commonly used for malicious purposes. These developments have triggered a need for effective deepfake detectors, capable of identifying forged and manipulated imagery as robustly as possible. While a considerable number of detection techniques has been proposed over the years, generalization across a wide spectrum of deepfake-generation techniques still remains an open problem. In this paper, we study a representative set of deepfake generation methods and analyze their performance in a cross-dataset setting with the goal of better understanding the reasons behind the observed generalization performance. To this end, we conduct a comprehensive analysis on the FaceForensics++ dataset and adopt Gradient-weighted Class Activation Mappings (Grad-CAM) to provide insights into the behavior of the evaluated detectors. Since a new class of deepfake generation techniques based on diffusion models recently appeared in the literature, we introduce a new subset of the FaceForensics++ dataset with diffusion-based deepfake and include it in our analysis. The results of our experiments show that most detectors overfit to the specific image artifacts induced by a given deepfake-generation model and mostly focus on local image areas where such artifacts can be expected. Conversely, good generalization appears to be correlated with class activations that cover a broad spatial area and hence capture different image artifacts that appear in various part of the facial region.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Marija Ivanovska; Vitomir Štruc

On the Vulnerability of Deepfake Detectors to Attacks Generated by Denoising Diffusion Models Proceedings Article

In: Proceedings of WACV Workshops, pp. 1051-1060, 2024.

Abstract | Links | BibTeX

Martin Pernuš; Vitomir Štruc; Simon Dobrišek

MaskFaceGAN: High Resolution Face Editing With Masked GAN Latent Code Optimization Journal Article

In: IEEE Transactions on Image Processing, 2023, ISSN: 1941-0042.

Abstract | Links | BibTeX

@article{MaskFaceGAN,

title = {MaskFaceGAN: High Resolution Face Editing With Masked GAN Latent Code Optimization},

author = {Martin Pernuš and Vitomir Štruc and Simon Dobrišek},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10299582

https://lmi.fe.uni-lj.si/wp-content/uploads/2023/02/MaskFaceGAN_compressed.pdf

https://arxiv.org/pdf/2103.11135.pdf},

doi = {10.1109/TIP.2023.3326675},

issn = {1941-0042},

year  = {2023},

date = {2023-10-27},

urldate = {2023-01-02},

journal = {IEEE Transactions on Image Processing},

abstract = {Face editing represents a popular research topic within the computer vision and image processing communities. While significant progress has been made recently in this area, existing solutions: ( i ) are still largely focused on low-resolution images, ( ii ) often generate editing results with visual artefacts, or ( iii ) lack fine-grained control over the editing procedure and alter multiple (entangled) attributes simultaneously, when trying to generate the desired facial semantics. In this paper, we aim to address these issues through a novel editing approach, called MaskFaceGAN that focuses on local attribute editing. The proposed approach is based on an optimization procedure that directly optimizes the latent code of a pre-trained (state-of-the-art) Generative Adversarial Network (i.e., StyleGAN2) with respect to several constraints that ensure: ( i ) preservation of relevant image content, ( ii ) generation of the targeted facial attributes, and ( iii ) spatially–selective treatment of local image regions. The constraints are enforced with the help of an (differentiable) attribute classifier and face parser that provide the necessary reference information for the optimization procedure. MaskFaceGAN is evaluated in extensive experiments on the FRGC, SiblingsDB-HQf, and XM2VTS datasets and in comparison with several state-of-the-art techniques from the literature. Our experimental results show that the proposed approach is able to edit face images with respect to several local facial attributes with unprecedented image quality and at high-resolutions (1024×1024), while exhibiting considerably less problems with attribute entanglement than competing solutions. The source code is publicly available from: https://github.com/MartinPernus/MaskFaceGAN.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Peter Rot; Klemen Grm; Peter Peer; Vitomir Štruc

PrivacyProber: Assessment and Detection of Soft–Biometric Privacy–Enhancing Techniques Journal Article

In: IEEE Transactions on Dependable and Secure Computing, pp. 1-18, 2023, ISBN: 1545-5971.

Abstract | Links | BibTeX

@article{PrivacProberRot,

title = {PrivacyProber: Assessment and Detection of Soft–Biometric Privacy–Enhancing Techniques},

author = {Peter Rot and Klemen Grm and Peter Peer and Vitomir Štruc},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10264192},

doi = {10.1109/TDSC.2023.3319500},

isbn = {1545-5971},

year  = {2023},

date = {2023-09-23},

journal = {IEEE Transactions on Dependable and Secure Computing},

pages = {1-18},

abstract = {Soft–biometric privacy–enhancing techniques represent machine learning methods that aim to: (i) mitigate privacy concerns associated with face recognition technology by suppressing selected soft–biometric attributes in facial images (e.g., gender, age, ethnicity) and (ii) make unsolicited extraction of sensitive personal information infeasible. Because such techniques are increasingly used in real–world applications, it is imperative to understand to what extent the privacy enhancement can be inverted and how much attribute information can be recovered from privacy–enhanced images. While these aspects are critical, they have not been investigated in the literature so far. In this paper, we, therefore, study the robustness of several state–of–the–art soft–biometric privacy–enhancing techniques to attribute recovery attempts. We propose PrivacyProber, a high–level framework for restoring soft–biometric information from privacy–enhanced facial images, and apply it for attribute recovery in comprehensive experiments on three public face datasets, i.e., LFW, MUCT and Adience. Our experiments show that the proposed framework is able to restore a considerable amount of suppressed information, regardless of the privacy–enhancing technique used (e.g., adversarial perturbations, conditional synthesis, etc.), but also that there are significant differences between the considered privacy models. These results point to the need for novel mechanisms that can improve the robustness of existing privacy–enhancing techniques and secure them against potential adversaries trying to restore suppressed information. Additionally, we demonstrate that PrivacyProber can also be used to detect privacy–enhancement in facial images (under black–box assumptions) with high accuracy. Specifically, we show that a detection procedure can be developed around the proposed framework that is learning free and, therefore, generalizes well across different data characteristics and privacy–enhancing techniques.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Soft–biometric privacy–enhancing techniques represent machine learning methods that aim to: (i) mitigate privacy concerns associated with face recognition technology by suppressing selected soft–biometric attributes in facial images (e.g., gender, age, ethnicity) and (ii) make unsolicited extraction of sensitive personal information infeasible. Because such techniques are increasingly used in real–world applications, it is imperative to understand to what extent the privacy enhancement can be inverted and how much attribute information can be recovered from privacy–enhanced images. While these aspects are critical, they have not been investigated in the literature so far. In this paper, we, therefore, study the robustness of several state–of–the–art soft–biometric privacy–enhancing techniques to attribute recovery attempts. We propose PrivacyProber, a high–level framework for restoring soft–biometric information from privacy–enhanced facial images, and apply it for attribute recovery in comprehensive experiments on three public face datasets, i.e., LFW, MUCT and Adience. Our experiments show that the proposed framework is able to restore a considerable amount of suppressed information, regardless of the privacy–enhancing technique used (e.g., adversarial perturbations, conditional synthesis, etc.), but also that there are significant differences between the considered privacy models. These results point to the need for novel mechanisms that can improve the robustness of existing privacy–enhancing techniques and secure them against potential adversaries trying to restore suppressed information. Additionally, we demonstrate that PrivacyProber can also be used to detect privacy–enhancement in facial images (under black–box assumptions) with high accuracy. Specifically, we show that a detection procedure can be developed around the proposed framework that is learning free and, therefore, generalizes well across different data characteristics and privacy–enhancing techniques.

Close

Matej Vitek; Matic Bizjak; Peter Peer; Vitomir Štruc

IPAD: Iterative Pruning with Activation Deviation for Sclera Biometrics Journal Article

In: Journal of King Saud University - Computer and Information Sciences, vol. 35, no. 8, pp. 1-21, 2023.

Abstract | Links | BibTeX

@article{VitekSaud2023,

title = {IPAD: Iterative Pruning with Activation Deviation for Sclera Biometrics},

author = {Matej Vitek and Matic Bizjak and Peter Peer and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2023/07/PublishedVersion.pdf},

doi = {https://doi.org/10.1016/j.jksuci.2023.101630},

year  = {2023},

date = {2023-07-10},

journal = {Journal of King Saud University - Computer and Information Sciences},

volume = {35},

number = {8},

pages = {1-21},

abstract = {The sclera has recently been gaining attention as a biometric modality due to its various desirable characteristics. A key step in any type of ocular biometric recognition, including sclera recognition, is the segmentation of the relevant part(s) of the eye. However, the high computational complexity of the (deep) segmentation models used in this task can limit their applicability on resource-constrained devices such as smartphones or head-mounted displays. As these devices are a common desired target for such biometric systems, lightweight solutions for ocular segmentation are critically needed. To address this issue, this paper introduces IPAD (Iterative Pruning with Activation Deviation), a novel method for developing lightweight convolutional networks, that is based on model pruning. IPAD uses a novel filter-activation-based criterion (ADC) to determine low-importance filters and employs an iterative model pruning procedure to derive the final lightweight model. To evaluate the proposed pruning procedure, we conduct extensive experiments with two diverse segmentation models, over four publicly available datasets (SBVPI, SLD, SMD and MOBIUS), in four distinct problem configurations and in comparison to state-of-the-art methods from the literature. The results of the experiments show that the proposed filter-importance criterion outperforms the standard L1 and L2 approaches from the literature. Furthermore, the results also suggest that: 1) the pruned models are able to retain (or even improve on) the performance of the unpruned originals, as long as they are not over-pruned, with RITnet and U-Net at 50% of their original FLOPs reaching up to 4% and 7% higher IoU values than their unpruned versions, respectively, 2) smaller models require more careful pruning, as the pruning process can hurt the model’s generalization capabilities, and 3) the novel criterion most convincingly outperforms the classic approaches when sufficient training data is available, implying that the abundance of data leads to more robust activation-based importance computation.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

The sclera has recently been gaining attention as a biometric modality due to its various desirable characteristics. A key step in any type of ocular biometric recognition, including sclera recognition, is the segmentation of the relevant part(s) of the eye. However, the high computational complexity of the (deep) segmentation models used in this task can limit their applicability on resource-constrained devices such as smartphones or head-mounted displays. As these devices are a common desired target for such biometric systems, lightweight solutions for ocular segmentation are critically needed. To address this issue, this paper introduces IPAD (Iterative Pruning with Activation Deviation), a novel method for developing lightweight convolutional networks, that is based on model pruning. IPAD uses a novel filter-activation-based criterion (ADC) to determine low-importance filters and employs an iterative model pruning procedure to derive the final lightweight model. To evaluate the proposed pruning procedure, we conduct extensive experiments with two diverse segmentation models, over four publicly available datasets (SBVPI, SLD, SMD and MOBIUS), in four distinct problem configurations and in comparison to state-of-the-art methods from the literature. The results of the experiments show that the proposed filter-importance criterion outperforms the standard L1 and L2 approaches from the literature. Furthermore, the results also suggest that: 1) the pruned models are able to retain (or even improve on) the performance of the unpruned originals, as long as they are not over-pruned, with RITnet and U-Net at 50% of their original FLOPs reaching up to 4% and 7% higher IoU values than their unpruned versions, respectively, 2) smaller models require more careful pruning, as the pruning process can hurt the model’s generalization capabilities, and 3) the novel criterion most convincingly outperforms the classic approaches when sufficient training data is available, implying that the abundance of data leads to more robust activation-based importance computation.

Close

Martin Pernuš; Mansi Bhatnagar; Badr Samad; Divyanshu Singh; Peter Peer; Vitomir Štruc; Simon Dobrišek

ChildNet: Structural Kinship Face Synthesis Model With Appearance Control Mechanisms Journal Article

In: IEEE Access, pp. 1-22, 2023, ISSN: 2169-3536.

Abstract | Links | BibTeX

@article{AccessMartin2023,

title = {ChildNet: Structural Kinship Face Synthesis Model With Appearance Control Mechanisms},

author = {Martin Pernuš and Mansi Bhatnagar and Badr Samad and Divyanshu Singh and Peter Peer and Vitomir Štruc and Simon Dobrišek},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10126110},

doi = {10.1109/ACCESS.2023.3276877},

issn = {2169-3536},

year  = {2023},

date = {2023-05-17},

journal = {IEEE Access},

pages = {1-22},

abstract = {Kinship face synthesis is an increasingly popular topic within the computer vision community, particularly the task of predicting the child appearance using parental images. Previous work has been limited in terms of model capacity and inadequate training data, which is comprised of low-resolution and tightly cropped images, leading to lower synthesis quality. In this paper, we propose ChildNet,  a method for kinship face synthesis that leverages the facial image generation capabilities of a state-of-the-art Generative Adversarial Network (GAN), and resolves the aforementioned problems. ChildNet is designed within the GAN latent space and is able to predict a child appearance that bears high resemblance to real parents’ children. To ensure fine-grained control, we propose an age and gender manipulation module that allows precise manipulation of the child synthesis result. ChildNet is capable of generating multiple child images per parent pair input, while providing a way to control the image generation variability. Additionally, we introduce a mechanism to control the dominant parent image. Finally, to facilitate the task of kinship face synthesis, we introduce a new kinship dataset, called Next of Kin. This dataset contains 3690 high-resolution face images with a diverse range of ethnicities and ages. We evaluate ChildNet in comprehensive experiments against three competing kinship face synthesis models, using two kinship datasets. The experiments demonstrate the superior performance of ChildNet in terms of identity similarity, while exhibiting high perceptual image quality. The source code for the model is publicly available at: https://github.com/MartinPernus/ChildNet.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Fadi Boutros; Vitomir Štruc; Julian Fierrez; Naser Damer

Synthetic data for face recognition: Current state and future prospects Journal Article

In: Image and Vision Computing, no. 104688, 2023.

Abstract | Links | BibTeX

Miha Grabner; Yi Wang; Qingsong Wen; Boštjan Blažič; Vitomir Štruc

A global modeling framework for load forecasting in distribution networks Journal Article

In: IEEE Transactions on Smart Grid, 2023, ISSN: 1949-3061.

Abstract | Links | BibTeX

@article{Grabner_TSG,

title = {A global modeling framework for load forecasting in distribution networks},

author = {Miha Grabner and Yi Wang and Qingsong Wen and Boštjan Blažič and Vitomir Štruc},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10092804},

doi = {10.1109/TSG.2023.3264525},

issn = {1949-3061},

year  = {2023},

date = {2023-04-05},

journal = {IEEE Transactions on Smart Grid},

abstract = {With the increasing numbers of smart meter installations, scalable and efficient load forecasting techniques are critically needed to ensure sustainable situation awareness within the distribution networks. Distribution networks include a large amount of different loads at various aggregation levels, such as individual consumers, low-voltage feeders, and transformer stations. It is impractical to develop individual (or so-called local) forecasting models for each load separately. Additionally, such local models also (i) (largely) ignore the strong dependencies between different loads that might be present due to their spatial proximity and the characteristics of the distribution network, (ii) require historical data for each load to be able to make forecasts, and (iii) are incapable of adjusting to changes in the load behavior without retraining. To address these issues, we propose a global modeling framework for load forecasting in distribution networks that, unlike its local competitors, relies on a single global model to generate forecasts for a large number of loads. The global nature of the framework, significantly reduces the computational burden typically required when training multiple local forecasting models, efficiently exploits the cross-series information shared among different loads, and facilitates forecasts even when historical data for a load is missing or the behavior of a load evolves over time. To further improve on the performance of the proposed framework, an unsupervised localization mechanism and optimal ensemble construction strategy are also proposed to localize/personalize the global forecasting model to different load characteristics. Our experimental results show that the proposed framework outperforms naive benchmarks by more than 25% (in terms of Mean Absolute Error) on real-world dataset while exhibiting highly desirable characteristics when compared to the local models that are predominantly used in the literature. All source code and data are made publicly available to enable reproducibility: https://github.com/mihagrabner/GlobalModelingFramework},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

With the increasing numbers of smart meter installations, scalable and efficient load forecasting techniques are critically needed to ensure sustainable situation awareness within the distribution networks. Distribution networks include a large amount of different loads at various aggregation levels, such as individual consumers, low-voltage feeders, and transformer stations. It is impractical to develop individual (or so-called local) forecasting models for each load separately. Additionally, such local models also (i) (largely) ignore the strong dependencies between different loads that might be present due to their spatial proximity and the characteristics of the distribution network, (ii) require historical data for each load to be able to make forecasts, and (iii) are incapable of adjusting to changes in the load behavior without retraining. To address these issues, we propose a global modeling framework for load forecasting in distribution networks that, unlike its local competitors, relies on a single global model to generate forecasts for a large number of loads. The global nature of the framework, significantly reduces the computational burden typically required when training multiple local forecasting models, efficiently exploits the cross-series information shared among different loads, and facilitates forecasts even when historical data for a load is missing or the behavior of a load evolves over time. To further improve on the performance of the proposed framework, an unsupervised localization mechanism and optimal ensemble construction strategy are also proposed to localize/personalize the global forecasting model to different load characteristics. Our experimental results show that the proposed framework outperforms naive benchmarks by more than 25% (in terms of Mean Absolute Error) on real-world dataset while exhibiting highly desirable characteristics when compared to the local models that are predominantly used in the literature. All source code and data are made publicly available to enable reproducibility: https://github.com/mihagrabner/GlobalModelingFramework

Close

Blaž Meden; Manfred Gonzalez-Hernandez; Peter Peer; Vitomir Štruc

Face deidentification with controllable privacy protection Journal Article

In: Image and Vision Computing, vol. 134, no. 104678, pp. 1-19, 2023.

Abstract | Links | BibTeX

@article{MedenDeID2023,

title = {Face deidentification with controllable privacy protection},

author = {Blaž Meden and Manfred Gonzalez-Hernandez and Peter Peer and Vitomir Štruc},

url = {https://reader.elsevier.com/reader/sd/pii/S0262885623000525?token=BC1E21411C50118E666720B002A89C9EB3DB4CFEEB5EB18D7BD7B0613085030A96621C8364583BFE7BAE025BE3646096&originRegion=eu-west-1&originCreation=20230516115322},

doi = {https://doi.org/10.1016/j.imavis.2023.104678},

year  = {2023},

date = {2023-04-01},

journal = {Image and Vision Computing},

volume = {134},

number = {104678},

pages = {1-19},

abstract = {Privacy protection has become a crucial concern in today’s digital age. Particularly sensitive here are facial images, which typically not only reveal a person’s identity, but also other sensitive personal information. To address this problem, various face deidentification techniques have been presented in the literature. These techniques try to remove or obscure personal information from facial images while still preserving their usefulness for further analysis. While a considerable amount of work has been proposed on face deidentification, most state-of-theart solutions still suffer from various drawbacks, and (a) deidentify only a narrow facial area, leaving potentially important contextual information unprotected, (b) modify facial images to such degrees, that image naturalness and facial diversity is suffering in the deidentify images, (c) offer no flexibility in the level of privacy protection ensured, leading to suboptimal deployment in various applications, and (d) often offer an unsatisfactory tradeoff between the ability to obscure identity information, quality and naturalness of the deidentified images, and sufficient utility preservation. In this paper, we address these shortcomings with a novel controllable face deidentification technique that balances image quality, identity protection, and data utility for further analysis. The proposed approach utilizes a powerful generative model (StyleGAN2), multiple auxiliary classification models, and carefully designed constraints to guide the deidentification process. The approach is validated across four diverse datasets (CelebA-HQ, RaFD, XM2VTS, AffectNet) and in comparison to 7 state-of-the-art competitors. The results of the experiments demonstrate that the proposed solution leads to: (a) a considerable level of identity protection, (b) valuable preservation of data utility, (c) sufficient diversity among the deidentified faces, and (d) encouraging overall performance.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Matej Vitek; Abhijit Das; Diego Rafael Lucio; Luiz Antonio Zanlorensi Jr.; David Menotti; Jalil Nourmohammadi Khiarak; Mohsen Akbari Shahpar; Meysam Asgari-Chenaghlu; Farhang Jaryani; Juan E. Tapia; Andres Valenzuela; Caiyong Wang; Yunlong Wang; Zhaofeng He; Zhenan Sun; Fadi Boutros; Naser Damer; Jonas Henry Grebe; Arjan Kuijper; Kiran Raja; Gourav Gupta; Georgios Zampoukis; Lazaros Tsochatzidis; Ioannis Pratikakis; S. V. Aruna Kumar; B. S. Harish; Umapada Pal; Peter Peer; Vitomir Štruc

Exploring Bias in Sclera Segmentation Models: A Group Evaluation Approach Journal Article

In: IEEE Transactions on Information Forensics and Security, vol. 18, pp. 190-205, 2023, ISSN: 1556-6013.

Abstract | Links | BibTeX

@article{TIFS_Sclera2022,

title = {Exploring Bias in Sclera Segmentation Models: A Group Evaluation Approach},

author = {Matej Vitek and Abhijit Das and Diego Rafael Lucio and Luiz Antonio Zanlorensi Jr. and David Menotti and Jalil Nourmohammadi Khiarak and Mohsen Akbari Shahpar and Meysam Asgari-Chenaghlu and Farhang Jaryani and Juan E. Tapia and Andres Valenzuela and Caiyong Wang and Yunlong Wang and Zhaofeng He and Zhenan Sun and Fadi Boutros and Naser Damer and Jonas Henry Grebe and Arjan Kuijper and Kiran Raja and Gourav Gupta and Georgios Zampoukis and Lazaros Tsochatzidis and Ioannis Pratikakis and S. V. Aruna Kumar and B. S. Harish and Umapada Pal and Peter Peer and Vitomir Štruc},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9926136},

doi = {10.1109/TIFS.2022.3216468},

issn = {1556-6013},

year  = {2023},

date = {2023-01-18},

urldate = {2022-10-18},

journal = {IEEE Transactions on Information Forensics and Security},

volume = {18},

pages = {190-205},

abstract = {Bias and fairness of biometric algorithms have been key topics of research in recent years, mainly due to the societal, legal and ethical implications of potentially unfair decisions made by automated decision-making models. A considerable amount of work has been done on this topic across different biometric modalities, aiming at better understanding the main sources of algorithmic bias or devising mitigation measures. In this work, we contribute to these efforts and present the first study investigating bias and fairness of sclera segmentation models. Although sclera segmentation techniques represent a key component of sclera-based biometric systems with a considerable impact on the overall recognition performance, the presence of different types of biases in sclera segmentation methods is still underexplored. To address this limitation, we describe the results of a group evaluation effort (involving seven research groups), organized to explore the performance of recent sclera segmentation models within a common experimental framework and study performance differences (and bias), originating from various demographic as well as environmental factors. Using five diverse datasets, we analyze seven independently developed sclera segmentation models in different experimental configurations. The results of our experiments suggest that there are significant differences in the overall segmentation performance across the seven models and that among the considered factors, ethnicity appears to be the biggest cause of bias. Additionally, we observe that training with representative and balanced data does not necessarily lead to less biased results. Finally, we find that in general there appears to be a negative correlation between the amount of bias observed (due to eye color, ethnicity and acquisition device) and the overall segmentation performance, suggesting that advances in the field of semantic segmentation may also help with mitigating bias.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Fevziye Irem Eyiokur; Alperen Kantarci; Mustafa Ekrem Erakin; Naser Damer; Ferda Ofli; Muhammad Imran; Janez Križaj; Albert Ali Salah; Alexander Waibel; Vitomir Štruc; Hazim K. Ekenel

A Survey on Computer Vision based Human Analysis in the COVID-19 Era Journal Article

In: Image and Vision Computing, vol. 130, no. 104610, pp. 1-19, 2023.

Abstract | Links | BibTeX

@article{IVC2023,

title = {A Survey on Computer Vision based Human Analysis in the COVID-19 Era},

author = {Fevziye Irem Eyiokur and Alperen Kantarci and Mustafa Ekrem Erakin and Naser Damer and Ferda Ofli and Muhammad Imran and Janez Križaj and Albert Ali Salah and Alexander Waibel and Vitomir Štruc and Hazim K. Ekenel },

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2023/01/FG4COVID19_PAPER_compressed.pdf

https://authors.elsevier.com/a/1gKOyxnVK7RBS},

doi = {https://doi.org/10.1016/j.imavis.2022.104610},

year  = {2023},

date = {2023-01-01},

journal = {Image and Vision Computing},

volume = {130},

number = {104610},

pages = {1-19},

abstract = {The emergence of COVID-19 has had a global and profound impact, not only on society as a whole, but also on the lives of individuals. Various prevention measures were introduced around the world to limit the transmission of the disease, including 

face masks, mandates for social distancing and regular disinfection in public spaces, and the use of screening applications. These developments also triggered the need for novel and improved computer vision techniques capable of (i) providing support to the prevention measures through an automated analysis of visual data, on the one hand, and (ii) facilitating normal operation of existing vision-based services, such as biometric authentication schemes, on the other. Especially important here, are computer vision techniques that focus on the analysis of people and faces in visual data and have been affected the most by the partial occlusions introduced by the mandates for facial masks. 

Such computer vision based human analysis techniques include face and face-mask detection approaches, face recognition techniques, crowd counting solutions, age and expression estimation procedures, models for detecting face-hand interactions and many others, and have seen considerable attention over recent years. The goal of this survey is to provide an introduction to the problems induced by COVID-19 into such research and to present a comprehensive review of the work done in the computer vision based human analysis field. Particular attention is paid to the impact of facial masks on the performance of various methods and recent solutions to mitigate this problem. Additionally, a detailed review of existing datasets useful for the development and evaluation of methods for COVID-19 related applications is also provided. Finally, to help advance the field further, a discussion on the main open challenges and future research direction is given at the end of the survey. This work is intended to have a broad appeal and be useful not only for computer vision researchers but also the general public.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

The emergence of COVID-19 has had a global and profound impact, not only on society as a whole, but also on the lives of individuals. Various prevention measures were introduced around the world to limit the transmission of the disease, including
face masks, mandates for social distancing and regular disinfection in public spaces, and the use of screening applications. These developments also triggered the need for novel and improved computer vision techniques capable of (i) providing support to the prevention measures through an automated analysis of visual data, on the one hand, and (ii) facilitating normal operation of existing vision-based services, such as biometric authentication schemes, on the other. Especially important here, are computer vision techniques that focus on the analysis of people and faces in visual data and have been affected the most by the partial occlusions introduced by the mandates for facial masks.
Such computer vision based human analysis techniques include face and face-mask detection approaches, face recognition techniques, crowd counting solutions, age and expression estimation procedures, models for detecting face-hand interactions and many others, and have seen considerable attention over recent years. The goal of this survey is to provide an introduction to the problems induced by COVID-19 into such research and to present a comprehensive review of the work done in the computer vision based human analysis field. Particular attention is paid to the impact of facial masks on the performance of various methods and recent solutions to mitigate this problem. Additionally, a detailed review of existing datasets useful for the development and evaluation of methods for COVID-19 related applications is also provided. Finally, to help advance the field further, a discussion on the main open challenges and future research direction is given at the end of the survey. This work is intended to have a broad appeal and be useful not only for computer vision researchers but also the general public.

Close

Anja Hrovatič; Peter Peer; Vitomir Štruc; Žiga Emeršič

Efficient ear alignment using a two-stack hourglass network Journal Article

In: IET Biometrics , pp. 1-14, 2023, ISSN: 2047-4938.

Abstract | Links | BibTeX

@article{UhljiIETZiga,

title = {Efficient ear alignment using a two-stack hourglass network},

author = {Anja Hrovatič and Peter Peer and Vitomir Štruc and Žiga Emeršič},

url = {https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/bme2.12109},

doi = {10.1049/bme2.12109},

issn = {2047-4938},

year  = {2023},

date = {2023-01-01},

journal = {IET Biometrics },

pages = {1-14},

abstract = {Ear images have been shown to be a reliable modality for biometric recognition with desirable characteristics, such as high universality, distinctiveness, measurability and permanence. While a considerable amount of research has been directed towards ear recognition techniques, the problem of ear alignment is still under-explored in the open literature. Nonetheless, accurate alignment of ear images, especially in unconstrained acquisition scenarios, where the ear appearance is expected to vary widely due to pose and view point variations, is critical for the performance of all downstream tasks, including ear recognition. Here, the authors address this problem and present a framework for ear alignment that relies on a two-step procedure: (i) automatic landmark detection and (ii) fiducial point alignment. For the first (landmark detection) step, the authors implement and train a Two-Stack Hourglass model (2-SHGNet) capable of accurately predicting 55 landmarks on diverse ear images captured in uncontrolled conditions. For the second (alignment) step, the authors use the Random Sample Consensus (RANSAC) algorithm to align the estimated landmark/fiducial points with a pre-defined ear shape (i.e. a collection of average ear landmark positions). The authors evaluate the proposed framework in comprehensive experiments on the AWEx and ITWE datasets and show that the 2-SHGNet model leads to more accurate landmark predictions than competing state-of-the-art models from the literature. Furthermore, the authors also demonstrate that the alignment step significantly improves recognition accuracy with ear images from unconstrained environments compared to unaligned imagery.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Nicolas Larue; Ngoc-Son Vu; Vitomir Štruc; Peter Peer; Vassilis Christophides

SeeABLE: Soft Discrepancies and Bounded Contrastive Learning for Exposing Deepfakes Proceedings Article

In: Proceedings of the International Conference on Computer Vision (ICCV), pp. 21011 - 21021, IEEE 2023.

Abstract | Links | BibTeX

@inproceedings{NicolasCCV,

title = {SeeABLE: Soft Discrepancies and Bounded Contrastive Learning for Exposing Deepfakes},

author = {Nicolas Larue and Ngoc-Son Vu and Vitomir Štruc and Peter Peer and Vassilis Christophides},

url = {https://openaccess.thecvf.com/content/ICCV2023/papers/Larue_SeeABLE_Soft_Discrepancies_and_Bounded_Contrastive_Learning_for_Exposing_Deepfakes_ICCV_2023_paper.pdf

https://lmi.fe.uni-lj.si/wp-content/uploads/2024/01/SeeABLE_compressed.pdf

https://lmi.fe.uni-lj.si/wp-content/uploads/2024/01/SeeABLE_supplementary_compressed.pdf},

year  = {2023},

date = {2023-10-01},

urldate = {2023-10-01},

booktitle = {Proceedings of the International Conference on Computer Vision (ICCV)},

pages = {21011 - 21021},

organization = {IEEE},

abstract = {Modern deepfake detectors have achieved encouraging results, when training and test images are drawn from the same data collection. However, when these detectors are applied to images produced with unknown deepfake-generation techniques, considerable performance degradations are commonly observed. In this paper, we propose a novel deepfake detector, called SeeABLE, that formalizes the detection problem as a (one-class) out-of-distribution detection task and generalizes better to unseen deepfakes. Specifically, SeeABLE first generates local image perturbations (referred to as soft-discrepancies) and then pushes the perturbed faces towards predefined prototypes using a novel regression-based bounded contrastive loss. To strengthen the generalization performance of SeeABLE to unknown deepfake types, we generate a rich set of soft discrepancies and train the detector: (i) to localize, which part of the face was modified, and (ii) to identify the alteration type. To demonstrate the capabilities of SeeABLE, we perform rigorous experiments on several widely-used deepfake datasets and show that our model convincingly outperforms competing state-of-the-art detectors, while exhibiting highly encouraging generalization capabilities. The source code for SeeABLE is available from: https://github.com/anonymous-author-sub/seeable.

},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Žiga Babnik; Peter Peer; Vitomir Štruc

DifFIQA: Face Image Quality Assessment Using Denoising Diffusion Probabilistic Models Proceedings Article

In: IEEE International Joint Conference on Biometrics , pp. 1-10, IEEE, Ljubljana, Slovenia, 2023.

Abstract | Links | BibTeX

@inproceedings{Diffiqa_2023,

title = {DifFIQA: Face Image Quality Assessment Using Denoising Diffusion Probabilistic Models},

author = {Žiga Babnik and Peter Peer and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2023/09/121.pdf

https://lmi.fe.uni-lj.si/wp-content/uploads/2023/09/121-supp.pdf},

year  = {2023},

date = {2023-09-01},

booktitle = {IEEE International Joint Conference on Biometrics },

pages = {1-10},

publisher = {IEEE},

address = {Ljubljana, Slovenia},

abstract = {Modern face recognition (FR) models excel in constrained

scenarios, but often suffer from decreased performance

when deployed in unconstrained (real-world) environments

due to uncertainties surrounding the quality

of the captured facial data. Face image quality assessment

(FIQA) techniques aim to mitigate these performance

degradations by providing FR models with sample-quality

predictions that can be used to reject low-quality samples

and reduce false match errors. However, despite steady improvements,

ensuring reliable quality estimates across facial

images with diverse characteristics remains challenging.

In this paper, we present a powerful new FIQA approach,

named DifFIQA, which relies on denoising diffusion

probabilistic models (DDPM) and ensures highly competitive

results. The main idea behind the approach is to utilize

the forward and backward processes of DDPMs to perturb

facial images and quantify the impact of these perturbations

on the corresponding image embeddings for quality

prediction. Because the diffusion-based perturbations are

computationally expensive, we also distill the knowledge

encoded in DifFIQA into a regression-based quality predictor,

called DifFIQA(R), that balances performance and

execution time. We evaluate both models in comprehensive

experiments on 7 diverse datasets, with 4 target FR models

and against 10 state-of-the-art FIQA techniques with

highly encouraging results. The source code is available

from: https://github.com/LSIbabnikz/DifFIQA.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Bo Peng; Xianyun Sun; Caiyong Wang; Wei Wang; Jing Dong; Zhenan Sun; Rongyu Zhang; Heng Cong; Lingzhi Fu; Hao Wang; Yusheng Zhang; HanYuan Zhang; Xin Zhang; Boyuan Liu; Hefei Ling; Luka Dragar; Borut Batagelj; Peter Peer; Vitomir Struc; Xinghui Zhou; Kunlin Liu; Weitao Feng; Weiming Zhang; Haitao Wang; Wenxiu Diao

DFGC-VRA: DeepFake Game Competition on Visual Realism Assessment Proceedings Article

In: IEEE International Joint Conference on Biometrics (IJCB 2023), pp. 1-9, Ljubljana, Slovenia, 2023.

Abstract | Links | BibTeX

@inproceedings{Deepfake_comp2023,

title = {DFGC-VRA: DeepFake Game Competition on Visual Realism Assessment},

author = {Bo Peng and Xianyun Sun and Caiyong Wang and Wei Wang and Jing Dong and Zhenan Sun and Rongyu Zhang and Heng Cong and Lingzhi Fu and Hao Wang and Yusheng Zhang and HanYuan Zhang and Xin Zhang and Boyuan Liu and Hefei Ling and Luka Dragar and Borut Batagelj and Peter Peer and Vitomir Struc and Xinghui Zhou and Kunlin Liu and Weitao Feng and Weiming Zhang and Haitao Wang and Wenxiu Diao},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2023/09/CameraReady-225.pdf},

year  = {2023},

date = {2023-09-01},

booktitle = {IEEE International Joint Conference on Biometrics (IJCB 2023)},

pages = {1-9},

address = {Ljubljana, Slovenia},

abstract = {This paper presents the summary report on the DeepFake

Game Competition on Visual Realism Assessment (DFGCVRA).

Deep-learning based face-swap videos, also known

as deepfakes, are becoming more and more realistic and

deceiving. The malicious usage of these face-swap videos

has caused wide concerns. There is a ongoing deepfake

game between its creators and detectors, with the human in

the loop. The research community has been focusing on

the automatic detection of these fake videos, but the assessment

of their visual realism, as perceived by human

eyes, is still an unexplored dimension. Visual realism assessment,

or VRA, is essential for assessing the potential

impact that may be brought by a specific face-swap video,

and it is also useful as a quality metric to compare different

face-swap methods. This is the third edition of DFGC

competitions, which focuses on the new visual realism assessment

topic, different from previous ones that compete

creators versus detectors. With this competition, we conduct

a comprehensive study of the SOTA performance on

the new task. We also release our MindSpore codes to fur-

*Jing Dong (jdong@nlpr.ia.ac.cn) is the corresponding author.

ther facilitate research in this field (https://github.

com/bomb2peng/DFGC-VRA-benckmark).},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Jan Niklas Kolf; Fadi Boutros; Jurek Elliesen; Markus Theuerkauf; Naser Damer; Mohamad Y Alansari; Oussama Abdul Hay; Sara Yousif Alansari; Sajid Javed; Naoufel Werghi; Klemen Grm; Vitomir Struc; Fernando Alonso-Fernandez; Kevin Hernandez-Diaz; Josef Bigun; Anjith George; Christophe Ecabert; Hatef Otroshi Shahreza; Ketan Kotwal; Sébastien Marcel; Iurii Medvedev; Jin Bo; Diogo Nunes; Ahmad Hassanpour; Pankaj Khatiwada; Aafan Ahmad Toor; Bian Yang

EFaR 2023: Efficient Face Recognition Competition Proceedings Article

In: IEEE International Joint Conference on Biometrics (IJCB 2023), pp. 1-12, Ljubljana, Slovenia, 2023.

Abstract | Links | BibTeX

Abhijit Das; Saurabh K Atreya; Aritra Mukherjee; Matej Vitek; Haiqing Li; Caiyong Wang; Zhao Guangzhe; Fadi Boutros; Patrick Siebke; Jan Niklas Kolf; Naser Damer; Ye Sun; Lu Hexin; Fab Aobo; You Sheng; Sabari Nathan; Suganya Ramamoorthy; Rampriya R S; Geetanjali G; Prinaka Sihag; Aditya Nigam; Peter Peer; Umapada Pal; Vitomir Struc

Sclera Segmentation and Joint Recognition Benchmarking Competition: SSRBC 2023 Proceedings Article

In: IEEE International Joint Conference on Biometrics (IJCB 2023), pp. 1-10, Ljubljana, Slovenia, 2023.

Abstract | Links | BibTeX

Ziga Emersic; Tetsushi Ohki; Muku Akasaka; Takahiko Arakawa; Soshi Maeda; Masora Okano; Yuya Sato; Anjith George; Sébastien Marcel; Iyyakutti Iyappan Ganapathi; Syed Sadaf Ali; Sajid Javed; Naoufel Werghi; Selin Gök Işık; Erdi Sarıtaş; Hazim Kemal Ekenel; Valter Hudovernik; Jan Niklas Kolf; Fadi Boutros; Naser Damer; Geetanjali Sharma; Aman Kamboj; Aditya Nigam; Deepak Kumar Jain; Guillermo Cámara; Peter Peer; Vitomir Struc

The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias Proceedings Article

In: IEEE International Joint Conference on Biometrics (IJCB 2023), pp. 1-10, Ljubljana, Slovenia, 2023.

Abstract | Links | BibTeX

@inproceedings{UERC2023,

title = {The Unconstrained Ear Recognition Challenge 2023: Maximizing Performance and Minimizing Bias},

author = {Ziga Emersic and Tetsushi Ohki and Muku Akasaka and Takahiko Arakawa and Soshi Maeda and Masora Okano and Yuya Sato and Anjith George and Sébastien Marcel and Iyyakutti Iyappan Ganapathi and Syed Sadaf Ali and Sajid Javed and Naoufel Werghi and Selin Gök Işık and Erdi Sarıtaş and Hazim Kemal Ekenel and Valter Hudovernik and Jan Niklas Kolf and Fadi Boutros and Naser Damer and Geetanjali Sharma and Aman Kamboj and Aditya Nigam and Deepak Kumar Jain and Guillermo Cámara and Peter Peer and Vitomir Struc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2023/09/CameraReady-234.pdf},

year  = {2023},

date = {2023-09-01},

booktitle = {IEEE International Joint Conference on Biometrics (IJCB 2023)},

pages = {1-10},

address = {Ljubljana, Slovenia},

abstract = {The paper provides a summary of the 2023 Unconstrained

Ear Recognition Challenge (UERC), a benchmarking

effort focused on ear recognition from images acquired

in uncontrolled environments. The objective of the challenge

was to evaluate the effectiveness of current ear recognition

techniques on a challenging ear dataset while analyzing

the techniques from two distinct aspects, i.e., verification

performance and bias with respect to specific demographic

factors, i.e., gender and ethnicity. Seven research

groups participated in the challenge and submitted

a seven distinct recognition approaches that ranged from

descriptor-based methods and deep-learning models to ensemble

techniques that relied on multiple data representations

to maximize performance and minimize bias. A comprehensive

investigation into the performance of the submitted

models is presented, as well as an in-depth analysis of

bias and associated performance differentials due to differences

in gender and ethnicity. The results of the challenge

suggest that a wide variety of models (e.g., transformers,

convolutional neural networks, ensemble models) is capable

of achieving competitive recognition results, but also

that all of the models still exhibit considerable performance

differentials with respect to both gender and ethnicity. To

promote further development of unbiased and effective ear

recognition models, the starter kit of UERC 2023 together

with the baseline model, and training and test data is made

available from: http://ears.fri.uni-lj.si/.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Marija Ivanovska; Vitomir Štruc; Janez Perš

TomatoDIFF: On–plant Tomato Segmentation with Denoising Diffusion Models Best Paper Proceedings Article

In: 18th International Conference on Machine Vision and Applications (MVA 2023), pp. 1-6, 2023.

Abstract | Links | BibTeX

Richard Plesh; Peter Peer; Vitomir Štruc

GlassesGAN: Eyewear Personalization using Synthetic Appearance Discovery and Targeted Subspace Modeling Proceedings Article

In: Proceedings of the IEEE/CVF International Conference on Computer Vision and Pattern Recognition (CVPR) , 2023.

Abstract | Links | BibTeX

Marija Ivanovska; Vitomir Štruc

Face Morphing Attack Detection with Denoising Diffusion Probabilistic Models Proceedings Article

In: Proceedings of the International Workshop on Biometrics and Forensics (IWBF), pp. 1-6, 2023.

Abstract | Links | BibTeX

Žiga Babnik; Naser Damer; Vitomir Štruc

Optimization-Based Improvement of Face Image Quality Assessment Techniques Proceedings Article

In: Proceedings of the International Workshop on Biometrics and Forensics (IWBF), 2023.

Abstract | Links | BibTeX

Klemen Grm; Berk Ozata; Vitomir Struc; Hazim K. Ekenel

Meet-in-the-middle: Multi-scale upsampling and matching for cross-resolution face recognition Proceedings Article

In: WACV workshops, pp. 120-129, 2023.

Abstract | Links | BibTeX

Chenquan Gan; Yucheng Yang; Qingyi Zhub; Deepak Kumar Jain; Vitomir Struc

DHF-Net: A hierarchical feature interactive fusion network for dialogue emotion recognition Journal Article

In: Expert Systems with Applications, vol. 210, 2022.

Abstract | Links | BibTeX

Darian Tomašecić; Peter Peer; Franc Solina; Aleš Jaklič; Vitomir Štruc

Reconstructing Superquadrics from Intensity and Color Images Journal Article

In: Sensors, vol. 22, iss. 4, no. 5332, 2022.

Abstract | Links | BibTeX

@article{TomasevicSensors,

title = {Reconstructing Superquadrics from Intensity and Color Images},

author = {Darian Tomašecić and Peter Peer and Franc Solina and Aleš Jaklič and Vitomir Štruc},

url = {https://www.mdpi.com/1424-8220/22/14/5332/pdf?version=1658380987},

doi = {https://doi.org/10.3390/s22145332},

year  = {2022},

date = {2022-07-16},

journal = {Sensors},

volume = {22},

number = {5332},

issue = {4},

abstract = {The task of reconstructing 3D scenes based on visual data represents a longstanding problem in computer vision. Common reconstruction approaches rely on the use of multiple volumetric primitives to describe complex objects. Superquadrics (a class of volumetric primitives) have shown great promise due to their ability to describe various shapes with only a few parameters. Recent research has shown that deep learning methods can be used to accurately reconstruct random superquadrics from both 3D point cloud data and simple depth images. In this paper, we extended these reconstruction methods to intensity and color images. Specifically, we used a dedicated convolutional neural network (CNN) model to reconstruct a single superquadric from the given input image. We analyzed the results in a qualitative and quantitative manner, by visualizing reconstructed superquadrics as well as observing error and accuracy distributions of predictions. We showed that a CNN model designed around a simple ResNet backbone can be used to accurately reconstruct superquadrics from images containing one object, but only if one of the spatial parameters is fixed or if it can be determined from other image characteristics, e.g., shadows. Furthermore, we experimented with images of increasing complexity, for example, by adding textures, and observed that the results degraded only slightly. In addition, we show that our model outperforms the current state-of-the-art method on the studied task. Our final result is a highly accurate superquadric reconstruction model, which can also reconstruct superquadrics from real images of simple objects, without additional training.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Daile Osorio-Roig; Christian Rathgeb; Pawel Drozdowski; Philipp Terhörst; Vitomir Štruc; Christoph Busch

An Attack on Feature Level-based Facial Soft-biometric Privacy Enhancement Journal Article

In: IEEE Transactions on Biometrics, Identity and Behavior (TBIOM), vol. 4, iss. 2, pp. 263-275, 2022.

Abstract | Links | BibTeX

@article{TBIOM_2022,

title = {An Attack on Feature Level-based Facial Soft-biometric Privacy Enhancement},

author = {Daile Osorio-Roig and Christian Rathgeb and Pawel Drozdowski and Philipp Terhörst and Vitomir Štruc and Christoph Busch},

url = {https://arxiv.org/pdf/2111.12405.pdf},

year  = {2022},

date = {2022-05-02},

urldate = {2022-05-02},

journal = {IEEE Transactions on Biometrics, Identity and Behavior (TBIOM)},

volume = {4},

issue = {2},

pages = {263-275},

abstract = {In the recent past, different researchers have proposed novel privacy-enhancing face recognition systems designed to conceal soft-biometric information at feature level. These works have reported impressive results, but usually do not consider specific attacks in their analysis of privacy protection. In most cases, the privacy protection capabilities of these schemes are tested through simple machine learning-based classifiers and visualisations of dimensionality reduction tools. In this work, we introduce an attack on feature level-based facial soft–biometric privacy-enhancement techniques. The attack is based on two observations: (1) to achieve high recognition accuracy, certain similarities between facial representations have to be retained in their privacy-enhanced versions; (2) highly similar facial representations usually originate from face images with similar soft-biometric attributes. Based on these observations, the proposed attack compares a privacy-enhanced face representation against a set of privacy-enhanced face representations with known soft-biometric attributes. Subsequently, the best obtained similarity scores are analysed to infer the unknown soft-biometric attributes of the attacked privacy-enhanced face representation. That is, the attack only requires a relatively small database of arbitrary face images and the privacy-enhancing face recognition algorithm as a black-box. In the experiments, the attack is applied to two representative approaches which have previously been reported to reliably conceal the gender in privacy-enhanced face representations. It is shown that the presented attack is able to circumvent the privacy enhancement to a considerable degree and is able to correctly classify gender with an accuracy of up to approximately 90% for both of the analysed privacy-enhancing face recognition systems. Future works on privacy-enhancing face recognition are encouraged to include the proposed attack in evaluations on privacy protection.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

In the recent past, different researchers have proposed novel privacy-enhancing face recognition systems designed to conceal soft-biometric information at feature level. These works have reported impressive results, but usually do not consider specific attacks in their analysis of privacy protection. In most cases, the privacy protection capabilities of these schemes are tested through simple machine learning-based classifiers and visualisations of dimensionality reduction tools. In this work, we introduce an attack on feature level-based facial soft–biometric privacy-enhancement techniques. The attack is based on two observations: (1) to achieve high recognition accuracy, certain similarities between facial representations have to be retained in their privacy-enhanced versions; (2) highly similar facial representations usually originate from face images with similar soft-biometric attributes. Based on these observations, the proposed attack compares a privacy-enhanced face representation against a set of privacy-enhanced face representations with known soft-biometric attributes. Subsequently, the best obtained similarity scores are analysed to infer the unknown soft-biometric attributes of the attacked privacy-enhanced face representation. That is, the attack only requires a relatively small database of arbitrary face images and the privacy-enhancing face recognition algorithm as a black-box. In the experiments, the attack is applied to two representative approaches which have previously been reported to reliably conceal the gender in privacy-enhanced face representations. It is shown that the presented attack is able to circumvent the privacy enhancement to a considerable degree and is able to correctly classify gender with an accuracy of up to approximately 90% for both of the analysed privacy-enhancing face recognition systems. Future works on privacy-enhancing face recognition are encouraged to include the proposed attack in evaluations on privacy protection.

Close

Janez Križaj; Simon Dobrišek; Vitomir Štruc

Making the most of single sensor information : a novel fusion approach for 3D face recognition using region covariance descriptors and Gaussian mixture models Journal Article

In: Sensors, iss. 6, no. 2388, pp. 1-26, 2022.

Abstract | Links | BibTeX

@article{KrizajSensors2022,

title = {Making the most of single sensor information : a novel fusion approach for 3D face recognition using region covariance descriptors and Gaussian mixture models},

author = {Janez Križaj and Simon Dobrišek and Vitomir Štruc},

url = {https://www.mdpi.com/1424-8220/22/6/2388},

doi = {10.3390/s22062388},

year  = {2022},

date = {2022-03-01},

journal = {Sensors},

number = {2388},

issue = {6},

pages = {1-26},

abstract = {Most commercially successful face recognition systems combine information from multiple sensors (2D and 3D, visible light and infrared, etc.) to achieve reliable recognition in various environments. When only a single sensor is available, the robustness as well as efficacy of the recognition process suffer. In this paper, we focus on face recognition using images captured by a single 3D sensor and propose a method based on the use of region covariance matrixes and Gaussian mixture models (GMMs). All steps of the proposed framework are automated, and no metadata, such as pre-annotated eye, nose, or mouth positions is required, while only a very simple clustering-based face detection is performed. The framework computes a set of region covariance descriptors from local regions of different face image representations and then uses the unscented transform to derive low-dimensional feature vectors, which are finally modeled by GMMs. In the last step, a support vector machine classification scheme is used to make a decision about the identity of the input 3D facial image. The proposed framework has several desirable characteristics, such as an inherent mechanism for data fusion/integration (through the region covariance matrixes), the ability to explore facial images at different levels of locality, and the ability to integrate a domain-specific prior knowledge into the modeling procedure. Several normalization techniques are incorporated into the proposed framework to further improve performance. Extensive experiments are performed on three prominent databases (FRGC v2, CASIA, and UMB-DB) yielding competitive results.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Marjan Stoimchev; Marija Ivanovska; Vitomir Štruc

Learning to Combine Local and Global Image Information for Contactless Palmprint Recognition Journal Article

In: Sensors, vol. 22, no. 1, pp. 1-26, 2022.

Abstract | Links | BibTeX

@article{Stoimchev2022,

title = {Learning to Combine Local and Global Image Information for Contactless Palmprint Recognition},

author = {Marjan Stoimchev and Marija Ivanovska and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2022/03/sensors-22-00073_reduced.pdf},

doi = {https://doi.org/10.3390/s22010073},

year  = {2022},

date = {2022-01-01},

journal = {Sensors},

volume = {22},

number = {1},

pages = {1-26},

abstract = {In the past few years, there has been a leap from traditional palmprint recognition methodologies, which use handcrafted features, to deep-learning approaches that are able to automatically learn feature representations from the input data. However, the information that is extracted from such deep-learning models typically corresponds to the global image appearance, where only the most discriminative cues from the input image are considered. This characteristic is especially problematic when data is acquired in unconstrained settings, as in the case of contactless palmprint recognition systems, where visual artifacts caused by elastic deformations of the palmar surface are typically present in spatially local parts of the captured images. In this study we address the problem of elastic deformations by introducing a new approach to contactless palmprint recognition based on a novel CNN model, designed as a two-path architecture, where one path processes the input in a holistic manner, while the second path extracts local information from smaller image patches sampled from the input image. As elastic deformations can be assumed to most significantly affect the global appearance, while having a lesser impact on spatially local image areas, the local processing path addresses the issues related to elastic deformations thereby supplementing the information from the global processing path. The model is trained with a learning objective that combines the Additive Angular Margin (ArcFace) Loss and the well-known center loss. By using the proposed model design, the discriminative power of the learned image representation is significantly enhanced compared to standard holistic models, which, as we show in the experimental section, leads to state-of-the-art performance for contactless palmprint recognition. Our approach is tested on two publicly available contactless palmprint datasets—namely, IITD and CASIA—and is demonstrated to perform favorably against state-of-the-art methods from the literature. The source code for the proposed model is made publicly available.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

In the past few years, there has been a leap from traditional palmprint recognition methodologies, which use handcrafted features, to deep-learning approaches that are able to automatically learn feature representations from the input data. However, the information that is extracted from such deep-learning models typically corresponds to the global image appearance, where only the most discriminative cues from the input image are considered. This characteristic is especially problematic when data is acquired in unconstrained settings, as in the case of contactless palmprint recognition systems, where visual artifacts caused by elastic deformations of the palmar surface are typically present in spatially local parts of the captured images. In this study we address the problem of elastic deformations by introducing a new approach to contactless palmprint recognition based on a novel CNN model, designed as a two-path architecture, where one path processes the input in a holistic manner, while the second path extracts local information from smaller image patches sampled from the input image. As elastic deformations can be assumed to most significantly affect the global appearance, while having a lesser impact on spatially local image areas, the local processing path addresses the issues related to elastic deformations thereby supplementing the information from the global processing path. The model is trained with a learning objective that combines the Additive Angular Margin (ArcFace) Loss and the well-known center loss. By using the proposed model design, the discriminative power of the learned image representation is significantly enhanced compared to standard holistic models, which, as we show in the experimental section, leads to state-of-the-art performance for contactless palmprint recognition. Our approach is tested on two publicly available contactless palmprint datasets—namely, IITD and CASIA—and is demonstrated to perform favorably against state-of-the-art methods from the literature. The source code for the proposed model is made publicly available.

Close

Peter Rot; Peter Peer; Vitomir Štruc

Detecting Soft-Biometric Privacy Enhancement Book Section

In: Rathgeb, Christian; Tolosana, Ruben; Vera-Rodriguez, Ruben; Busch, Christoph (Ed.): Handbook of Digital Face Manipulation and Detection, 2022.

Links | BibTeX

Ruben Tolosana; Christian Rathgeb; Ruben Vera-Rodriguez; Christoph Busch; Luisa Verdilova; Siwei Lyu; Huy H. Nguyen; Junichi Yamagishi; Isao Echizen; Peter Rot; Klemen Grm; Vitomir Štruc; Antitza Datcheva; Zahid Akhtar; Sergio Romero-Tapiador; Julian Fierrez; Aythami Morales; Javier Ortega-Garcia; Els Kindt; Catherine Jasserand; Tarmo Kalvet; Marek Tiits

Future Trends in Digital Face Manipulation and Detection Book Section

In: Rathgeb, Christian; Tolosana, Ruben; Vera-Rodriguez, Ruben; Busch, Christoph (Ed.): Handbook of Digital Face Manipulation and Detection, pp. 463–482, 2022, ISBN: 978-3-030-87663-0.

Abstract | Links | BibTeX

Darian Tomašević; Peter Peer; Vitomir Štruc

BiOcularGAN: Bimodal Synthesis and Annotation of Ocular Images Proceedings Article

In: IEEE/IAPR International Joint Conference on Biometrics (IJCB 2022) , pp. 1-10, 2022.

Abstract | Links | BibTeX

Marco Huber; Fadi Boutros; Anh Thi Luu; Kiran Raja; Raghavendra Ramachandra; Naser Damer; Pedro C. Neto; Tiago Goncalves; Ana F. Sequeira; Jaime S. Cardoso; João Tremoco; Miguel Lourenco; Sergio Serra; Eduardo Cermeno; Marija Ivanovska; Borut Batagelj; Andrej Kronovšek; Peter Peer; Vitomir Štruc

SYN-MAD 2022: Competition on Face Morphing Attack Detection based on Privacy-aware Synthetic Training Data Proceedings Article

In: IEEE International Joint Conference on Biometrics (IJCB), pp. 1-10, 2022, ISBN: 978-1-6654-6394-2.

Links | BibTeX

Marija Ivanovska; Andrej Kronovšek; Peter Peer; Vitomir Štruc; Borut Batagelj

Face Morphing Attack Detection Using Privacy-Aware Training Data Proceedings Article

In: Proceedings of ERK 2022, pp. 1-4, 2022.

Abstract | Links | BibTeX

Jaka Šircelj; Peter Peer; Franc Solina; Vitomir Štruc

Hierarchical Superquadric Decomposition with Implicit Space Separation Proceedings Article

In: Proceedings of ERK 2022, pp. 1-4, 2022.

Abstract | Links | BibTeX

Klemen Grm; Vitomir Štruc

Optimization-based Image Filter Design for Self-supervised Super-resolution Training Proceedings Article

In: Proceedings of ERK 2022, 2022.

Abstract | Links | BibTeX

Žiga Babnik; Vitomir Štruc

Iterativna optimizacija ocen kakovosti slikovnih podatkov v sistemih za razpoznavanje obrazov Proceedings Article

In: Proceedings of ERK 2022, pp. 1-4, 2022.

Abstract | Links | BibTeX

Žiga Babnik; Peter Peer; Vitomir Štruc

FaceQAN: Face Image Quality Assessment Through Adversarial Noise Exploration Proceedings Article

In: IAPR International Conference on Pattern Recognition (ICPR), 2022.

Abstract | Links | BibTeX

Žiga Babnik; Vitomir Štruc

Assessing Bias in Face Image Quality Assessment Proceedings Article

In: EUSIPCO 2022, 2022.

Abstract | Links | BibTeX

Grega Dvoršak; Ankita Dwivedi; Vitomir Štruc; Peter Peer; Žiga Emeršič

Kinship Verification from Ear Images: An Explorative Study with Deep Learning Models Proceedings Article

In: International Workshop on Biometrics and Forensics (IWBF), pp. 1–6, 2022.

Abstract | Links | BibTeX

Julijan Jug; Ajda Lampe; Peter Peer; Vitomir Štruc

Segmentacija telesa z uporabo večciljnega učenja Proceedings Article

In: Proceedings of Rosus 2022, 2022.

Abstract | Links | BibTeX

Julijan Jug; Ajda Lampe; Vitomir Štruc; Peter Peer

Body Segmentation Using Multi-task Learning Proceedings Article

In: International Conference on Artificial Intelligence in Information and Communication (ICAIIC), IEEE, 2022, ISBN: 978-1-6654-5818-4.

Abstract | Links | BibTeX

Benjamin Fele; Ajda Lampe; Peter Peer; Vitomir Štruc

C-VTON: Context-Driven Image-Based Virtual Try-On Network Proceedings Article

In: IEEE/CVF Winter Applications in Computer Vision (WACV), pp. 1–10, 2022.

Abstract | Links | BibTeX

Žiga Emeršič; Diego Sušanj; Blaž Meden; Peter Peer; Vitomir Štruc

ContexedNet : Context-Aware Ear Detection in Unconstrained Settings Journal Article

In: IEEE Access, pp. 1–17, 2021, ISSN: 2169-3536.

Abstract | Links | BibTeX

@article{ContexedNet_Emersic_2021,

title = {ContexedNet : Context-Aware Ear Detection in Unconstrained Settings},

author = {Žiga Emeršič and Diego Sušanj and Blaž Meden and Peter Peer and Vitomir Štruc},

editor = {ContexedNet : Context-Aware Ear Detection in Unconstrained Settings},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9583244},

issn = {2169-3536},

year  = {2021},

date = {2021-10-20},

urldate = {2021-10-20},

journal = {IEEE Access},

pages = {1--17},

abstract = {Ear detection represents one of the key components of contemporary ear recognition systems. While significant progress has been made in the area of ear detection over recent years, most of the improvements are direct results of advances in the field of visual object detection. Only a limited number of techniques presented in the literature are domain--specific and designed explicitly with ear detection in mind. In this paper, we aim to address this gap and present a novel detection approach that does not rely only on general ear (object) appearance, but also exploits contextual information, i.e., face--part locations, to ensure accurate and robust ear detection with images captured in a wide variety of imaging conditions. The proposed approach is based on a Context--aware Ear Detection Network (ContexedNet) and poses ear detection as a semantic image segmentation problem. ContexedNet consists of two processing paths: 1) a context--provider that extracts probability maps corresponding to the locations of facial parts from the input image, and 2) a dedicated ear segmentation model that integrates the computed probability maps into a context--aware segmentation-based ear detection procedure. ContexedNet is evaluated in rigorous experiments on the AWE and UBEAR datasets and shown to ensure competitive performance when evaluated against state--of--the--art ear detection models from the literature. Additionally, because the proposed contextualization is model agnostic, it can also be utilized with other ear detection techniques to improve performance.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Blaz Meden, Peter Rot, Philipp Terhorst, Naser Damer, Arjan Kuijper, Walter J. Scheirer, Arun Ross, Peter Peer, Vitomir Struc

Privacy-Enhancing Face Biometrics: A Comprehensive Survey Journal Article

In: IEEE Transactions on Information Forensics and Security, vol. 16, pp. 4147-4183, 2021.

Abstract | Links | BibTeX

@article{TIFS_PrivacySurveyb,

title = {Privacy-Enhancing Face Biometrics: A Comprehensive Survey},

author = {Blaz Meden, Peter Rot, Philipp Terhorst, Naser Damer, Arjan Kuijper, Walter J. Scheirer, Arun Ross, Peter Peer, Vitomir Struc},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9481149

https://lmi.fe.uni-lj.si/en/visual_privacy_of_faces__a_survey_preprint-compressed/},

doi = {10.1109/TIFS.2021.3096024},

year  = {2021},

date = {2021-07-12},

journal = {IEEE Transactions on Information Forensics and Security},

volume = {16},

pages = {4147-4183},

abstract = {Biometric recognition technology has made significant advances over the last decade and is now used across a number of services and applications. However, this widespread deployment has also resulted in privacy concerns and evolving societal expectations about the appropriate use of the technology. For example, the ability to automatically extract age, gender, race, and health cues from biometric data has heightened concerns about privacy leakage. Face recognition technology, in particular, has been in the spotlight, and is now seen by many as posing a considerable risk to personal privacy. In response to these and similar concerns, researchers have intensified efforts towards developing techniques and computational models capable of ensuring privacy to individuals, while still facilitating the utility of face recognition technology in several application scenarios.  These efforts have resulted in a multitude of privacy--enhancing techniques that aim at addressing privacy risks originating from biometric systems and providing technological solutions for legislative requirements set forth in privacy laws and regulations, such as GDPR. The goal of this overview paper is to provide a comprehensive introduction into privacy--related research in the area of biometrics and review existing work on textit{Biometric Privacy--Enhancing Techniques} (B--PETs) applied to face biometrics. To make this work useful for as wide of an audience as possible,  several key topics are covered as well, including evaluation strategies used with B--PETs, existing datasets, relevant standards, and regulations and critical open issues that will have to be addressed in the future. },

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Klemen Pevec; Klemen Grm; Vitomir Štruc

Benchmarking Crowd-Counting Techniques across Image Characteristics Journal Article

In: Elektorethniski Vestnik, vol. 88, iss. 5, pp. 227-235, 2021.

Abstract | Links | BibTeX

Borut Batagelj; Peter Peer; Vitomir Štruc; Simon Dobrišek

How to correctly detect face-masks for COVID-19 from visual information? Journal Article

In: Applied sciences, vol. 11, no. 5, pp. 1-24, 2021, ISBN: 2076-3417.

Abstract | Links | BibTeX

@article{Batagelj2021,

title = {How to correctly detect face-masks for COVID-19 from visual information?},

author = {Borut Batagelj and Peter Peer and Vitomir Štruc and Simon Dobrišek},

url = {https://www.mdpi.com/2076-3417/11/5/2070/pdf},

doi = {10.3390/app11052070},

isbn = {2076-3417},

year  = {2021},

date = {2021-03-01},

urldate = {2021-03-01},

journal = {Applied sciences},

volume = {11},

number = {5},

pages = {1-24},

abstract = {The new Coronavirus disease (COVID-19) has seriously affected the world. By the end of November 2020, the global number of new coronavirus cases had already exceeded 60 million and the number of deaths 1,410,378 according to information from the World Health Organization (WHO). To limit the spread of the disease, mandatory face-mask rules are now becoming common in public settings around the world. Additionally, many public service providers require customers to wear face-masks in accordance with predefined rules (e.g., covering both mouth and nose) when using public services. These developments inspired research into automatic (computer-vision-based) techniques for face-mask detection that can help monitor public behavior and contribute towards constraining the COVID-19 pandemic. Although existing research in this area resulted in efficient techniques for face-mask detection, these usually operate under the assumption that modern face detectors provide perfect detection performance (even for masked faces) and that the main goal of the techniques is to detect the presence of face-masks only. In this study, we revisit these common assumptions and explore the following research questions: (i) How well do existing face detectors perform with masked-face images? (ii) Is it possible to detect a proper (regulation-compliant) placement of facial masks? and (iii) How useful are existing face-mask detection techniques for monitoring applications during the COVID-19 pandemic? To answer these and related questions we conduct a comprehensive experimental evaluation of several recent face detectors for their performance with masked-face images. Furthermore, we investigate the usefulness of multiple off-the-shelf deep-learning models for recognizing correct face-mask placement. Finally, we design a complete pipeline for recognizing whether face-masks are worn correctly or not and compare the performance of the pipeline with standard face-mask detection models from the literature. To facilitate the study, we compile a large dataset of facial images from the publicly available MAFA and Wider Face datasets and annotate it with compliant and non-compliant labels. The annotation dataset, called Face-Mask-Label Dataset (FMLD), is made publicly available to the research community.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

The new Coronavirus disease (COVID-19) has seriously affected the world. By the end of November 2020, the global number of new coronavirus cases had already exceeded 60 million and the number of deaths 1,410,378 according to information from the World Health Organization (WHO). To limit the spread of the disease, mandatory face-mask rules are now becoming common in public settings around the world. Additionally, many public service providers require customers to wear face-masks in accordance with predefined rules (e.g., covering both mouth and nose) when using public services. These developments inspired research into automatic (computer-vision-based) techniques for face-mask detection that can help monitor public behavior and contribute towards constraining the COVID-19 pandemic. Although existing research in this area resulted in efficient techniques for face-mask detection, these usually operate under the assumption that modern face detectors provide perfect detection performance (even for masked faces) and that the main goal of the techniques is to detect the presence of face-masks only. In this study, we revisit these common assumptions and explore the following research questions: (i) How well do existing face detectors perform with masked-face images? (ii) Is it possible to detect a proper (regulation-compliant) placement of facial masks? and (iii) How useful are existing face-mask detection techniques for monitoring applications during the COVID-19 pandemic? To answer these and related questions we conduct a comprehensive experimental evaluation of several recent face detectors for their performance with masked-face images. Furthermore, we investigate the usefulness of multiple off-the-shelf deep-learning models for recognizing correct face-mask placement. Finally, we design a complete pipeline for recognizing whether face-masks are worn correctly or not and compare the performance of the pipeline with standard face-mask detection models from the literature. To facilitate the study, we compile a large dataset of facial images from the publicly available MAFA and Wider Face datasets and annotate it with compliant and non-compliant labels. The annotation dataset, called Face-Mask-Label Dataset (FMLD), is made publicly available to the research community.

Close

Tim Oblak; Jaka Šircelj; Vitomir Struc; Peter Peer; Franc Solina; Aleš Jaklic

Learning to predict superquadric parameters from depth images with explicit and implicit supervision Journal Article

In: IEEE Access, pp. 1-16, 2021, ISSN: 2169-3536.

Abstract | Links | BibTeX

@article{Oblak2021,

title = {Learning to predict superquadric parameters from depth images with explicit and implicit supervision},

author = {Tim Oblak and Jaka Šircelj and Vitomir Struc and Peter Peer and Franc Solina and Aleš Jaklic

},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9274424},

doi = {10.1109/ACCESS.2020.3041584},

issn = {2169-3536},

year  = {2021},

date = {2021-01-01},

journal = {IEEE Access},

pages = {1-16},

abstract = {Reconstruction of 3D space from visual data has always been a significant challenge in

the field of computer vision. A popular approach to address this problem can be found in the form of

bottom-up reconstruction techniques which try to model complex 3D scenes through a constellation of

volumetric primitives. Such techniques are inspired by the current understanding of the human visual

system and are, therefore, strongly related to the way humans process visual information, as suggested

by recent visual neuroscience literature. While advances have been made in recent years in the area of

3D reconstruction, the problem remains challenging due to the many possible ways of representing 3D

data, the ambiguity of determining the shape and general position in 3D space and the difficulty to train

efficient models for the prediction of volumetric primitives. In this paper, we address these challenges and

present a novel solution for recovering volumetric primitives from depth images. Specifically, we focus on

the recovery of superquadrics, a special type of parametric models able to describe a wide array of 3D

shapes using only a few parameters. We present a new learning objective that relies on the superquadric

(inside-outside) function and develop two learning strategies for training convolutional neural networks

(CNN) capable of predicting superquadric parameters. The first uses explicit supervision and penalizes the

difference between the predicted and reference superquadric parameters. The second strategy uses implicit

supervision and penalizes differences between the input depth images and depth images rendered from

the predicted parameters. CNN predictors for superquadric parameters are trained with both strategies and

evaluated on a large dataset of synthetic and real-world depth images. Experimental results show that both

strategies compare favourably to the existing state-of-the-art and result in high quality 3D reconstructions

of the modelled scenes at a much shorter processing time.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Reconstruction of 3D space from visual data has always been a significant challenge in
the field of computer vision. A popular approach to address this problem can be found in the form of
bottom-up reconstruction techniques which try to model complex 3D scenes through a constellation of
volumetric primitives. Such techniques are inspired by the current understanding of the human visual
system and are, therefore, strongly related to the way humans process visual information, as suggested
by recent visual neuroscience literature. While advances have been made in recent years in the area of
3D reconstruction, the problem remains challenging due to the many possible ways of representing 3D
data, the ambiguity of determining the shape and general position in 3D space and the difficulty to train
efficient models for the prediction of volumetric primitives. In this paper, we address these challenges and
present a novel solution for recovering volumetric primitives from depth images. Specifically, we focus on
the recovery of superquadrics, a special type of parametric models able to describe a wide array of 3D
shapes using only a few parameters. We present a new learning objective that relies on the superquadric
(inside-outside) function and develop two learning strategies for training convolutional neural networks
(CNN) capable of predicting superquadric parameters. The first uses explicit supervision and penalizes the
difference between the predicted and reference superquadric parameters. The second strategy uses implicit
supervision and penalizes differences between the input depth images and depth images rendered from
the predicted parameters. CNN predictors for superquadric parameters are trained with both strategies and
evaluated on a large dataset of synthetic and real-world depth images. Experimental results show that both
strategies compare favourably to the existing state-of-the-art and result in high quality 3D reconstructions
of the modelled scenes at a much shorter processing time.

Close

Martin Pernus; Vitomir Struc; Simon Dobrisek

High Resolution Face Editing with Masked GAN Latent Code Optimization Journal Article

In: CoRR, vol. abs/2103.11135, 2021.

Links | BibTeX

Marija Ivanovska; Vitomir Štruc

A Comparative Study on Discriminative and One--Class Learning Models for Deepfake Detection Proceedings Article

In: Proceedings of ERK 2021, pp. 1–4, 2021.

Abstract | Links | BibTeX

Klemen Grm; Štruc Vitomir

Frequency Band Encoding for Face Super-Resolution Proceedings Article

In: Proceedings of ERK 2021, pp. 1-4, 2021.

Abstract | Links | BibTeX

Fadi Boutros; Naser Damer; Jan Niklas Kolf; Kiran Raja; Florian Kirchbuchner; Raghavendra Ramachandra; Arjan Kuijper; Pengcheng Fang; Chao Zhang; Fei Wang; David Montero; Naiara Aginako; Basilio Sierra; Marcos Nieto; Mustafa Ekrem Erakin; Ugur Demir; Hazım Kemal Ekenel; Asaki Kataoka; Kohei Ichikawa; Shizuma Kubo; Jie Zhang; Mingjie He; Dan Han; Shiguang Shan; Klemen Grm; Vitomir Štruc; Sachith Seneviratne; Nuran Kasthuriarachchi; Sanka Rasnayaka; Pedro C. Neto; Ana F. Sequeira; Joao Ribeiro Pinto; Mohsen Saffari; Jaime S. Cardoso

MFR 2021: Masked Face Recognition Competition Proceedings Article

In: Proceedings of the IEEE International Joint Conference on Biometrics (IJCB 2021), 2021.

Abstract | Links | BibTeX

@inproceedings{MFR_IJCB2021,

title = {MFR 2021: Masked Face Recognition Competition},

author = {Fadi Boutros and Naser Damer and Jan Niklas Kolf and Kiran Raja and Florian Kirchbuchner and Raghavendra Ramachandra and Arjan Kuijper and Pengcheng Fang and Chao Zhang and Fei Wang and David Montero and Naiara Aginako and Basilio Sierra and Marcos Nieto and Mustafa Ekrem Erakin and Ugur Demir and Hazım Kemal Ekenel and Asaki Kataoka and Kohei Ichikawa and Shizuma Kubo and Jie Zhang and Mingjie He and Dan Han and Shiguang Shan and Klemen Grm and Vitomir Štruc and Sachith Seneviratne and Nuran Kasthuriarachchi and Sanka Rasnayaka and Pedro C. Neto and Ana F. Sequeira and Joao Ribeiro Pinto and Mohsen Saffari and Jaime S. Cardoso},

url = {https://ieeexplore.ieee.org/iel7/9484326/9484328/09484337.pdf?casa_token=OOL4s274P0YAAAAA:XE7ga2rP_wNom2Zeva75ZwNwN-HKz6kF1HZtkpzrdTdz36eaGcLffWkzOgIe3xU2PqaU30qTLws},

doi = {10.1109/IJCB52358.2021.9484337},

year  = {2021},

date = {2021-08-01},

booktitle = {Proceedings of the IEEE International Joint Conference on Biometrics (IJCB 2021)},

abstract = {This paper presents a summary of the Masked Face Recognition Competitions (MFR) held within the 2021 International Joint Conference on Biometrics (IJCB 2021). The competition attracted a total of 10 participating teams with valid submissions. The affiliations of these teams are diverse and associated with academia and industry in nine different countries. These teams successfully submitted 18 valid solutions. The competition is designed to motivate solutions aiming at enhancing the face recognition accuracy of masked faces. Moreover, the competition considered the deployability of the proposed solutions by taking the compactness of the face recognition models into account. A private dataset representing a collaborative, multisession, real masked, capture scenario is used to evaluate the submitted solutions. In comparison to one of the topperforming academic face recognition solutions, 10 out of the 18 submitted solutions did score higher masked face verification accuracy.

},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Caiyong Wang; Yunlong Wang; Kunbo Zhang; Jawad Muhammad; Tianhao Lu; Qi Zhang; Qichuan Tian; Zhaofeng He; Zhenan Sun; Yiwen Zhang; Tianbao Liu; Wei Yang; Dongliang Wu; Yingfeng Liu; Ruiye Zhou; Huihai Wu; Hao Zhang; Junbao Wang; Jiayi Wang; Wantong Xiong; Xueyu Shi; Shao Zeng; Peihua Li; Haodong Sun; Jing Wang; Jiale Zhang; Qi Wang; Huijie Wu; Xinhui Zhang; Haiqing Li; Yu Chen; Liang Chen; Menghan Zhang; Ye Sun; Zhiyong Zhou; Fadi Boutros; Naser Damer; Arjan Kuijper; Juan Tapia; Andres Valenzuela; Christoph Busch; Gourav Gupta; Kiran Raja; Xi Wu; Xiaojie Li; Jingfu Yang; Hongyan Jing; Xin Wang; Bin Kong; Youbing Yin; Qi Song; Siwei Lyu; Shu Hu; Leon Premk; Matej Vitek; Vitomir Štruc; Peter Peer; Jalil Nourmohammadi Khiarak; Farhang Jaryani; Samaneh Salehi Nasab; Seyed Naeim Moafinejad; Yasin Amini; Morteza Noshad

NIR Iris Challenge Evaluation in Non-cooperative Environments: Segmentation and Localization Proceedings Article

In: Proceedings of the IEEE International Joint Conference on Biometrics (IJCB 2021), 2021.

Abstract | Links | BibTeX

@inproceedings{NIR_IJCB2021,

title = {NIR Iris Challenge Evaluation in Non-cooperative Environments: Segmentation and Localization},

author = {Caiyong Wang and Yunlong Wang and Kunbo Zhang and Jawad Muhammad and Tianhao Lu and Qi Zhang and Qichuan Tian and Zhaofeng He and Zhenan Sun and Yiwen Zhang and Tianbao Liu and Wei Yang and Dongliang Wu and Yingfeng Liu and Ruiye Zhou and Huihai Wu and Hao Zhang and Junbao Wang and Jiayi Wang and Wantong Xiong and Xueyu Shi and Shao Zeng and Peihua Li and Haodong Sun and Jing Wang and Jiale Zhang and Qi Wang and Huijie Wu and Xinhui Zhang and Haiqing Li and Yu Chen and Liang Chen and Menghan Zhang and Ye Sun and Zhiyong Zhou and Fadi Boutros and Naser Damer and Arjan Kuijper and Juan Tapia and Andres Valenzuela and Christoph Busch and Gourav Gupta and Kiran Raja and Xi Wu and Xiaojie Li and Jingfu Yang and Hongyan Jing and Xin Wang and Bin Kong and Youbing Yin and Qi Song and Siwei Lyu and Shu Hu and Leon Premk and Matej Vitek and Vitomir Štruc and Peter Peer and Jalil Nourmohammadi Khiarak and Farhang Jaryani and Samaneh Salehi Nasab and Seyed Naeim Moafinejad and Yasin Amini and Morteza Noshad},

url = {https://ieeexplore.ieee.org/iel7/9484326/9484328/09484336.pdf?casa_token=FOKx4ltO-hYAAAAA:dCkNHfumDzPGkAipRdbppNWpzAiUYUrJL6OrAjNmimTxUA0Vmx311-3-J3ej7YQc_zONxEO-XKo},

doi = {10.1109/IJCB52358.2021.9484336},

year  = {2021},

date = {2021-08-01},

booktitle = {Proceedings of the IEEE International Joint Conference on Biometrics (IJCB 2021)},

abstract = {For iris recognition in non-cooperative environments, iris segmentation has been regarded as the first most important challenge still open to the biometric community, affecting all downstream tasks from normalization to recognition. In recent years, deep learning technologies have gained significant popularity among various computer vision tasks and also been introduced in iris biometrics, especially iris segmentation. To investigate recent developments and attract more interest of researchers in the iris segmentation method, we organized the 2021 NIR Iris Challenge Evaluation in Non-cooperative Environments: Segmentation and Localization (NIR-ISL 2021) at the 2021 International Joint Conference on Biometrics (IJCB 2021). The challenge was used as a public platform to assess the performance of iris segmentation and localization methods on Asian and African NIR iris images captured in non-cooperative environments. The three best-performing entries achieved solid and satisfactory iris segmentation and localization results in most cases, and their code and models have been made publicly available for reproducibility research.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Philipp Terhorst; Kevin Riehl; Naser Damer; Peter Rot; Blaz Bortolato; Florian Kirchbuchner; Vitomir Struc; Arjan Kuijper

PE-MIU: a training-free privacy-enhancing face recognition approach based on minimum information units Journal Article

In: IEEE Access, vol. 2020, 2020.

Abstract | Links | BibTeX

@article{PEMIU_Access2020,

title = {PE-MIU: a training-free privacy-enhancing face recognition approach based on minimum information units},

author = {Philipp Terhorst and Kevin Riehl and Naser Damer and Peter Rot and Blaz Bortolato and Florian Kirchbuchner and Vitomir Struc and Arjan Kuijper},

url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9094207},

year = {2020},

date = {2020-06-02},

journal = {IEEE Access},

volume = {2020},

abstract = {Research on soft-biometrics showed that privacy-sensitive information can be deduced from

biometric data. Utilizing biometric templates only, information about a persons gender, age, ethnicity,

sexual orientation, and health state can be deduced. For many applications, these templates are expected

to be used for recognition purposes only. Thus, extracting this information raises major privacy issues.

Previous work proposed two kinds of learning-based solutions for this problem. The first ones provide

strong privacy-enhancements, but limited to pre-defined attributes. The second ones achieve more comprehensive but weaker privacy-improvements. In this work, we propose a Privacy-Enhancing face recognition

approach based on Minimum Information Units (PE-MIU). PE-MIU, as we demonstrate in this work, is a

privacy-enhancement approach for face recognition templates that achieves strong privacy-improvements

and is not limited to pre-defined attributes. We exploit the structural differences between face recognition

and facial attribute estimation by creating templates in a mixed representation of minimal information

units. These representations contain pattern of privacy-sensitive attributes in a highly randomized form.

Therefore, the estimation of these attributes becomes hard for function creep attacks. During verification,

these units of a probe template are assigned to the units of a reference template by solving an optimal

best-matching problem. This allows our approach to maintain a high recognition ability. The experiments

are conducted on three publicly available datasets and with five state-of-the-art approaches. Moreover,

we conduct the experiments simulating an attacker that knows and adapts to the systems privacy mechanism.

The experiments demonstrate that PE-MIU is able to suppress privacy-sensitive information to a significantly

higher degree than previous work in all investigated scenarios. At the same time, our solution is able to

achieve a verification performance close to that of the unmodified recognition system. Unlike previous

works, our approach offers a strong and comprehensive privacy-enhancement without the need of training},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Research on soft-biometrics showed that privacy-sensitive information can be deduced from
biometric data. Utilizing biometric templates only, information about a persons gender, age, ethnicity,
sexual orientation, and health state can be deduced. For many applications, these templates are expected
to be used for recognition purposes only. Thus, extracting this information raises major privacy issues.
Previous work proposed two kinds of learning-based solutions for this problem. The first ones provide
strong privacy-enhancements, but limited to pre-defined attributes. The second ones achieve more comprehensive but weaker privacy-improvements. In this work, we propose a Privacy-Enhancing face recognition
approach based on Minimum Information Units (PE-MIU). PE-MIU, as we demonstrate in this work, is a
privacy-enhancement approach for face recognition templates that achieves strong privacy-improvements
and is not limited to pre-defined attributes. We exploit the structural differences between face recognition
and facial attribute estimation by creating templates in a mixed representation of minimal information
units. These representations contain pattern of privacy-sensitive attributes in a highly randomized form.
Therefore, the estimation of these attributes becomes hard for function creep attacks. During verification,
these units of a probe template are assigned to the units of a reference template by solving an optimal
best-matching problem. This allows our approach to maintain a high recognition ability. The experiments
are conducted on three publicly available datasets and with five state-of-the-art approaches. Moreover,
we conduct the experiments simulating an attacker that knows and adapts to the systems privacy mechanism.
The experiments demonstrate that PE-MIU is able to suppress privacy-sensitive information to a significantly
higher degree than previous work in all investigated scenarios. At the same time, our solution is able to
achieve a verification performance close to that of the unmodified recognition system. Unlike previous
works, our approach offers a strong and comprehensive privacy-enhancement without the need of training

Close

Klemen Grm; Walter J. Scheirer; Vitomir Štruc

Face hallucination using cascaded super-resolution and identity priors Journal Article

In: IEEE Transactions on Image Processing, 2020.

Abstract | Links | BibTeX

Matej Vitek; Peter Rot; Vitomir Struc; Peter Peer

A comprehensive investigation into sclera biometrics: a novel dataset and performance study Journal Article

In: Neural Computing and Applications, pp. 1-15, 2020.

Abstract | Links | BibTeX

@article{vitek2020comprehensive,

title = {A comprehensive investigation into sclera biometrics: a novel dataset and performance study},

author = {Matej Vitek and Peter Rot and Vitomir Struc and Peter Peer},

url = {https://link.springer.com/epdf/10.1007/s00521-020-04782-1},

doi = {https://doi.org/10.1007/s00521-020-04782-1},

year  = {2020},

date = {2020-01-01},

journal = {Neural Computing and Applications},

pages = {1-15},

abstract = {The area of ocular biometrics is among the most popular branches of biometric recognition technology. This area has long been dominated by iris recognition research, while other ocular modalities such as the periocular region or the vasculature of the sclera have received significantly less attention in the literature. Consequently, ocular modalities beyond the iris are not well studied and their characteristics are today still not as well understood. While recent needs for more secure authentication schemes have considerably increased the interest in competing ocular modalities, progress in these areas is still held back by the lack of publicly available datasets that would allow for more targeted research into specific ocular characteristics next to the iris. In this paper, we aim to bridge this gap for the case of sclera biometrics and introduce a novel dataset designed for research into ocular biometrics and most importantly for research into the vasculature of the sclera. Our dataset, called Sclera Blood Vessels, Periocular and Iris (SBVPI), is, to the best of our knowledge, the first publicly available dataset designed specifically with research in sclera biometrics in mind. The dataset contains high-quality RGB ocular images, captured in the visible spectrum, belonging to 55 subjects. Unlike competing datasets, it comes with manual markups of various eye regions, such as the iris, pupil, canthus or eyelashes and a detailed pixel-wise annotation of the complete sclera vasculature for a subset of the images. Additionally, the datasets ship with gender and age labels. The unique characteristics of the dataset allow us to study aspects of sclera biometrics technology that have not been studied before in the literature (e.g. vasculature segmentation techniques) as well as issues that are of key importance for practical recognition systems. Thus, next to the SBVPI dataset we also present in this paper a comprehensive investigation into sclera biometrics and the main covariates that affect the performance of sclera segmentation and recognition techniques, such as gender, age, gaze direction or image resolution. Our experiments not only demonstrate the usefulness of the newly introduced dataset, but also contribute to a better understanding of sclera biometrics in general.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

The area of ocular biometrics is among the most popular branches of biometric recognition technology. This area has long been dominated by iris recognition research, while other ocular modalities such as the periocular region or the vasculature of the sclera have received significantly less attention in the literature. Consequently, ocular modalities beyond the iris are not well studied and their characteristics are today still not as well understood. While recent needs for more secure authentication schemes have considerably increased the interest in competing ocular modalities, progress in these areas is still held back by the lack of publicly available datasets that would allow for more targeted research into specific ocular characteristics next to the iris. In this paper, we aim to bridge this gap for the case of sclera biometrics and introduce a novel dataset designed for research into ocular biometrics and most importantly for research into the vasculature of the sclera. Our dataset, called Sclera Blood Vessels, Periocular and Iris (SBVPI), is, to the best of our knowledge, the first publicly available dataset designed specifically with research in sclera biometrics in mind. The dataset contains high-quality RGB ocular images, captured in the visible spectrum, belonging to 55 subjects. Unlike competing datasets, it comes with manual markups of various eye regions, such as the iris, pupil, canthus or eyelashes and a detailed pixel-wise annotation of the complete sclera vasculature for a subset of the images. Additionally, the datasets ship with gender and age labels. The unique characteristics of the dataset allow us to study aspects of sclera biometrics technology that have not been studied before in the literature (e.g. vasculature segmentation techniques) as well as issues that are of key importance for practical recognition systems. Thus, next to the SBVPI dataset we also present in this paper a comprehensive investigation into sclera biometrics and the main covariates that affect the performance of sclera segmentation and recognition techniques, such as gender, age, gaze direction or image resolution. Our experiments not only demonstrate the usefulness of the newly introduced dataset, but also contribute to a better understanding of sclera biometrics in general.

Close

Dejan Stepec; Ziga Emersic; Peter Peer; Vitomir Struc

Constellation-Based Deep Ear Recognition Book Section

In: Jiang, R.; Li, CT.; Crookes, D.; Meng, W.; Rosenberger, C. (Ed.): Deep Biometrics: Unsupervised and Semi-Supervised Learning, Springer, 2020, ISBN: 978-3-030-32582-4.

Abstract | Links | BibTeX

Blaž Bortolato; Marija Ivanovska; Peter Rot; Janez Križaj; Philipp Terhorst; Naser Damer; Peter Peer; Vitomir Štruc

Learning privacy-enhancing face representations through feature disentanglement Proceedings Article

In: Proceedings of FG 2020, IEEE, 2020.

Abstract | Links | BibTeX

@inproceedings{BortolatoFG2020,

title = {Learning privacy-enhancing face representations through feature disentanglement},

author = {Blaž Bortolato and Marija Ivanovska and Peter Rot and Janez Križaj and Philipp Terhorst and Naser Damer and Peter Peer and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2020/07/FG2020___Learning_privacy_enhancing_face_representations_through_feature_disentanglement-1.pdf

},

year  = {2020},

date = {2020-11-04},

booktitle = {Proceedings of FG 2020},

publisher = {IEEE},

abstract = {Convolutional Neural Networks (CNNs) are today the de-facto standard for extracting compact and discriminative face representations (templates) from images in automatic face recognition systems. Due to the characteristics of CNN models, the generated representations typically encode a multitude of information ranging from identity to soft-biometric attributes, such as age, gender or ethnicity. However, since these representations were computed for the purpose of identity recognition only, the soft-biometric information contained in the templates represents a serious privacy risk. To mitigate this problem, we present in this paper a privacy-enhancing approach capable of suppressing potentially sensitive soft-biometric information in face representations without significantly compromising identity information. Specifically, we introduce a Privacy-Enhancing Face-Representation learning Network (PFRNet) that disentangles identity from attribute information in face representations and consequently allows to efficiently suppress soft-biometrics in face templates. We demonstrate the feasibility of PFRNet on the problem of gender suppression and show through rigorous experiments on the CelebA, Labeled Faces in the Wild (LFW) and Adience datasets that the proposed disentanglement-based approach is highly effective and improves significantly on the existing state-of-the-art.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

M. Vitek; A. Das; Y. Pourcenoux; A. Missler; C. Paumier; S. Das; I. De Ghosh; D. R. Lucio; L. A. Zanlorensi Jr.; D. Menotti; F. Boutros; N. Damer; J. H. Grebe; A. Kuijper; J. Hu; Y. He; C. Wang; H. Liu; Y. Wang; Z. Sun; D. Osorio-Roig; C. Rathgeb; C. Busch; J. Tapia; A.~Valenzuela; G. Zampoukis; L. Tsochatzidis; I. Pratikakis; S. Nathan; R. Suganya; V. Mehta; A. Dhall; K. Raja; G. Gupta; J. N. Khiarak; M. Akbari-Shahper; F. Jaryani; M. Asgari-Chenaghlu; R. Vyas; S. Dakshit; S. Dakshit; P. Peer; U. Pal; V. Štruc

SSBC 2020: Sclera Segmentation Benchmarking Competition in the Mobile Environment Proceedings Article

In: International Joint Conference on Biometrics (IJCB 2020), pp. 1–10, 2020.

Abstract | Links | BibTeX

@inproceedings{SSBC2020,

title = {SSBC 2020: Sclera Segmentation Benchmarking Competition in the Mobile Environment},

author = {M. Vitek and A. Das and Y. Pourcenoux and A. Missler and C. Paumier and S. Das and I. De Ghosh and D. R. Lucio and L. A. Zanlorensi Jr. and D. Menotti and F. Boutros and N. Damer and J. H. Grebe and A. Kuijper and J. Hu and Y. He and C. Wang and H. Liu and Y. Wang and Z. Sun and D. Osorio-Roig and C. Rathgeb and C. Busch and J. Tapia and A.~Valenzuela and G. Zampoukis and L. Tsochatzidis and I. Pratikakis and S. Nathan and R. Suganya and V. Mehta and A. Dhall and K. Raja and G. Gupta and J. N. Khiarak and M. Akbari-Shahper and F. Jaryani and M. Asgari-Chenaghlu and R. Vyas and S. Dakshit and S. Dakshit and P. Peer and U. Pal and V. Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2020/11/IJCB_SSBC_2020.pdf},

year  = {2020},

date = {2020-09-28},

booktitle = {International Joint Conference on Biometrics (IJCB 2020)},

pages = {1--10},

abstract = {The paper presents a summary of the 2020 Sclera Segmentation Benchmarking Competition (SSBC), the 7th in the series of group benchmarking efforts centred around the problem of sclera segmentation. Different from previous editions, the goal of SSBC 2020 was to evaluate the performance of sclera-segmentation models on images captured with mobile devices. The competition was used as a platform to assess the sensitivity of existing models to i) differences in mobile devices used for image capture and ii) changes in the ambient acquisition conditions. 26 research groups registered for SSBC 2020, out of which 13 took part in the final round and submitted a total of 16 segmentation models for scoring. These included a wide variety of deep-learning solutions as well as one approach based on standard image processing techniques. Experiments were conducted with three recent datasets. Most of the segmentation models achieved relatively consistent  performance across images captured with different mobile devices (with slight differences across devices), but struggled most with  low-quality images captured in challenging ambient conditions, i.e., in an indoor environment and with poor lighting. },

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Philipp Terhörst, Marco Huber, Naser Damer, Peter Rot, Florian Kirchbuchner, Vitomir Struc, Arjan Kuijper

Privacy Evaluation Protocols for the Evaluation of Soft-Biometric Privacy-Enhancing Technologies Proceedings Article

In: Proceedings of the International Conference of the Biometrics Special Interest Group (BIOSIG) 2020, pp. 1-5, IEEE, 2020, ISSN: 1617-5468.

Abstract | Links | BibTeX

Andraž Puc; Vitomir Štruc; Klemen Grm

Analysis of Race and Gender Bias in Deep Age Estimation Model Proceedings Article

In: Proceedings of EUSIPCO 2020, 2020.

Abstract | Links | BibTeX

Jaka Šircelj; Tim Oblak; Klemen Grm; Uroš Petković; Aleš Jaklič; Peter Peer; Vitomir Štruc; Franc Solina

Segmentation and Recovery of Superquadric Models using Convolutional Neural Networks Proceedings Article

In: 25th Computer Vision Winter Workshop (CVWW 2020), 2020.

Abstract | Links | BibTeX

Janez Krizaj; Peter Peer; Vitomir Struc; Simon Dobrisek

Simultaneous multi-decent regression and feature learning for landmarking in depth image Journal Article

In: Neural Computing and Applications, 2019, ISBN: 0941-0643.

Abstract | Links | BibTeX

@article{Krizaj3Docalization,

title = {Simultaneous multi-decent regression and feature learning for landmarking in depth image},

author = {Janez Krizaj and Peter Peer and Vitomir Struc and Simon Dobrisek},

url = {https://link.springer.com/content/pdf/10.1007%2Fs00521-019-04529-7.pdf},

doi = {https://doi.org/10.1007/s00521-019-04529-7},

isbn = {0941-0643},

year  = {2019},

date = {2019-10-01},

journal = {Neural Computing and Applications},

abstract = {Face alignment (or facial landmarking) is an important task in many face-related applications, ranging from registration, tracking, and animation to higher-level classification problems such as face, expression, or attribute recognition. While several solutions have been presented in the literature for this task so far, reliably locating salient facial features across a wide range of posses still remains challenging. To address this issue, we propose in this paper a novel method for automatic facial landmark localization in 3D face data designed specifically to address appearance variability caused by significant pose variations. Our method builds on recent cascaded regression-based methods to facial landmarking and uses a gating mechanism to incorporate multiple linear cascaded regression models each trained for a limited range of poses into a single powerful landmarking model capable of processing arbitrary-posed input data. We develop two distinct approaches around the proposed gating mechanism: (1) the first uses a gated multiple ridge descent mechanism in conjunction with established (hand-crafted) histogram of gradients features for face alignment and achieves state-of-the-art landmarking performance across a wide range of facial poses and (2) the second simultaneously learns multiple-descent directions as well as binary features that are optimal for the alignment tasks and in addition to competitive landmarking results also ensures extremely rapid processing. We evaluate both approaches in rigorous experiments on several popular datasets of 3D face images, i.e., the FRGCv2 and Bosphorus 3D face datasets and image collections F and G from the University of Notre Dame. The results of our evaluation show that both approaches compare favorably to the state-of-the-art, while exhibiting considerable robustness to pose variations.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Jure Kovač; Vitomir Štruc; Peter Peer

Frame-based classification for cross-speed gait recognition Journal Article

In: Multimedia Tools and Applications, vol. 78, no. 5, pp. 5621–5643, 2019, ISSN: 1573-7721.

Abstract | Links | BibTeX

Janez Križaj; Janez Perš; Simon Dobrišek; Vitomir Štruc

Sistem nadgrajene resničnosti za verifikacijo predmetov v skladiščnih okoljih Journal Article

In: Elektrotehniski Vestnik, vol. 86, no. 1/2, pp. 1–6, 2019.

Abstract | Links | BibTeX

Peter Rot; Matej Vitek; Klemen Grm; Žiga Emeršič; Peter Peer and Vitomir Štruc

Deep Sclera Segmentation and Recognition Book Section

In: Uhl, Andreas; Busch, Christoph; Marcel, Sebastien; Veldhuis, Rainer (Ed.): Handbook of Vascular Biometrics, pp. 395-432, Springer, 2019, ISBN: 978-3-030-27731-4.

Abstract | Links | BibTeX

@incollection{ScleraNetChapter,

title = {Deep Sclera Segmentation and Recognition},

author = {Peter Rot and Matej Vitek and Klemen Grm and Žiga Emeršič and Peter Peer

and Vitomir Štruc},

editor = {Andreas Uhl and Christoph Busch and Sebastien Marcel and Rainer Veldhuis},

url = {https://link.springer.com/content/pdf/10.1007%2F978-3-030-27731-4_13.pdf},

doi = {https://doi.org/10.1007/978-3-030-27731-4_13},

isbn = {978-3-030-27731-4},

year  = {2019},

date = {2019-11-14},

booktitle = {Handbook of Vascular Biometrics},

pages = {395-432},

publisher = {Springer},

chapter = {13},

series = {Advances in Computer Vision and Pattern Recognition},

abstract = {In this chapter, we address the problem of biometric identity recognition from the vasculature of the human sclera. Specifically, we focus on the challenging task of multi-view sclera recognition, where the visible part of the sclera vasculature changes from image to image due to varying gaze (or view) directions. We propose a complete solution for this task built around Convolutional Neural Networks (CNNs) and make several contributions that result in state-of-the-art recognition performance, i.e.: (i) we develop a cascaded CNN assembly that is able to robustly segment the sclera vasculature from the input images regardless of gaze direction, and (ii) we present ScleraNET, a CNN model trained in a multi-task manner (combining losses pertaining to identity and view-direction recognition) that allows for the extraction of discriminative vasculature descriptors that can be used for identity inference. To evaluate the proposed contributions, we also introduce a new dataset of ocular images, called the Sclera Blood Vessels, Periocular and Iris (SBVPI) dataset, which represents one of the few publicly available datasets suitable for research in multi-view sclera segmentation and recognition. The datasets come with a rich set of annotations, such as a per-pixel markup of various eye parts (including the sclera vasculature), identity, gaze-direction and gender labels. We conduct rigorous experiments on SBVPI with competing techniques from the literature and show that the combination of the proposed segmentation and descriptor-computation models results in highly competitive recognition performance.},

keywords = {},

pubstate = {published},

tppubtype = {incollection}

}

Close

Emersic Ziga; Krizaj Janez; Struc Vitomir; Peer Peter

Deep ear recognition pipeline Book Section

In: Mahmoud, Hassaballah; M., Hosny Khalid (Ed.): Recent advances in computer vision : theories and applications, vol. 804, Springer, 2019, ISBN: 1860-9503.

Abstract | Links | BibTeX

Tim Oblak; Klemen Grm; Aleš Jaklič; Peter Peer; Vitomir Štruc; Franc Solina

Recovery of Superquadrics from Range Images using Deep Learning: A Preliminary Study Proceedings Article

In: 2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI), pp. 45-52, IEEE, 2019.

Abstract | Links | BibTeX

@inproceedings{oblak2019recovery,

title = {Recovery of Superquadrics from Range Images using Deep Learning: A Preliminary Study},

author = {Tim Oblak and Klemen Grm and Aleš Jaklič and Peter Peer and Vitomir Štruc and Franc Solina},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2019/08/Superkvadriki_draft.pdf},

year  = {2019},

date = {2019-06-01},

booktitle = {2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)},

journal = {arXiv preprint arXiv:1904.06585},

pages = {45-52},

publisher = {IEEE},

abstract = {It has been a longstanding goal in computer vision to describe the 3D physical space in terms of parameterized volumetric models that would allow autonomous machines to understand and interact with their surroundings. Such models are typically motivated by human visual perception and aim to represents all elements of the physical word ranging from individual objects to complex scenes using a small set of parameters. One of the de facto standards to approach this problem are superquadrics - volumetric models that define various 3D shape primitives and can be fitted to actual 3D data (either in the form of point clouds or range images). However, existing solutions to superquadric recovery involve costly iterative fitting procedures, which limit the applicability of such techniques in practice. To alleviate this problem, we explore in this paper the possibility to recover superquadrics from range images without time consuming iterative parameter estimation techniques by using contemporary deep-learning models, more specifically, convolutional neural networks (CNNs). We pose the superquadric recovery problem as a regression task and develop a CNN regressor that is able to estimate the parameters of a superquadric model from a given range image. We train the regressor on a large set of synthetic range images, each containing a single (unrotated) superquadric shape and evaluate the learned model in comparative experiments with the current state-of-the-art. Additionally, we also present a qualitative analysis involving a dataset of real-world objects. The results of our experiments show that the proposed regressor not only outperforms the existing state-of-the-art, but also ensures a 270x faster  execution time.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Žiga Emeršič; A. Kumar S. V.; B. S. Harish; W. Gutfeter; J. N. Khiarak; A. Pacut; E. Hansley; M. Pamplona Segundo; S. Sarkar; H. Park; G. Pyo Nam; I. J. Kim; S.G. Sangodkar; U. Kacar; M. Kirci; L. Yuan; J. Yuan; H. Zhao; F. Lu; J. Mao; X. Zhang; D. Yaman; F. I. Eyiokur; K. B. Ozler; H. K. Ekenel; D. Paul Chowdhury; S. Bakshi; P. K. Sa; B. Majhni; P. Peer; V. Štruc

The Unconstrained Ear Recognition Challenge 2019 Proceedings Article

In: International Conference on Biometrics (ICB 2019), 2019.

Abstract | Links | BibTeX

@inproceedings{emervsivc2019unconstrained,

title = {The Unconstrained Ear Recognition Challenge 2019},

author = {Žiga Emeršič and A. Kumar S. V. and B. S. Harish and W. Gutfeter and J. N. Khiarak and A. Pacut and E. Hansley and M. Pamplona Segundo and S. Sarkar and H. Park and G. Pyo Nam and I. J. Kim and S.G. Sangodkar and U. Kacar and M. Kirci and L. Yuan and J. Yuan and H. Zhao and F. Lu and J. Mao and X. Zhang and D. Yaman and F. I. Eyiokur and K. B. Ozler and H. K. Ekenel and D. Paul Chowdhury and S. Bakshi and P. K. Sa and B. Majhni and P. Peer and V. Štruc},

url = {https://arxiv.org/pdf/1903.04143.pdf},

year  = {2019},

date = {2019-06-01},

booktitle = {International Conference on Biometrics (ICB 2019)},

journal = {arXiv preprint arXiv:1903.04143},

abstract = {This paper presents a summary of the 2019 Unconstrained Ear Recognition Challenge (UERC), the second in a series of group benchmarking efforts centered around the problem of person recognition from ear images captured in uncontrolled settings. The goal of the challenge is to assess the performance of existing ear recognition techniques on a challenging large-scale ear dataset and to analyze performance of the technology from various viewpoints, such as generalization abilities to unseen data characteristics, sensitivity to rotations, occlusions and image resolution and performance bias on sub-groups of subjects, selected based on demographic criteria, i.e. gender and ethnicity. Research groups from 12 institutions entered the competition and submitted a total of 13 recognition approaches ranging from descriptor-based methods to deep-learning models. The majority of submissions focused on ensemble based methods combining either representations from multiple deep models or hand-crafted with learned image descriptors. Our analysis shows that methods incorporating deep learning models clearly outperform techniques relying solely on hand-crafted descriptors, even though both groups of techniques exhibit similar behaviour when it comes to robustness to various covariates, such presence of occlusions, changes in (head) pose, or variability in image resolution. The results of the challenge also show that there has been considerable progress since the first UERC in 2017, but that there is still ample room for further research in this area.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Klemen Grm; Martin Pernus; Leo Cluzel; Walter J. Scheirer; Simon Dobrisek; Vitomir Struc

Face Hallucination Revisited: An Exploratory Study on Dataset Bias Proceedings Article

In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019.

Abstract | Links | BibTeX

Juš Lozej; Dejan Štepec; Vitomir Štruc; Peter Peer

Influence of segmentation on deep iris recognition performance Proceedings Article

In: 7th IAPR/IEEE International Workshop on Biometrics and Forensics (IWBF 2019), 2019.

Abstract | Links | BibTeX

Klemen Grm; Vitomir Štruc

Deep face recognition for surveillance applications Journal Article

In: IEEE Intelligent Systems, vol. 33, no. 3, pp. 46–50, 2018.

Abstract | Links | BibTeX

Žiga Emeršič; Blaž Meden; Peter Peer; Vitomir Štruc

Evaluation and analysis of ear recognition models: performance, complexity and resource requirements Journal Article

In: Neural Computing and Applications, pp. 1–16, 2018, ISBN: 0941-0643.

Abstract | Links | BibTeX

@article{emervsivc2018evaluation,

title = {Evaluation and analysis of ear recognition models: performance, complexity and resource requirements},

author = {Žiga Emeršič and Blaž Meden and Peter Peer and Vitomir Štruc},

url = {https://rdcu.be/Os7a},

doi = {https://doi.org/10.1007/s00521-018-3530-1},

isbn = {0941-0643},

year  = {2018},

date = {2018-05-01},

journal = {Neural Computing and Applications},

pages = {1--16},

publisher = {Springer},

abstract = {Ear recognition technology has long been dominated by (local) descriptor-based techniques due to their formidable recognition performance and robustness to various sources of image variability. While deep-learning-based techniques have started to appear in this field only recently, they have already shown potential for further boosting the performance of ear recognition technology and dethroning descriptor-based methods as the current state of the art. However, while recognition performance is often the key factor when selecting recognition models for biometric technology, it is equally important that the behavior of the models is understood and their sensitivity to different covariates is known and well explored. Other factors, such as the train- and test-time complexity or resource requirements, are also paramount and need to be consider when designing recognition systems. To explore these issues, we present in this paper a comprehensive analysis of several descriptor- and deep-learning-based techniques for ear recognition. Our goal is to discover weak points of contemporary techniques, study the characteristics of the existing technology and identify open problems worth exploring in the future. We conduct our analysis through identification experiments on the challenging Annotated Web Ears (AWE) dataset and report our findings. The results of our analysis show that the presence of accessories and high degrees of head movement significantly impacts the identification performance of all types of recognition models, whereas mild degrees of the listed factors and other covariates such as gender and ethnicity impact the identification performance only to a limited extent. From a test-time-complexity point of view, the results suggest that lightweight deep models can be equally fast as descriptor-based methods given appropriate computing hardware, but require significantly more resources during training, where descriptor-based methods have a clear advantage. As an additional contribution, we also introduce a novel dataset of ear images, called AWE Extended (AWEx), which we collected from the web for the training of the deep models used in our experiments. AWEx contains 4104 images of 346 subjects and represents one of the largest and most challenging (publicly available) datasets of unconstrained ear images at the disposal of the research community.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Ear recognition technology has long been dominated by (local) descriptor-based techniques due to their formidable recognition performance and robustness to various sources of image variability. While deep-learning-based techniques have started to appear in this field only recently, they have already shown potential for further boosting the performance of ear recognition technology and dethroning descriptor-based methods as the current state of the art. However, while recognition performance is often the key factor when selecting recognition models for biometric technology, it is equally important that the behavior of the models is understood and their sensitivity to different covariates is known and well explored. Other factors, such as the train- and test-time complexity or resource requirements, are also paramount and need to be consider when designing recognition systems. To explore these issues, we present in this paper a comprehensive analysis of several descriptor- and deep-learning-based techniques for ear recognition. Our goal is to discover weak points of contemporary techniques, study the characteristics of the existing technology and identify open problems worth exploring in the future. We conduct our analysis through identification experiments on the challenging Annotated Web Ears (AWE) dataset and report our findings. The results of our analysis show that the presence of accessories and high degrees of head movement significantly impacts the identification performance of all types of recognition models, whereas mild degrees of the listed factors and other covariates such as gender and ethnicity impact the identification performance only to a limited extent. From a test-time-complexity point of view, the results suggest that lightweight deep models can be equally fast as descriptor-based methods given appropriate computing hardware, but require significantly more resources during training, where descriptor-based methods have a clear advantage. As an additional contribution, we also introduce a novel dataset of ear images, called AWE Extended (AWEx), which we collected from the web for the training of the deep models used in our experiments. AWEx contains 4104 images of 346 subjects and represents one of the largest and most challenging (publicly available) datasets of unconstrained ear images at the disposal of the research community.

Close

Žiga Emeršič; Luka Gabriel; Vitomir Štruc; Peter Peer

Convolutional encoder--decoder networks for pixel-wise ear detection and segmentation Journal Article

In: IET Biometrics, vol. 7, no. 3, pp. 175–184, 2018.

Abstract | Links | BibTeX

Blaž Meden; Žiga Emeršič; Vitomir Štruc; Peter Peer

k-Same-Net: k-Anonymity with Generative Deep Neural Networks for Face Deidentification Journal Article

In: Entropy, vol. 20, no. 1, pp. 60, 2018.

Abstract | Links | BibTeX

@article{meden2018k,

title = {k-Same-Net: k-Anonymity with Generative Deep Neural Networks for Face Deidentification},

author = {Blaž Meden and Žiga Emeršič and Vitomir Štruc and Peter Peer},

url = {https://www.mdpi.com/1099-4300/20/1/60/pdf},

year  = {2018},

date = {2018-01-01},

journal = {Entropy},

volume = {20},

number = {1},

pages = {60},

publisher = {Multidisciplinary Digital Publishing Institute},

abstract = {Image and video data are today being shared between government entities and other relevant stakeholders on a regular basis and require careful handling of the personal information contained therein. A popular approach to ensure privacy protection in such data is the use of deidentification techniques, which aim at concealing the identity of individuals in the imagery while still preserving certain aspects of the data after deidentification. In this work, we propose a novel approach towards face deidentification, called k-Same-Net, which combines recent Generative Neural Networks (GNNs) with the well-known k-Anonymitymechanism and provides formal guarantees regarding privacy protection on a closed set of identities. Our GNN is able to generate synthetic surrogate face images for deidentification by seamlessly combining features of identities used to train the GNN model. Furthermore, it allows us to control the image-generation process with a small set of appearance-related parameters that can be used to alter specific aspects (e.g., facial expressions, age, gender) of the synthesized surrogate images. We demonstrate the feasibility of k-Same-Net in comprehensive experiments on the XM2VTS and CK+ datasets. We evaluate the efficacy of the proposed approach through reidentification experiments with recent recognition models and compare our results with competing deidentification techniques from the literature. We also present facial expression recognition experiments to demonstrate the utility-preservation capabilities of k-Same-Net. Our experimental results suggest that k-Same-Net is a viable option for facial deidentification that exhibits several desirable characteristics when compared to existing solutions in this area.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Robert Šket; Tadej Debevec; Susanne Kublik; Michael Schloter; Anne Schoeller; Boštjan Murovec; Katarina Vogel Mikuš; Damjan Makuc; Klemen Pečnik; Janez Plavec; Igor B Mekjavić; Ola Eiken; Zala Prevoršek; Blaž Stres

Intestinal Metagenomes and Metabolomes in Healthy Young Males: Inactivity and Hypoxia Generated Negative Physiological Symptoms Precede Microbial Dysbiosis Journal Article

In: Frontiers in Physiology, vol. 9, pp. 198, 2018, ISSN: 1664-042X.

Abstract | Links | BibTeX

@article{10.3389/fphys.2018.00198,

title = {Intestinal Metagenomes and Metabolomes in Healthy Young Males: Inactivity and Hypoxia Generated Negative Physiological Symptoms Precede Microbial Dysbiosis},

author = {Robert Šket and Tadej Debevec and Susanne Kublik and Michael Schloter and Anne Schoeller and Boštjan Murovec and Katarina Vogel Mikuš and Damjan Makuc and Klemen Pečnik and Janez Plavec and Igor B Mekjavić and Ola Eiken and Zala Prevoršek and Blaž Stres},

url = {https://www.frontiersin.org/article/10.3389/fphys.2018.00198},

doi = {10.3389/fphys.2018.00198},

issn = {1664-042X},

year  = {2018},

date = {2018-01-01},

urldate = {2018-01-01},

journal = {Frontiers in Physiology},

volume = {9},

pages = {198},

abstract = {We explored the metagenomic, metabolomic and trace metal makeup of intestinal microbiota and environment in healthy male participants during the run-in (5 day) and the following three 21-day interventions: normoxic bedrest (NBR), hypoxic bedrest (HBR) and hypoxic ambulation (HAmb) which were carried out within a controlled laboratory environment (circadian rhythm, fluid and dietary intakes, microbial bioburden, oxygen level, exercise). The fraction of inspired O2 (FiO2) and partial pressure of inspired O2 (PiO2) were 0.209 and 133.1 ± 0.3 mmHg for the NBR and 0.141 ± 0.004 and 90.0 ± 0.4 mmHg (~4000 m simulated altitude) for HBR and HAmb interventions, respectively. Shotgun metagenomes were analyzed at various taxonomic and functional levels, 1H- and 13C -metabolomes were processed using standard quantitative and human expert approaches, whereas metals were assessed using X-ray fluorescence spectrometry. Inactivity and hypoxia resulted in a significant increase in the genus Bacteroides in HBR, in genes coding for proteins involved in iron acquisition and metabolism, cell wall, capsule, virulence, defense and mucin degradation, such as beta-galactosidase (EC3.2.1.23), α-L-fucosidase (EC3.2.1.51), Sialidase (EC3.2.1.18) and α-N-acetylglucosaminidase (EC3.2.1.50). In contrast, the microbial metabolomes, intestinal element and metal profiles, the diversity of bacterial, archaeal and fungal microbial communities were not significantly affected. The observed progressive decrease in defecation frequency and concomitant increase in the electrical conductivity (EC) preceded or took place in absence of significant changes at the taxonomic, functional gene, metabolome and intestinal metal profile levels. The fact that the genus Bacteroides and proteins involved in iron acquisition and metabolism, cell wall, capsule, virulence and mucin degradation were enriched at the end of HBR suggest that both constipation and EC decreased intestinal metal availability leading to modified expression of co-regulated genes in Bacteroides genomes. Bayesian network analysis was used to derive the first hierarchical model of initial inactivity mediated deconditioning steps over time. The PlanHab wash-out period corresponded to a profound life-style change (i.e. reintroduction of exercise) that resulted in stepwise amelioration of the negative physiological symptoms, indicating that exercise apparently prevented the crosstalk between the microbial physiology, mucin degradation and proinflammatory immune activities in the host.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

We explored the metagenomic, metabolomic and trace metal makeup of intestinal microbiota and environment in healthy male participants during the run-in (5 day) and the following three 21-day interventions: normoxic bedrest (NBR), hypoxic bedrest (HBR) and hypoxic ambulation (HAmb) which were carried out within a controlled laboratory environment (circadian rhythm, fluid and dietary intakes, microbial bioburden, oxygen level, exercise). The fraction of inspired O2 (FiO2) and partial pressure of inspired O2 (PiO2) were 0.209 and 133.1 ± 0.3 mmHg for the NBR and 0.141 ± 0.004 and 90.0 ± 0.4 mmHg (~4000 m simulated altitude) for HBR and HAmb interventions, respectively. Shotgun metagenomes were analyzed at various taxonomic and functional levels, 1H- and 13C -metabolomes were processed using standard quantitative and human expert approaches, whereas metals were assessed using X-ray fluorescence spectrometry. Inactivity and hypoxia resulted in a significant increase in the genus Bacteroides in HBR, in genes coding for proteins involved in iron acquisition and metabolism, cell wall, capsule, virulence, defense and mucin degradation, such as beta-galactosidase (EC3.2.1.23), α-L-fucosidase (EC3.2.1.51), Sialidase (EC3.2.1.18) and α-N-acetylglucosaminidase (EC3.2.1.50). In contrast, the microbial metabolomes, intestinal element and metal profiles, the diversity of bacterial, archaeal and fungal microbial communities were not significantly affected. The observed progressive decrease in defecation frequency and concomitant increase in the electrical conductivity (EC) preceded or took place in absence of significant changes at the taxonomic, functional gene, metabolome and intestinal metal profile levels. The fact that the genus Bacteroides and proteins involved in iron acquisition and metabolism, cell wall, capsule, virulence and mucin degradation were enriched at the end of HBR suggest that both constipation and EC decreased intestinal metal availability leading to modified expression of co-regulated genes in Bacteroides genomes. Bayesian network analysis was used to derive the first hierarchical model of initial inactivity mediated deconditioning steps over time. The PlanHab wash-out period corresponded to a profound life-style change (i.e. reintroduction of exercise) that resulted in stepwise amelioration of the negative physiological symptoms, indicating that exercise apparently prevented the crosstalk between the microbial physiology, mucin degradation and proinflammatory immune activities in the host.

Close

Boštjan Murovec; Damjan Makuc; Sabina Kolbl Repinc; Zala Prevoršek; Domen Zavec; Robert Šket; Klemen Pečnik; Janez Plavec; Blaž Stres

1H NMR metabolomics of microbial metabolites in the four MW agricultural biogas plant reactors: A case study of inhibition mirroring the acute rumen acidosis symptoms Journal Article

In: Journal of Environmental Management, vol. 222, pp. 428 - 435, 2018, ISSN: 0301-4797.

Abstract | Links | BibTeX

@article{MUROVEC2018428,

title = {1H NMR metabolomics of microbial metabolites in the four MW agricultural biogas plant reactors: A case study of inhibition mirroring the acute rumen acidosis symptoms},

author = {Boštjan Murovec and Damjan Makuc and Sabina Kolbl Repinc and Zala Prevoršek and Domen Zavec and Robert Šket and Klemen Pečnik and Janez Plavec and Blaž Stres},

url = {http://www.sciencedirect.com/science/article/pii/S0301479718305991},

doi = {https://doi.org/10.1016/j.jenvman.2018.05.068},

issn = {0301-4797},

year  = {2018},

date = {2018-01-01},

journal = {Journal of Environmental Management},

volume = {222},

pages = {428 - 435},

abstract = {In this study, nuclear magnetic resonance (1H NMR) spectroscopic profiling was used to provide a more comprehensive view of microbial metabolites associated with poor reactor performance in a full-scale 4 MW mesophilic agricultural biogas plant under fully operational and also under inhibited conditions. Multivariate analyses were used to assess the significance of differences between reactors whereas artificial neural networks (ANN) were used to identify the key metabolites responsible for inhibition and their network of interaction. Based on the results of nm-MDS ordination the subsamples of each reactor were similar, but not identical, despite homogenization of the full-scale reactors before sampling. Hence, a certain extent of variability due to the size of the system under analysis was transferred into metabolome analysis. Multivariate analysis showed that fully active reactors were clustered separately from those containing inhibited reactor metabolites and were significantly different. Furthermore, the three distinct inhibited states were significantly different from each other. The inhibited metabolomes were enriched in acetate, caprylate, trimethylamine, thymine, pyruvate, alanine, xanthine and succinate. The differences in the metabolic fingerprint between inactive and fully active reactors observed in this study resembled closely the metabolites differentiating the (sub) acute rumen acidosis inflicted and healthy rumen metabolomes, creating thus favorable conditions for the growth and activity of pathogenic bacteria. The consistency of our data with those reported before for rumen ecosystems shows that 1H NMR based metabolomics is a reliable approach for the evaluation of metabolic events at full-scale biogas reactors.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Janez Križaj; Žiga Emeršič; Simon Dobrišek; Peter Peer; Vitomir Štruc

Localization of Facial Landmarks in Depth Images Using Gated Multiple Ridge Descent Proceedings Article

In: 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI), pp. 1–8, IEEE 2018.

Abstract | Links | BibTeX

Matej Kristan; Ales Leonardis; Jiri Matas; Michael Felsberg; Roman Pflugfelder; Luka Cehovin Zajc; Tomas Vojir; Goutam Bhat; Alan Lukezic; Abdelrahman Eldesokey; Vitomir Štruc; Klemen Grm; others

The sixth visual object tracking VOT2018 challenge results Proceedings Article

In: European Conference on Computer Vision Workshops (ECCV-W 2018), 2018.

Abstract | Links | BibTeX

Peter Rot; Žiga Emeršič; Vitomir Struc; Peter Peer

Deep multi-class eye segmentation for ocular biometrics Proceedings Article

In: 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI), pp. 1–8, IEEE 2018.

Abstract | Links | BibTeX

Juš Lozej; Blaž Meden; Vitomir Struc; Peter Peer

End-to-end iris segmentation using U-Net Proceedings Article

In: 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI), pp. 1–6, IEEE 2018.

Abstract | Links | BibTeX

Blaz Meden; Peter Peer; Vitomir Struc

Selective Face Deidentification with End-to-End Perceptual Loss Learning Proceedings Article

In: 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI), pp. 1–7, IEEE 2018.

Abstract | Links | BibTeX

Sandipan Banerjee; Joel Brogan; Janez Krizaj; Aparna Bharati; Brandon RichardWebster; Vitomir Struc; Patrick J. Flynn; Walter J. Scheirer

To frontalize or not to frontalize: Do we really need elaborate pre-processing to improve face recognition? Proceedings Article

In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 20–29, IEEE 2018.

Abstract | Links | BibTeX

Žiga Emeršič; Nil Oleart Playa; Vitomir Štruc; Peter Peer

Towards Accessories-Aware Ear Recognition Proceedings Article

In: 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI), pp. 1–8, IEEE 2018.

Abstract | Links | BibTeX

@inproceedings{emervsivc2018towards,

title = {Towards Accessories-Aware Ear Recognition},

author = {Žiga Emeršič and Nil Oleart Playa and Vitomir Štruc and Peter Peer},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2019/08/iwobi-2018-inpaint-1.pdf},

doi = {10.1109/IWOBI.2018.8464138},

year  = {2018},

date = {2018-03-01},

booktitle = {2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)},

pages = {1--8},

organization = {IEEE},

abstract = {Automatic ear recognition is gaining popularity within the research community due to numerous desirable properties, such as high recognition performance, the possibility of capturing ear images at a distance and in a covert manner, etc. Despite this popularity and the corresponding research effort that is being directed towards ear recognition technology, open problems still remain. One of the most important issues stopping ear recognition systems from being widely available are ear occlusions and accessories. Ear accessories not only mask biometric features and by this reduce the overall recognition performance, but also introduce new non-biometric features that can be exploited for spoofing purposes. Ignoring ear accessories during recognition can, therefore, present a security threat to ear recognition and also adversely affect performance. Despite the importance of this topic there has been, to the best of our knowledge, no ear recognition studies that would address these problems. In this work we try to close this gap and study the impact of ear accessories on the recognition performance of several state-of-the-art ear recognition techniques. We consider ear accessories as a tool for spoofing attacks and show that CNN-based recognition approaches are more susceptible to spoofing attacks than traditional descriptor-based approaches. Furthermore, we demonstrate that using inpainting techniques or average coloring can mitigate the problems caused by ear accessories and slightly outperforms (standard) black color to mask ear accessories.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Rosaura G. Vidal; Sreya Banerjee; Klemen Grm; Vitomir Struc; Walter J. Scheirer

UG^ 2: A Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition Proceedings Article

In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1597–1606, IEEE 2018.

Abstract | Links | BibTeX

Abhijit Das; Umapada Pal; Miguel A. Ferrer; Michael Blumenstein; Dejan Štepec; Peter Rot; Žiga Emeršič; Peter Peer; Vitomir Štruc

SSBC 2018: Sclera Segmentation Benchmarking Competition Proceedings Article

In: 2018 International Conference on Biometrics (ICB), 2018.

Abstract | Links | BibTeX

@inproceedings{Dasicb2018,

title = {SSBC 2018: Sclera Segmentation Benchmarking Competition},

author = {Abhijit Das and Umapada Pal and Miguel A. Ferrer and Michael Blumenstein and Dejan Štepec and Peter Rot and Žiga Emeršič and Peter Peer and Vitomir Štruc},

url = {https://lmi.fe.uni-lj.si/wp-content/uploads/2019/08/icb2018_sserbc.pdf},

year  = {2018},

date = {2018-02-01},

booktitle = {2018 International Conference on Biometrics (ICB)},

abstract = {This paper summarises the results of the Sclera Segmentation Benchmarking Competition (SSBC 2018). It was organised in the context of the 11th IAPR International Conference on Biometrics (ICB 2018). The aim of this competition was to record the developments on sclera segmentation in the cross-sensor environment (sclera trait captured using multiple acquiring sensors). Additionally, the competition also aimed to gain the attention of researchers on this subject of research. For the purpose of benchmarking, we have developed two datasets of sclera images captured using different sensors. The first dataset was collected using a DSLR camera and the second one was collected using a mobile phone camera. The first dataset is the Multi-Angle Sclera Dataset (MASD version 1), which was used in the context of the previous versions of sclera segmentation competitions. The images in the second dataset were captured using .a mobile phone rear camera of 8-megapixel. As baseline manual segmentation mask of the sclera images from both the datasets were developed. Precision and recall-based statistical measures were employed to evaluate the effectiveness of the submitted segmentation technique and to rank them. Six algorithms were submitted towards the segmentation task. This paper analyses the results produced by these algorithms/system and defines a way forward for this subject of research. Both the datasets along with some of the accompanying ground truth/baseline mask will be freely available for research purposes upon request to authors by email.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Žiga Emeršič; Vitomir Štruc; Peter Peer

Ear recognition: More than a survey Journal Article

In: Neurocomputing, vol. 255, pp. 26–39, 2017.

Abstract | Links | BibTeX

Blaž Meden; Refik Can Malli; Sebastjan Fabijan; Hazim Kemal Ekenel; Vitomir Štruc; Peter Peer

Face deidentification with generative deep neural networks Journal Article

In: IET Signal Processing, vol. 11, no. 9, pp. 1046–1054, 2017.

Abstract | Links | BibTeX

Klemen Grm; Vitomir Štruc; Anais Artiges; Matthieu Caron; Hazim K. Ekenel

Strengths and weaknesses of deep learning models for face recognition against image degradations Journal Article

In: IET Biometrics, vol. 7, no. 1, pp. 81–89, 2017.

Abstract | Links | BibTeX

2024

Journal Articles

Proceedings Articles

2023

Journal Articles

Proceedings Articles

2022

Journal Articles

Book Sections

Proceedings Articles

2021

Journal Articles

Proceedings Articles

2020

Journal Articles

Book Sections

Proceedings Articles

2019

Journal Articles

Book Sections

Proceedings Articles

2018

Journal Articles

Proceedings Articles

2017

Journal Articles