Workshops – The 18th IEEE International Conference on Automatic Face and Gesture Recognition

Seven workshops will be organized at FG’24. More details about each workshop are found below:

Synthetic Data for Face and Gesture Analysis

Organizers: Deepak Kumar Jain (Dalian University of Technology), Pourya Shamsolmoali (East China Normal University), Fadi Boutros (Fraunhofer IGD), Naser Damer (Fraunhofer Institute for Computer Graphics Research IGD and TU Darmstadt ), Vitomir Struc (University of Ljubljana)

Abstract: Recent advancements in generative models within the realms of computer vision and artificial intelligence have revolutionized the way researchers approach data-driven tasks. The advent of sophisticated generative models, such as GANs (Generative Adversarial Networks), VAEs (Variational Autoencoders), or more recently, diffusion models, has empowered practitioners to create synthetic data that closely mirrors real-world scenarios. These models enable the generation of high-fidelity images and sequences, laying the foundation for groundbreaking applications in face and gesture analysis. The significance of these generative models lies in their ability to produce synthetic data that is remarkably realistic, thereby mitigating challenges associated with data scarcity and privacy concerns. As a result, the utilization of synthetic data has become increasingly prevalent in various research domains, offering a versatile and ethical alternative for training and testing machine learning algorithms. This workshop aims to delve into the diverse applications of synthetic data in the realm of face and gesture analysis. Participants will explore how synthetic datasets have been instrumental in training facial recognition systems, enhancing emotion detection models, and refining gesture recognition algorithms. The workshop will showcase exemplary use cases where the integration of synthetic data has not only overcome data limitations but has also fostered the development of more robust and accurate models.

Website: https://sites.google.com/view/sd-fga2024/

Program (May 27 Morning, Room 2):

9:00 – 9:05: Opening Session

9:05 – 10:00: Keynote talk: Prof. Rama Chellappa

10:00 – 10:45: Session 1 – Applications of Synthetic Data

A Study of Video-based Human Representation for American Sign Language Alphabet Generation; Fei Xu; Lipisha Chaudhary; Lu Dong; Srirangaraj Setlur; Venu Govindaraju; Ifeoma Nwogu

Training Against Disguises: Addressing and Mitigating Bias in Facial Emotion Recognition with Synthetic Data; AAdith Sukumar; Aditya Desai; Peeyush Singhal ; Sai Gokhale; Deepak Kumar Jain; Rahee Walambe; Ketan V Kotecha

DiCTI: Diffusion-based Clothing Designer via Text-guided Input; Ajda Lampe; Julija Stopar; Deepak Kumar Jain; Shinichiro Omachi; Peter Peer; Vitomir Štruc

10:45 – 11:00: Coffee Break

11:00 – 12:15: Session 2 – Generation and Detection of Synthetic Data

Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration; Praveen Kumar Chandaliya; Kiran Raja; Raghavendra Ramachandra; Zahid Akhtar; Christoph Busch

Massively Annotated Datasets for Assessment of Synthetic and Real Data in Face Recognition; Pedro C. Neto; Rafael M Mamede; Carolina Albuquerque; Tiago FS Gonçalves; Ana F. Sequeira

Analyzing the Feature Extractor Networks for Face Image Synthesis; Erdi Sarıtaş; Hazim Kemal Ekenel

INDIFACE: Illuminating India’s Deepfake Landscape with a Comprehensive Dataset; Kartik Kuckreja; Ximi Hoque; Nishit Nilesh Poddar; Shukesh G Reddy; Abhinav Dhall; Abhijit Das

Real, fake and synthetic faces – does the coin have three sides? Shahzeb Naeem; Ramzi Al-Sharawi; Muhammad Riyyan Khan; Usman Tariq*; Abhinav Dhall; Hasan Al-Nashash 12:15 – 12:20 Closing session

Advancements in Facial Expression Analysis and Synthesis: Past, Present, and Future

Organizers: Itir Onal Ertugrul (Utrecht University), Laszlo A Jeni (Carnegie Mellon University)

Abstract: This workshop aims to bring together computer scientists, psychologists and behavioral scientists who have been working on automated analysis and synthesis of facial expressions and their application in several domains including assessment of pain, mental health, personality, and emotion among others. With the invited talks by distinguished researchers in the field, we aim to shed light on the past, present, and future of face analysis and synthesis. The workshop will conclude with a dynamic panel discussion, featuring interdisciplinary researchers and their valuable insights into the multidimensional aspects of facial expression analysis and synthesis.

Website: https://sites.google.com/view/afeas-24/home

Program (May 27 Afternoon, Room 2):

14:00 – 14:10: Opening

14:10 – 14:30: Keynote talk by Takeo Kanade

14:30 – 14:50: Keynote talk by Lijun Yin

14:50 – 15:10: Keynote talk by Carlos Busso

15:10 – 15:30: Keynote talk by Michel Valstar

15:30 – 16:10: Coffee Break

16:10 – 16:30: Keynote talk by Hamdi Dibeklioğlu

16:30 – 16:50: Keynote talk by Yingli Tian

17:00 – 17:20: Keynote talk by Iain Matthews

17:20 – 17:40: Keynote talk by Fernando De la Torre

17:40 – 17:50: Closing

First International Workshop on Responsible Face Image Processing (ReFIP 2024)

Organizers: Andrea Atzori (University of Cagliari), Fadi Boutros (Fraunhofer IGD), Lucia Cascone (University of Salerno), Naser Damer (Fraunhofer Institute for Computer Graphics Research IGD and TU Darmstadt ), Mirko Marras (University of Cagliari), Ruben Tolosana (Universidad Autonoma de Madrid), Ruben Vera-Rodriguez (Universidad Autónoma de Madrid)

Abstract: The consideration of ethical dimensions beyond mere accuracy is increasingly important in both industrial and academic spheres, given the pervasive influence of facial image processing systems in our daily lives. Despite this attention, crucial aspects such as fairness, accountability, transparency, and privacy remain under-explored in the domain of facial image processing systems. To have a better understanding of these aspects, our workshop on responsible face image processing (ReFIP) aims to gather high-quality, impactful, and original research in this emerging field, providing a shared platform for researchers and practitioners. This workshop seeks to go beyond domain-generic studies in the literature, fostering a deeper understanding of the ethical aspects associated with facial image processing, generating vivid community exchanges.

Website: https://responsiblefaceimageprocessing.github.io/fg2024/

Program (May 31 Morning, Room 1):

9:00 – 9:10: Workshop Presentation and Introduction

9:10 – 10:00: Keynote Speech (Vitomir Štruc: Face Image Quality Assessment (FIQA): Recent Advancements and Future Challenges)

10:00 – 10:15: Emircan Gündoğdu, Altay Ünal, Gozde Unal: “A Study Regarding Machine Unlearning on Facial Attribute Data”

10:15 – 10:30: Christian Rathgeb, Mathias Ibsen, Denise Hartmann, Simon Hradetzky, Berglind Ólafsdóttir: “Testing the Performance of Face Recognition for People with Down Syndrome”

10:30 – 11:00: Coffee Break

11:00 – 11:20: Pablo Augusto Negri, Isabelle Hupont, Emilia Gomez: “A Framework for Assessing Proportionate Intervention with Face Recognition Systems in Real-Life Scenarios”

11:20 – 11:40: Alaa Elobaid, Nathan Ramoly, Lara Younes, Symeon Papadopoulos, Eirini Ntoutsi, Yiannis Kompatsiaris: “Sum of Group Error Differences: A Critical Examination of Bias Evaluation Metrics in Biometric Verification and a Novel Dual-Metric Measure”

11:40 – 12:00: Marco Huber, Anh Thi Luu, Naser Damer: “Recognition Performance Variation Across Demographic Groups through the Eyes of Explainable Face Recognition”

12:00 – 12:20: Georgia Baltsou, Ioannis Sarridis, Christos Koutlis, Symeon Papadopoulos: “SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes” 12:20 – 12:30: Final Remarks; End of the Workshop

2nd Workshop on Learning with Few or without Annotated Face, Body and Gesture Data (LFA-FG2024)

Organizers: Maxime Devanne (Université de Haute Alsace), Mohamed Daoudi (IMT Nord Europe/CRIStAL (UMR 9189)), Germain Forestier (University of Haute Alsace), Jonathan Weber (University of Haute Alsace), Stefano Berretti (University of Florence, Italy)

Abstract: Since more than a decade, Deep Learning has been successfully employed for vision-based face, body and gesture analysis, both for static and dynamic granularities. This is particularly due to the development of effective deep architectures and the release of quite consequent datasets.

However, one of the main limitations of Deep Learning is that it requires large scale annotated datasets to train efficient models. Gathering such face, body or gesture data and annotating them can be time consuming and laborious. This is particularly the case in areas where experts from the field are required, like in the medical domain. In such a case, using crowdsourcing may not be suitable.

In addition, currently available face and/or gesture datasets cover a limited set of categories. This makes the adaptation of trained models to novel categories not straightforward. Finally, while most of the available datasets focus on classification problems with discretized labels, continuous annotations are required in many scenarios. Hence, this significantly complicates the annotation process.

The goal of this 2nd edition of the workshop is to explore approaches to overcome such limitations by investigating ways to learn from few annotated data, to transfer knowledge from similar domains or problems, to generate new data or to benefit from the community to gather novel large scale annotated datasets.

Website: https://sites.google.com/view/lfa-fg2024/home

Program (May 31 Afternoon, Room 1):

Program (May 31 Afternoon, Room 1):

14:00 – 14:10
Opening session

14:10 – 14:30
Gait Recognition from Highly Compressed Videos
Andrei Niculae, Andy Catruna, Adrian Cosma, Daniel Rosner, Emilian Radoi

14:30 – 14:50
Aligning Actions and Walking to LLM-Generated Textual Descriptions
Radu Chivereanu, Adrian Cosma, Andy Catruna, Razvan Rughinis, Emilian Rado

14:50 – 15:10
Exploring Radar Capabilities to Support Gesture-Based Interaction in Smart Environments
Gonçalo Aguiar, Ana P. Rocha, Samuel Silva, António Teixeira

15:10 – 15:30
Interactive Visualization and Dexterity Analysis of Human Movement: AIMove Platform
Brenda Elizabeth Olivas Padilla, Sotiris Manitsaris, Alina Glushkova

15:30 – 16:00
Coffee break

16:00 – 16:20
ENTIRe-ID: An Extensive and Diverse Dataset for Person Re-Identification
Serdar Yıldız, Ahmet Nezih Kasim

16:20 – 16:40
IMEmo: An Interpersonal Relation Multi-Emotion Dataset
Hajer Guerdelli, Claudio Ferrari, Stefano Berretti, Alberto Del Bimbo

16:40 – 17:00
Self-supervised Variational Contrastive Learning with Applications to Face Understanding
Mehmet Can Yavuz, Berrin Yanikoglu

17:00
Closing session

The Second Workshop on Privacy-aware and Acceptable Video-based Assistive Technologies

Organizers: Sara Colantonio (Institute of Information Science and Technologies of the National Research Council of Italy), Francisco Flórez-Revuelta (University of Alicante), Martin Kampel (Vienna University of Technology, Computer Vision Lab)

Abstract: The quest for responsible research is a cornerstone of an ethical, legal and social-aware approach to the development of assistive technologies. As technology advances – driven by the huge and rapidly evolving innovations through modern information and communication technologies – it penetrates private domains and interacts with personal, private, and intimate activities. It is a necessary requirement that any technology development should be carefully designed and balanced within societal, cultural and individual values, and norms.

Assistive technologies based on computer vision, multimedia data processing and understanding, and machine intelligence present several advantages in terms of unobtrusiveness and information richness. Indeed, camera sensors are far less obtrusive with respect to the hindrance that other wearable sensors may cause to people’s activities. Currently, video-based applications are effective in recognising and monitoring face expressions, activities, movements, and overall conditions of the assisted individuals as well as to assess their vital parameters (e.g., heart rate, respiratory rate). However, cameras are often perceived as the most intrusive technologies from the viewpoint of the privacy of the monitored individuals. This is due to the richness of the information that this technology conveys and the intimate setting where it may be deployed in. Therefore, solutions able to ensure privacy preservation by context and design as well as to ensure high legal and ethical standards are in high demand.

This workshop aims to create a forum for contributions presenting and discussing image- and video-based applications for active assisted living as well as initiatives proposing ethical and privacy-aware solutions.

The workshop is supported by the visuAAL Marie Skłodowska-Curie Innovative Training Network and the GoodBrother COST Action, which aims to bridge the gap between users’ requirements and the safe and secure use of video-based AAL.

Website: https://goodbrother.eu/conferences/privaal2024/

Program (May 31 Morning, Room 2):

9:00-9:10
Opening Session

9:10-9:50
Keynote: Albert Ali Salah – Computational Approaches for Behavioral and Clinical Science

Session 1

9:50-10:10
Maksymilian Kuźmicz – What should we care about in AAL? Unveiling the main interests of the users in the legal context

10:10-10:30
Irene Ballester and Martin Kampel – Ethical Impact Identification of a Dementia Behaviour Monitoring System

10:30-11:00
Coffee break

11:00-11:40
Keynote: Krzysztof Krejtz – Attention Biases in Emotion Recognition – A Dynamical Approach (remote)

Session 2

11:40-12:00
Giulio Del Corso, Danila Germanese, Maria Antonietta Pascali, Serena Bardelli, Armando Cuttano, Fabrizia Festante, Andrea Guzzetta, Lucia Rocchitelli and Sara Colantonio – Facial landmark identification and data preparation can significantly improve the extraction of newborns’ facial features

12:00-12:20
Nursena Boluk and Hatice Kose – Evaluating Gaze Detection for Children with Autism Using the ChildPlay-R Dataset

12:20-12:40
Erhan Bicer and Hatice Kose – LITE-FER: A lightweight facial expression recognition framework for children in resource-limited devices

12:40-13:00
Erdi Sarıtaş and Hazım Kemal Ekenel – Analyzing the Effect of Combined Degradations on Face Recognition

13:00-13:20
Round table

13:20-13:30
Closing remarks

SkatingVerse: Segmentation and Assessment of Continuous Video in Figure Skating Competition and the 1st SkatingVerse Workshop & Challenge

Organizers: Jian Zhao (Institute of North Electronic Equipment), Lei Jin (Beijing University of Posts and Telecommunications), Zheng Zhu (Tsinghua University), Yinglei Teng (Beijing University of Posts and Telecommunications), Jiaojiao Zhao (University of Amsterdam), Sadaf Gulshad (University of Amsterdam), Zheng Wang (Wuhan University), Bo Zhao (Bank of Montreal), Xiangbo Shu (Nanjing University of Science and Technology), Xuecheng Nie (Meitu Inc.), Xiaojie Jin (Bytedance Inc. USA), Xiaodan Liang (Sun Yat-sen University), Yunchao Wei (UTS), Jianshu Li (Ant Group), Shin’ichi Satoh (National Institute of Informatics), Yandong Guo (AI^2 Robotics), Cewu Lu (Shanghai Jiao Tong University), Junliang Xing (Tsinghua University), Shen Jane (Pensees Technology)

Abstract: Human action understanding in computer vision focuses on locating, classifying, and assessing human actions in videos. However, the current tasks are inadequate for practical application such as fine-grained action segmentation and assessment. To address this, we construct a dataset comprising 1,687 continuous videos from figure skating competitions, encouraging the development of algorithms that can accurately analyze each action. We chose the figure skating task, because of its difficulty, presence of challenging actions, and availability of fine-grained labels. This workshop encourages participants to submit their contributions, surveys, and case studies that address human action perception and understanding problems.

Website: https://skatingverse.github.io/

Program (May 31 Afternoon, https://zoom.us/j/96879621760 projected in Room 2):

14:00 – 14:10 Opening session

14:15 – 14:30 1st place in Challenge Presentation: Beijing DeepGlint

14:35 – 14:45 2nd place in Challenge Presentation: China Mobile (Suzhou) Software Technology Co., Ltd.

14:50 – 14:55 3rd place in Challenge Presentation: Chengdu University of Technology

15:00 – 15:30 Keynote: Shanghang Zhang, Professor at Peking University, Embodied AI for Autonomous Driving

15:30 – 16:00 Coffee Break

16:00 – 16:20 Best Paper Presentation: DCAPose: Improve One-stage Multi-Person Pose Estimation with Dynamic Center Assignment, Wei Zhang, Huiru Xie , Qi Li, Zhenan Sun)

16:25 – 16:55 Keynote: Cewu Lu, Professor at Shanghai Jiaotong University, Action Recognition and Embodied AI.

Fourth Workshop on Applied Multimodal Affect Recognition (AMAR)

Organizers: Shaun Canavan (University of South Florida), Tempestt Neal (USF), Marvin Andujar (University of South Florida), Saurabh Hinduja (University of Pittsburgh), Lijun Yin (State University of New York at Binghamton)

Abstract: Novel applications of affective computing have emerged in recent years in domains ranging from health care to the 5th generation mobile network. Many of these have found improved emotion classification performance when fusing multiple sources of data (e.g., audio, video, brain, face, thermal, physiological, environmental, positional, text, etc.). Multimodal affect recognition has the potential to revolutionize the way various industries and sectors utilize information gained from recognition of a person’s emotional state, particularly considering the flexibility in the choice of modalities and measurement tools (e.g., surveillance versus mobile device cameras). Multimodal classification methods have been proven highly effective at minimizing misclassification error in practice and in dynamic conditions. Further, multi-modal classification models tend to be more stable over time compared to relying on a single modality, increasing their reliability in sensitive applications such as mental health monitoring and automobile driver state recognition. To continue the trend of lab to practice within the field and encourage new applications of affective computing, this workshop will provide a forum for researchers to exchange ideas on future directions, including novel fusion methods and databases, innovations through interdisciplinary research, and emerging emotion sensing devices. Also, this workshop will address the ethical use of novel applications of affective computing in real world scenarios. More specifically, it will discuss topics including, but not limited to, privacy, manipulation of users, and public fears and misconceptions regarding affective computing.

Website: https://cse.usf.edu/~tjneal/AMAR2024/

Program (May 31 Morning, Room 3):

09:15 – 9:30
Welcome and Opening Remarks from Organizers

9:30 – 10:30
Keynote

10:30 – 10:45
Workshop Paper Presentation

10:45 – 11:00
Coffee Break

11:00 – 11:15
Workshop Paper Presentation

11:15 – 12:15
Keynote

12:15 – 12:30
Closing Remarks