The 18th IEEE International Conference on Automatic Face and Gesture RecognitionThe 18th IEEE International Conference on Automatic Face and Gesture Recognition
  • Homepage
  • Dates
  • Participate
    • Conference Program
    • Presentation Instructions
    • Workshops
    • Tutorials
    • Competitions
    • DEI Travel Grants
    • Doctoral Consortium
    • Invitation Letter
    • Venue
    • Traveling to Türkiye
    • Camera Ready Papers
    • Printing Posters
  • Organizers
  • Keynotes
  • Awards
  • Registration
  • Sponsors
  • Supported By
  • Pre-Post Tour
  • Contact
  • FG’26

Conference Program

Click here to FG Program Booklet
Pre&Post-Workshops
Tuesday, 28 May 2024
Wednesday, 29 May 2024
Thursday, 30 May 2024
Pre&Post-Workshops
Morning: from 9am to 1:30pm (end time fixed)
Afternoon: from 2pm to 6pm (start time fixed)
Monday, 27 May 2024
Room 1
Morning: 8:30 – 11:30 Doctoral Consortium (DC)
Noon: 11:30 – 12:30 IEEE Transactions Associate Editor Training for the Next Generation (AE)
Noon: 12:30 – 14:00 DC Lunch
Afternoon: 14:00 – 19:00 Advancements in Facial Expression Analysis and Synthesis: Past, Present, and Future (AFEAS)
Room 2
Morning workshop Synthetic Data for Face and Gesture Analysis (SDA-FGA)
Afternoon workshop FG 2024 Competitions
Room 3
Morning tutorial Bias Assessment, Explanation, and Mitigation in Deep Face Recognition (BIAS)
Afternoon tutorial Generation of Synthetic Data for Remote Verification System (SYNTH)
Friday, 31 May 2024
Room 1
Morning workshop Responsible Face Image Processing (REFIP)
Afternoon workshop Learning with Few or without Annotated Face, Body and Gesture Data (LFA)
Room 2
Morning workshop Privacy-aware and Acceptable Video-based Assistive Technologies (PrivAAL)
Afternoon workshop Segmentation and Assessment of Continuous Video in Figure Skating Workshop & Challenge (SkatingVerse)
Room 3
Morning workshop Applied Multimodal Affect Recognition (AMAR)
Tuesday, 28 May 2024
Tuesday, 28 May 2024
8:00 – 8:45 Registration
8:45 – 9:00 Opening session
Chair: Vito Struc
9:00 – 10:00 Keynote 1
  Chair: Xilin Chen
Speaker: Prof. Shiguang Shan
Title: Gaze analysis and applicationst
10:00 – 10:30 Coffee break
10:30 – 11:30 Ask Me Anything Session
  Chair: Laszlo Jeni
Speaker: Prof. Takeo Kanade
11:30 – 12:30 Poster Spotlights
  Posters from Poster Session 1
12:30 – 14:00 Lunch Break
14:00 – 15:00 Oral Session 1 – Face biometrics
Chair: Nuno Goncalves
  Designing Cross-Race Tests for Forensic Facial Examiners, Super-recognizers, and Face Recognition Algorithm Géraldine Jeckeln (The University of Texas at Dallas)*; Selin Yavuzcan (The University of Texas at Dallas); Kate A. Marquis (The University of Texas at Dallas); Prajay S. Mehta (The University of Texas at Dallas); Amy N. Yates (National Institute of Standards and Technology); P Jonathon Phillips (NIST); Alice O’Toole (University of Texas at Dallas)
  TetraLoss: Improving the Robustness of Face Recognition against Morphing Attacks Mathias Ibsen (Hochschule Darmstadt)*; Lazaro Janier Gonzalez-Soler (Hochschule Darmstadt); Christian Rathgeb (Hochschule Darmstadt); Christoph Busch (Hochschule Darmstadt)
  Hierarchical Generative Network for Face Morphing Attacks Zuyuan He (SiChuan University); Zongyong Deng (Sichuan University); qiaoyun He (Sichuan University); Qijun Zhao (Sichuan University)*
  Face Anti-spoofing via Interaction Learning with Face Image Quality Alignment Yongluo Liu (Beijing University of Technology); Zun Li (Beijing University of Technology); Shuyi Li (Beijing University of Technology); Zhuming Wang (Beijing University of Technology); Lifang Wu (Beijing University of Technology)*
15:00 – 15:15 Break
15:15 – 16:15 Oral Session 2 – Facial Expressions
Chair: Lijun Yin
  Multi-Scale Spatio-Temporal Graph Convolutional Network for Facial Expression Spotting Yicheng Deng (Osaka University)*; Hideaki Hayashi (Osaka University); Hajime Nagahara (Osaka University)
  epsilon-Mesh Attack: A Surface-based Adversarial Point Cloud Attack for Facial Expression Recognition Batuhan Cengiz (Istanbul Technical University)*; Mert Gülşen (Istanbul Technical University); Yusuf Hüseyin Şahin (İTÜ); Gozde Unal (Istanbul Technical University)
  Distilling Privileged Multimodal Information for Expression Recognition using Optimal Transport Muhammad Haseeb Aslam (ETS)*; Muhammad Osama Zeeshan (École de technologie supérieure); Soufiane Belharbi (ÉTS Montreal); Marco Pedersoli (École de technologie supérieure); Alessandro Lameiras Koerich (École de technologie supérieure ); Simon Bacon (Concordia University); Eric Granger (ETS Montreal )
  CSTalk: Correlation Supervised Speech-driven 3D Emotional Facial Animation Generation Xiangyu Liang (Southeast University); Wenlin Zhuang (Southeast University); Tianyong Wang (Southeast University); Guangxing Geng (Nanjing 8:8 Digital Technology Co., Ltd); Guangyue Geng (Nanjing 8:8 Digital Technology Co., Ltd); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)*
16:15 – 18:00 Poster session 1 + Coffee break
Posters from Oral sessions 1 and 2
1 Designing Cross-Race Tests for Forensic Facial Examiners, Super-recognizers, and Face Recognition Algorithm Géraldine Jeckeln (The University of Texas at Dallas)*; Selin Yavuzcan (The University of Texas at Dallas); Kate A. Marquis (The University of Texas at Dallas); Prajay S. Mehta (The University of Texas at Dallas); Amy N. Yates (National Institute of Standards and Technology); P Jonathon Phillips (NIST); Alice O’Toole (University of Texas at Dallas)
2 TetraLoss: Improving the Robustness of Face Recognition against Morphing Attacks Mathias Ibsen (Hochschule Darmstadt)*; Lazaro Janier Gonzalez-Soler (Hochschule Darmstadt); Christian Rathgeb (Hochschule Darmstadt); Christoph Busch (Hochschule Darmstadt)
3 Hierarchical Generative Network for Face Morphing Attacks Zuyuan He (SiChuan University); Zongyong Deng (Sichuan University); qiaoyun He (Sichuan University); Qijun Zhao (Sichuan University)*
4 Face Anti-spoofing via Interaction Learning with Face Image Quality Alignment Yongluo Liu (Beijing University of Technology); Zun Li (Beijing University of Technology); Shuyi Li (Beijing University of Technology); Zhuming Wang (Beijing University of Technology); Lifang Wu (Beijing University of Technology)*
5 Multi-Scale Spatio-Temporal Graph Convolutional Network for Facial Expression Spotting Yicheng Deng (Osaka University)*; Hideaki Hayashi (Osaka University); Hajime Nagahara (Osaka University)
6 epsilon-Mesh Attack: A Surface-based Adversarial Point Cloud Attack for Facial Expression Recognition Batuhan Cengiz (Istanbul Technical University)*; Mert Gülşen (Istanbul Technical University); Yusuf Hüseyin Şahin (İTÜ); Gozde Unal (Istanbul Technical University)
7 Distilling Privileged Multimodal Information for Expression Recognition using Optimal Transport Muhammad Haseeb Aslam (ETS)*; Muhammad Osama Zeeshan (École de technologie supérieure); Soufiane Belharbi (ÉTS Montreal); Marco Pedersoli (École de technologie supérieure); Alessandro Lameiras Koerich (École de technologie supérieure ); Simon Bacon (Concordia University); Eric Granger (ETS Montreal )
8 CSTalk: Correlation Supervised Speech-driven 3D Emotional Facial Animation Generation Xiangyu Liang (Southeast University); Wenlin Zhuang (Southeast University); Tianyong Wang (Southeast University); Guangxing Geng (Nanjing 8:8 Digital Technology Co., Ltd); Guangyue Geng (Nanjing 8:8 Digital Technology Co., Ltd); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)*
Posters Only
9 Efficient Verification-Based Face Identification ‪Barak Battash‬ (Intel)*; Amit Rozner (Bar-Ilan University); Ofir Lindenbaum (Yale ); Lior Wolf (Tel Aviv University, Israel)
10 Dataset Infant Anonymization with Pose and Emotion Retention Mason Lary (SUNY Buffalo)*; Matthew M Klawonn (US Air Force Research Laboratory); Daniel Messinger (University of Miami); Ifeoma Nwogu (University at Buffalo, SUNY)
11 Face the Needle: Predicting risk of fear and fainting during blood donation through video analysis Judita Rudokaite (Tilburg University)*; Itir Onal Ertugrul (Utrecht University); Sharon Ong (Tilburg University); Mart Janssen (Sanquin); Elisabeth Huis in ‘t Veld (Tilburg University)
12 Intra-Person Camera Adversarial for Intra-Camera Supervised Person Re-identification Ruochen tang (Southwest Jiaotong University)*; Xun Gong (Southwest Jiaotong University)
13 Adaptive Cross-architecture Mutual Knowledge Distillation Jianyuan Ni (Texas State University)*; Hao Tang (ETH Zurich & CMU); Yuzhang Shang (Illinois Institute of Technology); Bin Duan (Illinois Institute of Technology); Yan Yan (Illinois Institute of Technology)
14 ASPECD: Adaptable Soft-Biometric Privacy-Enhancement Using Centroid Decoding for Face Verification                              Peter Rot (Univerza v Ljubljani, Fakulteta za Elektrotehniko)*; Philipp Terhörst (Paderborn University); Peter Peer (University of Ljubljana); Vitomir Struc (University of Ljubljana)
15 Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces Recognition Iurii Medvedev (University of Coimbra)*; Farhad Shadmand (University of Coimbra); Nuno Gonçalves (University of Coimbra)
16 Deepfake: Classifiers, Fairness, and Demographically Robust Algorithm Akshay Agarwal (IISER Bhopal)*; Nalini Ratha (SUNY Buffalo)
17 PointFaceFormer: local and global attention based transformer for 3D point cloud face recognition Ziqi Gao (shenzhen university); Qiufu Li (shenzhen university); Gui Wang (WKU); Linlin Shen (Shenzhen University)*
18 Subject-Based Domain Adaptation for Facial Expression Recognition Muhammad Osama Zeeshan (École de technologie supérieure)*; Muhammad Haseeb Aslam (ETS); Soufiane Belharbi (ÉTS Montreal); Alessandro Lameiras Koerich (École de technologie supérieure ); Marco Pedersoli (École de technologie supérieure); Simon Bacon (Concordia University); Eric Granger (ETS Montreal )
19 Efficient Detection of Disguised Faces using Photos/Sketches from Low-Quality Surveillance Footage Nikhil Reddy Pottanigari (University of Montreal (MILA))*; Rithin Pullela (Texas A&M University); Abdul kalam azad Shaik (university of florida ); Rithik Reddy Katpally (San Jose State University)
20 Lip and speech synchronization using supervised contrastive learning and cross-modal attention Munender Varshney (Hitachi Research and Development Center)*; Mayurakshi Mukherji (Hitachi India Pvt. Ltd.); Senthil raja G (Hitachi India Pvt. Ltd); Ananth Ganesh (Hitachi India); Kingshuk Banerjee (Hitachi India Pvt. Ltd)
21 If It’s Not Enough, Make It So: Reducing Authentic Data Demand in Face Recognition through Synthetic Faces Andrea Atzori (University of Cagliari)*; Fadi Boutros (Fraunhofer IGD); Naser Damer (Fraunhofer Institute for Computer Graphics Research IGD and TU Darmstadt ); Gianni Fenu (University of Cagliari); Mirko Marras (University of Cagliari)
22 Data Augmentation Techniques for Enhanced Facial Landmarks Detection in Patients with Repaired Cleft Lip and Palate Karen Rosero (University of Texas at Dallas)*; Ali N Salman (University of Texas at Dallas ); Berrak Sisman (University of Texas at Dallas ); Rami Hallac (University of Texas Southwestern Medical Center, Children’s Medical Center); Carlos Busso (University of Texas at Dallas)
23 Deep adaptative spectral zoom for improved remote heart rate estimation Joaquim Comas Martínez (Universitat Pompeu Fabra)*; Adria Ruiz (Pompeu Fabra University); Federico Sukno (Pompeu Fabra University)
24 Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis Guanyu Hu (Xi’an Jiaotong University ); Eleni Papadopoulou (NTUA); Dimitrios Kollias (Queen Mary University London)*; Paraskevi Tzouveli (NTUA); JIE WEI (Xi’an Jiaotong University); Xinyu Yang (Xi’an Jiaotong University)
25 ONOT: a High-Quality ICAO-compliant Synthetic Mugshot Dataset Nicolò Di Domenico (University of Bologna); Guido Borghi (University of Bologna)*; Annalisa Franco (University of Bologna); Davide Maltoni (University of Bologna)
26 RFIS-FPI: Reversible Face Image Steganography Neural Network for Face Privacy Interactions Yubo Huang (Southwest Jiaotong Unverisity)*; Anran Zhu (Southwest Jiaotong University); Cheng Zeng (Southwest Jiaotong University); Cong Hu (Southwest Jiaotong University); Xin Lai (Southwest Jiaotong University); Wenhao Feng (Southwest Jiaotong University); Fan Chen (Southwest Jiaotong University)
27 Unlocking the Black Box: Concept-Based Modeling for Interpretable Affective Computing Applications Xinyu Li (University of Glasgow)*; Marwa Mahmoud (University of Glasgow)
28 Social-MAE: A Transformer-Based Multimodal Autoencoder for Face and Voice Hugo Bohy (University of Mons)*; Kevin El Haddad (University of Mons/The Big Projects); Minh Tran (ICT, USC); Thierry Dutoit (University of Mons); Mohammad Soleymani (University of Southern California)
29 Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues Soufiane Belharbi (ÉTS Montreal)*; Marco Pedersoli (École de technologie supérieure); Alessandro Lameiras Koerich (École de technologie supérieure ); Simon Bacon (Concordia University); Eric Granger (ETS Montreal )
30 AerialFace: A Light Weight Framework for Unmanned Aerial Vehicle Face Recognition zhiquan ou (Hohai University); Liang Yao (Hohai University)*; Ting Wu (Hohai University); Fan Liu (Hohai University)
31 QGFace: Quality-Guided Joint Training for Mixed Quality Face Recognition Youzhe Song (East China Normal University)*; Feng Wang (East China Normal University)
32 EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition Niki M Foteinopoulou (SnT, University of Luxembourg)*; Ioannis Patras (Queen Mary University of London)
33 In-Domain Inversion for Improved 3D Face Alignment on Asymmetrical Expressions Jilliam M. Diaz Barros (German Research Center for Artificial Intelligence)*; Jason Rambach (DFKI); Pramod Murthy (DFKI); Didier Stricker (DFKI)
34 3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior Guohao Li (BUAA)*; Hongyu Yang (Beihang University); Di Huang (Beihang University, China); Yunhong Wang (State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China)
35 Expression-aware Masking and Progressive Decoupling for Cross-database Facial Expression Recognition Tao Zhong (Shenzhen University); Xiaole Xian (Shenzhen University); Zihan Wang (Shenzhen University); Weicheng Xie (Shenzhen University)*; Linlin Shen (Shenzhen University)
36 Explainable Face Verification via Feature-Guided Gradient Backpropagation Yuhang Lu (EPFL)*; Zewei Xu (EPFL); Touradj Ebrahimi (EPFL)
Demo presentation
 

Russian sign language learning simulator

Maxim Novopoltsev (SberAI), Aleksandr Tulenkov (SberAI), Roman Akhidov (SberAI), Ruslan Murtazin (SberAI), Dmitriy Milevich (SberAI), Iuliia Zemtsova (SberAI)
18:00 Welcome Reception
Wednesday, 29 May 2024
Wednesday, 29 May 2024
8:00 – 9:00 Registration
9:00 – 10:00 Keynote 2
  Chair: Lale Akarun
Speaker: Prof. Beatrice de Gelder
Title: Linking body movement analysis and brain activity
10:00 – 10:30 Coffee break
10:30 – 11:30 Oral Session 3 – Human pose and motion
Chair: Martin Kampel
  Uncalibrated Multi-view 3D Human Pose Estimation with Geometry Driven Attention Victor Galizzi (CEA)*; Bertrand Luvison (CEA LIST)
  Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction Olivier Moliner (Lund University)*; Sangxia Huang (Sony Research); Kalle Åström (Lund University)
  One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features Trung Thanh NGUYEN (Nagoya Univeristy)*; Yasutomo Kawanishi (RIKEN); Takahiro Komamizu (Nagoya University); Ichiro Ide (Nagoya University)
  CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras James Y Tang (University of British Columbia, Department of Computer Science)*; Shashwat Suri (University of British Columbia); Daniel Abidemi Ajisafe (The University of British Columbia); Bastian Wandt (Linköping University); Helge Rhodin (UBC)
11:30 – 12:30 Poster Spotlights
  Posters from Poster Session 2
12:30 – 14:00 Lunch Break
14:00 – 15:00 Oral Session 4 – Gait and Action
Chair: Yan Yan
  Unveiling Gender Effects in Gait Recognition using Conditional-Matched Bootstrap Analysis Azim Ibragimov (University of Florida)*; Mauricio Pamplona Segundo (University of South Florida); Sudeep Sarkar (University of South Florida, Tampa); Kevin W Bowyer (University of Notre Dame)
  GaitPT: Skeletons Are All You Need For Gait Recognition Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest)
  Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Action Recognition Wele Gedara Chaminda Bandara (Apple Inc)*; Vishal Patel (Johns Hopkins University)
  ViewDiffGait: View Pyramid Diffusion for Gait Recognition Rijun Liao (University of Missouri-Kansas City)*; Zhu Li (University of Missouri-Kansas City); Shuvra Bhattacharyya (University of Maryland); George York (US Air Force Academy)
15:00 – 15:15 Break
15:15 – 16:15 Oral Session 5 – Hand and Sign Language
Chair: Hazem Wannous
  Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks Maksym Ivashechkin (University of Surrey)*; Richard Bowden (University of Surrey); Oscar Mendez (University of Surrey)
  SynthSL: Expressive Humans for Sign Language Image Synthesis Jilliam M. Diaz Barros (German Research Center for Artificial Intelligence)*; Chen-Yu Wang (DFKI); Jameel Malik (DFKI); Abdalla Arafa (DFKI); Didier Stricker (DFKI)
  A Gloss-free Sign Language Production with Discrete Representation Eui Jun Hwang (KAIST)*; Huije Lee (Korea Advanced Institute of Science and Technology); Jong C. Park (KAIST)
  In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition Wiktor Mucha (Vienna University of Technology, Computer Vision Lab)*; Martin Kampel (Vienna University of Technology, Computer Vision Lab)
16:15 – 18:00 Poster session 2 + Coffee
Posters from Oral sessions 2, 3 and 4
1 Uncalibrated Multi-view 3D Human Pose Estimation with Geometry Driven Attention Victor Galizzi (CEA)*; Bertrand Luvison (CEA LIST)
2 Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction Olivier Moliner (Lund University)*; Sangxia Huang (Sony Research); Kalle Åström (Lund University)
3 One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features Trung Thanh NGUYEN (Nagoya Univeristy)*; Yasutomo Kawanishi (RIKEN); Takahiro Komamizu (Nagoya University); Ichiro Ide (Nagoya University)
4 CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras James Y Tang (University of British Columbia, Department of Computer Science)*; Shashwat Suri (University of British Columbia); Daniel Abidemi Ajisafe (The University of British Columbia); Bastian Wandt (Linköping University); Helge Rhodin (UBC)
5 Unveiling Gender Effects in Gait Recognition using Conditional-Matched Bootstrap Analysis Azim Ibragimov (University of Florida)*; Mauricio Pamplona Segundo (University of South Florida); Sudeep Sarkar (University of South Florida, Tampa); Kevin W Bowyer (University of Notre Dame)
6 GaitPT: Skeletons Are All You Need For Gait Recognition Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest)
7 Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Action Recognition Wele Gedara Chaminda Bandara (Apple Inc)*; Vishal Patel (Johns Hopkins University)
8 ViewDiffGait: View Pyramid Diffusion for Gait Recognition Rijun Liao (University of Missouri-Kansas City)*; Zhu Li (university of missouri-kansas city); Shuvra Bhattacharyya (University of Maryland); George York (US Air Force Academy)
9 Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks Maksym Ivashechkin (University of Surrey)*; Richard Bowden (University of Surrey); Oscar Mendez (University of Surrey)
10 SynthSL: Expressive Humans for Sign Language Image Synthesis Jilliam M. Diaz Barros (German Research Center for Artificial Intelligence)*; Chen-Yu Wang (DFKI); Jameel Malik (DFKI); Abdalla Arafa (DFKI); Didier Stricker (DFKI)
11 A Gloss-free Sign Language Production with Discrete Representation Eui Jun Hwang (KAIST)*; Huije Lee (Korea Advanced Institute of Science and Technology); Jong C. Park (KAIST)
12 In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition Wiktor Mucha (Vienna University of Technology, Computer Vision Lab)*; Martin Kampel (Vienna University of Technology, Computer Vision Lab)
Posters Only
13 BEAVP: A Bidirectional Enhanced Adversarial Model for Video Prediction Peiyuan Zhu (Tongji University); Fengxia Han (Tongji University); Shengjie Zhao (Tongji University); Hao Deng (Tongji Universtiy)*
14 Skeleton-based Self-Supervised Feature Extraction for Improved Dynamic Hand Gesture Recognition Omar Ikne (IMT Nord Europe)*; Benjamin Allaert (IMT Nord Europe); Hazem Wannous (IMT Nord Europe, CRIStAL UMR 9189)
15 Human Action Recognition with Multi-Level Granularity and Pair-wise Hyper GCN Tamam Alsarhan (Khalifa University)*; Tamam Alsarhan (The university of Jordan); Ayoub Alsarhan (Hashemite university); Syed Sadaf Ali (Khalifa University); Iyyakutti Iyappan Ganapathi (Khalifa University); Naoufel Werghi (Khalifa University of Science and Technology)
16 MGRFormer: A Multimodal Transformer Approach for Surgical Gesture Recognition Kevin Feghoul (University of Lille)*; Deise S Maia (Université de Lille); Mehdi Elamrani (CHU Lille); Mohamed Daoudi (IMT Nord Europe); Ali Amad (University of Lille)
17 CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition Pierre Vuillecard (Idiap)*; Arya Farkhondeh (Idiap Research Institute, EPFL); Michael Villamizar (Idiap Research Institute); Jean-Marc ODOBEZ (IDIAP/EPFL, SWITZERLAND)
18 GestSpoof: Gesture Based Spatio-Temporal Representation Learning For Robust Fingerprint Presentation Attack Detection. Bhavin Jawade (University at Buffalo)*; Shreeram Gudemaranahalli Subramanya (University at Buffalo); Atharv Dabhade (University at Buffalo, SUNY); Srirangaraj Setlur (University at Buffalo, SUNY); Venu Govindaraju (University at Buffalo, SUNY)
19 Spatio Temporal Sparse Graph Convolution Network for Hand Gesture Recognition Omar Ikne (IMT Nord Europe)*; Rim Slama (CESI LINEACT); Hichem Saoudi (IMT Nord Europe); Hazem Wannous (IMT Nord Europe, CRIStAL UMR 9189)
20 Crowd Detection via Point Localization with Diffusion Models Don Yasiru L Ranasinghe (Johns Hopkins University)*; Vishal Patel (Johns Hopkins University)
21 MIMIC-Pose: Implicit Membership Discrimination of Body Joints for Human Pose Estimation Ying Huang (Hangzhou Normal University)*; Shanfeng Hu (Northumbria University)
22 DPA-2D: Depth Propagation and Alignment with 2D Observations Guidance for Human Mesh Recovery Weihao You (Tomorrow Advancing Life)*; Pengcheng Wang (Tomorrow Advancing Life); Jinfeng Bai (Tomorrow Advance Life); zhilong ji (Tomorrow Advancing Life)
23 Evaluating Recent 2D Human Pose Estimators for 2D-3D Pose Lifting Soroush Mehraban (University of Toronto)*; Yiqian Qin (University of Toronto); Babak Taati (University Health Network)
24 The Paradox of Motion: Evidence for Spurious Correlations in Skeleton-based Gait Recognition Models Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest)
25 Improving 2D Human Pose Estimation in Unseen Camera Views with Synthetic Data Miroslav Purkrabek (Czech Technical University, Prague)*; Jiri Matas (Czech Technical University, Prague)
26 DualH: A Dual Hierarchical Model for Temporal Action Localization Zejian Zhang (Universitat de Barcelona)*; Cristina Palmero (Universitat de Barcelona); Sergio Escalera (Universitat de Barcelona)
27 HR-xNet: A Novel High-Resolution Network for Human Pose Estimation with Low Resource Consumption cun feng (Ningbo University); Rong Zhang (Ningbo University)*; Lijun Guo (Ningbo University)
28 Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition zhendong liu (Southeast University); Haifeng Xia (Southeast University); Tong Guo (Southeast University); Libo Sun (Southeast University); Ming Shao (University of Massachusetts Dartmouth); Siyu Xia (Southeast University, China)*
29 HM-Auth: Redefining User Authentication in Immersive Virtual World through Hand Movement Signatures Sindhu Reddy Kalathur Gopal (University of Wyoming)*; Paul S Gyreyiri (University of Wyoming); Diksha Shukla (University of Wyoming)
30 A Data-Driven Representation for Sign Language Production Harry Walsh (University of Surrey)*; Abolfazl Zargari Khuzani (Intel); Mariam Rahmani (Intel Corporation); Richard Bowden (University of Surrey)
31 Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder Mohamed I Lakhal (University of Surrey)*; Richard Bowden (University of Surrey)
32 Resource-Efficient Gesture Recognition using Low-Resolution Thermal Camera via Spiking Neural Networks and Sparse Segmentation Ali Safa (KU Leuven – IMEC)*; Wout Mommen (VUB – IMEC); Piet Wambacq (IMEC -VUB); Lars Keuninckx (imec)
33 The Seven Faces of Stress: Understanding Facial Activity Patterns during Cognitive Stress Carla Viegas (Carnegie Mellon University)*; Roy A Maxion (Carnegie Mellon University, USA); Alexander Hauptmann (Carnegie Mellon University); Joao Magalhaes (Universidade NOVA Lisboa)
34 Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets Ahmet Alp Kindiroglu (Huawei)*; Ozgur Kara (Georgia Institute of Technology); Oğulcan Özdemir (Bogazici University); Lale Akarun (Bogazici University)
35 Patch-based Privacy Attention for Weakly-supervised Privacy-Preserving Action Recognition Xiao Li (Sun Yat-sen University); Yukun Qiu (Sun Yat-sen University); Yi-Xing Peng (Sun Yat-sen University, China); WEI-SHI ZHENG (Sun Yat-sen University, China)*
36 Boosting Gesture Recognition with an Automatic Gesture Annotation Framework Junxiao Shen (University of Cambridge)*; Xuhai Xu (Meta Reality Lab Research); Ran Tan (Meta Reality Labs Research); Amy Karlson (Meta Reality Labs Research); Evan Strasnick (Meta Reality Labs Research)
37 Towards Better Communication: Refining Hand Pose Estimation in Low-Resolution Sign Language Videos Sümeyye M Taşyürek (Hacettepe University)*; Tuğçe Kızıltepe (Hacettepe University); Hacer Yalim Keles (Hacettepe University)
38 Quantifying Biometric Characteristics of Hand Gestures through Feature Space Probing and Identity-Level Cross-Gesture Disentanglement Aman Verma (Indian Institute of Technology Delhi); Gaurav Jaswal (Indian Institute of Technology Delhi); Seshan Srirangarajan (Indian Institute of Technology Delhi)*; Sumantra Dutta Roy (Indian Institute of Technology Delhi)
39 Hand Graph Topology Selection for Skeleton-based Sign Language Recognition Oğulcan Özdemir (Bogazici University)*; Inci M. Baytas (Bogazici University); Lale Akarun (Bogazici University)
40 Unconstrained Hand Recognition using Thermal Infrared Sensing of Dorsal Veins Wallace Lawson (Naval Research Laboratory)*; Grant Daneils (Naval Research Laboratory); Daniel Steinhurst (Nova Research); David Kidwell (Naval Research Laboratory)
DC Posters
  Integrating a hierarchical structure of situated human motion in Multi-task learning for professional gesture recognition Gavriela Senteri (ARMINES)
  Towards High Fidelity and Accurate Face Swapping Phyo Yee (IIT Ropar)
  Face-based Strategies for Evaluating Asymmetry and Speech Articulation in Patients with Craniofacial Anomalies Karen Rosero (University of Texas at Dallas)
Demo presentation
 

Expanding PyAFAR: A Novel Privacy-Preserving Infant AU Detector

Itir Onal Ertugrul (Utrecht University), Saurabh Hinduja (University of Pittsburgh), Maneesh Bilalpur (University of Pittsburgh),
Daniel Messinger (University of Miami), Jeffrey Cohn (University of Pittsburgh).
19:00-23:00 Gala Dinner (Boat)
Thursday, 30 May 2024
Thursday, 30 May 2024
8:00 – 9:00 Registration
9:00 – 10:00 Keynote 3
  Chair: Shaun Canavan
Speaker: Prof. Mohamed Daoudi
Title: Learning to Synthesize 3D Faces and Human Interactions
10:00 – 10:30 Coffee break
10:30 – 11:30 Oral Session 6 – Animation, Synthesis and Self-Supervision
Chair: Raghavendra Ramachandra
  Multi-View Consistent 3D GAN Inversion via Bidirectional Encoder Haozhan Wu (Institute of Computing Technology, Chinese Academy of Sciences)*; Hu Han (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences)
  Embedded Representation Learning Network for Animating Styled Video Portrait Tianyong Wang (Southeast University); Xiangyu Liang (Southeast University); wangguandong zheng (Southeast University); Dan Niu (Southeast University); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)*
  Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation Anton Pelykh (University of Surrey)*; Ozge Mercanoglu Sincan (University of Surrey); Richard Bowden (University of Surrey)
  RS-rPPG: Robust Self-Supervised Learning for rPPG Marko Radisa Savic (University of Oulu)*; Guoying Zhao (University of Oulu)
11:30 – 12:30 Poster Spotlights
  Posters from Poster Session 3
12:30 – 13:45 Lunch Break
13:45 – 15:30 Poster session 3
Posters from Oral sessions 6, 7 and 8
1 Multi-View Consistent 3D GAN Inversion via Bidirectional Encoder Haozhan Wu (Institute of Computing Technology, Chinese Academy of Sciences)*; Hu Han (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences)
2 Embedded Representation Learning Network for Animating Styled Video Portrait Tianyong Wang (Southeast University); Xiangyu Liang (Southeast University); wangguandong zheng (Southeast University); Dan Niu (Southeast University); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)*
3 Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation Anton Pelykh (University of Surrey)*; Ozge Mercanoglu Sincan (University of Surrey); Richard Bowden (University of Surrey)
4 RS-rPPG: Robust Self-Supervised Learning for rPPG Marko Radisa Savic (University of Oulu)*; Guoying Zhao (University of Oulu)
5 An Active-gaze Morphable Model for 3D Gaze Estimation Hao Sun (University of York); Nick E. Pears (University of York, UK)*; William Smith (University of York)
6 Occluded Person Retrieval with Hierarchical Feature Optimization Yang Zhao (La Trobe University)*; Pengcheng Zhang (Beihang University); Xiaohan Yu (Griffith University); Zhibin Liao (University of Adelaide); Johan Verjans (SAHMRI); Xiao Bai (Beihang University); Wei Xiang (La Trobe University)
7 High-resolution Image Enumeration for Low-resolution Face Recognition Can Chen (Kitware Inc.)*; Scott McCloskey (Kitware)
8 OpenThermalPose: An Open-Source Annotated Thermal Human Pose Dataset and Initial YOLOv8-Pose Baselines Askat Kuzdeuov (Nazarbayev University)*; Darya Taratynova (Nazarbayev University); Alim Tleuliyev (Nazarbayev University); Huseyin Atakan Varol (Nazarbayev University)
9 A Unified Model for Gaze Following and Social Gaze Prediction Anshul Gupta (Idiap Research Institute, EPFL)*; Samy Tafasca (Idiap Research Institute, EPFL); Naravich Chutisilp (EPFL); Jean-Marc ODOBEZ (IDIAP/EPFL, SWITZERLAND)
10 DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition Hebeizi Li (Beihang University)*; Hongyu Yang (Beihang University); Di Huang (Beihang University, China)
11 ClipSwap: Towards High Fidelity Face Swapping via Attribute and CLIP-Informed Loss Phyo Thet Yee (IIT Ropar); Sudeepta Mishra (IIT Ropar); Abhinav Dhall (Flinders University)*
12 Multi-modal Human Behaviour Graph Representation Learning for Automatic Depression Assessment Haotian Shen (University of Cambridge ); Siyang Song (University of Cambridge)*; Hatice Gunes (University of Cambridge)
Posters Only
13 Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention Gnana Praveen Rajasekhar (Computer Research Institute of Montreal)*; Jahangir Alam (Computer Research Institute of Montreal (CRIM), Montreal (Quebec) Canada)
14 VoxAtnNet: A 3D Point Clouds Convolutional Neural Network for Generalizable Face Presentation Attack Detection Raghavendra Ramachandra (NTNU, Norway)*; Narayan Vetrekar (Goa University); Sushma Krupa Venkatesh (Aiba); Savita Nageshker (Goa University); Jag Mohan Singh (Norwegian University of Science and Technology (NTNU) Gjøvik); Rajendra Gad (UoG, India)
15 EAT-Face: Emotion-Controllable Audio-Driven Talking Face Generation via Diffusion Model Haodi Wang (School of Computer Science and Engineering, Sun Yat-sen University); Xiaojun Jia (Nanyang Technological University); Xiaochun Cao (Sun Yat-sen University)*
16 Context-based Dataset for Analysis of Videos of Autistic Children Sk Rahatul Jannat (University of South Florida); Heather Agazzi (University of South Florida); Shaun Canavan (University of South Florida)*
17 Seeing and hearing what has not been said; A multimodal client behavior classifier in Motivational Interviewing with interpretable fusion Lucie Galland (ISIR)*; Catherine Pelachaud (CNRS, Sorbonne Université); Florian Pecune (Bordeaux University)
18 SignAvatar: Sign Language 3D Motion Reconstruction and Generation Lu Dong (University at Buffalo)*; Lipisha Chaudhary (University at Buffalo, SUNY); Fei Xu (University at Buffalo, SUNY); Xiao Wang (Syracuse University); Mason Lary (SUNY Buffalo); Ifeoma Nwogu (University at Buffalo, SUNY)
19 PortraitDAE: Line-Drawing Portraits Style Transfer from Photos via Diffusion Autoencoder with Meaningful Encoded Noise Yexiang Liu (Institute of Automation,Chinese Academy of Sciences); Jin Liu (Shanghaitech University); Jie Cao (Institute of Automation, Chinese Academy of Sciences); Junxian Duan (National Laboratory of Pattern Recognition); Ran He (Institute of Automation, Chinese Academy of Sciences)*
20 FE-Adapter: Adapting Image-based Emotion Classifiers to Videos Shreyank N Gowda (University of Oxford)*; Boyan Gao (University of Oxford); David A Clifton (University of Oxford)
21 Latent Embedding Clustering for Occlusion Robust Head Pose Estimation José Carlos Celestino (Instituto Superior Técnico)*; Manuel Marques (Institute for Systems and Robotics (ISR/LARSyS), DEEC, Instituto Superior Tecnico, Portugal); Jacinto C. Nascimento (Instituto Superior Tecnico de Lisboa)
22 Pivotal Tuning Editing: Towards Disentangled Wrinkle Editing with GANs Neil Farmer (CentraleSupelec)*; Catherine SOLADIE (CentraleSupelec); Gabriel CAZORLA (Chanel); Renaud SEGUIER (CENTRALESUPELEC)
23 Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis Maxim Maximov (TUM)*; Tim Meinhardt (TUM); Caner Hazirbas (Meta AI); Zoe Papakipos (Meta); Canton Cristian (Meta AI); Laura Leal-Taixé (NVIDIA)
24 CrossGaze: A Strong Method for 3D Gaze Estimation in the Wild Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest)
25 Survey of Automated Methods for Nonverbal Behavior Analysis in Parent-Child Interactions Berfu Karaca (Utrecht University)*; Albert Ali Salah (Utrecht University); Jaap Denissen (Utrecht University); Ronald Poppe (Utrecht University);  Sonja de Zwarte (Utrecht University)
26 Naive Data Augmentation Might Be Toxic: Data-prior Guided Self-supervised Representation Learning for Micro-gesture Recognition Atif Shah (University of Oulu)*; Haoyu Chen (University of Oulu); Guoying Zhao (University of Oulu)
27 SMCTL: Subcarrier Masking Contrastive Transfer Learning For Human Gesture Recognition With Passive Wi-Fi Sensing Hojjat Salehinejad (Mayo Clinic)*; Radomir Djogo (University of Toronto); Navid Hasanzadeh (University of Toronto); Shahrokh Valaee (University of Toronto)
28 Semantic-Aware Detail Enhancement for Blind Face Restoration Huimin Zhao (Anhui University)*; Jie Cao (Institute of Automation, Chinese Academy of Sciences); Huaibo Huang (Institute of Automation, Chinese Academy of Sciences); Xiaoqiang Zhou (University of Science and Technology of China); Aihua Zheng (Anhui University); Ran He (Institute of Automation, Chinese Academy of Sciences)
30 Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models René Haas (IT University of Copehagen); Inbar Huberman-Spiegelglas (Technion); Rotem Mulayoff (Technion); Stella Graßhof (IT University of Copenhagen)*; Sami S Brandt (IT University of Copenhagen); Tomer Michaeli (Technion)
31 Breaking Template Protection: Reconstruction of Face Images from Protected Facial Templates Hatef Otroshi Shahreza (Idiap Research Institute)*; Sebastien Marcel (Idiap Research Institute)
32 Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson’s Disease Severity in Walking Sequences Vida Adeli (University of Toronto)*; Soroush Mehraban (University of Toronto); Irene Ballester (TU Wien); Yasamin Zarghami (University of Toronto); Andrea Sabo (University of Toronto); Andrea Iaboni (Toronto Rehabilitation Institute); Babak Taati (University Health Network)
34 Visual Coherence Face Anonymization Algorithm Based on Dynamic Identity Perception Xuan Tan (Hangzhou Dianzi University); Shanqing Zhang (Hangzhou Dianzi University); Yixuan Ju (University of Yamanashi); Xiaoyang mao (University of Yamanashi); Jiayi Xu (Hangzhou Dianzi University)*
35 PyraMoT: A Novel Framework for Enhanced Facial Thermal Landmarks Detection Kais Riani (University of Michigan)*; Salem Sharak (University Of Michigan); Mohamed Abouelenien (University of Michigan)
36 Visual Saliency Guided Gaze Target Estimation with Limited Labels Cheng Peng (King’s College London)*; Oya Celiktutan (King’s College London)
37 Hyp-OC: Hyperbolic One Class Classifier for Face Anti-Spoofing Kartik Narayan (Johns Hopkins University)*; Vishal Patel (Johns Hopkins University)
38 Dynamic Cross Attention for Audio-Visual Person Verification Gnana Praveen Rajasekhar (Computer Research Institute of Montreal)*; Jahangir Alam (Computer Research Institute of Montreal (CRIM), Montreal (Quebec) Canada)
39 Enhancing Privacy in Face Analytics Using Fully Homomorphic Encryption Bharat Yalavarthi (University at Buffalo)*; Arjun Ramesh Kaushik (University at Buffalo, The State University of New York); Arun Ross (Michigan State University); Vishnu Boddeti (Michigan State University); Nalini Ratha (SUNY Buffalo)
40 CribNet: Enhancing Infant Safety in Cribs through Vision-based Hazard Detection Shaotong Zhu (Northeastern University); Amal Mathew (Northeastern University); Elaheh Hatamimajoumerd (Northeastern University); Michael Wan (Northeastern University); Briana Taylor (The Roux Institute at Northeastern University); Rajagopal Venkatesaramani (Northeastern University); Sarah Ostadabbas (Northeastern University)*
41 3D Face Morphing Attack Generation using Non-Rigid Registration Jag Mohan Singh (Norwegian University of Science and Technology (NTNU) Gjøvik)*; Raghavendra Ramachandra (NTNU, Norway)
42 BTVSL: A Novel Sentence-Level Annotated Dataset for Bangla Sign Language Translation Iftekhar E Mahbub Zeeon (Bangladesh University of Engineering and Technology); Mir Mahathir Mohammad (University of Utah); Muhammad Abdullah Adnan (University of California San Diego)*
15:30- 16:30 Oral Session 7 – Best Reviewed Papers
Chair: Carlos Busso
  An Active-gaze Morphable Model for 3D Gaze Estimation Hao Sun (University of York); Nick E. Pears (University of York, UK)*; William Smith (University of York)
  Occluded Person Retrieval with Hierarchical Feature Optimization Yang Zhao (La Trobe University)*; Pengcheng Zhang (Beihang University); Xiaohan Yu (Griffith University); Zhibin Liao (University of Adelaide); Johan Verjans (SAHMRI); Xiao Bai (Beihang University); Wei Xiang (La Trobe University)
  High-resolution Image Enumeration for Low-resolution Face Recognition Can Chen (Kitware Inc.)*; Scott McCloskey (Kitware)
  OpenThermalPose: An Open-Source Annotated Thermal Human Pose Dataset and Initial YOLOv8-Pose Baselines Askat Kuzdeuov (Nazarbayev University)*; Darya Taratynova (Nazarbayev University); Alim Tleuliyev (Nazarbayev University); Huseyin Atakan Varol (Nazarbayev University)
16:30 – 17:00 Coffee break
17:00 – 18:00 Oral Session 8 – Best reviewed Student Papers
Chair: Itir Onal Ertugrul
  A Unified Model for Gaze Following and Social Gaze Prediction Anshul Gupta (Idiap Research Institute, EPFL)*; Samy Tafasca (Idiap Research Institute, EPFL); Naravich Chutisilp (EPFL); Jean-Marc ODOBEZ (IDIAP/EPFL, SWITZERLAND)
  DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition Hebeizi Li (Beihang University)*; Hongyu Yang (Beihang University); Di Huang (Beihang University, China)
  ClipSwap: Towards High Fidelity Face Swapping via Attribute and CLIP-Informed Loss Phyo Thet Yee (IIT Ropar); Abhinav Dhall (Indian Institute of Technology Ropar)*
  Multi-modal Human Behaviour Graph Representation Learning for Automatic Depression Assessment Haotian Shen (University of Cambridge ); Siyang Song (University of Cambridge)*; Hatice Gunes (University of Cambridge)
18:00 – 18:10 Closing session
Click here to FG Program Booklet

  • Home
  • Sitemap/More Sites
  • Contact
  • Accessibility
  • Nondiscrimination Policy
  • IEEE Ethics Reporting
  • IEEE Privacy Policy
  • Terms
Hestia | Developed by ThemeIsle