Pre&Post-Workshops
Tuesday, 28 May 2024
Wednesday, 29 May 2024
Thursday, 30 May 2024
Pre&Post-Workshops
Morning: from 9am to 1:30pm (end time fixed)
Afternoon: from 2pm to 6pm (start time fixed) |
|
Monday, 27 May 2024 | |
---|---|
Room 1 | |
Morning: 8:30 – 11:30 | Doctoral Consortium (DC) |
Noon: 11:30 – 12:30 | IEEE Transactions Associate Editor Training for the Next Generation (AE) |
Noon: 12:30 – 14:00 | DC Lunch |
Afternoon: 14:00 – 19:00 | Advancements in Facial Expression Analysis and Synthesis: Past, Present, and Future (AFEAS) |
Room 2 | |
Morning workshop | Synthetic Data for Face and Gesture Analysis (SDA-FGA) |
Afternoon workshop | FG 2024 Competitions |
Room 3 | |
Morning tutorial | Bias Assessment, Explanation, and Mitigation in Deep Face Recognition (BIAS) |
Afternoon tutorial | Generation of Synthetic Data for Remote Verification System (SYNTH) |
Friday, 31 May 2024 | |
Room 1 | |
Morning workshop | Responsible Face Image Processing (REFIP) |
Afternoon workshop | Learning with Few or without Annotated Face, Body and Gesture Data (LFA) |
Room 2 | |
Morning workshop | Privacy-aware and Acceptable Video-based Assistive Technologies (PrivAAL) |
Afternoon workshop | Segmentation and Assessment of Continuous Video in Figure Skating Workshop & Challenge (SkatingVerse) |
Room 3 | |
Morning workshop | Applied Multimodal Affect Recognition (AMAR) |
Tuesday, 28 May 2024
Tuesday, 28 May 2024 | ||
---|---|---|
8:00 – 8:45 | Registration | |
8:45 – 9:00 | Opening session Chair: Vito Struc |
|
9:00 – 10:00 | Keynote 1 | |
Chair: Xilin Chen Speaker: Prof. Shiguang Shan Title: Gaze analysis and applicationst |
||
10:00 – 10:30 | Coffee break | |
10:30 – 11:30 | Ask Me Anything Session | |
Chair: Laszlo Jeni Speaker: Prof. Takeo Kanade |
||
11:30 – 12:30 | Poster Spotlights | |
Posters from Poster Session 1 | ||
12:30 – 14:00 | Lunch Break | |
14:00 – 15:00 | Oral Session 1 – Face biometrics Chair: Nuno Goncalves |
|
Designing Cross-Race Tests for Forensic Facial Examiners, Super-recognizers, and Face Recognition Algorithm | Géraldine Jeckeln (The University of Texas at Dallas)*; Selin Yavuzcan (The University of Texas at Dallas); Kate A. Marquis (The University of Texas at Dallas); Prajay S. Mehta (The University of Texas at Dallas); Amy N. Yates (National Institute of Standards and Technology); P Jonathon Phillips (NIST); Alice O’Toole (University of Texas at Dallas) | |
TetraLoss: Improving the Robustness of Face Recognition against Morphing Attacks | Mathias Ibsen (Hochschule Darmstadt)*; Lazaro Janier Gonzalez-Soler (Hochschule Darmstadt); Christian Rathgeb (Hochschule Darmstadt); Christoph Busch (Hochschule Darmstadt) | |
Hierarchical Generative Network for Face Morphing Attacks | Zuyuan He (SiChuan University); Zongyong Deng (Sichuan University); qiaoyun He (Sichuan University); Qijun Zhao (Sichuan University)* | |
Face Anti-spoofing via Interaction Learning with Face Image Quality Alignment | Yongluo Liu (Beijing University of Technology); Zun Li (Beijing University of Technology); Shuyi Li (Beijing University of Technology); Zhuming Wang (Beijing University of Technology); Lifang Wu (Beijing University of Technology)* | |
15:00 – 15:15 | Break | |
15:15 – 16:15 | Oral Session 2 – Facial Expressions Chair: Lijun Yin |
|
Multi-Scale Spatio-Temporal Graph Convolutional Network for Facial Expression Spotting | Yicheng Deng (Osaka University)*; Hideaki Hayashi (Osaka University); Hajime Nagahara (Osaka University) | |
epsilon-Mesh Attack: A Surface-based Adversarial Point Cloud Attack for Facial Expression Recognition | Batuhan Cengiz (Istanbul Technical University)*; Mert Gülşen (Istanbul Technical University); Yusuf Hüseyin Şahin (İTÜ); Gozde Unal (Istanbul Technical University) | |
Distilling Privileged Multimodal Information for Expression Recognition using Optimal Transport | Muhammad Haseeb Aslam (ETS)*; Muhammad Osama Zeeshan (École de technologie supérieure); Soufiane Belharbi (ÉTS Montreal); Marco Pedersoli (École de technologie supérieure); Alessandro Lameiras Koerich (École de technologie supérieure ); Simon Bacon (Concordia University); Eric Granger (ETS Montreal ) | |
CSTalk: Correlation Supervised Speech-driven 3D Emotional Facial Animation Generation | Xiangyu Liang (Southeast University); Wenlin Zhuang (Southeast University); Tianyong Wang (Southeast University); Guangxing Geng (Nanjing 8:8 Digital Technology Co., Ltd); Guangyue Geng (Nanjing 8:8 Digital Technology Co., Ltd); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)* | |
16:15 – 18:00 | Poster session 1 + Coffee break | |
Posters from Oral sessions 1 and 2 | ||
1 | Designing Cross-Race Tests for Forensic Facial Examiners, Super-recognizers, and Face Recognition Algorithm | Géraldine Jeckeln (The University of Texas at Dallas)*; Selin Yavuzcan (The University of Texas at Dallas); Kate A. Marquis (The University of Texas at Dallas); Prajay S. Mehta (The University of Texas at Dallas); Amy N. Yates (National Institute of Standards and Technology); P Jonathon Phillips (NIST); Alice O’Toole (University of Texas at Dallas) |
2 | TetraLoss: Improving the Robustness of Face Recognition against Morphing Attacks | Mathias Ibsen (Hochschule Darmstadt)*; Lazaro Janier Gonzalez-Soler (Hochschule Darmstadt); Christian Rathgeb (Hochschule Darmstadt); Christoph Busch (Hochschule Darmstadt) |
3 | Hierarchical Generative Network for Face Morphing Attacks | Zuyuan He (SiChuan University); Zongyong Deng (Sichuan University); qiaoyun He (Sichuan University); Qijun Zhao (Sichuan University)* |
4 | Face Anti-spoofing via Interaction Learning with Face Image Quality Alignment | Yongluo Liu (Beijing University of Technology); Zun Li (Beijing University of Technology); Shuyi Li (Beijing University of Technology); Zhuming Wang (Beijing University of Technology); Lifang Wu (Beijing University of Technology)* |
5 | Multi-Scale Spatio-Temporal Graph Convolutional Network for Facial Expression Spotting | Yicheng Deng (Osaka University)*; Hideaki Hayashi (Osaka University); Hajime Nagahara (Osaka University) |
6 | epsilon-Mesh Attack: A Surface-based Adversarial Point Cloud Attack for Facial Expression Recognition | Batuhan Cengiz (Istanbul Technical University)*; Mert Gülşen (Istanbul Technical University); Yusuf Hüseyin Şahin (İTÜ); Gozde Unal (Istanbul Technical University) |
7 | Distilling Privileged Multimodal Information for Expression Recognition using Optimal Transport | Muhammad Haseeb Aslam (ETS)*; Muhammad Osama Zeeshan (École de technologie supérieure); Soufiane Belharbi (ÉTS Montreal); Marco Pedersoli (École de technologie supérieure); Alessandro Lameiras Koerich (École de technologie supérieure ); Simon Bacon (Concordia University); Eric Granger (ETS Montreal ) |
8 | CSTalk: Correlation Supervised Speech-driven 3D Emotional Facial Animation Generation | Xiangyu Liang (Southeast University); Wenlin Zhuang (Southeast University); Tianyong Wang (Southeast University); Guangxing Geng (Nanjing 8:8 Digital Technology Co., Ltd); Guangyue Geng (Nanjing 8:8 Digital Technology Co., Ltd); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)* |
Posters Only | ||
9 | Efficient Verification-Based Face Identification | Barak Battash (Intel)*; Amit Rozner (Bar-Ilan University); Ofir Lindenbaum (Yale ); Lior Wolf (Tel Aviv University, Israel) |
10 | Dataset Infant Anonymization with Pose and Emotion Retention | Mason Lary (SUNY Buffalo)*; Matthew M Klawonn (US Air Force Research Laboratory); Daniel Messinger (University of Miami); Ifeoma Nwogu (University at Buffalo, SUNY) |
11 | Face the Needle: Predicting risk of fear and fainting during blood donation through video analysis | Judita Rudokaite (Tilburg University)*; Itir Onal Ertugrul (Utrecht University); Sharon Ong (Tilburg University); Mart Janssen (Sanquin); Elisabeth Huis in ‘t Veld (Tilburg University) |
12 | Intra-Person Camera Adversarial for Intra-Camera Supervised Person Re-identification | Ruochen tang (Southwest Jiaotong University)*; Xun Gong (Southwest Jiaotong University) |
13 | Adaptive Cross-architecture Mutual Knowledge Distillation | Jianyuan Ni (Texas State University)*; Hao Tang (ETH Zurich & CMU); Yuzhang Shang (Illinois Institute of Technology); Bin Duan (Illinois Institute of Technology); Yan Yan (Illinois Institute of Technology) |
14 | ASPECD: Adaptable Soft-Biometric Privacy-Enhancement Using Centroid Decoding for Face Verification | Peter Rot (Univerza v Ljubljani, Fakulteta za Elektrotehniko)*; Philipp Terhörst (Paderborn University); Peter Peer (University of Ljubljana); Vitomir Struc (University of Ljubljana) |
15 | Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces Recognition | Iurii Medvedev (University of Coimbra)*; Farhad Shadmand (University of Coimbra); Nuno Gonçalves (University of Coimbra) |
16 | Deepfake: Classifiers, Fairness, and Demographically Robust Algorithm | Akshay Agarwal (IISER Bhopal)*; Nalini Ratha (SUNY Buffalo) |
17 | PointFaceFormer: local and global attention based transformer for 3D point cloud face recognition | Ziqi Gao (shenzhen university); Qiufu Li (shenzhen university); Gui Wang (WKU); Linlin Shen (Shenzhen University)* |
18 | Subject-Based Domain Adaptation for Facial Expression Recognition | Muhammad Osama Zeeshan (École de technologie supérieure)*; Muhammad Haseeb Aslam (ETS); Soufiane Belharbi (ÉTS Montreal); Alessandro Lameiras Koerich (École de technologie supérieure ); Marco Pedersoli (École de technologie supérieure); Simon Bacon (Concordia University); Eric Granger (ETS Montreal ) |
19 | Efficient Detection of Disguised Faces using Photos/Sketches from Low-Quality Surveillance Footage | Nikhil Reddy Pottanigari (University of Montreal (MILA))*; Rithin Pullela (Texas A&M University); Abdul kalam azad Shaik (university of florida ); Rithik Reddy Katpally (San Jose State University) |
20 | Lip and speech synchronization using supervised contrastive learning and cross-modal attention | Munender Varshney (Hitachi Research and Development Center)*; Mayurakshi Mukherji (Hitachi India Pvt. Ltd.); Senthil raja G (Hitachi India Pvt. Ltd); Ananth Ganesh (Hitachi India); Kingshuk Banerjee (Hitachi India Pvt. Ltd) |
21 | If It’s Not Enough, Make It So: Reducing Authentic Data Demand in Face Recognition through Synthetic Faces | Andrea Atzori (University of Cagliari)*; Fadi Boutros (Fraunhofer IGD); Naser Damer (Fraunhofer Institute for Computer Graphics Research IGD and TU Darmstadt ); Gianni Fenu (University of Cagliari); Mirko Marras (University of Cagliari) |
22 | Data Augmentation Techniques for Enhanced Facial Landmarks Detection in Patients with Repaired Cleft Lip and Palate | Karen Rosero (University of Texas at Dallas)*; Ali N Salman (University of Texas at Dallas ); Berrak Sisman (University of Texas at Dallas ); Rami Hallac (University of Texas Southwestern Medical Center, Children’s Medical Center); Carlos Busso (University of Texas at Dallas) |
23 | Deep adaptative spectral zoom for improved remote heart rate estimation | Joaquim Comas Martínez (Universitat Pompeu Fabra)*; Adria Ruiz (Pompeu Fabra University); Federico Sukno (Pompeu Fabra University) |
24 | Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis | Guanyu Hu (Xi’an Jiaotong University ); Eleni Papadopoulou (NTUA); Dimitrios Kollias (Queen Mary University London)*; Paraskevi Tzouveli (NTUA); JIE WEI (Xi’an Jiaotong University); Xinyu Yang (Xi’an Jiaotong University) |
25 | ONOT: a High-Quality ICAO-compliant Synthetic Mugshot Dataset | Nicolò Di Domenico (University of Bologna); Guido Borghi (University of Bologna)*; Annalisa Franco (University of Bologna); Davide Maltoni (University of Bologna) |
26 | RFIS-FPI: Reversible Face Image Steganography Neural Network for Face Privacy Interactions | Yubo Huang (Southwest Jiaotong Unverisity)*; Anran Zhu (Southwest Jiaotong University); Cheng Zeng (Southwest Jiaotong University); Cong Hu (Southwest Jiaotong University); Xin Lai (Southwest Jiaotong University); Wenhao Feng (Southwest Jiaotong University); Fan Chen (Southwest Jiaotong University) |
27 | Unlocking the Black Box: Concept-Based Modeling for Interpretable Affective Computing Applications | Xinyu Li (University of Glasgow)*; Marwa Mahmoud (University of Glasgow) |
28 | Social-MAE: A Transformer-Based Multimodal Autoencoder for Face and Voice | Hugo Bohy (University of Mons)*; Kevin El Haddad (University of Mons/The Big Projects); Minh Tran (ICT, USC); Thierry Dutoit (University of Mons); Mohammad Soleymani (University of Southern California) |
29 | Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues | Soufiane Belharbi (ÉTS Montreal)*; Marco Pedersoli (École de technologie supérieure); Alessandro Lameiras Koerich (École de technologie supérieure ); Simon Bacon (Concordia University); Eric Granger (ETS Montreal ) |
30 | AerialFace: A Light Weight Framework for Unmanned Aerial Vehicle Face Recognition | zhiquan ou (Hohai University); Liang Yao (Hohai University)*; Ting Wu (Hohai University); Fan Liu (Hohai University) |
31 | QGFace: Quality-Guided Joint Training for Mixed Quality Face Recognition | Youzhe Song (East China Normal University)*; Feng Wang (East China Normal University) |
32 | EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition | Niki M Foteinopoulou (SnT, University of Luxembourg)*; Ioannis Patras (Queen Mary University of London) |
33 | In-Domain Inversion for Improved 3D Face Alignment on Asymmetrical Expressions | Jilliam M. Diaz Barros (German Research Center for Artificial Intelligence)*; Jason Rambach (DFKI); Pramod Murthy (DFKI); Didier Stricker (DFKI) |
34 | 3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior | Guohao Li (BUAA)*; Hongyu Yang (Beihang University); Di Huang (Beihang University, China); Yunhong Wang (State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China) |
35 | Expression-aware Masking and Progressive Decoupling for Cross-database Facial Expression Recognition | Tao Zhong (Shenzhen University); Xiaole Xian (Shenzhen University); Zihan Wang (Shenzhen University); Weicheng Xie (Shenzhen University)*; Linlin Shen (Shenzhen University) |
36 | Explainable Face Verification via Feature-Guided Gradient Backpropagation | Yuhang Lu (EPFL)*; Zewei Xu (EPFL); Touradj Ebrahimi (EPFL) |
Demo presentation | ||
Russian sign language learning simulator |
Maxim Novopoltsev (SberAI), Aleksandr Tulenkov (SberAI), Roman Akhidov (SberAI), Ruslan Murtazin (SberAI), Dmitriy Milevich (SberAI), Iuliia Zemtsova (SberAI) | |
18:00 | Welcome Reception |
Wednesday, 29 May 2024
Wednesday, 29 May 2024 | ||
---|---|---|
8:00 – 9:00 | Registration | |
9:00 – 10:00 | Keynote 2 | |
Chair: Lale Akarun Speaker: Prof. Beatrice de Gelder Title: Linking body movement analysis and brain activity |
||
10:00 – 10:30 | Coffee break | |
10:30 – 11:30 | Oral Session 3 – Human pose and motion Chair: Martin Kampel |
|
Uncalibrated Multi-view 3D Human Pose Estimation with Geometry Driven Attention | Victor Galizzi (CEA)*; Bertrand Luvison (CEA LIST) | |
Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction | Olivier Moliner (Lund University)*; Sangxia Huang (Sony Research); Kalle Åström (Lund University) | |
One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features | Trung Thanh NGUYEN (Nagoya Univeristy)*; Yasutomo Kawanishi (RIKEN); Takahiro Komamizu (Nagoya University); Ichiro Ide (Nagoya University) | |
CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras | James Y Tang (University of British Columbia, Department of Computer Science)*; Shashwat Suri (University of British Columbia); Daniel Abidemi Ajisafe (The University of British Columbia); Bastian Wandt (Linköping University); Helge Rhodin (UBC) | |
11:30 – 12:30 | Poster Spotlights | |
Posters from Poster Session 2 | ||
12:30 – 14:00 | Lunch Break | |
14:00 – 15:00 | Oral Session 4 – Gait and Action Chair: Yan Yan |
|
Unveiling Gender Effects in Gait Recognition using Conditional-Matched Bootstrap Analysis | Azim Ibragimov (University of Florida)*; Mauricio Pamplona Segundo (University of South Florida); Sudeep Sarkar (University of South Florida, Tampa); Kevin W Bowyer (University of Notre Dame) | |
GaitPT: Skeletons Are All You Need For Gait Recognition | Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest) | |
Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Action Recognition | Wele Gedara Chaminda Bandara (Apple Inc)*; Vishal Patel (Johns Hopkins University) | |
ViewDiffGait: View Pyramid Diffusion for Gait Recognition | Rijun Liao (University of Missouri-Kansas City)*; Zhu Li (University of Missouri-Kansas City); Shuvra Bhattacharyya (University of Maryland); George York (US Air Force Academy) | |
15:00 – 15:15 | Break | |
15:15 – 16:15 | Oral Session 5 – Hand and Sign Language Chair: Hazem Wannous |
|
Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks | Maksym Ivashechkin (University of Surrey)*; Richard Bowden (University of Surrey); Oscar Mendez (University of Surrey) | |
SynthSL: Expressive Humans for Sign Language Image Synthesis | Jilliam M. Diaz Barros (German Research Center for Artificial Intelligence)*; Chen-Yu Wang (DFKI); Jameel Malik (DFKI); Abdalla Arafa (DFKI); Didier Stricker (DFKI) | |
A Gloss-free Sign Language Production with Discrete Representation | Eui Jun Hwang (KAIST)*; Huije Lee (Korea Advanced Institute of Science and Technology); Jong C. Park (KAIST) | |
In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition | Wiktor Mucha (Vienna University of Technology, Computer Vision Lab)*; Martin Kampel (Vienna University of Technology, Computer Vision Lab) | |
16:15 – 18:00 | Poster session 2 + Coffee | |
Posters from Oral sessions 2, 3 and 4 | ||
1 | Uncalibrated Multi-view 3D Human Pose Estimation with Geometry Driven Attention | Victor Galizzi (CEA)*; Bertrand Luvison (CEA LIST) |
2 | Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction | Olivier Moliner (Lund University)*; Sangxia Huang (Sony Research); Kalle Åström (Lund University) |
3 | One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features | Trung Thanh NGUYEN (Nagoya Univeristy)*; Yasutomo Kawanishi (RIKEN); Takahiro Komamizu (Nagoya University); Ichiro Ide (Nagoya University) |
4 | CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras | James Y Tang (University of British Columbia, Department of Computer Science)*; Shashwat Suri (University of British Columbia); Daniel Abidemi Ajisafe (The University of British Columbia); Bastian Wandt (Linköping University); Helge Rhodin (UBC) |
5 | Unveiling Gender Effects in Gait Recognition using Conditional-Matched Bootstrap Analysis | Azim Ibragimov (University of Florida)*; Mauricio Pamplona Segundo (University of South Florida); Sudeep Sarkar (University of South Florida, Tampa); Kevin W Bowyer (University of Notre Dame) |
6 | GaitPT: Skeletons Are All You Need For Gait Recognition | Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest) |
7 | Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Action Recognition | Wele Gedara Chaminda Bandara (Apple Inc)*; Vishal Patel (Johns Hopkins University) |
8 | ViewDiffGait: View Pyramid Diffusion for Gait Recognition | Rijun Liao (University of Missouri-Kansas City)*; Zhu Li (university of missouri-kansas city); Shuvra Bhattacharyya (University of Maryland); George York (US Air Force Academy) |
9 | Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks | Maksym Ivashechkin (University of Surrey)*; Richard Bowden (University of Surrey); Oscar Mendez (University of Surrey) |
10 | SynthSL: Expressive Humans for Sign Language Image Synthesis | Jilliam M. Diaz Barros (German Research Center for Artificial Intelligence)*; Chen-Yu Wang (DFKI); Jameel Malik (DFKI); Abdalla Arafa (DFKI); Didier Stricker (DFKI) |
11 | A Gloss-free Sign Language Production with Discrete Representation | Eui Jun Hwang (KAIST)*; Huije Lee (Korea Advanced Institute of Science and Technology); Jong C. Park (KAIST) |
12 | In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition | Wiktor Mucha (Vienna University of Technology, Computer Vision Lab)*; Martin Kampel (Vienna University of Technology, Computer Vision Lab) |
Posters Only | ||
13 | BEAVP: A Bidirectional Enhanced Adversarial Model for Video Prediction | Peiyuan Zhu (Tongji University); Fengxia Han (Tongji University); Shengjie Zhao (Tongji University); Hao Deng (Tongji Universtiy)* |
14 | Skeleton-based Self-Supervised Feature Extraction for Improved Dynamic Hand Gesture Recognition | Omar Ikne (IMT Nord Europe)*; Benjamin Allaert (IMT Nord Europe); Hazem Wannous (IMT Nord Europe, CRIStAL UMR 9189) |
15 | Human Action Recognition with Multi-Level Granularity and Pair-wise Hyper GCN | Tamam Alsarhan (Khalifa University)*; Tamam Alsarhan (The university of Jordan); Ayoub Alsarhan (Hashemite university); Syed Sadaf Ali (Khalifa University); Iyyakutti Iyappan Ganapathi (Khalifa University); Naoufel Werghi (Khalifa University of Science and Technology) |
16 | MGRFormer: A Multimodal Transformer Approach for Surgical Gesture Recognition | Kevin Feghoul (University of Lille)*; Deise S Maia (Université de Lille); Mehdi Elamrani (CHU Lille); Mohamed Daoudi (IMT Nord Europe); Ali Amad (University of Lille) |
17 | CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition | Pierre Vuillecard (Idiap)*; Arya Farkhondeh (Idiap Research Institute, EPFL); Michael Villamizar (Idiap Research Institute); Jean-Marc ODOBEZ (IDIAP/EPFL, SWITZERLAND) |
18 | GestSpoof: Gesture Based Spatio-Temporal Representation Learning For Robust Fingerprint Presentation Attack Detection. | Bhavin Jawade (University at Buffalo)*; Shreeram Gudemaranahalli Subramanya (University at Buffalo); Atharv Dabhade (University at Buffalo, SUNY); Srirangaraj Setlur (University at Buffalo, SUNY); Venu Govindaraju (University at Buffalo, SUNY) |
19 | Spatio Temporal Sparse Graph Convolution Network for Hand Gesture Recognition | Omar Ikne (IMT Nord Europe)*; Rim Slama (CESI LINEACT); Hichem Saoudi (IMT Nord Europe); Hazem Wannous (IMT Nord Europe, CRIStAL UMR 9189) |
20 | Crowd Detection via Point Localization with Diffusion Models | Don Yasiru L Ranasinghe (Johns Hopkins University)*; Vishal Patel (Johns Hopkins University) |
21 | MIMIC-Pose: Implicit Membership Discrimination of Body Joints for Human Pose Estimation | Ying Huang (Hangzhou Normal University)*; Shanfeng Hu (Northumbria University) |
22 | DPA-2D: Depth Propagation and Alignment with 2D Observations Guidance for Human Mesh Recovery | Weihao You (Tomorrow Advancing Life)*; Pengcheng Wang (Tomorrow Advancing Life); Jinfeng Bai (Tomorrow Advance Life); zhilong ji (Tomorrow Advancing Life) |
23 | Evaluating Recent 2D Human Pose Estimators for 2D-3D Pose Lifting | Soroush Mehraban (University of Toronto)*; Yiqian Qin (University of Toronto); Babak Taati (University Health Network) |
24 | The Paradox of Motion: Evidence for Spurious Correlations in Skeleton-based Gait Recognition Models | Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest) |
25 | Improving 2D Human Pose Estimation in Unseen Camera Views with Synthetic Data | Miroslav Purkrabek (Czech Technical University, Prague)*; Jiri Matas (Czech Technical University, Prague) |
26 | DualH: A Dual Hierarchical Model for Temporal Action Localization | Zejian Zhang (Universitat de Barcelona)*; Cristina Palmero (Universitat de Barcelona); Sergio Escalera (Universitat de Barcelona) |
27 | HR-xNet: A Novel High-Resolution Network for Human Pose Estimation with Low Resource Consumption | cun feng (Ningbo University); Rong Zhang (Ningbo University)*; Lijun Guo (Ningbo University) |
28 | Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition | zhendong liu (Southeast University); Haifeng Xia (Southeast University); Tong Guo (Southeast University); Libo Sun (Southeast University); Ming Shao (University of Massachusetts Dartmouth); Siyu Xia (Southeast University, China)* |
29 | HM-Auth: Redefining User Authentication in Immersive Virtual World through Hand Movement Signatures | Sindhu Reddy Kalathur Gopal (University of Wyoming)*; Paul S Gyreyiri (University of Wyoming); Diksha Shukla (University of Wyoming) |
30 | A Data-Driven Representation for Sign Language Production | Harry Walsh (University of Surrey)*; Abolfazl Zargari Khuzani (Intel); Mariam Rahmani (Intel Corporation); Richard Bowden (University of Surrey) |
31 | Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder | Mohamed I Lakhal (University of Surrey)*; Richard Bowden (University of Surrey) |
32 | Resource-Efficient Gesture Recognition using Low-Resolution Thermal Camera via Spiking Neural Networks and Sparse Segmentation | Ali Safa (KU Leuven – IMEC)*; Wout Mommen (VUB – IMEC); Piet Wambacq (IMEC -VUB); Lars Keuninckx (imec) |
33 | The Seven Faces of Stress: Understanding Facial Activity Patterns during Cognitive Stress | Carla Viegas (Carnegie Mellon University)*; Roy A Maxion (Carnegie Mellon University, USA); Alexander Hauptmann (Carnegie Mellon University); Joao Magalhaes (Universidade NOVA Lisboa) |
34 | Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets | Ahmet Alp Kindiroglu (Huawei)*; Ozgur Kara (Georgia Institute of Technology); Oğulcan Özdemir (Bogazici University); Lale Akarun (Bogazici University) |
35 | Patch-based Privacy Attention for Weakly-supervised Privacy-Preserving Action Recognition | Xiao Li (Sun Yat-sen University); Yukun Qiu (Sun Yat-sen University); Yi-Xing Peng (Sun Yat-sen University, China); WEI-SHI ZHENG (Sun Yat-sen University, China)* |
36 | Boosting Gesture Recognition with an Automatic Gesture Annotation Framework | Junxiao Shen (University of Cambridge)*; Xuhai Xu (Meta Reality Lab Research); Ran Tan (Meta Reality Labs Research); Amy Karlson (Meta Reality Labs Research); Evan Strasnick (Meta Reality Labs Research) |
37 | Towards Better Communication: Refining Hand Pose Estimation in Low-Resolution Sign Language Videos | Sümeyye M Taşyürek (Hacettepe University)*; Tuğçe Kızıltepe (Hacettepe University); Hacer Yalim Keles (Hacettepe University) |
38 | Quantifying Biometric Characteristics of Hand Gestures through Feature Space Probing and Identity-Level Cross-Gesture Disentanglement | Aman Verma (Indian Institute of Technology Delhi); Gaurav Jaswal (Indian Institute of Technology Delhi); Seshan Srirangarajan (Indian Institute of Technology Delhi)*; Sumantra Dutta Roy (Indian Institute of Technology Delhi) |
39 | Hand Graph Topology Selection for Skeleton-based Sign Language Recognition | Oğulcan Özdemir (Bogazici University)*; Inci M. Baytas (Bogazici University); Lale Akarun (Bogazici University) |
40 | Unconstrained Hand Recognition using Thermal Infrared Sensing of Dorsal Veins | Wallace Lawson (Naval Research Laboratory)*; Grant Daneils (Naval Research Laboratory); Daniel Steinhurst (Nova Research); David Kidwell (Naval Research Laboratory) |
DC Posters | ||
Integrating a hierarchical structure of situated human motion in Multi-task learning for professional gesture recognition | Gavriela Senteri (ARMINES) | |
Towards High Fidelity and Accurate Face Swapping | Phyo Yee (IIT Ropar) | |
Face-based Strategies for Evaluating Asymmetry and Speech Articulation in Patients with Craniofacial Anomalies | Karen Rosero (University of Texas at Dallas) | |
Demo presentation | ||
Expanding PyAFAR: A Novel Privacy-Preserving Infant AU Detector |
Itir Onal Ertugrul (Utrecht University), Saurabh Hinduja (University of Pittsburgh), Maneesh Bilalpur (University of Pittsburgh), Daniel Messinger (University of Miami), Jeffrey Cohn (University of Pittsburgh). |
|
19:00-23:00 | Gala Dinner (Boat) |
Thursday, 30 May 2024
Thursday, 30 May 2024 | ||
---|---|---|
8:00 – 9:00 | Registration | |
9:00 – 10:00 | Keynote 3 | |
Chair: Shaun Canavan Speaker: Prof. Mohamed Daoudi Title: Learning to Synthesize 3D Faces and Human Interactions |
||
10:00 – 10:30 | Coffee break | |
10:30 – 11:30 | Oral Session 6 – Animation, Synthesis and Self-Supervision Chair: Raghavendra Ramachandra |
|
Multi-View Consistent 3D GAN Inversion via Bidirectional Encoder | Haozhan Wu (Institute of Computing Technology, Chinese Academy of Sciences)*; Hu Han (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences) | |
Embedded Representation Learning Network for Animating Styled Video Portrait | Tianyong Wang (Southeast University); Xiangyu Liang (Southeast University); wangguandong zheng (Southeast University); Dan Niu (Southeast University); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)* | |
Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation | Anton Pelykh (University of Surrey)*; Ozge Mercanoglu Sincan (University of Surrey); Richard Bowden (University of Surrey) | |
RS-rPPG: Robust Self-Supervised Learning for rPPG | Marko Radisa Savic (University of Oulu)*; Guoying Zhao (University of Oulu) | |
11:30 – 12:30 | Poster Spotlights | |
Posters from Poster Session 3 | ||
12:30 – 13:45 | Lunch Break | |
13:45 – 15:30 | Poster session 3 | |
Posters from Oral sessions 6, 7 and 8 | ||
1 | Multi-View Consistent 3D GAN Inversion via Bidirectional Encoder | Haozhan Wu (Institute of Computing Technology, Chinese Academy of Sciences)*; Hu Han (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences) |
2 | Embedded Representation Learning Network for Animating Styled Video Portrait | Tianyong Wang (Southeast University); Xiangyu Liang (Southeast University); wangguandong zheng (Southeast University); Dan Niu (Southeast University); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)* |
3 | Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation | Anton Pelykh (University of Surrey)*; Ozge Mercanoglu Sincan (University of Surrey); Richard Bowden (University of Surrey) |
4 | RS-rPPG: Robust Self-Supervised Learning for rPPG | Marko Radisa Savic (University of Oulu)*; Guoying Zhao (University of Oulu) |
5 | An Active-gaze Morphable Model for 3D Gaze Estimation | Hao Sun (University of York); Nick E. Pears (University of York, UK)*; William Smith (University of York) |
6 | Occluded Person Retrieval with Hierarchical Feature Optimization | Yang Zhao (La Trobe University)*; Pengcheng Zhang (Beihang University); Xiaohan Yu (Griffith University); Zhibin Liao (University of Adelaide); Johan Verjans (SAHMRI); Xiao Bai (Beihang University); Wei Xiang (La Trobe University) |
7 | High-resolution Image Enumeration for Low-resolution Face Recognition | Can Chen (Kitware Inc.)*; Scott McCloskey (Kitware) |
8 | OpenThermalPose: An Open-Source Annotated Thermal Human Pose Dataset and Initial YOLOv8-Pose Baselines | Askat Kuzdeuov (Nazarbayev University)*; Darya Taratynova (Nazarbayev University); Alim Tleuliyev (Nazarbayev University); Huseyin Atakan Varol (Nazarbayev University) |
9 | A Unified Model for Gaze Following and Social Gaze Prediction | Anshul Gupta (Idiap Research Institute, EPFL)*; Samy Tafasca (Idiap Research Institute, EPFL); Naravich Chutisilp (EPFL); Jean-Marc ODOBEZ (IDIAP/EPFL, SWITZERLAND) |
10 | DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition | Hebeizi Li (Beihang University)*; Hongyu Yang (Beihang University); Di Huang (Beihang University, China) |
11 | ClipSwap: Towards High Fidelity Face Swapping via Attribute and CLIP-Informed Loss | Phyo Thet Yee (IIT Ropar); Sudeepta Mishra (IIT Ropar); Abhinav Dhall (Flinders University)* |
12 | Multi-modal Human Behaviour Graph Representation Learning for Automatic Depression Assessment | Haotian Shen (University of Cambridge ); Siyang Song (University of Cambridge)*; Hatice Gunes (University of Cambridge) |
Posters Only | ||
13 | Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention | Gnana Praveen Rajasekhar (Computer Research Institute of Montreal)*; Jahangir Alam (Computer Research Institute of Montreal (CRIM), Montreal (Quebec) Canada) |
14 | VoxAtnNet: A 3D Point Clouds Convolutional Neural Network for Generalizable Face Presentation Attack Detection | Raghavendra Ramachandra (NTNU, Norway)*; Narayan Vetrekar (Goa University); Sushma Krupa Venkatesh (Aiba); Savita Nageshker (Goa University); Jag Mohan Singh (Norwegian University of Science and Technology (NTNU) Gjøvik); Rajendra Gad (UoG, India) |
15 | EAT-Face: Emotion-Controllable Audio-Driven Talking Face Generation via Diffusion Model | Haodi Wang (School of Computer Science and Engineering, Sun Yat-sen University); Xiaojun Jia (Nanyang Technological University); Xiaochun Cao (Sun Yat-sen University)* |
16 | Context-based Dataset for Analysis of Videos of Autistic Children | Sk Rahatul Jannat (University of South Florida); Heather Agazzi (University of South Florida); Shaun Canavan (University of South Florida)* |
17 | Seeing and hearing what has not been said; A multimodal client behavior classifier in Motivational Interviewing with interpretable fusion | Lucie Galland (ISIR)*; Catherine Pelachaud (CNRS, Sorbonne Université); Florian Pecune (Bordeaux University) |
18 | SignAvatar: Sign Language 3D Motion Reconstruction and Generation | Lu Dong (University at Buffalo)*; Lipisha Chaudhary (University at Buffalo, SUNY); Fei Xu (University at Buffalo, SUNY); Xiao Wang (Syracuse University); Mason Lary (SUNY Buffalo); Ifeoma Nwogu (University at Buffalo, SUNY) |
19 | PortraitDAE: Line-Drawing Portraits Style Transfer from Photos via Diffusion Autoencoder with Meaningful Encoded Noise | Yexiang Liu (Institute of Automation,Chinese Academy of Sciences); Jin Liu (Shanghaitech University); Jie Cao (Institute of Automation, Chinese Academy of Sciences); Junxian Duan (National Laboratory of Pattern Recognition); Ran He (Institute of Automation, Chinese Academy of Sciences)* |
20 | FE-Adapter: Adapting Image-based Emotion Classifiers to Videos | Shreyank N Gowda (University of Oxford)*; Boyan Gao (University of Oxford); David A Clifton (University of Oxford) |
21 | Latent Embedding Clustering for Occlusion Robust Head Pose Estimation | José Carlos Celestino (Instituto Superior Técnico)*; Manuel Marques (Institute for Systems and Robotics (ISR/LARSyS), DEEC, Instituto Superior Tecnico, Portugal); Jacinto C. Nascimento (Instituto Superior Tecnico de Lisboa) |
22 | Pivotal Tuning Editing: Towards Disentangled Wrinkle Editing with GANs | Neil Farmer (CentraleSupelec)*; Catherine SOLADIE (CentraleSupelec); Gabriel CAZORLA (Chanel); Renaud SEGUIER (CENTRALESUPELEC) |
23 | Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis | Maxim Maximov (TUM)*; Tim Meinhardt (TUM); Caner Hazirbas (Meta AI); Zoe Papakipos (Meta); Canton Cristian (Meta AI); Laura Leal-Taixé (NVIDIA) |
24 | CrossGaze: A Strong Method for 3D Gaze Estimation in the Wild | Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest) |
25 | Survey of Automated Methods for Nonverbal Behavior Analysis in Parent-Child Interactions | Berfu Karaca (Utrecht University)*; Albert Ali Salah (Utrecht University); Jaap Denissen (Utrecht University); Ronald Poppe (Utrecht University); Sonja de Zwarte (Utrecht University) |
26 | Naive Data Augmentation Might Be Toxic: Data-prior Guided Self-supervised Representation Learning for Micro-gesture Recognition | Atif Shah (University of Oulu)*; Haoyu Chen (University of Oulu); Guoying Zhao (University of Oulu) |
27 | SMCTL: Subcarrier Masking Contrastive Transfer Learning For Human Gesture Recognition With Passive Wi-Fi Sensing | Hojjat Salehinejad (Mayo Clinic)*; Radomir Djogo (University of Toronto); Navid Hasanzadeh (University of Toronto); Shahrokh Valaee (University of Toronto) |
28 | Semantic-Aware Detail Enhancement for Blind Face Restoration | Huimin Zhao (Anhui University)*; Jie Cao (Institute of Automation, Chinese Academy of Sciences); Huaibo Huang (Institute of Automation, Chinese Academy of Sciences); Xiaoqiang Zhou (University of Science and Technology of China); Aihua Zheng (Anhui University); Ran He (Institute of Automation, Chinese Academy of Sciences) |
30 | Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models | René Haas (IT University of Copehagen); Inbar Huberman-Spiegelglas (Technion); Rotem Mulayoff (Technion); Stella Graßhof (IT University of Copenhagen)*; Sami S Brandt (IT University of Copenhagen); Tomer Michaeli (Technion) |
31 | Breaking Template Protection: Reconstruction of Face Images from Protected Facial Templates | Hatef Otroshi Shahreza (Idiap Research Institute)*; Sebastien Marcel (Idiap Research Institute) |
32 | Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson’s Disease Severity in Walking Sequences | Vida Adeli (University of Toronto)*; Soroush Mehraban (University of Toronto); Irene Ballester (TU Wien); Yasamin Zarghami (University of Toronto); Andrea Sabo (University of Toronto); Andrea Iaboni (Toronto Rehabilitation Institute); Babak Taati (University Health Network) |
34 | Visual Coherence Face Anonymization Algorithm Based on Dynamic Identity Perception | Xuan Tan (Hangzhou Dianzi University); Shanqing Zhang (Hangzhou Dianzi University); Yixuan Ju (University of Yamanashi); Xiaoyang mao (University of Yamanashi); Jiayi Xu (Hangzhou Dianzi University)* |
35 | PyraMoT: A Novel Framework for Enhanced Facial Thermal Landmarks Detection | Kais Riani (University of Michigan)*; Salem Sharak (University Of Michigan); Mohamed Abouelenien (University of Michigan) |
36 | Visual Saliency Guided Gaze Target Estimation with Limited Labels | Cheng Peng (King’s College London)*; Oya Celiktutan (King’s College London) |
37 | Hyp-OC: Hyperbolic One Class Classifier for Face Anti-Spoofing | Kartik Narayan (Johns Hopkins University)*; Vishal Patel (Johns Hopkins University) |
38 | Dynamic Cross Attention for Audio-Visual Person Verification | Gnana Praveen Rajasekhar (Computer Research Institute of Montreal)*; Jahangir Alam (Computer Research Institute of Montreal (CRIM), Montreal (Quebec) Canada) |
39 | Enhancing Privacy in Face Analytics Using Fully Homomorphic Encryption | Bharat Yalavarthi (University at Buffalo)*; Arjun Ramesh Kaushik (University at Buffalo, The State University of New York); Arun Ross (Michigan State University); Vishnu Boddeti (Michigan State University); Nalini Ratha (SUNY Buffalo) |
40 | CribNet: Enhancing Infant Safety in Cribs through Vision-based Hazard Detection | Shaotong Zhu (Northeastern University); Amal Mathew (Northeastern University); Elaheh Hatamimajoumerd (Northeastern University); Michael Wan (Northeastern University); Briana Taylor (The Roux Institute at Northeastern University); Rajagopal Venkatesaramani (Northeastern University); Sarah Ostadabbas (Northeastern University)* |
41 | 3D Face Morphing Attack Generation using Non-Rigid Registration | Jag Mohan Singh (Norwegian University of Science and Technology (NTNU) Gjøvik)*; Raghavendra Ramachandra (NTNU, Norway) |
42 | BTVSL: A Novel Sentence-Level Annotated Dataset for Bangla Sign Language Translation | Iftekhar E Mahbub Zeeon (Bangladesh University of Engineering and Technology); Mir Mahathir Mohammad (University of Utah); Muhammad Abdullah Adnan (University of California San Diego)* |
15:30- 16:30 | Oral Session 7 – Best Reviewed Papers Chair: Carlos Busso |
|
An Active-gaze Morphable Model for 3D Gaze Estimation | Hao Sun (University of York); Nick E. Pears (University of York, UK)*; William Smith (University of York) | |
Occluded Person Retrieval with Hierarchical Feature Optimization | Yang Zhao (La Trobe University)*; Pengcheng Zhang (Beihang University); Xiaohan Yu (Griffith University); Zhibin Liao (University of Adelaide); Johan Verjans (SAHMRI); Xiao Bai (Beihang University); Wei Xiang (La Trobe University) | |
High-resolution Image Enumeration for Low-resolution Face Recognition | Can Chen (Kitware Inc.)*; Scott McCloskey (Kitware) | |
OpenThermalPose: An Open-Source Annotated Thermal Human Pose Dataset and Initial YOLOv8-Pose Baselines | Askat Kuzdeuov (Nazarbayev University)*; Darya Taratynova (Nazarbayev University); Alim Tleuliyev (Nazarbayev University); Huseyin Atakan Varol (Nazarbayev University) | |
16:30 – 17:00 | Coffee break | |
17:00 – 18:00 | Oral Session 8 – Best reviewed Student Papers Chair: Itir Onal Ertugrul |
|
A Unified Model for Gaze Following and Social Gaze Prediction | Anshul Gupta (Idiap Research Institute, EPFL)*; Samy Tafasca (Idiap Research Institute, EPFL); Naravich Chutisilp (EPFL); Jean-Marc ODOBEZ (IDIAP/EPFL, SWITZERLAND) | |
DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition | Hebeizi Li (Beihang University)*; Hongyu Yang (Beihang University); Di Huang (Beihang University, China) | |
ClipSwap: Towards High Fidelity Face Swapping via Attribute and CLIP-Informed Loss | Phyo Thet Yee (IIT Ropar); Abhinav Dhall (Indian Institute of Technology Ropar)* | |
Multi-modal Human Behaviour Graph Representation Learning for Automatic Depression Assessment | Haotian Shen (University of Cambridge ); Siyang Song (University of Cambridge)*; Hatice Gunes (University of Cambridge) | |
18:00 – 18:10 | Closing session |