IEEE FG 2024 received a total of 299 submissions and following a rigorous reviewing procedure, 118 papers were selected for presentation. The overall acceptance rate was 39.4%.
List of Accepted Papers
- Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks; Maksym Ivashechkin (University of Surrey)*; Richard Bowden (University of Surrey); Oscar Mendez (University of Surrey)
- Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces Recognition; Iurii Medvedev (University of Coimbra)*; Farhad Shadmand (University of Coimbra); Nuno Gonçalves (University of Coimbra)
- CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras: James Y Tang (University of British Columbia, Department of Computer Science)*; Shashwat Suri (University of British Columbia); Daniel Abidemi Ajisafe (The University of British Columbia); Bastian Wandt (Linköping University); Helge Rhodin (UBC)
- RS-rPPG: Robust Self-Supervised Learning for rPPG; Marko Radisa Savic (University of Oulu)*; Guoying Zhao (University of Oulu)
- An Active-gaze 3D Morphable Model for Eyeball, Eye Region and Gaze Reconstruction; Hao Sun (University of York); Nick E. Pears (University of York, UK)*; William Smith (University of York)
- Multi-View Consistent 3D GAN Inversion via Bidirectional Encoder; Haozhan Wu (Institute of Computing Technology, Chinese Academy of Sciences)*; Hu Han (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences)
- Uncalibrated Multi-view 3D Human Pose Estimation with Geometry Driven Attention; Victor Galizzi (CEA)*; Bertrand Luvison (CEA LIST)
- Hierarchical Generative Network for Face Morphing Attacks; Zuyuan He (SiChuan University); Zongyong Deng (Sichuan University); qiaoyun He (Sichuan University); Qijun Zhao (Sichuan University)*
- Adaptive Cross-architecture Mutual Knowledge Distillation; Jianyuan Ni (Texas State University)*; Hao Tang (ETH Zurich); Yuzhang Shang (Illinois Institute of Technology); Bin Duan (Illinois Institute of Technology); Yan Yan (Illinois Institute of Technology)
- PortraitDAE: Line-Drawing Portraits Style Transfer from Photos via Diffusion Autoencoder with Meaningful Encoded Noise; Liu Yexiang (Institute of Automation,Chinese Academy of Sciences); Jin Liu (Shanghaitech University); Jie Cao (Institute of Automation, Chinese Academy of Sciences); Junxian Duan (National Laboratory of Pattern Recognition); Ran He (Institute of Automation, Chinese Academy of Sciences)*
- GestSpoof: Gesture Based Spatio-Temporal Representation Learning For Robust Fingerprint Presentation Attack Detection; Bhavin Jawade (University at Buffalo)*; Shreeram Gudemaranahalli Subramanya (University at Buffalo); Atharv Dabhade (University at Buffalo, SUNY); Srirangaraj Setlur (University at Buffalo, SUNY); Venu Govindaraju (University at Buffalo, SUNY)
- Intra-Person Camera Adversarial for Intra-Camera Supervised Person Re-identification; Ruochen Tang (Southwest Jiaotong University)*; Xun Gong (Southwest Jiaotong University)
- Unveiling Gender Effects in Gait Recognition using Conditional-Matched Bootstrap Analysis; Azim Ibragimov (University of Florida)*; Mauricio Pamplona Segundo (University of South Florida); Sudeep Sarkar (University of South Florida, Tampa); Kevin W Bowyer (University of Notre Dame)
- Efficient Verification-Based Face Identification; Barak Battash (Intel)*; Amit Rozner (Bar-Ilan University); Ofir Lindenbaum (Yale ); Lior Wolf (Tel Aviv University, Israel)
- Geometry-Biased Transformer for Robust Multi-View 3D Human Pose Reconstruction; Olivier Moliner (Lund University)*; Sangxia Huang (Sony Research); Kalle Åström (Lund University)
- A Unified Model for Gaze Following and Social Gaze Prediction Anshul Gupta (Idiap Research Institute, EPFL)*; Samy Tafasca (Idiap Research Institute, EPFL); Naravich Chutisilp (EPFL); Jean-Marc ODOBEZ (IDIAP/EPFL, SWITZERLAND)
- Occluded Person Retrieval with Hierarchical Feature Optimization; Yang Zhao (La Trobe University)*; Pengcheng Zhang (Beihang University); Xiaohan Yu (Griffith University); Zhibin Liao (University of Adelaide); Johan Verjans (SAHMRI); Xiao Bai (Beihang University); Wei Xiang (La Trobe University)
- SignAvatar: Sign Language 3D Motion Reconstruction and Generation; Lu Dong (University at Buffalo)*; Lipisha Chaudhary (University at Buffalo, SUNY); Fei Xu (University at Buffalo, SUNY); Xiao Wang (Syracuse University); Mason Lary (SUNY Buffalo); Ifeoma Nwogu (University at Buffalo, SUNY)
- Spatio Temporal Sparse Graph Convolution Network for Hand Gesture Recognition; Omar Ikne (IMT Nord Europe)*; Rim Slama (CESI LINEACT); Hichem Saoudi (IMT Nord Europe); Hazem Wannous (IMT Nord Europe, CRIStAL UMR 9189)
- Skeleton-based Self-Supervised Feature Extraction for Improved Dynamic Hand Gesture Recognition; Omar Ikne (IMT Nord Europe)*; Benjamin Allaert (IMT Nord Europe); Hazem Wannous (IMT Nord Europe, CRIStAL UMR 9189)
- PointFaceFormer: local and global attention based transformer for 3D point cloud face recognition; Ziqi Gao (shenzhen university); Qiufu Li (shenzhen university); Gui Wang (WKU); Linlin Shen (Shenzhen University)*
- Deepfake: Classifiers, Fairness, and Demographically Robust Algorithm; Akshay Agarwal (IISER Bhopal)*; Nalini Ratha (SUNY Buffalo)
- DrFER: Learning Disentangled Representations for 3D Facial Expression Recognition; Hebeizi Li (Beihang University)*; Hongyu Yang (Beihang University); Di Huang (Beihang University, China)
- If It’s Not Enough, Make It So: Reducing Authentic Data Demand in Face Recognition through Synthetic Faces; Andrea Atzori (University of Cagliari)*; Fadi Boutros (Fraunhofer IGD); Naser Damer (Fraunhofer Institute for Computer Graphics Research IGD and TU Darmstadt ); Gianni Fenu (University of Cagliari); Mirko Marras (University of Cagliari)
- Subject-Based Domain Adaptation for Facial Expression Recognition; Muhammad Osama Zeeshan (École de technologie supérieure)*; Muhammad Haseeb Aslam (ETS); Soufiane Belharbi (ÉTS Montreal); Alessandro Lameiras Koerich (École de technologie supérieure ); Marco Pedersoli (École de technologie supérieure); Simon Bacon (Concordia University); Eric Granger (ETS Montreal )
- Efficient Detection of Disguised Faces using Photos/Sketches from Low-Quality Surveillance Footage; Nikhil Reddy Pottanigari (University of Montreal (MILA))*; Rithin Pullela (Texas A&M University); Abdul kalam azad Shaik (university of florida ); Rithik Reddy Katpally (San Jose State University)
- Context-based Dataset for Analysis of Videos of Children with Risk for Autism Spectrum Disorder; Sk Rahatul Jannat (University of South Florida); Heather Agazzi (University of South Florida); Shaun Canavan (University of South Florida)*
- Seeing and hearing what has not been said; A multimodal client behavior classifier in Motivational Interviewing with interpretable fusion; Lucie Galland (ISIR)*; Catherine Pelachaud (CNRS, Sorbonne Université); Florian Pecune (Bordeaux University)
- Face the Needle: Predicting risk of fear and fainting during blood donation through video analysis; Judita Rudokaite (Tilburg University)*; Itir Onal Ertugrul (Utrecht University); Sharon Ong (Tilburg University); Mart Janssen (Sanquin); Elisabeth Huis in ‘t Veld (Tilburg University)
- VoxAtnNet: A 3D Point Clouds Convolutional Neural Network for Generalizable Face Presentation Attack Detection; Raghavendra Ramachandra (NTNU, Norway)*; Narayan Vetrekar (Goa University); Sushma Krupa Venkatesh (Aiba); Savita Nageshker (Goa University); Jag Mohan Singh (Norwegian University of Science and Technology (NTNU) Gjøvik); Rajendra Gad (UoG, India)
- Crowd Detection via Point Localization with Diffusion Models; Don Yasiru L Ranasinghe (Johns Hopkins University)*; Vishal Patel (Johns Hopkins University)
- EAT-Face: Emotion-Controllable Audio-Driven Talking Face Generation via Diffusion Model; Haodi Wang (School of Computer Science and Engineering, Sun Yat-sen University); Xiaojun Jia (Nanyang Technological University); Xiaochun Cao (Sun Yat-sen University)*
- Dataset Infant Anonymization with Pose and Emotion Retention; Mason Lary (SUNY Buffalo)*; Matthew M Klawonn (US Air Force Research Laboratory); Daniel Messinger (University of Miami); Ifeoma Nwogu (University at Buffalo, SUNY)
- Lip and speech synchronization using supervised contrastive learning and cross-modal attention; Munender Varshney (Hitachi Research and Development Center)*; Mayurakshi Mukherji (Hitachi India Pvt. Ltd.); Senthil raja G (Hitachi India Pvt. Ltd); Ananth Ganesh (Hitachi India); Kingshuk Banerjee (Hitachi India Pvt. Ltd)
- BEAVP: A Bidirectional Enhanced Adversarial Model for Video Prediction; Peiyuan Zhu (Tongji University); Fengxia Han (Tongji University); Shengjie Zhao (Tongji University); Hao Deng (Tongji Universtiy)*
- Embedded Representation Learning Network for Animating Styled Video Portrait; Tianyong Wang (Southeast University); Xiangyu Liang (Southeast University); wangguandong zheng (Southeast University); Dan Niu (Southeast University); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)*
- CSTalk: Correlation Supervised Speech-driven 3D Emotional Facial Animation Generation; Xiangyu Liang (Southeast University); Wenlin Zhuang (Southeast University); Tianyong Wang (Southeast University); Guangxing Geng (Nanjing 8:8 Digital Technology Co., Ltd); Guangyue Geng (Nanjing 8:8 Digital Technology Co., Ltd); Haifeng Xia (Southeast University); Siyu Xia (Southeast University, China)*
- ClipSwap: Towards High Fidelity Face Swapping via Attribute and CLIP-Informed Loss; Phyo Thet Yee (IIT Ropar); Abhinav Dhall (Indian Institute of Technology Ropar)*
- Designing Cross-Race Tests for Forensic Facial Examiners, Super-recognizers, and Face Recognition Algorithm; Géraldine Jeckeln (The University of Texas at Dallas)*; Selin Yavuzcan (The University of Texas at Dallas); Kate A. Marquis (The University of Texas at Dallas); Prajay S. Mehta (The University of Texas at Dallas); Amy N. Yates (National Institute of Standards and Technology); P Jonathon Phillips (NIST); Alice O’Toole (University of Texas at Dallas)
- Data Augmentation Techniques for Enhanced Facial Landmarks Detection in Patients with Repaired Cleft Lip and Palate; Karen Rosero (University of Texas at Dallas)*; Ali N Salman (University of Texas at Dallas ); Berrak Sisman (University of Texas at Dallas); Rami Hallac (University of Texas Southwestern Medical Center, Children’s Medical Center); Carlos Busso (University of Texas at Dallas)”
- Multi-modal Human Behaviour Graph Representation Learning for Automatic Depression Assessment; Haotian Shen (University of Cambridge ); Siyang Song (University of Cambridge)*; Hatice Gunes (University of Cambridge)
- Human Action Recognition with Multi-Level Granularity and Pair-wise Hyper GCN; Tamam Alsarhan (Khalifa University)*; Tamam Alsarhan (The university of Jordan); Ayoub Alsarhan (Hashemite university); Syed Sadaf Ali (Khalifa University); Iyyakutti Iyappan Ganapathi (Khalifa University); Naoufel Werghi (Khalifa University of Science and Technology)
- TetraLoss: Improving the Robustness of Face Recognition against Morphing Attacks; Mathias Ibsen (Hochschule Darmstadt)*; Lazaro Janier Gonzalez-Soler (Hochschule Darmstadt); Christian Rathgeb (Hochschule Darmstadt); Christoph Busch (Hochschule Darmstadt)
- High-resolution Image Enumeration for Low-resolution Face Recognition; Can Chen (Kitware Inc.)*; Scott McCloskey (Kitware)
- One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features; Trung Thanh NGUYEN (Nagoya Univeristy)*; Yasutomo Kawanishi (RIKEN); Takahiro Komamizu (Nagoya University); Ichiro Ide (Nagoya University)
- MGRFormer: A Multimodal Transformer Approach for Surgical Gesture Recognition; Kevin Feghoul (University of Lille)*; Deise S Maia (Université de Lille); Mehdi Elamrani (CHU Lille); Mohamed Daoudi (IMT Nord Europe); Ali Amad (University of Lille)
- Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention; Gnana Praveen Rajasekhar (Computer Research Institute of Montreal)*; Jahangir Alam (Computer Research Institute of Montreal (CRIM), Montreal (Quebec) Canada)
- epsilon-Mesh Attack: A Surface-based Adversarial Point Cloud Attack for Facial Expression Recognition; Mert Gülşen (Istanbul Technical University); Batuhan Cengiz (Istanbul Technical University)*; Yusuf Hüseyin Şahin (İTÜ); Gozde Unal (Istanbul Technical University)
- Multi-Scale Spatio-Temporal Graph Convolutional Network for Facial Expression Spotting ; Yicheng Deng (Osaka University)*; Hideaki Hayashi (Osaka University); Hajime Nagahara (Osaka University)
- CCDb-HG: Novel Annotations and Gaze-Aware Representations for Head Gesture Recognition; Pierre Vuillecard (Idiap)*; Arya Farkhondeh (Idiap Research Institute); Michael Villamizar (Idiap Research Institute); Jean-Marc ODOBEZ (IDIAP/EPFL, SWITZERLAND)
- GaitPT: Skeletons Are All You Need For Gait Recognition; Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest)
- QGFace: Quality-Guided Joint Training for Mixed Quality Face Recognition; Youzhe Song (East China Normal University)*; Feng Wang (East China Normal University)
- ONOT: a High-Quality ICAO-compliant Synthetic Mugshot Dataset; Nicolò Di Domenico (Alma Mater Studiorum – Università di Bologna); Guido Borghi (University of Bologna)*; Annalisa Franco (University of Bologna); Davide Maltoni (University of Bologna)
- CribNet: Enhancing Infant Safety in Cribs through Vision-based Hazard Detection; Shaotong Zhu (Northeastern University); Amal Mathew (Northeastern University); Elaheh Hatamimajoumerd (Northeastern University); Michael Wan (Northeastern University); Briana Taylor (The Roux Institute at Northeastern University); Rajagopal Venkatesaramani (Northeastern University); Sarah Ostadabbas (Northeastern University)*
- Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder; Mohamed I Lakhal (University of Surrey)*; Richard Bowden (University of Surrey)
- EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition; Niki M Foteinopoulou (Queen Mary University of London)*; Ioannis Patras (Queen Mary University of London)
- Deep adaptative spectral zoom for improved remote heart rate estimation; Joaquim Comas Martínez (Universitat Pompeu Fabra)*; Federico Sukno (Pompeu Fabra University); Adria Ruiz (Pompeu Fabra University)
- Resource-Efficient Gesture Recognition using Low-Resolution Thermal Camera via Spiking Neural Networks and Sparse Segmentation; Ali Safa (KU Leuven – IMEC)*; Wout Mommen (VUB – IMEC); Piet Wambacq (IMEC -VUB); Lars Keuninckx (imec)
- SynthSL: Expressive Humans for Sign Language Image Synthesis; Jilliam M. Diaz Barros (German Research Center for Artificial Intelligence)*; Chen-Yu Wang (DFKI); Jameel Malik (DFKI); Abdalla Arafa (DFKI); Didier Stricker (DFKI)
- In-Domain Inversion for Improved 3D Face Alignment on Asymmetrical Expressions; Jilliam M. Diaz Barros (German Research Center for Artificial Intelligence)*; Jason Rambach (DFKI); Pramod Murthy (DFKI); Didier Stricker (DFKI)
- Distilling Privileged Multimodal Information for Expression Recognition using Optimal Transport; Muhammad Haseeb Aslam (ETS)*; Muhammad Osama Zeeshan (École de technologie supérieure); Soufiane Belharbi (ÉTS Montreal); Marco Pedersoli (École de technologie supérieure); Alessandro Lameiras Koerich (École de technologie supérieure ); Simon Bacon (Concordia University); Eric Granger (ETS Montreal )
- MIMIC-Pose: Implicit Membership Discrimination of Body Joints for Human Pose Estimation; YING HUANG (Hangzhou Normal University)*; Shanfeng Hu (Northumbria University)
- 3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior; Guohao Li (BUAA)*; Hongyu Yang (Beihang University); Di Huang (Beihang University, China); Yunhong Wang (State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China)
- Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation; Anton Pelykh (University of Surrey)*; Ozge Mercanoglu Sincan (University of Surrey); Richard Bowden (University of Surrey)
- Face Anti-spoofing via Interaction Learning with Face Image Quality Alignment; Yongluo Liu (Beijing University of Technology); Zun Li (Beijing University of Technology); Shuyi Li (Beijing University of Technology); Zhuming Wang (Beijing University of Technology); Lifang Wu (Beijing University of Technology)*
- Rank and Sort Loss-Aware Label Assignment with Centroid Prior for Dense Object Detection; Shicheng Zu (Fordham University)*; Yucheng Jin (Fordham University)
- Latent Embedding Clustering for Occlusion Robust Head Pose Estimation; José Carlos Celestino (Instituto Superior Técnico)*; Manuel Marques (Institute for Systems and Robotics (ISR/LARSyS), DEEC, Instituto Superior Tecnico, Portugal); Jacinto C. Nascimento (Instituto Superior Tecnico de Lisboa)
- Pivotal Tuning Editing: Towards Disentangled Wrinkle Editing with GANs; Neil Farmer (CentraleSupelec)*; catherine SOLADIE (CentraleSupelec); Gabriel CAZORLA (Chanel); Renaud SEGUIER (CENTRALESUPELEC)
- Visual Saliency Guided Gaze Target Estimation with Limited Labels; Cheng Peng (King’s College London)*; Oya Celiktutan (King’s College London)
- Patch-based Privacy Attention for Weakly-supervised Privacy-Preserving Action Recognition; Xiao Li (Sun Yat-sen University); Yukun Qiu (Sun Yat-sen University); Yi-Xing Peng (Sun Yat-sen University, China); WEI-SHI ZHENG (Sun Yat-sen University, China)*
- A Data-Driven Representation for Sign Language Production; Harry Walsh (University of Surrey)*; Abolfazl Zargari Khuzani (Intel); Mariam Rahmani (Intel Corporation); Richard Bowden (University of Surrey)
- 3D Face Morphing Attack Generation using Non-Rigid Registration; Jag Mohan Singh (Norwegian University of Science and Technology (NTNU) Gjøvik)*; Raghavendra Ramachandra (NTNU, Norway)
- FE-Adapter: Adapting Image-based Emotion Classifiers to Videos; Shreyank N Gowda (University of Oxford)*; Boyan Gao (University of Oxford); David A Clifton (University of Oxford)
- ViewDiffGait: View Pyramid Diffusion for Gait Recognition; Rijun Liao (University of Missouri-Kansas City)*; Zhu Li (university of missouri-kansas city); Shuvra Bhattacharyya (University of Maryland); George York (US Air Force Academy)
- A Gloss-free Sign Language Production with Discrete Representation; Eui Jun Hwang (KAIST)*; Huije Lee (Korea Advanced Institute of Science and Technology); Jong C. Park (KAIST)
- DPA-2D: Depth Propagation and Alignment with 2D Observations Guidance for Human Mesh Recovery; Weihao You (Tomorrow Advancing Life)*; Pengcheng Wang (Tomorrow Advancing Life); Jinfeng Bai (Tomorrow Advance Life); zhilong ji (Tomorrow Advancing Life)
- Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis, Guanyu Hu (Xi’an Jiaotong University ); Eleni Papadopoulou (NTUA); Dimitrios Kollias (Queen Mary University London)*; Paraskevi Tzouveli (NTUA); JIE WEI (Xi’an Jiaotong University); Xinyu Yang (Xi’an Jiaotong University)
- Expression-aware Masking and Progressive Decoupling for Cross-database Facial Expression Recognition; Tao Zhong (Shenzhen University); Xiaole Xian (Shenzhen University); Zihan Wang (Shenzhen University); Weicheng Xie (Shenzhen University)*; Linlin Shen (Shenzhen University)
- HR-xNet: A Novel High-Resolution Network for Human Pose Estimation with Low Resource Consumption; Cun Feng (Ningbo University); Rong Zhang (Ningbo University)*; Lijun Guo (Ningbo University)
- SMCTL: Subcarrier Masking Contrastive Transfer Learning For Human Gesture Recognition With Passive Wi-Fi Sensing; Hojjat Salehinejad (Mayo Clinic)*; Radomir Djogo (University of Toronto); Navid Hasanzadeh (University of Toronto); Shahrokh Valaee (University of Toronto)
- In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition; Wiktor Mucha (Vienna University of Technology, Computer Vision Lab)*; Martin Kampel (Vienna University of Technology, Computer Vision Lab)
- Hyp-OC: Hyperbolic One Class Classifier for Face Anti-Spoofing; Kartik Narayan (Johns Hopkins University)*; Vishal Patel (Johns Hopkins University)
- The Paradox of Motion: Evidence for Spurious Correlations in Skeleton-based Gait Recognition Models; Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest)
- Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition; Zhendong Liu (Southeast University); Haifeng Xia (Southeast University); Tong Guo (Southeast University); Libo Sun (Southeast University); Ming Shao (University of Massachusetts Dartmouth); Siyu Xia (Southeast University, China)*
- Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis; Maxim Maximov (TUM)*; Tim Meinhardt (TUM); Caner Hazirbas (Meta AI); Zoe Papakipos (Meta); Canton Cristian (Meta AI); Laura Leal-Taixé (NVIDIA)
- Attention Prompt Tuning: A Parameter-Efficient Adaptation of Pre-Trained Models for Action Recognition; Wele Gedara Chaminda Bandara (Apple Inc)*; Vishal Patel (Johns Hopkins University)
- Semantic-Aware Detail Enhancement for Blind Face Restoration; Huimin Zhao (Anhui University)*; Jie Cao (Institute of Automation, Chinese Academy of Sciences); Huaibo Huang (Institute of Automation, Chinese Academy of Sciences); Xiaoqiang Zhou (University of Science and Technology of China); Aihua Zheng (Anhui University); Ran He (Institute of Automation, Chinese Academy of Sciences)
- RFIS-FPI: Reversible Face Image Steganography Neural Network for Face Privacy Interactions; Yubo Huang (Southwest Jiaotong Unverisity)*; Anran Zhu (Southwest Jiaotong University); Cheng Zeng (Southwest Jiaotong University); Cong Hu (Southwest Jiaotong University); Xin Lai (Southwest Jiaotong University); Wenhao Feng (Southwest Jiaotong University); Fan Chen (Southwest Jiaotong University)
- BTVSL: A Novel Sentence-Level Annotated Dataset for Bangla Sign Language Translation; Iftekhar E Mahbub Zeeon (Bangladesh University of Engineering and Technology); Mir Mahathir Mohammad (University of Utah); Muhammad Abdullah Adnan (University of California San Diego)*
- OpenThermalPose: An Open-Source Annotated Thermal Human Pose Dataset and Initial YOLOv8-Pose Baselines; Askat Kuzdeuov (Nazarbayev University)*; Darya Taratynova (Nazarbayev University); Alim Tleuliyev (Nazarbayev University); Huseyin Atakan Varol (Nazarbayev University)
- Hand Graph Topology Selection for Skeleton-based Sign Language Recognition; Oğulcan Özdemir (Bogazici University)*; Inci M. Baytas (Bogazici University); Lale Akarun (Bogazici University)
- Boosting Gesture Recognition with An Automatic Gesture Annotation Framework; Junxiao Shen (University of Cambridge)*; Xuhai Xu (Meta Reality Lab Research); Ran Tan (Meta Reality Labs Research); Amy Karlson (Meta Reality Labs Research); Evan Strasnick (Meta Reality Labs Research)
- Unlocking the Black Box: Concept-Based Modeling for Interpretable Affective Computing Applications; Xinyu Li (University of Glasgow)*; Marwa Mahmoud (University of Glasgow)
- Unconstrained Hand Recognition using Thermal Infrared Sensing of Dorsal Veins Wallace Lawson (Naval Research Laboratory)*; Grant Daneils (Naval Research Laboratory); Daniel Steinhurst (Nova Research); David Kidwell (Naval Research Laboratory)
- Explainable Face Verification via Feature-Guided Gradient Backpropagation; Yuhang Lu (EPFL)*; Zewei Xu (EPFL); Touradj Ebrahimi (EPFL)
- Improving 2D Human Pose Estimation in Unseen Camera Views with Synthetic Data; Miroslav Purkrabek (Czech Technical University, Prague)*; Jiri Matas (Czech Technical University, Prague)
- A Study on End-to-End Face Analysis: How to Cope with Challenges; Doganay Demir (TRT); İlknur Durgar Elkahlout (TRT)*
- Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models; René Haas (IT University of Copehagen); Inbar Huberman-Spiegelglas (Technion); Rotem Mulayoff (Technion); Stella Graßhof (IT University of Copenhagen)*; Sami S Brandt (IT University of Copenhagen); Tomer Michaeli (Technion)
- Social-MAE: A Transformer-Based Multimodal Autoencoder for Face and Voice; Hugo Bohy (University of Mons)*; Kevin El Haddad (University of Mons/The Big Projects); Minh Tran (University of Southern California); Thierry Dutoit (University of Mons); Mohammad Soleymani (University of Southern California)
- Breaking Template Protection: Reconstruction of Face Images from Protected Facial Templates; Hatef Otroshi Shahreza (Idiap Research Institute)*; Sebastien Marcel (Idiap Research Institute)
- Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets; Ahmet Alp Kindiroglu (Huawei)*; Ozgur Kara (Georgia Institute of Technology); Oğulcan Özdemir (Bogazici University); Lale Akarun (Bogazici University)
- Evaluating Recent 2D Human Pose Estimators for 2D-3D Pose Lifting; Soroush Mehraban (University of Toronto)*; Yiqian Qin (University of Toronto); Babak Taati (University Health Network)
- Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson’s Disease Severity in Walking Sequences; Vida Adeli (University of Toronto)*; Soroush Mehraban (University of Toronto); Irene Ballester (TU Wien); Yasamin Zarghami (University of Toronto); Andrea Sabo (University of Toronto); Andrea Iaboni (Toronto Rehabilitation Institute); Babak Taati (University Health Network)
- CrossGaze: A Strong Method for 3D Gaze Estimation in the Wild; Andy Eduard Catruna (University Politehnica Of Bucharest)*; Adrian Cosma (University Politehnica of Bucharest); Emilian Radoi (Politehnica University of Bucharest)
- Survey of Automated Methods for Nonverbal Behavior Analysis in Parent-Child Interactions; Berfu Karaca (Utrecht University)*; Sonja de Zwarte (Utrecht University); Ronald Poppe (Utrecht University); Albert Ali Salah (Utrecht University); Jaap Denissen (Utrecht University)
- Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues; Soufiane Belharbi (ÉTS Montreal)*; Marco Pedersoli (École de technologie supérieure); Alessandro Lameiras Koerich (École de technologie supérieure ); Simon Bacon (Concordia University); Eric Granger (ETS Montreal )
- The Seven Faces of Stress: Understanding Facial Activity Patterns during Cognitive Stress; Carla Viegas (Carnegie Mellon University)*; Roy A Maxion (“””Carnegie Mellon University, USA”””); Alexander Hauptmann (Carnegie Mellon University); Joao Magalhaes (Universidade NOVA Lisboa)
- DualH: A Dual Hierarchical Model for Temporal Action Localization; Zejian Zhang (Universitat de Barcelona)*; Cristina Palmero (Universitat de Barcelona); Sergio Escalera (Universitat de Barcelona)
- Visual Coherence Face Anonymization Algorithm Based on Dynamic Identity Perception; Xuan Tan (Hangzhou Dianzi University); Shanqing Zhang (Hangzhou Dianzi University); Yixuan Ju (University of Yamanashi); Xiaoyang mao (University of Yamanashi); Jiayi Xu (Hangzhou Dianzi University)*
- ASPECD: Adaptable Soft-Biometric Privacy-Enhancement Using Centroid Decoding for Face Verification; Peter Rot (Univerza v Ljubljani, Fakulteta za Elektrotehniko)*; Philipp Terhörst (Paderborn University); Peter Peer (University of Ljubljana); Vitomir Struc (University of Ljubljana)
- Dynamic Cross Attention for Audio-Visual Person Verification; Gnana Praveen Rajasekhar (Computer Research Institute of Montreal)*; Jahangir Alam (Computer Research Institute of Montreal (CRIM), Montreal (Quebec) Canada)
- AerialFace: A Light Weight Framework for Unmanned Aerial Vehicle Face Recognition; Zhiquan Ou (Hohai University); Liang Yao (Hohai University)*; Ting Wu (Hohai University); Fan Liu (Hohai University)
- Towards Better Communication: Refining Hand Pose Estimation in Low-Resolution Sign Language Videos; Sümeyye M Taşyürek (Hacettepe University)*; Tuğçe Kızıltepe (Hacettepe University); Hacer Yalim Keles (Hacettepe University)
- Improving Template Protection in Face Analytics; Bharat Yalavarthi (University at Buffalo)*; Arjun Ramesh Kaushik (University at Buffalo, The State University of New York); Arun Ross (Michigan State University); Vishnu Boddeti (Michigan State University); Nalini Ratha (SUNY Buffalo)
- PyraMoT: A Novel Framework for Enhanced Facial Thermal Landmarks Detection; Kais Riani (University of Michigan)*; Salem Sharak (University Of Michigan); Mohamed Abouelenien (University of Michigan)
- HM-Auth: Redefining User Authentication in Immersive Virtual World through Hand Movement Signatures; Sindhu Reddy Kalathur Gopal (University of Wyoming)*; Paul Gyreyiri (University of Wyoming); Diksha Shukla (University of Wyoming)
- Quantifying Biometric Characteristics of Hand Gestures through Feature Space Probing and Identity-Level Cross-Gesture Disentanglemen; Aman Verma (Indian Institute of Technology Delhi); Gaurav Jaswal (Indian Institute of Technology Delhi); Seshan Srirangarajan (Indian Institute of Technology Delhi)*; Sumantra Dutta Roy (Indian Institute of Technology Delhi)
- Naive Data Augmentation Might Be Toxic: Data-prior Guided Self-supervised Representation Learning for Micro-gesture Recognition; Atif Shah (University of Oulu)*; Haoyu Chen (University of Oulu); Guoying Zhao (University of Oulu)