EIFER: Electromyography-Informed Facial Expression Reconstruction For Physiological-Based Synthesis and Analysis

Abstract

The relationship between muscle activity and resulting facial expressions is crucial for various fields, including psychology, medicine, and entertainment. The synchronous recording of facial mimicry and muscular activity via surface electromyography (sEMG) provides a unique window into these complex dynamics. Unfortunately, existing methods for facial analysis cannot handle electrode occlusion, rendering them ineffective. Even with occlusion-free reference images of the same person, variations in expression intensity and execution are unmatchable. Our electromyography-informed facial expression reconstruction (EIFER) approach is a novel method to restore faces under sEMG occlusion faithfully in an adversarial manner. We decouple facial geometry and visual appearance (e.g., skin texture, lighting, electrodes) by combining a 3D Morphable Model (3DMM) with neural unpaired image-to-image translation via reference recordings. Then, EIFER learns a bidirectional mapping between 3DMM expression parameters and muscle activity, establishing correspondence between the two domains. We validate the effectiveness of our approach through experiments on a dataset of synchronized sEMG recordings and facial mimicry, demonstrating faithful geometry and appearance reconstruction. Further, we synthesize expressions based on muscle activity and how observed expressions can predict dynamic muscle activity. Consequently, EIFER introduces a new paradigm for facial electromyography, which could be extended to other forms of multi-modal face recordings.

Main Insights

Facial Geometry Reconstruction
Our method EIFER faithfully reconstructs the facial geometry under strong sEMG electrode occlusion. It correctly captures the expression, identity, and pose of the subject. Existing methods struggle to align the face due to already misaligned landmark placements. Further, the identity parameters diverge even with the same person, making them unsuitable for our task.

Facial Appearance Reconstruction
Our method separates the facial appearance from the geometry, allowing for faithful reconstruction of the skin texture, lighting, and occluded electrodes. While existing methods inherently restore the occluded regions due to the utilized appearance model, our method creates more photorealistic results due to the adversarial cyclic training.

Electrode-Free Facial EMG
Our method allows for the prediction of muscle activity based on observed facial expressions, this is a new paradigm for electrode-free facial electromyography. This enables the reconstruction of muscle activity without the need for electrodes, providing a new way to analyze facial expressions. We demonstrate that our method can predict muscle activity from observed expressions and synthesize expressions based on muscle activity. We provide next to EIFER several Exp2EMG models for existing extractors like DECA, EMOCAv2, SMIRK,Deep3DFace, and FOCUS. Hence, both FLAME and BFM expression spaces are connected to the muscle activity.

Expression Synthesis
In the second phase of EIFER we learn to synthesize facial expressions based on muscle activity. This allows to generate expression based solely on muscle activity and provides a new way to analyze facial expressions, especially interesting for possible advances in camera-free animation captures. While EIFER creates the most realistic results, we will also publish EMG2Exp of MC-CycleGAN+Model combinations. Therefore, both the expression space of both FLAME and BFM are now connected to the muscle activity.

More Geometry Examples - Fact At Rest
Our method EIFER faithfully reconstructs the facial geometry under strong sEMG electrode occlusion. This example shows the face at rest, where the subject is not performing any facial expression.

More Geometry Examples - Eyes closed tightly
Our method EIFER faithfully reconstructs the facial geometry under strong sEMG electrode occlusion. This example shows the subject closing their eyes tightly, which is a challenging due to the similarity of a soft eye closure or rapid blinking.

More Geometry Examples - Smile Open
Our method EIFER faithfully reconstructs the facial geometry under strong sEMG electrode occlusion. This example shows the subject smiling with their mouth open, which is a challenging as the mouth inside should not be visible.

More Geometry Examples - Snarl
Our method EIFER faithfully reconstructs the facial geometry under strong sEMG electrode occlusion. This example shows the subject snarling, which is a challenging expression due to the wrinkles and the mouth opening downwards.

More Geometry Examples - Nose wrinkling
Our method EIFER faithfully reconstructs the facial geometry under strong sEMG electrode occlusion. This example shows the subject wrinkling their nose, which is a challenging expression due to the wrinkles only the nose bridge and forehead (glabella) should be affected.

Method

EIFER (Electromyography-Informed Facial Expression Reconstruction) introduces a novel approach for facial expression analysis that addresses the challenges posed by surface electromyography (sEMG) electrode occlusion. Our method is grounded in the principle of decoupling facial geometry and appearance, achieved through a neural unpaired image-to-image translation framework, see the above figure. EIFER leverages 3D Morphable Models (3DMMs), specifically FLAME , to provide a parametric representation of facial shape and expression. We see the 3DMM expression space and muscle activity as the missing link between computer animation and physiologically-grounded facial expression analysis.

For monocular 3D face reconstruction, EIFER employs neural differential rendering and utilizes pre-trained SMIRK encoder networks. These encoders, consisting of sub-encoders based on MobileNetV3 backbones, estimate 3DMM parameters (shape, expression, pose) from the sEMG electrode occluded input facial images. The core of EIFER's occlusion handling lies in a CycleGAN-like adversarial architecture for unpaired image-to-image translation. This architecture comprises generator and discriminator networks trained adversarially using unpaired sets of sEMG-occluded and occlusion-free (reference) facial images. Crucially, EIFER learns a bidirectional mapping between the 3DMM expression parameter space and measured sEMG muscle activity through multi-layer perceptron (MLPs). This bidirectional mapping enables both the synthesis of facial expressions from muscle activity and the prediction of muscle activity from observed facial expressions, effectively achieving electrode-free facial electromyography.

Key Advantages of EIFER:

Reconstructs Facial Expressions Under Occlusion: EIFER uniquely addresses the challenge of sensor occlusion, accurately reconstructing facial expressions even when electrodes or other visual obstructions are present.
Combines Video and Muscle Activity Data: By leveraging both video and muscle activity (sEMG) data, EIFER provides a more comprehensive and physiologically grounded fully data-driven understanding of facial expressions.
Enables Electrode-Free Facial Electromyography: EIFER can predict muscle activity from facial expressions alone, paving the way for non-invasive, electrode-free facial electromyography in the future.
Provides Accurate 3D Facial Geometry and Appearance: EIFER not only reconstructs a visually realistic face but also captures the underlying 3D geometry, providing richer data for analysis and synthesis.
Utilizes Advanced AI Techniques: Built upon state-of-the-art 3D Morphable Models and unpaired image-to-image translation, EIFER represents a significant advancement in facial analysis technology.
Opens Doors for Multi-Modal Facial Analysis: EIFER facilitates the integration of various data streams for a more holistic understanding of facial expressions, with potential applications in medicine, psychology, human-computer interaction, and animation.

EIFER
Electromyography-Informed Facial Expression Reconstruction For Physiological-Based Synthesis and Analysis

Abstract

Main Insights

Method

Key Advantages of EIFER:

Related Links

Acknowledgments

BibTeX (arXiv Preprint)

EIFER Electromyography-Informed Facial Expression Reconstruction For Physiological-Based Synthesis and Analysis

Abstract

Main Insights

Method

Key Advantages of EIFER:

Related Links

Acknowledgments

BibTeX (arXiv Preprint)

EIFER
Electromyography-Informed Facial Expression Reconstruction For Physiological-Based Synthesis and Analysis