RigAnyFace: Scaling Neural Facial Mesh Auto-Rigging with Unlabeled Data

Applications

RigAnyFace enables various downstream applications:

User-Controlled Animation: Artists can directly edit FACS parameters to pose meshes.
Video-to-Mesh Retargeting: Transfer facial expressions from videos to 3D meshes.
Animating Generated Meshes: Automatically rig meshes from text-to-3D models.

Data Collection

We collect a diverse set of artist-crafted facial meshes for model training and evaluation. (a) (i) The dataset includes meshes with multiple disconnected components (e.g., eyeballs) and diverse facial shapes. (ii) A subset of neutral head meshes is annotated with blendshape rigs by professional artists. (iii) To expand the dataset, we apply a head interpolation strategy based on standardized UV layouts. (b) For the remaining unrigged samples, we generate 2D supervision. Given a posed image rendered from a rigged head and a neutral image from an unrigged head, a 2D animation model transfers the expression while preserving identity. A flow estimation model then predicts pixel offsets between the neutral and synthesized posed images as 2D displacements.

Method Overview

(a) Given a neutral facial mesh, our deformation model predicts the 3D displacement needed to deform the mesh into different expressions based on the input FACS vector. During training, 2D supervision is utilized for both rigged and unrigged heads, while 3D supervision is exclusively applied to rigged heads. (b) We modify the original diffusion block in DiffusionNet to support the FACS vector as an additional conditional inputs (left). Additionally, we design a global encoder that processes vertex positions and normals of the neutral facial mesh to capture holistic information across disconnected components (right).

Results

Artist-Crafted Meshes

Our method achieves high-quality rigging results on diverse facial meshes, including humanoid and non-humanoid characters with multiple disconnected components.

Qualitative results on our artist-crafted unrigged heads.

Comparison with Baseline Methods. Reference mesh and corresponding points are provided for Deformation Transfer.

In-the-Wild Meshes

RigAnyFace generalizes well to in-the-wild facial meshes from ICT FaceKit, Objaverse, and CGTrader compared with piror art NFR.

BibTeX

@inproceedings{ma2025riganyface,
  title     = {RigAnyFace: Scaling Neural Facial Mesh Auto-Rigging with Unlabeled Data},
  author    = {Ma, Wenchao and Kneubuehler, Dario and Chu, Maurice and Sachs, Ian and Jiang, Haomiao and Huang, Sharon X.},
  booktitle = {39th Conference on Neural Information Processing Systems (NeurIPS)},
  year      = {2025}
}

Acknowledgement

We thank Hsueh-Ti Derek Liu, Chrystiano Araújo, and Jinseok Bae for proofreading the draft and providing helpful comments, and Jihyun Yoon for curating the dataset. We also thank the authors of DiffusionNet, MegActor, and NFR for their codes.