HARDER: 3D human avatar reconstruction with distillation and explicit representation
Journal
Computers and Graphics
Journal Volume
134
Start Page
104512
ISSN
0097-8493
Date Issued
2026-02
Author(s)
Abstract
3D human avatar reconstruction has become a popular research field in recent years. Although many studies have shown remarkable results, most existing methods either impose overly strict data requirements, such as depth information or multi-view images, or suffer from significant performance drops in specific areas. To address these challenges, we propose HARDER. We combine the Score Distillation Sampling (SDS) technique with the designed modules, Feature-Specific Image Captioning (FSIC) and RADR (Region-Aware Differentiable Rendering), allowing the Latent Diffusion Model (LDM) to guide the reconstruction process, especially in unseen regions. Furthermore, we have developed various training strategies, including personalized LDM, delayed SDS, focused SDS, and multi-pose SDS, to make the training process more efficient.Our avatars use an explicit representation that is compatible with modern computer graphics pipelines. Also, the entire reconstruction and real-time animation process can be completed on a single consumer-grade GPU, making this application more accessible.
Subjects
3D human reconstruction
Avatar
Latent diffusion models
Score distillation sampling
Publisher
Elsevier Ltd
Type
journal article
