RecDreamer: Consistent Text-to-3D Generation via Uniform Score Distillation

Chenxi Zheng1 Yihong Lin1 Bangzhen Liu1 Xuemiao Xu1† Yongwei Nie1† Shengfeng He2
1South China University of Technology 2Singapore Management University
Corresponding Authors.

Abstract

Prevailing text-to-3D generation methods based on score distillation often suffer from significant geometric inconsistencies, manifesting as repeated patterns across different poses of 3D assets—an issue commonly referred to as the Multi-Janus problem. Although recent work has attempted to address this by improving control over pose prompts or adjusting the approximation distribution, the underlying prior distribution remains biased toward a canonical pose, leading to skewed guidance.

To overcome this challenge of inconsistency due to an imbalanced prior distribution, we have investigated techniques for modifying a prescribed distribution, enabling reconstruction of its density to ensure compliance with specific marginal constraints. By modifying the original data distribution through the proposed auxiliary function, we ensure that the marginal distribution of the pose adheres to a uniform distribution, thereby eliminating biases from prior knowledge. We integrate the rectified data distribution into existing score distillation algorithms, and term the process uniform score distillation.

To efficiently compute the posterior distribution required for the auxiliary function, we introduce a training-free classifier capable of estimating pose categories in a plug-and-play fashion. Additionally, we employ various approximation techniques for terms of noisy states, significantly enhancing system performance. Our proposed solution, called RecDreamer, demonstrates a marked ability to mitigate the Multi-Janus problem, as confirmed by our experiments.

Qualitative Results

Qualitative Comparison

"A DSLR photo of a beagle in a detective's outfit."

SDS

Perpneg

Debiased

VSD

USD

"Samurai koala bear."

SDS

Perpneg

Debiased

VSD

USD

"DSLR Camera, photography, dslr, camera, noobie, box-modeling, maya."

SDS

Perpneg

Debiased

VSD

USD

"A portrait of Groot, head, HDR, photorealistic, 8K."

SDS

Perpneg

Debiased

VSD

USD

"A DSLR photo of a chimpanzee dressed like Napoleon Bonaparte."

SDS

Perpneg

Debiased

VSD

USD

"A kangaroo wearing boxing gloves."

SDS

Perpneg

Debiased

VSD

USD