Abstract: This paper presents a novel methodology for generating synthetic images that adhere accurately to provided semantic segmentation maps using the Stable Diffusion model with the ControlNet ...
task-specific conditions in an end-to-end way, and the learning is robust even when the training dataset is small ( 50k). Moreover, training a ControlNet is as fast as fine-tuning a diffusion model, ...