https://github.com/yk7333/d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
diffusion-models
human-feedback
reinforcement-learning
Added: over 1 year ago - Last Synced: 11 months ago
- Created: November 23, 2023
