UnifiedReward Edit Models
Collection
11 items • Updated • 1
UnifiedReward-Edit-qwen3vl-4b is a unified reward model for both Text-to-Image and Image-to-Image generation!! For image editing reward task, our models support:
Pairwise Rank — directly judge which of two edited images is better.
Pairwise Score — assign a separate score to each image in a pair.
Pointwise Score — rate a single image on two axes: instruction-following and overall image quality.
🚀 The image editing reward inference code is available at UnifiedReward-Edit/ directory, while T2I inference code is unchanged from previous models. The editing training data is preprocessed from EditScore, EditReward, and Pico-Nano-Banana. We sincerely appreciate all contributors!!
For further details, please refer to the following resources:
@article{unifiedreward,
title={Unified reward model for multimodal understanding and generation},
author={Wang, Yibin and Zang, Yuhang and Li, Hao and Jin, Cheng and Wang, Jiaqi},
journal={arXiv preprint arXiv:2503.05236},
year={2025}
}