Preference-Guided Prompt Optimization for Text-to-Image Generation

2026-02-13Human-Computer Interaction

Human-Computer Interaction
AI summary

The authors created APPO, a new method to help people guide generative models like those that make images. Instead of rewriting prompts many times, users just give simple yes-or-no feedback on results. APPO uses this feedback to quickly improve the prompts by balancing learning from answers and trying new ideas. Tests showed APPO works faster and is easier for users than doing prompt edits manually. This approach could make working with AI to create things smoother and more user-friendly.

generative modelsprompt optimizationuser feedbackbinary preferenceshuman-AI collaborationimage generationexploration-exploitationcognitive loadprompt refinementcontent creation
Authors
Zhipeng Li, Yi-Chi Liao, Christian Holz
Abstract
Generative models are increasingly powerful, yet users struggle to guide them through prompts. The generative process is difficult to control and unpredictable, and user instructions may be ambiguous or under-specified. Prior prompt refinement tools heavily rely on human effort, while prompt optimization methods focus on numerical functions and are not designed for human-centered generative tasks, where feedback is better expressed as binary preferences and demands convergence within few iterations. We present APPO, a preference-guided prompt optimization algorithm. Instead of iterating prompts, users only provide binary preferential feedback. APPO adaptively balances its strategies between exploiting user feedback and exploring new directions, yielding effective and efficient optimization. We evaluate APPO on image generation, and the results show APPO enables achieving satisfactory outcomes in fewer iterations with lower cognitive load than manual prompt editing. We anticipate APPO will advance human-AI collaboration in generative tasks by leveraging user preferences to guide complex content creation.