Research
UXBench: Benchmarking User Experience in AI Assistants
UXBench is introduced as a novel user-centric benchmark designed to evaluate AI assistant user experience (UX) through real user feedback signals. It includes three tasks—UX Judge, UX Eval, and UX Recovery—comprising 7,400 test instances derived from over 70,000 interaction logs of a major Chinese AI assistant, covering 8 scenarios and 83 domains. This benchmark provides insights into model performance regarding user experience, highlighting that user feedback prediction can be effectively learned and emphasizes the need for tailored UX optimization in AI assistant development.
uxbenchmarkuserexperienceai