ResearcharXiv cs.AI — 10 d ago

A Unified Definition of Hallucination: It's The World Model, Stupid!

The paper presents a unified definition of hallucination in language models, characterizing it as inaccurate internal world modeling observable to users, such as contradicting established facts. This framework allows for a clearer evaluation of hallucinations by identifying the reference world model and distinguishing them from other types of errors. Additionally, it introduces HalluWorld, a benchmark designed to rigorously test model hallucinations against specified reference world models, which is crucial for practitioners aiming to enhance the reliability of LLMs.

hallucinationllmdefinitionrelevance 0.00 · engagement 0.00

Read at source ↗← all news