Agents
Advancing DialNav through Automatic Embodied Dialog Augmentation
The article presents advancements in the DialNav framework for embodied agents, introducing the RAINbow dataset with 238K episodes to enhance dialog training, addressing the previous limitation of only 2K episodes. It details the implementation of Dual-Strategy Training and a localization model that incorporates Vision-and-Language Navigation (VLN) knowledge, leading to significant performance improvements with success rates of 58.24% on Val Seen (+89%) and 29.05% on Val Unseen (+100%). This work is crucial for practitioners as it provides a scalable solution for training dialog systems in navigation tasks, enhancing both safety and effectiveness in embodied AI applications.
dialogembodied-agents