ai-digest.dev
last updated 2 h ago
AgentsarXiv cs.AI 15 d ago

VOiLA: Vectorized Online Planning with Learned Diffusion Model for POMDP Agents

The VOiLA framework introduces a method for online planning in POMDPs using learned diffusion models to create task-agnostic POMDP models. It employs conditional diffusion models for transition and observation sampling, which are distilled into compact feedforward generators to facilitate GPU-accelerated planning with the Vectorized Online POMDP Planner (VOPP). Experimental results show that VOiLA reduces sampling costs significantly and outperforms the Recurrent Soft Actor Critic in benchmark tests while demonstrating effective real-world task execution with simulated data.

POMDPplanningreinforcement learningrelevance 0.00 · engagement 0.00
Read at source ↗← all news
VOiLA: Vectorized Online Planning with Learned Diffusion Model for POMDP Agents — AI News Digest