AgentsarXiv cs.AI — 7 d ago

ARROW: Augmented Replay for RObust World models

ARROW (Augmented Replay for RObust World models) is a new model-based continual reinforcement learning algorithm that enhances DreamerV3 with a memory-efficient, distribution-matching replay buffer. It features two complementary buffers: a short-term buffer for recent experiences and a long-term buffer for task diversity through intelligent sampling. In evaluations on Atari and Procgen CoinRun tasks, ARROW significantly reduces catastrophic forgetting compared to existing model-free and model-based baselines while maintaining forward transfer, underscoring the effectiveness of bio-inspired approaches in continual RL.

reinforcement learningcontinual learningworld modelsrelevance 0.00 · engagement 0.00

Read at source ↗← all news