TrainingarXiv cs.AI — 10 d ago

Training and Evaluating Diffusion Policies with Long Context Lengths

This work presents a detailed evaluation of imitation learning policies with varying context lengths, demonstrating that longer context can improve performance on tasks requiring memory without the brittleness previously suggested. Using a UNet architecture with cross-attention for denoising, the study shows that single-task policies can achieve high success rates even with naive context length scaling. Additionally, a novel training algorithm is introduced that enables joint training across multiple context lengths, thereby reducing sample complexity for long-context learning, which is significant for practitioners aiming to enhance robotic manipulation capabilities.

imitationlearningcontextroboticsrelevance 0.00 · engagement 0.00

Read at source ↗← all news