TrainingarXiv cs.AI — 21 h ago

Geometrically Averaged Hard Target Updates for Linear Q-Learning

The paper introduces the $\lambda$-target update mechanism for linear Q-learning, which employs geometrically averaged hard target updates to enhance stability in Q-learning with function approximation. The method allows for a continuum of target update strategies, ranging from one-period updates to projected Q-value iteration, and is analyzed using a switching-system model. This advancement is significant for practitioners as it provides a more flexible and potentially more stable approach to target updates in reinforcement learning, particularly in environments where linear function approximation is utilized.

q-learningreinforcement learningtarget updatesrelevance 0.00 · engagement 0.00

Read at source ↗← all news