Online TD(1) Meets Offline Monte Carlo Jan 1, 2009· Y. Yu , Y. Zhang , C. Szepesvári · 0 min read Cite Type Book section Publication Multidisciplinary Symposium on Reinforcement Learning Last updated on Jan 1, 2009 Workshop ← A General Projection Property for Distribution Families Jan 1, 2009