Reinforcement Learning | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		pmalynin on Oct 24, 2017 \| parent \| context \| favorite \| on: Dynamic Progamming: First Principles Reinforcement Learning

gugagore on Oct 25, 2017 [–]

I think really just "Value Iteration" (which isn't just used in RL). Reinforcement Learning itself is a problem setting and there are solutions in RL that don't use dynamic programming (for example, policy gradient methods).

pmalynin on Oct 25, 2017 | [–]

Well of course :) but it’s a cute method to solve certain MDPs

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact