Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Reinforcement Learning


I think really just "Value Iteration" (which isn't just used in RL). Reinforcement Learning itself is a problem setting and there are solutions in RL that don't use dynamic programming (for example, policy gradient methods).


Well of course :) but it’s a cute method to solve certain MDPs




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: