So RL is true AI. Alpha GO did make moves inconceivable to the best human mind o...

bitL · on Nov 23, 2019

The learning from game itself was curve fitting, the Deep in Deep Reinforcement Learning usually means some difficult function is replaced by a deep neural network, approximating optimal values (for moves) trained on gameplay samples, usually in sense of rewards/punishments for reaching certain states; in games they could rank e.g. good/bad moves, winning states, losing states etc.

streetcat1 · on Nov 23, 2019

Right. But the curve itself, was invented by the machine.

mkl · on Nov 24, 2019

I think the curve is defined by the rules of the game, and the machine learned some details of it that humans hadn't figured out yet.

arkano · on Nov 23, 2019

It's curve-fitting with a few extra steps. You can do a lot with curve-fitting though.