Hacker Read

thisisdave | karma 193 | avg karma 2.51 · 2016-03-16 23:59:12+00:00

My understanding is that this is based on the policy network output. If I understand correctly, the policy network is designed to estimate probabilities accurately, although I don't know how accurate it is for low-probability moves like this one.