How to verify policy-iteration Q tables on a 2x4 gridworld with absorbing states and tie-break up>down>right>left?