Q-Learning Model Tables
Open Game View
Policy Matrix
Rows are states. Columns follow the exported action order.
Q Matrix
Tied maxima are highlighted in each row.