Q-Learning Model Tables

Policy Matrix

Rows are states. Columns follow the exported action order.

Q Matrix

Tied maxima are highlighted in each row.