textbook rl1 Properties authors Richard S. Sutton, Andrew G. Barton year 2018 Planning and Learning with Tabular Methods¶ 8.1 Models and Planning¶ 8.2 Dyna: Integrated Planning, Acting, and Learning¶