TY - BOOK AU - Sutton, Richard S. AU - Barto, Andrew G. TI - Reinforcement Learning SN - 9780262039246 U1 - 006.3 SUT PY - 2018/// CY - USA PB - MIT KW - Tubular Solution Method KW - Monte Carlo Method ER -