MARC View

000			00622nam a22001577a 4500
008			231016b \|\|\|\|\|\|\|\| \|\|\|\| 00\| 0 eng d
020			_a978-0-262-03924-6
082			_223 _a006.31 _bSUT
100			_aSutton, Richard
245			_aReinforcement Learning : _bAn Introduction / _cRichard S. Sutton & Andrew G. Barto _hEnglish
250			_a2nd ed
260			_aLondon: _bThe MIT Press, _c2020.
300			_aviii, 526p.: _bhard bound _c18x 24cm.
505			_a1.Introduction 2.Multi-Armed Bandits 3.Finite Markov Decision Processes 4.Dynamic programming 5.Monte Carlo Methods ........................
942			_2ddc _cREF
999			_c8393 _d8393