Learning to predictby the methods oftemporal differences Richard S. Sutton1988 год

Learning to predict by the methods of temporal differences
статья из журнала