![]() |
| * | 2009 | |
|---|---|---|
| 2 | EE | Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora: Fast gradient-descent methods for temporal-difference learning with linear function approximation. ICML 2009: 125 |
| 2008 | ||
| 1 | EE | Richard S. Sutton, Csaba Szepesvári, Hamid Reza Maei: A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation. NIPS 2008: 1609-1616 |
| 1 | Shalabh Bhatnagar | [2] |
| 2 | Doina Precup | [2] |
| 3 | David Silver | [2] |
| 4 | Richard S. Sutton | [1] [2] |
| 5 | Csaba Szepesvári | [1] [2] |
| 6 | Eric Wiewiora | [2] |