dblp.uni-trier.dewww.uni-trier.de

Csaba Szepesvári Vis

List of publications from the DBLP Bibliography Server - FAQ
Coauthor Index - Ask others: ACM DL/Guide - CiteSeerX - CSB - MetaPress - Google - Bing - Yahoo

*2009
54EEBarnabás Póczos, Yasin Abbasi-Yadkori, Csaba Szepesvári, Russell Greiner, Nathan R. Sturtevant: Learning when to stop thinking and do something! ICML 2009: 104
53EERichard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora: Fast gradient-descent methods for temporal-difference learning with linear function approximation. ICML 2009: 125
52EEJean-Yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko, Csaba Szepesvári: Workshop summary: On-line learning with limited feedback. ICML 2009: 168
51EEAlireza Farhangfar, Russell Greiner, Csaba Szepesvári: Learning to segment from a few well-selected training images. ICML 2009: 39
50EEJean-Yves Audibert, Rémi Munos, Csaba Szepesvári: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits. Theor. Comput. Sci. 410(19): 1876-1902 (2009)
2008
49EEAndrás Antos, Varun Grover, Csaba Szepesvári: Active Learning in Multi-armed Bandits. ALT 2008: 287-302
48EEGábor Bartók, Csaba Szepesvári, Sandra Zilles: Active Learning of Group-Structured Environments. ALT 2008: 329-343
47EEAmir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor: Regularized Fitted Q-Iteration: Application to Planning. EWRL 2008: 55-68
46EEVolodymyr Mnih, Csaba Szepesvári, Jean-Yves Audibert: Empirical Bernstein stopping. ICML 2008: 672-679
45EERichard S. Sutton, Csaba Szepesvári, Hamid Reza Maei: A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation. NIPS 2008: 1609-1616
44EESébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári: Online Optimization in X-Armed Bandits. NIPS 2008: 201-208
43EEAmir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor: Regularized Policy Iteration. NIPS 2008: 441-448
42EEAlejandro Isaza, Csaba Szepesvári, Vadim Bulitko, Russell Greiner: Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstraction. UAI 2008: 306-314
41EERichard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael H. Bowling: Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. UAI 2008: 528-536
40EEAndrás Antos, Csaba Szepesvári, Rémi Munos: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path. Machine Learning 71(1): 89-129 (2008)
2007
39EEJean-Yves Audibert, Rémi Munos, Csaba Szepesvári: Tuning Bandit Algorithms in Stochastic Environments. ALT 2007: 150-165
38EEPeter Auer, Ronald Ortner, Csaba Szepesvári: Improved Rates for the Stochastic Continuum-Armed Bandit Problem. COLT 2007: 454-468
37EEAmir Massoud Farahmand, Csaba Szepesvári, Jean-Yves Audibert: Manifold-adaptive dimension estimation. ICML 2007: 265-272
36EEIstván Bíró, Zoltán Szamonek, Csaba Szepesvári: Sequence Prediction Exploiting Similary Information. IJCAI 2007: 1576-1581
35EEAndrás György, Levente Kocsis, Ivett Szabó, Csaba Szepesvári: Continuous Time Associative Bandit Problems. IJCAI 2007: 830-835
34EEAndrás Antos, Rémi Munos, Csaba Szepesvári: Fitted Q-iteration in continuous action-space MDPs. NIPS 2007
2006
33EELevente Kocsis, Csaba Szepesvári, Mark H. M. Winands: RSPSA: Enhanced Parameter Optimization in Games. ACG 2006: 39-56
32EEAndrás Antos, Csaba Szepesvári, Rémi Munos: Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path. COLT 2006: 574-588
31EELevente Kocsis, Csaba Szepesvári: Bandit Based Monte-Carlo Planning. ECML 2006: 282-293
30EEPéter Torma, Csaba Szepesvári: Local Importance Sampling: A Novel Technique to Enhance Particle Filtering. Journal of Multimedia 1(1): 32-43 (2006)
29EELevente Kocsis, Csaba Szepesvári: Universal parameter optimisation in games based on SPSA. Machine Learning 63(3): 249-286 (2006)
2005
28EEZoltán Szamonek, Csaba Szepesvári: X-mHMM: An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown. ICDM 2005: 434-441
27EECsaba Szepesvári, Rémi Munos: Finite time bounds for sampling based fitted value iteration. ICML 2005: 880-887
2004
26 Csaba Szepesvári: Shortest Path Discovery Problems: A Framework, Algorithms and Experimental Results. AAAI 2004: 550-555
25 Csaba Szepesvári, András Kocsor, Kornél Kovács: Kernel Machine Based Feature Extraction Algorithms for Regression Problems. ECAI 2004: 1091-1092
24EEPéter Torma, Csaba Szepesvári: Enhancing Particle Filters Using Local Likelihood Sampling. ECCV (1) 2004: 16-27
23EEAndrás Kocsor, Kornél Kovács, Csaba Szepesvári: Margin Maximizing Discriminant Analysis. ECML 2004: 227-238
22EECsaba Szepesvári, William D. Smart: Interpolation-based Q-learning. ICML 2004
2001
21EECsaba Szepesvári: Efficient approximate planning in continuous space Markovian Decision Problems. AI Commun. 14(3): 163-176 (2001)
20EEAndrás Lörincz, György Hévízi, Csaba Szepesvári: Ockham's Razor Modeling of the Matrisome Channels of the Basal Ganglia Thalamocortical Loops. Int. J. Neural Syst. 11(2): 125-143 (2001)
2000
19EEGyörgy Balogh, Ervin Dobler, Tamás Gröbler, Béla Smodics, Csaba Szepesvári: FlexVoice: A Parametric Approach to High-Quality Speech Synthesis. TSD 2000: 189-194
18EEZsolt Kalmár, Csaba Szepesvári, András Lörincz: Modular Reinforcement Learning: A Case Study in a Robot Domain. Acta Cybern. 14(3): 507-522 (2000)
17 Satinder P. Singh, Tommi Jaakkola, Michael L. Littman, Csaba Szepesvári: Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms. Machine Learning 38(3): 287-308 (2000)
1999
16 Csaba Szepesvári, Michael L. Littman: A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms. Neural Computation 11(8): 2017-2060 (1999)
15EEZsolt Kalmár, Zsolt Marczell, Csaba Szepesvári, András Lörincz: Parallel and robust skeletonization built on self-organizing elements. Neural Networks 12(1): 163-173 (1999)
14 János Murvai, Kristian Vlahovicek, Endre Barta, Csaba Szepesvári, Cristina Acatrinei, Sándor Pongor: The SBASE protein domain library, release 6.0: a collection of annotated protein sequence segments. Nucleic Acids Research 27(1): 257-259 (1999)
1998
13 Zoltán Gábor, Zsolt Kalmár, Csaba Szepesvári: Multi-criteria Reinforcement Learning. ICML 1998: 197-205
12EECsaba Szepesvári: Non-Markovian Policies in Sequential Decision Problems. Acta Cybern. 13(3): 305-318 (1998)
11 Zsolt Kalmár, Csaba Szepesvári, András Lörincz: Module-Based Reinforcement Learning: Experiments with a Real Robot. Auton. Robots 5(3-4): 273-295 (1998)
10 Zsolt Kalmár, Csaba Szepesvári, András Lörincz: Module-Based Reinforcement Learning: Experiments with a Real Robot. Machine Learning 31(1-3): 55-85 (1998)
1997
9EECsaba Szepesvári: Learning and Exploitation Do Not Conflict Under Minimax Optimality. ECML 1997: 242-249
8EEZsolt Kalmár, Csaba Szepesvári, András Lörincz: Module Based Reinforcement Learning: An Application to a Real Robot. EWLR 1997: 29-45
7 Csaba Szepesvári: The Asymptotic Convergence-Rate of Q-learning. NIPS 1997
6EECsaba Szepesvári, Szabolcs Cimmer, András Lörincz: Neurocontroller using dynamic state feedback for compensatory control. Neural Networks 10(9): 1691-1708 (1997)
1996
5 Csaba Szepesvári, András Lörincz: Inverse Dynamics Controllers for Robust Control: Consequences for Neurocontrollers. ICANN 1996: 791-796
4 Michael L. Littman, Csaba Szepesvári: A Generalized Reinforcement-Learning Model: Convergence and Applications. ICML 1996: 310-318
3EETibor Fomin, Tamás Rozgonyi, Csaba Szepesvári, András Lörincz: Self-Organizing Multi-Resolution Grid for Motion Planning and Control. Int. J. Neural Syst. 7(6): 757- (1996)
2EECsaba Szepesvári, András Lörincz: Approximate geometry representations and sensory fusion. Neurocomputing 12(2-3): 267-287 (1996)
1994
1EECsaba Szepesvári, László Balázs, András Lörincz: Topology Learning Solved by Extended Objects: A Neural Network Model. Neural Computation 6(3): 441-458 (1994)

Coauthor Index

1Yasin Abbasi-Yadkori [54]
2Cristina Acatrinei [14]
3András Antos [32] [34] [40] [49]
4Jean-Yves Audibert [37] [39] [46] [50] [52]
5Peter Auer [38] [52]
6László Balázs [1]
7György Balogh [19]
8Endre Barta [14]
9Gábor Bartók [48]
10Shalabh Bhatnagar [53]
11István Bíró [36]
12Michael H. Bowling [41]
13Sébastien Bubeck [44]
14Vadim Bulitko [42]
15Szabolcs Cimmer [6]
16Ervin Dobler [19]
17Amir Massoud Farahmand [37] [43] [47]
18Alireza Farhangfar [51]
19Tibor Fomin [3]
20Zoltán Gábor [13]
21Alborz Geramifard [41]
22Mohammad Ghavamzadeh [43] [47]
23Russell Greiner [42] [51] [54]
24Tamás Gröbler [19]
25Varun Grover [49]
26András György [35]
27György Hévízi [20]
28Alejandro Isaza [42]
29Tommi Jaakkola [17]
30Zsolt Kalmár [8] [10] [11] [13] [15] [18]
31Levente Kocsis [29] [31] [33] [35]
32András Kocsor [23] [25]
33Kornél Kovács [23] [25]
34Alessandro Lazaric [52]
35Michael L. Littman [4] [16] [17]
36András Lörincz [1] [2] [3] [5] [6] [8] [10] [11] [15] [18] [20]
37Hamid Reza Maei [45] [53]
38Shie Mannor [43] [47]
39Zsolt Marczell [15]
40Volodymyr Mnih [46]
41Rémi Munos [27] [32] [34] [39] [40] [44] [50] [52]
42János Murvai [14]
43Ronald Ortner [38]
44Barnabás Póczos [54]
45Sándor Pongor [14]
46Doina Precup [53]
47Tamás Rozgonyi [3]
48Daniil Ryabko [52]
49David Silver [53]
50Satinder P. Singh [17]
51William D. Smart [22]
52Béla Smodics [19]
53Gilles Stoltz [44]
54Nathan R. Sturtevant [54]
55Richard S. Sutton [41] [45] [53]
56Ivett Szabó [35]
57Zoltán Szamonek [28] [36]
58Péter Torma [24] [30]
59Kristian Vlahovicek [14]
60Eric Wiewiora [53]
61Mark H. M. Winands [33]
62Sandra Zilles [48]

Colors in the list of coauthors

Copyright © Tue Nov 3 08:52:44 2009 by Michael Ley (ley@uni-trier.de)