The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for MDPs with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1994-2000 (18) 2001-2002 (23) 2003-2004 (28) 2005 (20) 2006 (43) 2007 (39) 2008 (29) 2009 (24) 2010 (17) 2011 (28) 2012 (39) 2013 (45) 2014 (39) 2015 (36) 2016 (25) 2017 (32) 2018 (37) 2019 (41) 2020 (94) 2021 (113) 2022 (99) 2023 (120) 2024 (28)
Publication types (Num. hits)
article(420) data(1) inproceedings(592) phdthesis(4)
Venues (Conferences, Journals, ...)
CoRR(327) NeurIPS(51) ICML(50) AAAI(47) AAMAS(44) UAI(35) IJCAI(28) AISTATS(22) CDC(20) ICAPS(19) NIPS(18) ALT(11) ICLR(11) J. Artif. Intell. Res.(9) CONCUR(8) ACC(7) More (+10 of total 197)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 112 occurrences of 78 keywords

Results
Found 1017 publication records. Showing 1017 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
109Luca de Alfaro, Krishnendu Chatterjee, Marco Faella, Axel Legay Qualitative Logics and Equivalences for Probabilistic Systems. Search on Bibsonomy QEST The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
83Sooraj Bhat, David L. Roberts 0001, Mark J. Nelson, Charles L. Isbell Jr., Michael Mateas A globally optimal algorithm for TTD-MDPs. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Markov decision processes, convex optimization, interactive entertainment
83Kee-Eung Kim, Thomas L. Dean Solving Factored MDPs with Large Action Space Using Algebraic Decision Diagrams. Search on Bibsonomy PRICAI The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
82Krishnendu Chatterjee Markov Decision Processes with Multiple Long-Run Average Objectives. Search on Bibsonomy FSTTCS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
69Dmitri A. Dolgov, Edmund H. Durfee Symmetric approximate linear programming for factored MDPs with application to constrained problems. Search on Bibsonomy Ann. Math. Artif. Intell. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF Mathematics Subject Classifications (2000) 60J22, 62C99, 90C90
68Hugo Gimbert, Wieslaw Zielonka Limits of Multi-Discounted Markov Decision Processes. Search on Bibsonomy LICS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
56Eric V. Denardo, Eugene A. Feinberg, Uriel G. Rothblum On occupation measures for total-reward MDPs. Search on Bibsonomy CDC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
56Dmitri A. Dolgov, Michael R. James 0001, Michael E. Samples Combinatorial resource scheduling for multiagent MDPs. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF task and resource allocation in agent systems, multiagent planning
52Jianhui Wu 0006, Edmund H. Durfee Automated resource-driven mission phasing techniques for constrained agents. Search on Bibsonomy AAMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF abstract MDPs, constrained MDPs, mission phasing, mixed integer programming
42Thomas Gabel, Martin A. Riedmiller Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets. Search on Bibsonomy EWRL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
42Dmitri A. Dolgov, Edmund H. Durfee Resource allocation among agents with preferences induced by factored MDPs. Search on Bibsonomy AAMAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF (multi-)agent planning, task and resource allocation in agent systems
42Alberto Reyes, Pablo H. Ibargüengoytia, Luis Enrique Sucar Power Plant Operator Assistant: An Industrial Application of Factored MDPs. Search on Bibsonomy MICAI The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
41Mark Kroon, Shimon Whiteson Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs. Search on Bibsonomy ICMLA The full citation details ... 2009 DBLP  DOI  BibTeX  RDF factored MDPs, feature selection, Reinforcement learning
41Juan Frausto Solís, Elizabeth Santiago D., Jaime Mora-Vargas Cosine Policy Iteration for Solving Infinite-Horizon Markov Decision Processes. Search on Bibsonomy MICAI The full citation details ... 2009 DBLP  DOI  BibTeX  RDF cosine simplex method, Markov decision processes, hybrid method, policy iteration
41Tiffany Barnes, John C. Stamper Toward Automatic Hint Generation for Logic Proof Tutoring Using Historical Student Data. Search on Bibsonomy Intelligent Tutoring Systems The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
41Carlos Diuk, Andre Cohen, Michael L. Littman An object-oriented representation for efficient reinforcement learning. Search on Bibsonomy ICML The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
41Stefan J. Witwicki, Edmund H. Durfee Commitment-driven distributed joint policy search. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF coordination, negotiation, agent modeling
41Janusz Marecki, Milind Tambe On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF decentralized Markov decision process, locally optimal solution, multi-agent systems, temporal constraints
41Alberto Reyes, Luis Enrique Sucar, Eduardo F. Morales 0001, Pablo H. Ibargüengoytia Solving Hybrid Markov Decision Processes. Search on Bibsonomy MICAI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
41Shulin Cui, Jigui Sun, Minghao Yin, Shuai Lu 0001 Solving Uncertain Markov Decision Problems: An Interval-Based Method. Search on Bibsonomy ICNC (2) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
41Aurélie Beynier, Abdel-Illah Mouaddib A polynomial algorithm for decentralized Markov decision processes with temporal constraints. Search on Bibsonomy AAMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF multi-agent systems, uncertainty, planning, Markov decision processes
41Raphen Becker, Shlomo Zilberstein, Victor R. Lesser Decentralized Markov Decision Processes with Event-Driven Interactions. Search on Bibsonomy AAMAS The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
41Xi-Ren Cao From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning. Search on Bibsonomy Discret. Event Dyn. Syst. The full citation details ... 2003 DBLP  DOI  BibTeX  RDF gradient-based policy iteration, perturbation realization, TD(), Q-learning, Poisson equations, Potentials
41Raphen Becker, Shlomo Zilberstein, Victor R. Lesser, Claudia V. Goldman Transition-independent decentralized markov decision processes. Search on Bibsonomy AAMAS The full citation details ... 2003 DBLP  DOI  BibTeX  RDF decentralized MDP, decision-theoretic planning
40Calin Ciufudean, Otilia Ciufudean, Constantin Filote New Models for Immune Mechanism Diagnosis. Search on Bibsonomy MDA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Markov Decision Processes (MDPs), Immune mechanisms diagnosis, Petri nets
40Xi-Ren Cao Basic Ideas for Event-Based Optimization of Markov Systems. Search on Bibsonomy Discret. Event Dyn. Syst. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF Markov decision processes (MDPs), performance potentials, policy gradients, aggregation, perturbation analysis, POMDPs, policy iteration
30Gellért Weisz, András György 0001, Csaba Szepesvári Online RL in Linearly qπ-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
30Runyu Zhang, Yang Hu, Na Li 0002 Regularized Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
30Gellért Weisz, András György 0001, Csaba Szepesvári Online RL in Linearly qπ-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
30Eugene A. Feinberg, Jefferson Huang Reduction of total-cost and average-cost MDPs with weakly continuous transition probabilities to discounted MDPs. Search on Bibsonomy Oper. Res. Lett. The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
30Kim Bauters, Weiru Liu, Lluís Godo Anytime Algorithms for Solving Possibilistic MDPs and Hybrid MDPs. Search on Bibsonomy FoIKS The full citation details ... 2016 DBLP  DOI  BibTeX  RDF
30Richard S. Sutton, Doina Precup, Satinder Singh 0001 Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning. Search on Bibsonomy Artif. Intell. The full citation details ... 1999 DBLP  DOI  BibTeX  RDF
28Moser Silva Fagundes, Roberto Centeno, Holger Billhardt, Sascha Ossowski Designing Organized Multiagent Systems through MDPs. Search on Bibsonomy MATES The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
28Feng Wu 0001, Xiaoping Chen Solving Large-Scale and Sparse-Reward DEC-POMDPs with Correlation-MDPs. Search on Bibsonomy RoboCup The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
28Song Zhiwei, Chen Xiaoping States evolution in Theta(lambda)-learning based on logical MDPs with negation. Search on Bibsonomy SMC The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
28Sarah Osentoski, Sridhar Mahadevan Learning state-action basis functions for hierarchical MDPs. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
28Jianhui Wu 0006, Edmund H. Durfee Mixed-integer linear programming for transition-independent decentralized MDPs. Search on Bibsonomy AAMAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF transition-independent decentralized MDP, mixed integer linear programming, MDP, piecewise linear approximation
28Gerardo I. Simari, Simon Parsons On the relationship between MDPs and the BDI architecture. Search on Bibsonomy AAMAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF policy, markov decision process, intention
28Dmitri A. Dolgov, Edmund H. Durfee Computationally-efficient combinatorial auctions for resource allocation in weakly-coupled MDPs. Search on Bibsonomy AAMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF distributed implementation, generalized Vickrey auctions, markov decision processes, combinatorial auctions
28David I. Ferguson, Anthony Stentz Focussed Propagation of MDPs for Path Planning. Search on Bibsonomy ICTAI The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
27Lihong Li 0001, Michael L. Littman, Christopher R. Mansley Online exploration in least-squares policy iteration. Search on Bibsonomy AAMAS (2) The full citation details ... 2009 DBLP  BibTeX  RDF PAC-MDP, least-squares policy iteration (LSPI), reinforcement learning, Markov decision processes, exploration
27Yanjie Li, Baoqun Yin, Hongsheng Xi Partially Observable Markov Decision Processes and Performance Sensitivity Analysis. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
27Pritam Roy, David Parker 0001, Gethin Norman, Luca de Alfaro Symbolic Magnifying Lens Abstraction in Markov Decision Processes. Search on Bibsonomy QEST The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
27Ronald Ortner Pseudometrics for State Aggregation in Average Reward Markov Decision Processes. Search on Bibsonomy ALT The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richard Maclin Building Relational World Models for Reinforcement Learning. Search on Bibsonomy ILP The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepalli Multi-task reinforcement learning: a hierarchical Bayesian approach. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Jeffrey Johns, Sridhar Mahadevan Constructing basis functions from directed graphs for value function approximation. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Jennifer Boger, Jesse Hoey, Pascal Poupart, Craig Boutilier, Geoff R. Fernie, Alex Mihailidis A Planning System Based on Markov Decision Processes to Guide People With Dementia Through Activities of Daily Living. Search on Bibsonomy IEEE Trans. Inf. Technol. Biomed. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Haibo Zhao, Prashant Doshi A Hierarchical Framework for Composing Nested Web Processes. Search on Bibsonomy ICSOC The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Marta Z. Kwiatkowska, Gethin Norman, David Parker 0001 Game-based Abstraction for Markov Decision Processes. Search on Bibsonomy QEST The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Nicole Immorlica, Kamal Jain, Mohammad Mahdian Game-Theoretic Aspects of Designing Hyperlink Structures. Search on Bibsonomy WINE The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Krishnendu Chatterjee, Rupak Majumdar, Thomas A. Henzinger Markov Decision Processes with Multiple Objectives. Search on Bibsonomy STACS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Kristian Kersting, Luc De Raedt Logical Markov Decision Programs and the Convergence of Logical TD(lambda). Search on Bibsonomy ILP The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
27Dmitri A. Dolgov, Edmund H. Durfee Graphical Models in Local, Asymmetric Multi-Agent Markov Decision Processes. Search on Bibsonomy AAMAS The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
27Fletcher Lu, Dale Schuurmans Model-Based Least-Squares Policy Evaluation. Search on Bibsonomy AI The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
27Mohammad Ghavamzadeh, Sridhar Mahadevan A multiagent reinforcement learning algorithm by dynamically merging markov decision processes. Search on Bibsonomy AAMAS The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
26Jianhui Wu 0006, Edmund H. Durfee Sequential resource allocation in multiagent systems with uncertainties. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF constrained MDPs, mission phasing, sequential resource allocation, mixed integer linear programming
26Thomas A. Wagner, Anita Raja, Victor R. Lesser Modeling Uncertainty and its Implications to Sophisticated Control in Tæms Agents. Search on Bibsonomy Auton. Agents Multi Agent Syst. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF Agent scheduling, Contingency analysis, Uncertainty, Intelligent agents, Control, MDPs
26Jiaying Shen, Victor R. Lesser, Norman Carver Minimizing communication cost in a distributed Bayesian network using a decentralized MDP. Search on Bibsonomy AAMAS The full citation details ... 2003 DBLP  DOI  BibTeX  RDF decentralized MDPs, Bayesian networks, action selection, decision-theoretic planning, coordination of multiple agents
15Sivaramakrishnan Ramani, Archis Ghate A Family of \(\boldsymbol{s}\)-Rectangular Robust MDPs: Relative Conservativeness, Asymptotic Analyses, and Finite-Sample Properties. Search on Bibsonomy SIAM J. Optim. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Long-Fei Li, Peng Zhao 0006, Zhi-Hua Zhou Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Toshinori Kitamura, Tadashi Kozuno, Masahiro Kato, Yuki Ichihara, Soichiro Nishimori, Akiyoshi Sannai, Sho Sonoda, Wataru Kumagai, Yutaka Matsuo A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Berk Bozkurt, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang 0002 Model approximation in MDPs with unbounded per-step cost. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Adrian Müller, Pragnya Alatur, Volkan Cevher, Giorgia Ramponi, Niao He Truly No-Regret Learning in Constrained MDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Michael Gimelfarb, Ayal Taitler, Scott Sanner Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Junze Deng, Yuan Cheng, Shaofeng Zou, Yingbin Liang Sample Complexity Characterization for Linear Contextual MDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Davide Maran, Alberto Maria Metelli, Matteo Papini, Marcello Restelli No-Regret Reinforcement Learning in Smooth MDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi 0001, Nicola Gatti 0001 Learning Adversarial MDPs with Stochastic Hard Constraints. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Kazuki Watanabe 0003, Marck van der Vegt, Ichiro Hasuo, Jurriaan Rot, Sebastian Junges Pareto Curves for Compositionally Model Checking String Diagrams of MDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Matthew Zurek, Yudong Chen Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Hei Yi Mak, Flint Xiaofeng Fan, Luca A. Lanzendörfer, Cheston Tan, Wei Tsang Ooi, Roger Wattenhofer CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Menno van Zutphen, Giannis Delimpaltadakis, Maurice Heemels, Duarte Antunes Predictable Interval MDPs through Entropy Regularization. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Kihyuk Hong, Ambuj Tewari A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Low-Rank MDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Qinbo Bai, Washim Uddin Mondal, Vaneet Aggarwal Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Omer Ben-Porat, Yishay Mansour, Michal Moshkovitz, Boaz Taitler Principal-Agent Reward Shaping in MDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Prashansa Panda, Shalabh Bhatnagar Critic-Actor for Average Reward MDPs with Function Approximation: A Finite-Time Analysis. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Ian A. Kash, Lev Reyzin, Zishun Yu Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPs. Search on Bibsonomy ALT The full citation details ... 2024 DBLP  BibTeX  RDF
15Yulong Gao, Karl Henrik Johansson, Alessandro Abate CTL Model Checking of MDPs over Distribution Spaces: Algorithms and Sampling-based Computations. Search on Bibsonomy HSCC The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Mateo Perez, Fabio Somenzi, Ashutosh Trivedi 0001 A PAC Learning Algorithm for LTL and Omega-Regular Objectives in MDPs. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Long-Fei Li, Peng Zhao 0006, Zhi-Hua Zhou Dynamic Regret of Adversarial MDPs with Unknown Transition and Linear Function Approximation. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Omer Ben-Porat, Yishay Mansour, Michal Moshkovitz, Boaz Taitler Principal-Agent Reward Shaping in MDPs. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Uri Gadot, Esther Derman, Navdeep Kumar, Maxence Mohamed Elfatihi, Kfir Levy, Shie Mannor Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Kazuki Watanabe 0003, Marck van der Vegt, Ichiro Hasuo, Jurriaan Rot, Sebastian Junges Pareto Curves for Compositionally Model Checking String Diagrams of MDPs. Search on Bibsonomy TACAS (2) The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Junze Deng, Yuan Cheng, Shaofeng Zou, Yingbin Liang Sample Complexity Characterization for Linear Contextual MDPs. Search on Bibsonomy AISTATS The full citation details ... 2024 DBLP  BibTeX  RDF
15Long-Fei Li, Peng Zhao, Zhi-Hua Zhou Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition. Search on Bibsonomy AISTATS The full citation details ... 2024 DBLP  BibTeX  RDF
15Miruna Oprescu, Andrew Bennett, Nathan Kallus Low-rank MDPs with Continuous Action Spaces. Search on Bibsonomy AISTATS The full citation details ... 2024 DBLP  BibTeX  RDF
15Germano Gabbianelli, Gergely Neu, Matteo Papini, Nneka Okolo Offline Primal-Dual Reinforcement Learning for Linear MDPs. Search on Bibsonomy AISTATS The full citation details ... 2024 DBLP  BibTeX  RDF
15Uday Kumar M, Veeraruna Kavitha, Sanjay P. Bhat, Nandyala Hemachandra Optimal Markov Policies for Finite-Horizon Constrained MDPs With Combined Additive and Multiplicative Utilities. Search on Bibsonomy IEEE Control. Syst. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Qinbo Bai, Vaneet Aggarwal, Ather Gattami Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints. Search on Bibsonomy J. Mach. Learn. Res. The full citation details ... 2023 DBLP  BibTeX  RDF
15Ali Devran Kara, Naci Saldi, Serdar Yüksel Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity. Search on Bibsonomy J. Mach. Learn. Res. The full citation details ... 2023 DBLP  BibTeX  RDF
15Shaorong Xie, Zhenyu Zhang 0013, Hang Yu 0006, Xiangfeng Luo Recurrent prediction model for partially observable MDPs. Search on Bibsonomy Inf. Sci. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Kosuke Sakamoto, Yasuharu Kunii A MDPs-Based Dynamic Path Planning in Unknown Environments for Hopping Locomotion. Search on Bibsonomy IEEE Access The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Ravi N. Haksar, Mac Schwager Constrained Control of Large Graph-Based MDPs Under Measurement Uncertainty. Search on Bibsonomy IEEE Trans. Autom. Control. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Frantisek Blahoudek, Petr Novotný 0001, Melkior Ornik, Pranay Thangeda, Ufuk Topcu Efficient Strategy Synthesis for MDPs With Resource Constraints. Search on Bibsonomy IEEE Trans. Autom. Control. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15S. Akshay 0001, Krishnendu Chatterjee, Tobias Meggendorfer, Dorde Zikelic MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Junkai Zhang, Weitong Zhang, Quanquan Gu Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Matthew Zurek, Yudong Chen Span-Based Optimal Sample Complexity for Average Reward MDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Kasper Engelen, Guillermo A. Pérez 0001, Shrisha Rao 0002 Graph-Based Reductions for Parametric and Weighted MDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Zakaria Mhammedi, Adam Block, Dylan J. Foster, Alexander Rakhlin Efficient Model-Free Exploration in Low-Rank MDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh 0001, Tom Zahavy ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 1017 (100 per page; Change: )
Pages: [1][2][3][4][5][6][7][8][9][10][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license