The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for PPO with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1988-2018 (15) 2019-2020 (17) 2021-2022 (25) 2023 (41) 2024 (8)
Publication types (Num. hits)
article(57) data(1) inproceedings(47) phdthesis(1)
Venues (Conferences, Journals, ...)
CoRR(29) IEEE Internet Things J.(4) AAAI(3) GLOBECOM(3) ICPADS(3) Sensors(3) AAMAS(2) ICLR(2) IEEE Access(2) IWCMC(2) ACC(1) Appl. Intell.(1) Appl. Soft Comput.(1) Bioinform.(1) CCEAI(1) CoG(1) More (+10 of total 61)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 5 occurrences of 5 keywords

Results
Found 106 publication records. Showing 106 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
76Su Xia, Hongyi Wu A CDMA-based approach for highly efficient medium access control in mesh wireless networks. Search on Bibsonomy WOWMOM The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
51Thomas Bousonville, Filippo Focacci, Claude Le Pape, Wim Nuijten, Frederic Paulin, Jean-Francois Puget, Anna Robert, Alireza Sadeghin Integration of Rules and Optimization in Plant PowerOps. Search on Bibsonomy CPAIOR The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
51H. J. Pu, M. Müller, E. Abdalla, L. Abdelatif, E. Mokhtar Bakr, Hassan A. Nour Eldin Parallel computation of the inertia matrix of a tree type robot using one directional recursion of Newton-Euler formulation. Search on Bibsonomy J. Intell. Robotic Syst. The full citation details ... 1996 DBLP  DOI  BibTeX  RDF parallel computation, Robotics, robot dynamics
51Jozsef A. Toth Specification of an Object to Object Protocol in Abstract Syntax Notation One (ASN.1). Search on Bibsonomy IEA/AIE (Vol. 2) The full citation details ... 1990 DBLP  DOI  BibTeX  RDF
46Yongheng Liang, Hejun Wu, Haitao Wang ASM-PPO: Asynchronous and Scalable Multi-Agent PPO for Cooperative Charging. (PDF / PS) Search on Bibsonomy AAMAS The full citation details ... 2022 DBLP  BibTeX  RDF
37Junrong Liang, Jiang Zheng, Xin Zhao Distribution of Antioxidatases in Cell of Diatom Nitzschia Closterium and Response to Different Environmental Silicon Concentrations. Search on Bibsonomy ESIAT (1) The full citation details ... 2009 DBLP  DOI  BibTeX  RDF diatom Nitzschia closterium, plasma membrane, antioxidatase, PPO, POD, environmental silicon concentration, CAT, SOD
25Anna Stein SASHA: The Automatic Generation of Rule-based Diagnostic Expert Systems. Search on Bibsonomy IEA/AIE (Vol. 1) The full citation details ... 1988 DBLP  DOI  BibTeX  RDF
23Yue Guan, Sai Zou, Haixia Peng, Wei Ni 0001, Yanglong Sun, Hongfeng Gao Cooperative UAV Trajectory Design for Disaster Area Emergency Communications: A Multiagent PPO Method. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Chanyuan Meng, Ke Xiong 0001, Wei Chen 0002, Bo Gao, Pingyi Fan, Khaled Ben Letaief Sum-Rate Maximization in STAR-RIS-Assisted RSMA Networks: A PPO-Based Algorithm. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Haonan An, Lin Wang 0023 Robust Topology Generation of Internet of Things Based on PPO Algorithm Using Discrete Action Space. Search on Bibsonomy IEEE Trans. Ind. Informatics The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Shengyi Huang, Michael Noukhovitch, Arian Hosseini, Kashif Rasul, Weixun Wang, Lewis Tunstall The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Junyang Zhang, Cristian Emanuel Ocampo Rivera, Kyle Tyni, Steven Nguyen A PPO-based DRL Auto-Tuning Nonlinear PID Drone Controller for Robust Autonomous Flights. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Yuchen Liu, Ka Lok Man, Gangmin Li, Terry R. Payne, Yong Yue 0001 Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC. Search on Bibsonomy CCEAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Jakob J. Hollenstein, Georg Martius, Justus H. Piater Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Nai-Chieh Huang, Ping-Chun Hsieh, Kuo-Hao Ho, I-Chen Wu PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Kang Liu, Wei Quan 0001, Nan Cheng, Wen Wu 0003, Ziheng Xu, Liang Guo 0003, Deyun Gao, Hongke Zhang Reliable PPO-Based Concurrent Multipath Transfer for Time-Sensitive Applications. Search on Bibsonomy IEEE Trans. Veh. Technol. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Yikun Zhao, Fanqin Zhou, Huaide Liu, Lei Feng 0001, Wenjing Li 0001 PPO-based deployment and phase control for movable intelligent reflecting surface. Search on Bibsonomy J. Cloud Comput. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Sanjna Siboo, Anushka Bhattacharyya, Rashmi Naveen Raj, S. H. Ashwin An Empirical Study of DDPG and PPO-Based Reinforcement Learning Algorithms for Autonomous Driving. Search on Bibsonomy IEEE Access The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Gyeong Ho Lee, Hyunseo Park, Jae Won Jang, Jaeseob Han, Jun Kyun Choi PPO-Based Autonomous Transmission Period Control System in IoT Edge Computing. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Haijun Zhang 0001, Minghui Jiang 0006, Xiangnan Liu, Xiangming Wen, Ning Wang 0004, Keping Long PPO-Based PDACB Traffic Control Scheme for Massive IoV Communications. Search on Bibsonomy IEEE Trans. Intell. Transp. Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Zhiling Jiang, Yining Chen, Ke Wang, Bowei Yang, Guanghua Song A Graph-Based PPO Approach in Multi-UAV Navigation for Communication Coverage. Search on Bibsonomy Int. J. Comput. Commun. Control The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Shuxin Yang, Xiaoyang Chang, Guixiang Zhu, Jie Cao 0001, Weiping Qin, Youquan Wang, Zhendong Wang GAA-PPO: A novel graph adversarial attack method by incorporating proximal policy optimization. Search on Bibsonomy Neurocomputing The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Ruichen Zhang, Ke Xiong 0001, Yang Lu 0008, Pingyi Fan, Derrick Wing Kwan Ng, Khaled B. Letaief Energy Efficiency Maximization in RIS-Assisted SWIPT Networks With RSMA: A PPO-Based Approach. Search on Bibsonomy IEEE J. Sel. Areas Commun. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Xiaoxue Yu, Rongpeng Li, Fei Wang, Chenghui Peng, Chengchao Liang, Zhifeng Zhao, Honggang Zhang 0001 Communication-Efficient Cooperative Multi-Agent PPO via Regulated Segment Mixture in Internet of Vehicles. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Michael Santacroce, Yadong Lu, Han Yu, Yuanzhi Li, Yelong Shen Efficient RLHF: Reducing the Memory Usage of PPO. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Chengcheng Han 0004, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li 0067, Ming Gao, Baoyuan Wang DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Jiacheng Liu 0010, Andrew Cohen, Ramakanth Pasunuru, Yejin Choi 0001, Hannaneh Hajishirzi, Asli Celikyilmaz Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Jakob J. Hollenstein, Georg Martius, Justus H. Piater Colored Noise in PPO: Improved Exploration and Performance Through Correlated Action Sampling. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan 0001, Tao Gui, Qi Zhang 0001, Xipeng Qiu, Xuanjing Huang 0001 Secrets of RLHF in Large Language Models Part I: PPO. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin 0003, Hongyuan Zha, Xiangfeng Wang Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Chenwei Xu, Jerry Yao-Chieh Hu, Aakaash Narayanan, Mattson Thieme, Vladimir Nagaslaev, Mark Austin, Jeremy Arnold, Jose Berlioz, Pierrick Hanlet, Aisha Ibrahim, Dennis Nicklaus, Jovan Mitrevski, Jason Michael St. John, Gauri Pradhan, Andrea Saewert, Kiyomi Seiya, Brian Schupbach, Randy Thurman-Keup, Nhan Tran, Rui Shi, Seda Ogrenci, Alexis Maya-Isabelle Shuping, Kyle J. Hazelwood, Han Liu Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Mandan Naresh, Paresh Saxena, Manik Gupta PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Nai-Chieh Huang, Ping-Chun Hsieh, Kuo-Hao Ho, I-Chen Wu PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Niloofar Gholipour, Marcos Dias de Assunção, Pranav Agarwal, Julien Gascon-Samson, Rajkumar Buyya TPTO: A Transformer-PPO based Task Offloading Solution for Edge Computing Environments. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Guanlin Wu, Wenqi Fang, Ji Wang 0002, Pin Ge, Jiang Cao, Yang Ping, Peng Gou Dyna-PPO reinforcement learning with Gaussian process for the continuous action decision-making in autonomous driving. Search on Bibsonomy Appl. Intell. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Bingxu Zhao, Hongbin Dong, Yingjie Wang 0002, Tingwei Pan PPO-TA: Adaptive task allocation via Proximal Policy Optimization for spatio-temporal crowdsourcing. Search on Bibsonomy Knowl. Based Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Qianhao Xiao, Li Jiang, Manman Wang, Xin Zhang An Improved Distributed Sampling PPO Algorithm Based on Beta Policy for Continuous Global Path Planning Scheme. Search on Bibsonomy Sensors The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Xiao Wang, Zhaohui Yang, Xueqian Bai, Mingjiang Ji, Hao Li, Dechao Ran A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader-Follower Tracking Problem. Search on Bibsonomy Sensors The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Chengqing Liang, Lei Liu 0008, Chen Liu Multi-UAV autonomous collision avoidance based on PPO-GIC algorithm with CNN-LSTM fusion network. Search on Bibsonomy Neural Networks The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Li Li, Wei Li 0106, Jun Wang 0005, Xiaonan Chen, Qihang Peng, Wei Huang 0021 UAV Trajectory Optimization for Spectrum Cartography: A PPO Approach. Search on Bibsonomy IEEE Commun. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Tao Jing, Zha Liu, Minghao Zhu, Xuehan Li, Bo Gao, Qinghe Gao, Yan Huo P-DRR: PPO-Based Efficient Dynamic Resource Reallocation Scheme in Industrial Internet of Things. Search on Bibsonomy VTC Fall The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Mandan Naresh, Paresh Saxena, Manik Gupta PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming. Search on Bibsonomy IWCMC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Wenwu Zhu 0007, Xin Chen 0018, Libo Jiao, Geyong Min, Wang Li Cost-Efficient 6G Space-Air-Ground Integrated Mobile Edge Computing for Smart City: A PPO-Based Offloading Decision and Resource Allocation Algorithm. Search on Bibsonomy HPCC/DSS/SmartCity/DependSys The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Christian T. Coletti, Kyle A. Williams, Hannah C. Lehman, Zahi M. Kakish, Daniel Whitten, Julie Parish Effectiveness of Warm-Start PPO for Guidance with Highly Constrained Nonlinear Fixed-Wing Dynamics. Search on Bibsonomy ACC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Tianyi Lin, Jun Du, Haijun Zhang 0001, Arumugam Nallanathan, Jun Wang PPO-Based Energy-Efficient Power Control and Spectrum Allocation in In-Vehicle HetNets. Search on Bibsonomy GLOBECOM The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Xiaoxue Yu, Rongpeng Li, Fei Wang, Chenghui Peng, Chengchao Liang, Zhifeng Zhao, Honggang Zhang 0001 Communication-Efficient Cooperative Multi-Agent PPO via Regulated Segment Mixture in Internet of Vehicles. Search on Bibsonomy GLOBECOM The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Xing Chen, Dongcui Diao, Hechang Chen, Hengshuai Yao, Haiyin Piao, Zhixiao Sun, Zhiwei Yang 0005, Randy Goebel, Bei Jiang, Yi Chang 0001 The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Yaqin Li, Zhicai Zhang, Fang Fu, Yan Wang A PPO-Based Dynamic Asynchronous Semi-Decentralized Federated Edge Learning. Search on Bibsonomy ICPADS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Niloofar Gholipour, Marcos Dias de Assunção, Pranav Agarwal, Julien Gascon-Samson, Rajkumar Buyya TPTO: A Transformer-PPO based Task Offloading Solution for Edge Computing Environments. Search on Bibsonomy ICPADS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Wei Zhao 0023, Runhu Zhong, Cheng Wu, Xinwei Xu Delay and Battery Degradation Optimization based on PPO for Task Offloading in RSU-assisted IoV. Search on Bibsonomy ICPADS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Haokun Zhang Inverse-Huber Loss Based PPO algorithm. Search on Bibsonomy RICAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Chengcheng Han 0004, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li 0067, Ming Gao, Baoyuan Wang DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models. Search on Bibsonomy EMNLP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Donghe Li, Chunlin Hu, Qingyu Yang, Shitao Chen Multi Actor-Critic PPO: A Novel Reinforcement Learning Method for Intelligent Task and Charging Scheduling in Electric Freight Vehicles Management. Search on Bibsonomy ITSC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Fang Li, Xueyan Wang, Yining Liu, Li Luo Penetration Test Path Discovery Based on NHSC-PPO. Search on Bibsonomy EITCE The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Lian Liu, Dongpu Sun Research on Multi-agent PPO Reinforcement Learning Algorithm based on Knowledge Graph. Search on Bibsonomy DSA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Mingfei Sun, Sam Devlin, Jacob Beck, Katja Hofmann, Shimon Whiteson Trust Region Bounds for Decentralized PPO Under Non-stationarity. Search on Bibsonomy AAMAS The full citation details ... 2023 DBLP  BibTeX  RDF
23Guohao Zhu, Zhou Shen, Laiyuan Liu, Sicong Zhao, Fangzheng Ji, Zixia Ju, Jialong Sun AUV Dynamic Obstacle Avoidance Method Based on Improved PPO Algorithm. Search on Bibsonomy IEEE Access The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Mingfei Sun, Sam Devlin, Katja Hofmann, Shimon Whiteson Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
23Mingfei Sun, Vitaly Kurin, Guoqing Liu, Sam Devlin, Tao Qin 0001, Katja Hofmann, Shimon Whiteson You May Not Need Ratio Clipping in PPO. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
23Jin Fang, Jiacheng Weng, Yi Xiang, Xinwen Zhang Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Shengyi Huang, Anssi Kanervisto, Antonin Raffin, Weixun Wang, Santiago Ontañón, Rousslan Fernand Julien Dossa A2C is a special case of PPO. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Qisheng Zhang, Zhen Guo, Audun Jøsang, Lance M. Kaplan, Feng Chen 0001, Dong Hyun Jeong, Jin-Hee Cho PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Behnam Mohammad Hasani Zade, Najme Mansouri PPO: a new nature-inspired metaheuristic algorithm based on predation for optimization. Search on Bibsonomy Soft Comput. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Sérgio F. Chevtchenko, Eduardo J. Barbosa, Marcelo Cabral Cavalcanti, Gustavo Medeiros de Souza Azevedo, Teresa Bernarda Ludermir Combining PPO and incremental conductance for MPPT under dynamic shading and temperature. Search on Bibsonomy Appl. Soft Comput. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Wei Guan, Zhewen Cui, Xianku Zhang Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm. Search on Bibsonomy Sensors The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Khalil Chikhaoui, Hakim Ghazzai, Yehia Massoud PPO-based Reinforcement Learning for UAV Navigation in Urban Environments. Search on Bibsonomy MWSCAS The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Prasanna Kumar Kukkamalla, Veli-Matti Uski, Olli Kuismanen, Hannu Kärkkäinen, Karan Menon Data Analytics Capability Roadmap for PPO Business Models in Equipment Manufacturing Companies. Search on Bibsonomy PLM The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Kang Liu, Wei Quan 0001, Nan Cheng, Ziheng Xu, Jun Deng, Deyun Gao PPO-based Reliable Concurrent Transmission Control for Telemedicine Real-time Services. Search on Bibsonomy ICC The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Xiangyu Li, Hangyue Liu, Chaojie Li, Guo Chen 0002, Shiping Wen 0001 PPO-based Pricing Method for Shared Energy Storage System. Search on Bibsonomy ISGT Asia The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Abdulrahman Nahhas, Andrey Kharitonov, Klaus Turowski Deep Reinforcement Learning Techniques For Solving Hybrid Flow Shop Scheduling Problems: Proximal Policy Optimization (PPO) and Asynchronous Advantage Actor-Critic (A3C). Search on Bibsonomy HICSS The full citation details ... 2022 DBLP  BibTeX  RDF
23Kun Du, Xianzhong Xie, Zhaoyuan Shi, Min Li Joint Time and Power Control of Energy Harvesting CRN Based on PPO. Search on Bibsonomy WTS The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Chao Yu 0005, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre M. Bayen, Yi Wu 0013 The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games. Search on Bibsonomy NeurIPS The full citation details ... 2022 DBLP  BibTeX  RDF
23Bilal Kabas Autonomous UAV Navigation via Deep Reinforcement Learning Using PPO. Search on Bibsonomy SIU The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Markus Holzleitner, Lukas Gruber, José Antonio Arjona-Medina, Johannes Brandstetter, Sepp Hochreiter Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER. Search on Bibsonomy Trans. Large Scale Data Knowl. Centered Syst. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Mingwu Zhang, Yu Chen 0056, Zhe Xia, Jiangyi Du, Willy Susilo PPO-DFK: A Privacy-Preserving Optimization of Distributed Fractional Knapsack With Application in Secure Footballer Configurations. Search on Bibsonomy IEEE Syst. J. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Hsuan-Yu Yao, Ping-Chun Hsieh, Kuo-Hao Ho, Kai-Chun Hu, Liang-Chun Ouyang, I-Chen Wu Hinge Policy Optimization: Rethinking Policy Improvement and Reinterpreting PPO. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
23Xingxing Liang, Yang Ma, Yanghe Feng, Zhong Liu PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
23Yunxiao Guo, Han Long, Xiaojun Duan, Kaiyuan Feng, Maochu Li, Xiaying Ma CIM-PPO: Proximal Policy Optimization with Liu-Correntropy Induced Metric. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
23Jiahao Shen, Tao Zhang 0063, Bingchi Zhang, Weixiao Ji, Xiaohui Kuang, Changqiao Xu PPO-RM: Proximal Policy Optimization Based Route Mutation for Multimedia Services. Search on Bibsonomy IWCMC The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Haijun Zhang 0001, Xiangnan Liu, Keping Long, H. Vincent Poor Primal Dual PPO Learning Resource Allocation in Indoor IRS-Aided Networks. Search on Bibsonomy GLOBECOM The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Mingwu Zhang, Yu Chen 0056, Willy Susilo PPO-CPQ: A Privacy-Preserving Optimization of Clinical Pathway Query for E-Healthcare Systems. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Markus Holzleitner, Lukas Gruber, Jose A. Arjona-Medina, Johannes Brandstetter, Sepp Hochreiter Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
23Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
23Mario S. Holubar, Marco A. Wiering Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPO. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
23Ju-Seung Byun, Byungmoon Kim, Huamin Wang Proximal Policy Gradient: PPO with Policy Gradient. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
23Cheng-Yen Tang, Chien-Hung Liu, Woei-Kae Chen, Shingchern D. You Implementing action mask in proximal policy optimization (PPO) algorithm. Search on Bibsonomy ICT Express The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation. Search on Bibsonomy MLSP The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Wlodzimierz Funika, Pawel Koperek, Jacek Kitowski Management of Heterogeneous Cloud Resources with Use of the PPO. Search on Bibsonomy Euro-Par Workshops The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Moksh Jain, S. Sowmya Kamath Improving Convergence in IRGAN with PPO. Search on Bibsonomy COMAD/CODS The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry Implementation Matters in Deep RL: A Case Study on PPO and TRPO. Search on Bibsonomy ICLR The full citation details ... 2020 DBLP  BibTeX  RDF
23Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames. Search on Bibsonomy ICLR The full citation details ... 2020 DBLP  BibTeX  RDF
23Lianjiang Li, Yunrong Yang, Bingna Li Combine PPO with NES to Improve Exploration. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
23Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra Decentralized Distributed PPO: Solving PointGoal Navigation. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
23Joe Booth PPO Dash: Improving Generalization in Deep Reinforcement Learning. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
23Mingwu Zhang PPO-DFK. Search on Bibsonomy 2019   DOI  RDF
23Jia-Chi Chen, Tao-Hsing Chang Modified PPO-RND Method for Solving Sparse Reward Problem in ViZDoom. Search on Bibsonomy CoG The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
23An Guo, Lianghua Song, Xiong Chen Learning Similar Tasks Based On PPO By Transferring Trajectory. Search on Bibsonomy ICNSC The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
23Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
23Haitham Al-Jabri, Takafumi Matsumaru Proposing Camera Calibration Method Using PPO (Proximal Policy Optimization) for Improving Camera Pose Estimations. Search on Bibsonomy ROBIO The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
23S. Raghavendra, S. J. Aditya Rao, Vadlapudi Kumar, C. K. Ramesh Multiple ligand simultaneous docking (MLSD): A novel approach to study the effect of inhibitors on substrate binding to PPO. Search on Bibsonomy Comput. Biol. Chem. The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 106 (100 per page; Change: )
Pages: [1][2][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license