|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 5 occurrences of 5 keywords
|
|
|
Results
Found 106 publication records. Showing 106 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
76 | Su Xia, Hongyi Wu |
A CDMA-based approach for highly efficient medium access control in mesh wireless networks. |
WOWMOM |
2009 |
DBLP DOI BibTeX RDF |
|
51 | Thomas Bousonville, Filippo Focacci, Claude Le Pape, Wim Nuijten, Frederic Paulin, Jean-Francois Puget, Anna Robert, Alireza Sadeghin |
Integration of Rules and Optimization in Plant PowerOps. |
CPAIOR |
2005 |
DBLP DOI BibTeX RDF |
|
51 | H. J. Pu, M. Müller, E. Abdalla, L. Abdelatif, E. Mokhtar Bakr, Hassan A. Nour Eldin |
Parallel computation of the inertia matrix of a tree type robot using one directional recursion of Newton-Euler formulation. |
J. Intell. Robotic Syst. |
1996 |
DBLP DOI BibTeX RDF |
parallel computation, Robotics, robot dynamics |
51 | Jozsef A. Toth |
Specification of an Object to Object Protocol in Abstract Syntax Notation One (ASN.1). |
IEA/AIE (Vol. 2) |
1990 |
DBLP DOI BibTeX RDF |
|
46 | Yongheng Liang, Hejun Wu, Haitao Wang |
ASM-PPO: Asynchronous and Scalable Multi-Agent PPO for Cooperative Charging. (PDF / PS) |
AAMAS |
2022 |
DBLP BibTeX RDF |
|
37 | Junrong Liang, Jiang Zheng, Xin Zhao |
Distribution of Antioxidatases in Cell of Diatom Nitzschia Closterium and Response to Different Environmental Silicon Concentrations. |
ESIAT (1) |
2009 |
DBLP DOI BibTeX RDF |
diatom Nitzschia closterium, plasma membrane, antioxidatase, PPO, POD, environmental silicon concentration, CAT, SOD |
25 | Anna Stein |
SASHA: The Automatic Generation of Rule-based Diagnostic Expert Systems. |
IEA/AIE (Vol. 1) |
1988 |
DBLP DOI BibTeX RDF |
|
23 | Yue Guan, Sai Zou, Haixia Peng, Wei Ni 0001, Yanglong Sun, Hongfeng Gao |
Cooperative UAV Trajectory Design for Disaster Area Emergency Communications: A Multiagent PPO Method. |
IEEE Internet Things J. |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Chanyuan Meng, Ke Xiong 0001, Wei Chen 0002, Bo Gao, Pingyi Fan, Khaled Ben Letaief |
Sum-Rate Maximization in STAR-RIS-Assisted RSMA Networks: A PPO-Based Algorithm. |
IEEE Internet Things J. |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Haonan An, Lin Wang 0023 |
Robust Topology Generation of Internet of Things Based on PPO Algorithm Using Discrete Action Space. |
IEEE Trans. Ind. Informatics |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Shengyi Huang, Michael Noukhovitch, Arian Hosseini, Kashif Rasul, Weixun Wang, Lewis Tunstall |
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Junyang Zhang, Cristian Emanuel Ocampo Rivera, Kyle Tyni, Steven Nguyen |
A PPO-based DRL Auto-Tuning Nonlinear PID Drone Controller for Robust Autonomous Flights. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Yuchen Liu, Ka Lok Man, Gangmin Li, Terry R. Payne, Yong Yue 0001 |
Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC. |
CCEAI |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Jakob J. Hollenstein, Georg Martius, Justus H. Piater |
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling. |
AAAI |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Nai-Chieh Huang, Ping-Chun Hsieh, Kuo-Hao Ho, I-Chen Wu |
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping. |
AAAI |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Kang Liu, Wei Quan 0001, Nan Cheng, Wen Wu 0003, Ziheng Xu, Liang Guo 0003, Deyun Gao, Hongke Zhang |
Reliable PPO-Based Concurrent Multipath Transfer for Time-Sensitive Applications. |
IEEE Trans. Veh. Technol. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Yikun Zhao, Fanqin Zhou, Huaide Liu, Lei Feng 0001, Wenjing Li 0001 |
PPO-based deployment and phase control for movable intelligent reflecting surface. |
J. Cloud Comput. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Sanjna Siboo, Anushka Bhattacharyya, Rashmi Naveen Raj, S. H. Ashwin |
An Empirical Study of DDPG and PPO-Based Reinforcement Learning Algorithms for Autonomous Driving. |
IEEE Access |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Gyeong Ho Lee, Hyunseo Park, Jae Won Jang, Jaeseob Han, Jun Kyun Choi |
PPO-Based Autonomous Transmission Period Control System in IoT Edge Computing. |
IEEE Internet Things J. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Haijun Zhang 0001, Minghui Jiang 0006, Xiangnan Liu, Xiangming Wen, Ning Wang 0004, Keping Long |
PPO-Based PDACB Traffic Control Scheme for Massive IoV Communications. |
IEEE Trans. Intell. Transp. Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Zhiling Jiang, Yining Chen, Ke Wang, Bowei Yang, Guanghua Song |
A Graph-Based PPO Approach in Multi-UAV Navigation for Communication Coverage. |
Int. J. Comput. Commun. Control |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Shuxin Yang, Xiaoyang Chang, Guixiang Zhu, Jie Cao 0001, Weiping Qin, Youquan Wang, Zhendong Wang |
GAA-PPO: A novel graph adversarial attack method by incorporating proximal policy optimization. |
Neurocomputing |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Ruichen Zhang, Ke Xiong 0001, Yang Lu 0008, Pingyi Fan, Derrick Wing Kwan Ng, Khaled B. Letaief |
Energy Efficiency Maximization in RIS-Assisted SWIPT Networks With RSMA: A PPO-Based Approach. |
IEEE J. Sel. Areas Commun. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Xiaoxue Yu, Rongpeng Li, Fei Wang, Chenghui Peng, Chengchao Liang, Zhifeng Zhao, Honggang Zhang 0001 |
Communication-Efficient Cooperative Multi-Agent PPO via Regulated Segment Mixture in Internet of Vehicles. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Michael Santacroce, Yadong Lu, Han Yu, Yuanzhi Li, Yelong Shen |
Efficient RLHF: Reducing the Memory Usage of PPO. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Chengcheng Han 0004, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li 0067, Ming Gao, Baoyuan Wang |
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Jiacheng Liu 0010, Andrew Cohen, Ramakanth Pasunuru, Yejin Choi 0001, Hannaneh Hajishirzi, Asli Celikyilmaz |
Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Jakob J. Hollenstein, Georg Martius, Justus H. Piater |
Colored Noise in PPO: Improved Exploration and Performance Through Correlated Action Sampling. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan 0001, Tao Gui, Qi Zhang 0001, Xipeng Qiu, Xuanjing Huang 0001 |
Secrets of RLHF in Large Language Models Part I: PPO. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin 0003, Hongyuan Zha, Xiangfeng Wang |
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Chenwei Xu, Jerry Yao-Chieh Hu, Aakaash Narayanan, Mattson Thieme, Vladimir Nagaslaev, Mark Austin, Jeremy Arnold, Jose Berlioz, Pierrick Hanlet, Aisha Ibrahim, Dennis Nicklaus, Jovan Mitrevski, Jason Michael St. John, Gauri Pradhan, Andrea Saewert, Kiyomi Seiya, Brian Schupbach, Randy Thurman-Keup, Nhan Tran, Rui Shi, Seda Ogrenci, Alexis Maya-Isabelle Shuping, Kyle J. Hazelwood, Han Liu |
Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Mandan Naresh, Paresh Saxena, Manik Gupta |
PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Nai-Chieh Huang, Ping-Chun Hsieh, Kuo-Hao Ho, I-Chen Wu |
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Niloofar Gholipour, Marcos Dias de Assunção, Pranav Agarwal, Julien Gascon-Samson, Rajkumar Buyya |
TPTO: A Transformer-PPO based Task Offloading Solution for Edge Computing Environments. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Guanlin Wu, Wenqi Fang, Ji Wang 0002, Pin Ge, Jiang Cao, Yang Ping, Peng Gou |
Dyna-PPO reinforcement learning with Gaussian process for the continuous action decision-making in autonomous driving. |
Appl. Intell. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Bingxu Zhao, Hongbin Dong, Yingjie Wang 0002, Tingwei Pan |
PPO-TA: Adaptive task allocation via Proximal Policy Optimization for spatio-temporal crowdsourcing. |
Knowl. Based Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Qianhao Xiao, Li Jiang, Manman Wang, Xin Zhang |
An Improved Distributed Sampling PPO Algorithm Based on Beta Policy for Continuous Global Path Planning Scheme. |
Sensors |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Xiao Wang, Zhaohui Yang, Xueqian Bai, Mingjiang Ji, Hao Li, Dechao Ran |
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader-Follower Tracking Problem. |
Sensors |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Chengqing Liang, Lei Liu 0008, Chen Liu |
Multi-UAV autonomous collision avoidance based on PPO-GIC algorithm with CNN-LSTM fusion network. |
Neural Networks |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Li Li, Wei Li 0106, Jun Wang 0005, Xiaonan Chen, Qihang Peng, Wei Huang 0021 |
UAV Trajectory Optimization for Spectrum Cartography: A PPO Approach. |
IEEE Commun. Lett. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Tao Jing, Zha Liu, Minghao Zhu, Xuehan Li, Bo Gao, Qinghe Gao, Yan Huo |
P-DRR: PPO-Based Efficient Dynamic Resource Reallocation Scheme in Industrial Internet of Things. |
VTC Fall |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Mandan Naresh, Paresh Saxena, Manik Gupta |
PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming. |
IWCMC |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Wenwu Zhu 0007, Xin Chen 0018, Libo Jiao, Geyong Min, Wang Li |
Cost-Efficient 6G Space-Air-Ground Integrated Mobile Edge Computing for Smart City: A PPO-Based Offloading Decision and Resource Allocation Algorithm. |
HPCC/DSS/SmartCity/DependSys |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Christian T. Coletti, Kyle A. Williams, Hannah C. Lehman, Zahi M. Kakish, Daniel Whitten, Julie Parish |
Effectiveness of Warm-Start PPO for Guidance with Highly Constrained Nonlinear Fixed-Wing Dynamics. |
ACC |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Tianyi Lin, Jun Du, Haijun Zhang 0001, Arumugam Nallanathan, Jun Wang |
PPO-Based Energy-Efficient Power Control and Spectrum Allocation in In-Vehicle HetNets. |
GLOBECOM |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Xiaoxue Yu, Rongpeng Li, Fei Wang, Chenghui Peng, Chengchao Liang, Zhifeng Zhao, Honggang Zhang 0001 |
Communication-Efficient Cooperative Multi-Agent PPO via Regulated Segment Mixture in Internet of Vehicles. |
GLOBECOM |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Xing Chen, Dongcui Diao, Hechang Chen, Hengshuai Yao, Haiyin Piao, Zhixiao Sun, Zhiwei Yang 0005, Randy Goebel, Bei Jiang, Yi Chang 0001 |
The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure. |
AAAI |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Yaqin Li, Zhicai Zhang, Fang Fu, Yan Wang |
A PPO-Based Dynamic Asynchronous Semi-Decentralized Federated Edge Learning. |
ICPADS |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Niloofar Gholipour, Marcos Dias de Assunção, Pranav Agarwal, Julien Gascon-Samson, Rajkumar Buyya |
TPTO: A Transformer-PPO based Task Offloading Solution for Edge Computing Environments. |
ICPADS |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Wei Zhao 0023, Runhu Zhong, Cheng Wu, Xinwei Xu |
Delay and Battery Degradation Optimization based on PPO for Task Offloading in RSU-assisted IoV. |
ICPADS |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Haokun Zhang |
Inverse-Huber Loss Based PPO algorithm. |
RICAI |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Chengcheng Han 0004, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li 0067, Ming Gao, Baoyuan Wang |
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models. |
EMNLP |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Donghe Li, Chunlin Hu, Qingyu Yang, Shitao Chen |
Multi Actor-Critic PPO: A Novel Reinforcement Learning Method for Intelligent Task and Charging Scheduling in Electric Freight Vehicles Management. |
ITSC |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Fang Li, Xueyan Wang, Yining Liu, Li Luo |
Penetration Test Path Discovery Based on NHSC-PPO. |
EITCE |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Lian Liu, Dongpu Sun |
Research on Multi-agent PPO Reinforcement Learning Algorithm based on Knowledge Graph. |
DSA |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Mingfei Sun, Sam Devlin, Jacob Beck, Katja Hofmann, Shimon Whiteson |
Trust Region Bounds for Decentralized PPO Under Non-stationarity. |
AAMAS |
2023 |
DBLP BibTeX RDF |
|
23 | Guohao Zhu, Zhou Shen, Laiyuan Liu, Sicong Zhao, Fangzheng Ji, Zixia Ju, Jialong Sun |
AUV Dynamic Obstacle Avoidance Method Based on Improved PPO Algorithm. |
IEEE Access |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Mingfei Sun, Sam Devlin, Katja Hofmann, Shimon Whiteson |
Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO. |
CoRR |
2022 |
DBLP BibTeX RDF |
|
23 | Mingfei Sun, Vitaly Kurin, Guoqing Liu, Sam Devlin, Tao Qin 0001, Katja Hofmann, Shimon Whiteson |
You May Not Need Ratio Clipping in PPO. |
CoRR |
2022 |
DBLP BibTeX RDF |
|
23 | Jin Fang, Jiacheng Weng, Yi Xiang, Xinwen Zhang |
Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Shengyi Huang, Anssi Kanervisto, Antonin Raffin, Weixun Wang, Santiago Ontañón, Rousslan Fernand Julien Dossa |
A2C is a special case of PPO. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Qisheng Zhang, Zhen Guo, Audun Jøsang, Lance M. Kaplan, Feng Chen 0001, Dong Hyun Jeong, Jin-Hee Cho |
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Behnam Mohammad Hasani Zade, Najme Mansouri |
PPO: a new nature-inspired metaheuristic algorithm based on predation for optimization. |
Soft Comput. |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Sérgio F. Chevtchenko, Eduardo J. Barbosa, Marcelo Cabral Cavalcanti, Gustavo Medeiros de Souza Azevedo, Teresa Bernarda Ludermir |
Combining PPO and incremental conductance for MPPT under dynamic shading and temperature. |
Appl. Soft Comput. |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Wei Guan, Zhewen Cui, Xianku Zhang |
Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm. |
Sensors |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Khalil Chikhaoui, Hakim Ghazzai, Yehia Massoud |
PPO-based Reinforcement Learning for UAV Navigation in Urban Environments. |
MWSCAS |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Prasanna Kumar Kukkamalla, Veli-Matti Uski, Olli Kuismanen, Hannu Kärkkäinen, Karan Menon |
Data Analytics Capability Roadmap for PPO Business Models in Equipment Manufacturing Companies. |
PLM |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Kang Liu, Wei Quan 0001, Nan Cheng, Ziheng Xu, Jun Deng, Deyun Gao |
PPO-based Reliable Concurrent Transmission Control for Telemedicine Real-time Services. |
ICC |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Xiangyu Li, Hangyue Liu, Chaojie Li, Guo Chen 0002, Shiping Wen 0001 |
PPO-based Pricing Method for Shared Energy Storage System. |
ISGT Asia |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Abdulrahman Nahhas, Andrey Kharitonov, Klaus Turowski |
Deep Reinforcement Learning Techniques For Solving Hybrid Flow Shop Scheduling Problems: Proximal Policy Optimization (PPO) and Asynchronous Advantage Actor-Critic (A3C). |
HICSS |
2022 |
DBLP BibTeX RDF |
|
23 | Kun Du, Xianzhong Xie, Zhaoyuan Shi, Min Li |
Joint Time and Power Control of Energy Harvesting CRN Based on PPO. |
WTS |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Chao Yu 0005, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre M. Bayen, Yi Wu 0013 |
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games. |
NeurIPS |
2022 |
DBLP BibTeX RDF |
|
23 | Bilal Kabas |
Autonomous UAV Navigation via Deep Reinforcement Learning Using PPO. |
SIU |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Markus Holzleitner, Lukas Gruber, José Antonio Arjona-Medina, Johannes Brandstetter, Sepp Hochreiter |
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER. |
Trans. Large Scale Data Knowl. Centered Syst. |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Mingwu Zhang, Yu Chen 0056, Zhe Xia, Jiangyi Du, Willy Susilo |
PPO-DFK: A Privacy-Preserving Optimization of Distributed Fractional Knapsack With Application in Secure Footballer Configurations. |
IEEE Syst. J. |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Hsuan-Yu Yao, Ping-Chun Hsieh, Kuo-Hao Ho, Kai-Chun Hu, Liang-Chun Ouyang, I-Chen Wu |
Hinge Policy Optimization: Rethinking Policy Improvement and Reinterpreting PPO. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
23 | Xingxing Liang, Yang Ma, Yanghe Feng, Zhong Liu |
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
23 | Yunxiao Guo, Han Long, Xiaojun Duan, Kaiyuan Feng, Maochu Li, Xiaying Ma |
CIM-PPO: Proximal Policy Optimization with Liu-Correntropy Induced Metric. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
23 | Jiahao Shen, Tao Zhang 0063, Bingchi Zhang, Weixiao Ji, Xiaohui Kuang, Changqiao Xu |
PPO-RM: Proximal Policy Optimization Based Route Mutation for Multimedia Services. |
IWCMC |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Haijun Zhang 0001, Xiangnan Liu, Keping Long, H. Vincent Poor |
Primal Dual PPO Learning Resource Allocation in Indoor IRS-Aided Networks. |
GLOBECOM |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Mingwu Zhang, Yu Chen 0056, Willy Susilo |
PPO-CPQ: A Privacy-Preserving Optimization of Clinical Pathway Query for E-Healthcare Systems. |
IEEE Internet Things J. |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Markus Holzleitner, Lukas Gruber, Jose A. Arjona-Medina, Johannes Brandstetter, Sepp Hochreiter |
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
23 | Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry |
Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
23 | Mario S. Holubar, Marco A. Wiering |
Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPO. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
23 | Ju-Seung Byun, Byungmoon Kim, Huamin Wang |
Proximal Policy Gradient: PPO with Policy Gradient. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
23 | Cheng-Yen Tang, Chien-Hung Liu, Woei-Kae Chen, Shingchern D. You |
Implementing action mask in proximal policy optimization (PPO) algorithm. |
ICT Express |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen |
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation. |
MLSP |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Wlodzimierz Funika, Pawel Koperek, Jacek Kitowski |
Management of Heterogeneous Cloud Resources with Use of the PPO. |
Euro-Par Workshops |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Moksh Jain, S. Sowmya Kamath |
Improving Convergence in IRGAN with PPO. |
COMAD/CODS |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry |
Implementation Matters in Deep RL: A Case Study on PPO and TRPO. |
ICLR |
2020 |
DBLP BibTeX RDF |
|
23 | Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra |
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames. |
ICLR |
2020 |
DBLP BibTeX RDF |
|
23 | Lianjiang Li, Yunrong Yang, Bingna Li |
Combine PPO with NES to Improve Exploration. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
23 | Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra |
Decentralized Distributed PPO: Solving PointGoal Navigation. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
23 | Joe Booth |
PPO Dash: Improving Generalization in Deep Reinforcement Learning. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
23 | Mingwu Zhang |
PPO-DFK. |
|
2019 |
DOI RDF |
|
23 | Jia-Chi Chen, Tao-Hsing Chang |
Modified PPO-RND Method for Solving Sparse Reward Problem in ViZDoom. |
CoG |
2019 |
DBLP DOI BibTeX RDF |
|
23 | An Guo, Lianghua Song, Xiong Chen |
Learning Similar Tasks Based On PPO By Transferring Trajectory. |
ICNSC |
2019 |
DBLP DOI BibTeX RDF |
|
23 | Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen |
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
23 | Haitham Al-Jabri, Takafumi Matsumaru |
Proposing Camera Calibration Method Using PPO (Proximal Policy Optimization) for Improving Camera Pose Estimations. |
ROBIO |
2018 |
DBLP DOI BibTeX RDF |
|
23 | S. Raghavendra, S. J. Aditya Rao, Vadlapudi Kumar, C. K. Ramesh |
Multiple ligand simultaneous docking (MLSD): A novel approach to study the effect of inhibitors on substrate binding to PPO. |
Comput. Biol. Chem. |
2015 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 106 (100 per page; Change: ) Pages: [ 1][ 2][ >>] |
|