junhyukoh/deep-reinforcement-learning-papers: A list of recent papers regarding ...

原作者: [db:作者] 来自: 网络收藏邀请

开源软件名称：

junhyukoh/deep-reinforcement-learning-papers

开源软件地址：

https://github.com/junhyukoh/deep-reinforcement-learning-papers

开源编程语言：

开源软件介绍：

Deep Reinforcement Learning Papers

A list of recent papers regarding deep reinforcement learning.
The papers are organized based on manually-defined bookmarks.
They are sorted by time to see the recent papers first.
Any suggestions and pull requests are welcome.

Bookmarks

All Papers
Value
Policy
Discrete Control
Continuous Control
Text Domain
Visual Domain
Robotics
Games
Monte-Carlo Tree Search
Inverse Reinforcement Learning
Improving Exploration
Multi-Task and Transfer Learning
Multi-Agent
Hierarchical Learning

All Papers

Model-Free Episodic Control, C. Blundell et al., arXiv, 2016.
Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al., arXiv, 2016.
Deep Successor Reinforcement Learning, T. D. Kulkarni et al., arXiv, 2016.
Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al., arXiv, 2016.
Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al., arXiv, 2016.
Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al., ICML, 2016.
Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al., IJCAI Deep RL Workshop, 2016.
Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al., arXiv, 2016.
Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al., ICML, 2016.
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al., arXiv, 2016.
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al., arXiv, 2016.
Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al., ICML, 2016.
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al., arXiv, 2016.
Deep Exploration via Bootstrapped DQN, I. Osband et al., arXiv, 2016.
Value Iteration Networks, A. Tamar et al., arXiv, 2016.
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al., arXiv, 2016.
Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al., arXiv, 2016.
Mastering the game of Go with deep neural networks and tree search, D. Silver et al., Nature, 2016.
Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al., AAAI, 2016.
Memory-based control with recurrent neural networks, N. Heess et al., NIPS Workshop, 2015.
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. François-Lavet et al., NIPS Workshop, 2015.
Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al., arXiv, 2015.
Strategic Dialogue Management via Deep Reinforcement Learning, H. Cuayáhuitl et al., NIPS Workshop, 2015.
MazeBase: A Sandbox for Learning from Games, S. Sukhbaatar et al., arXiv, 2016.
Learning Simple Algorithms from Examples, W. Zaremba et al., arXiv, 2015.
Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al., arXiv, 2015.
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al., ICLR, 2016.
Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al., ICLR, 2016.
Policy Distillation, A. A. Rusu et at., ICLR, 2016.
Prioritized Experience Replay, T. Schaul et al., ICLR, 2016.
Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al., arXiv, 2015.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., ICLR, 2016.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv, 2015.
Generating Text with Deep Reinforcement Learning, H. Guo, arXiv, 2015.
ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al., arXiv, 2015.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv, 2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al., arXiv, 2015.
Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al., arXiv, 2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al., ICLR, 2016.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP, 2015.
Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai, arXiv, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS, 2015.
Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al., NIPS, 2015.
Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al., arXiv, 2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv, 2015.
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al., arXiv, 2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv, 2015.
Maximum Entropy Deep Inverse Reinforcement Learning, M. Wulfmeier et al., arXiv, 2015.
High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al., ICLR, 2016.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al., RSS, 2015.
Universal Value Function Approximators, T. Schaul et al., ICML, 2015.
Deterministic Policy Gradient Algorithms, D. Silver et al., ICML, 2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al., ICML Workshop, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML, 2015.
Human-level control through deep reinforcement learning, V. Mnih et al., Nature, 2015.
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS Workshop, 2013.

Value

Model-Free Episodic Control, C. Blundell et al., arXiv, 2016.
Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al., arXiv, 2016.
Deep Successor Reinforcement Learning, T. D. Kulkarni et al., arXiv, 2016.
Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al., arXiv, 2016.
Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al., ICML, 2016.
Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al., IJCAI Deep RL Workshop, 2016.
Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al., arXiv, 2016.
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al., arXiv, 2016.
Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al., ICML, 2016.
Deep Exploration via Bootstrapped DQN, I. Osband et al., arXiv, 2016.
Value Iteration Networks, A. Tamar et al., arXiv, 2016.
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al., arXiv, 2016.
Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al., arXiv, 2016.
Mastering the game of Go with deep neural networks and tree search, D. Silver et al., Nature, 2016.
Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al., AAAI, 2016.
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. François-Lavet et al., NIPS Workshop, 2015.
Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al., arXiv, 2015.
Strategic Dialogue Management via Deep Reinforcement Learning, H. Cuayáhuitl et al., NIPS Workshop, 2015.
Learning Simple Algorithms from Examples, W. Zaremba et al., arXiv, 2015.
Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al., arXiv, 2015.
Prioritized Experience Replay, T. Schaul et al., ICLR, 2016.
Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al., arXiv, 2015.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., ICLR, 2016.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv, 2015.
Generating Text with Deep Reinforcement Learning, H. Guo, arXiv, 2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al., arXiv, 2015.
Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al., arXiv, 2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al., ICLR, 2016.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS, 2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv, 2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv, 2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al., ICML Workshop, 2015.
Human-level control through deep reinforcement learning, V. Mnih et al., Nature, 2015.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS Workshop, 2013.

Policy

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al., arXiv, 2016.
Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al., ICML, 2016.
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al., arXiv, 2016.
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al., arXiv, 2016.
Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al., arXiv, 2016.
Mastering the game of Go with deep neural networks and tree search, D. Silver et al., Nature, 2016.
Memory-based control with recurrent neural networks, N. Heess et al., NIPS Workshop, 2015.
MazeBase: A Sandbox for Learning from Games, S. Sukhbaatar et al., arXiv, 2016.
ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al., arXiv, 2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al., ICLR, 2016.
Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al., NIPS, 2015.
High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al., ICLR, 2016.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
Deterministic Policy Gradient Algorithms, D. Silver et al., ICML, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML, 2015.

Discrete Control

Model-Free Episodic Control, C. Blundell et al., arXiv, 2016.
Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al., arXiv, 2016.
Deep Successor Reinforcement Learning, T. D. Kulkarni et al., arXiv, 2016.
Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al., arXiv, 2016.
Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al., ICML, 2016.
Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al., IJCAI Deep RL Workshop, 2016.
Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al., arXiv, 2016.
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al., arXiv, 2016.
Deep Exploration via Bootstrapped DQN, I. Osband et al., arXiv, 2016.
Value Iteration Networks, A. Tamar et al., arXiv, 2016.
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al., arXiv, 2016.
Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al., arXiv, 2016.
Mastering the game of Go with deep neural networks and tree search, D. Silver et al., Nature, 2016.
Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al., AAAI, 2016.
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. François-Lavet et al., NIPS Workshop, 2015.
Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al., arXiv, 2015.
Strategic Dialogue Management via Deep Reinforcement Learning, H. Cuayáhuitl et al., NIPS Workshop, 2015.
Learning Simple Algorithms from Examples, W. Zaremba et al., arXiv, 2015.
Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al., arXiv, 2015.
Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al., ICLR, 2016.
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al., ICLR, 2016.
Policy Distillation, A. A. Rusu et at., ICLR, 2016.
Prioritized Experience Replay, T. Schaul et al., ICLR, 2016.
Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al., arXiv, 2015.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., ICLR, 2016.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv, 2015.
Generating Text with Deep Reinforcement Learning, H. Guo, arXiv, 2015.
ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al., arXiv, 2015.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv, 2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al., arXiv, 2015.
Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al., arXiv, 2015.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP, 2015.
Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai, arXiv, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS, 2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv, 2015.
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al., arXiv, 2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv, 2015.
Universal Value Function Approximators, T. Schaul et al., ICML, 2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al., ICML Workshop, 2015.
Human-level control through deep reinforcement learning, V. Mnih et al., Nature, 2015.
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS Workshop, 2013.

Continuous Control

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al., arXiv, 2016.
Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al., ICML, 2016.
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al., arXiv, 2016.
Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al., ICML, 2016.
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al., arXiv, 2016.
Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al., arXiv, 2016.
Memory-based control with recurrent neural networks, N. Heess et al., NIPS Workshop, 2015.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv, 2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al., ICLR, 2016.
Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al., NIPS, 2015.
Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al., arXiv, 2015.
High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al., ICLR, 2016.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al., RSS, 2015.
Deterministic Policy Gradient Algorithms, D. Silver et al., ICML, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML, 2015.

Text Domain

Strategic Dialogue Management via Deep Reinforcement Learning, H. Cuayáhuitl et al., NIPS Workshop, 2015.
MazeBase: A Sandbox for Learning from Games, S. Sukhbaatar et al., arXiv, 2016.
Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al., arXiv, 2015.
Generating Text with Deep Reinforcement Learning, H. Guo, arXiv, 2015.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP, 2015.
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al., arXiv, 2015.

Visual Domain

Model-Free Episodic Control, C. Blundell et al., arXiv, 2016.
Deep Successor Reinforcement Learning, T. D. Kulkarni et al., arXiv, 2016.
Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al., arXiv, 2016.
Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al., ICML, 2016.
Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al., IJCAI Deep RL Workshop, 2016.
Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al., arXiv, 2016.
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al., arXiv, 2016.
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al., arXiv, 2016.
Deep Exploration via Bootstrapped DQN, I. Osband et al., arXiv, 2016.
Value Iteration Networks, A. Tamar et al., arXiv, 2016.
Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al., arXiv, 2016.
Mastering the game of Go with deep neural networks and tree search, D. Silver et al., Nature, 2016.
Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al., AAAI, 2016.
Memory-based control with recurrent neural networks, N. Heess et al., NIPS Workshop, 2015.
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. François-Lavet et al., NIPS Workshop, 2015.
Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al., arXiv, 2015.
Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al., arXiv, 2015.
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al., ICLR, 2016.
Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al., ICLR, 2016.
Policy Distillation, A. A. Rusu et at., ICLR, 2016.
Prioritized Experience Replay, T. Schaul et al., ICLR, 2016.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., ICLR, 2016.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv, 2015.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv, 2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al., arXiv, 2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al., ICLR, 2016.
Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai, arXiv, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS, 2015.
Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al., NIPS, 2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv, 2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv, 2015.
High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al., ICLR, 2016.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
Universal Value Function Approximators, T. Schaul et al., ICML, 2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al., ICML Workshop, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML, 2015.
Human-level control through deep reinforcement learning, V. Mnih et al., Nature, 2015.
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS Workshop, 2013.

Robotics

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al., arXiv, 2016.
Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al., ICML, 2016.
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al., arXiv, 2016.
Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al., ICML, 2016.
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al., arXiv, 2016.
Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al., arXiv, 2016.
Memory-based control with recurrent neural networks, N. Heess et al., NIPS Workshop, 2015.
Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al., arXiv, 2015.
Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al., NIPS, 2015.
Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al., arXiv, 2015.
High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al., ICLR, 2016.
End-to-End Training of Deep Visuomotor Policies, S. Levine et al., arXiv, 2015.
DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al., RSS, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML, 2015.

Games

Model-Free Episodic Control, C. Blundell et al., arXiv, 2016.
Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al., arXiv, 2016.
Deep Successor Reinforcement Learning, T. D. Kulkarni et al., arXiv, 2016.
Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al., arXiv, 2016.
Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al., ICML, 2016.
Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al., IJCAI Deep RL Workshop, 2016.
Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al., arXiv, 2016.
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al., arXiv, 2016.
Deep Exploration via Bootstrapped DQN, I. Osband et al., arXiv, 2016.
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al., arXiv, 2016.
Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al., arXiv, 2016.
Mastering the game of Go with deep neural networks and tree search, D. Silver et al., Nature, 2016.
Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al., AAAI, 2016.
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. François-Lavet et al., NIPS Workshop, 2015.
Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al., arXiv, 2015.
MazeBase: A Sandbox for Learning from Games, S. Sukhbaatar et al., arXiv, 2016.
Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al., arXiv, 2015.
Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al., ICLR, 2016.
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al., ICLR, 2016.
Policy Distillation, A. A. Rusu et at., ICLR, 2016.
Prioritized Experience Replay, T. Schaul et al., ICLR, 2016.
Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al., arXiv, 2015.
Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al., ICLR, 2016.
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende, arXiv, 2015.
Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al., arXiv, 2015.
Continuous control with deep reinforcement learning, T. P. Lillicrap et al., ICLR, 2016.
Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al., EMNLP, 2015.
Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai, arXiv, 2015.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS, 2015.
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone, arXiv, 2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv, 2015.
Universal Value Function Approximators, T. Schaul et al., ICML, 2015.
Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al., ICML Workshop, 2015.
Trust Region Policy Optimization, J. Schulman et al., ICML, 2015.
Human-level control through deep reinforcement learning, V. Mnih et al., Nature, 2015.
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.
Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS Workshop, 2013.

Monte-Carlo Tree Search

Mastering the game of Go with deep neural networks and tree search, D. Silver et al., Nature, 2016.
Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al., ICLR, 2016.
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al., NIPS, 2014.

Inverse Reinforcement Learning

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al., arXiv, 2016.
Maximum Entropy Deep Inverse Reinforcement Learning, M. Wulfmeier et al., arXiv, 2015.

Multi-Task and Transfer Learning

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al., ICLR, 2016.
Policy Distillation, A. A. Rusu et at., ICLR, 2016.
ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al., arXiv, 2015.
Universal Value Function Approximators, T. Schaul et al., ICML, 2015.

Improving Exploration

Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al., arXiv, 2016.
Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al., arXiv, 2016.
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al., arXiv, 2016.
Deep Exploration via Bootstrapped DQN, I. Osband et al., arXiv, 2016.
Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al., NIPS, 2015.
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al., arXiv, 2015.

Multi-Agent

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al., arXiv, 2016.
Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al., arXiv, 2015.

Hierarchical Learning

Deep Successor Reinforcement Learning, T. D. Kulkarni et al., arXiv, 2016.
Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al., arXiv, 2016.
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al., arXiv, 2016.

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

ysh329/deep-learning-model-convertor: The convertor/conversion of deep learning ...发布时间：2022-05-02

Shicoder/DeepLearning_Demo: 深度学习入门的一些简单例子发布时间：2022-05-02

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

2023-10-27

florent37/ViewAnimator: A fluent Android

2022-08-15

florent37/Shrine-MaterialDesign2: implem

2022-08-17

CVE-2020-36276

2022-09-23

SimpleSoftwareIO/simple-sms: Send and re

2022-08-13

阅读排行榜

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：17213|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9311|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：7935|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8299|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8205|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：9041|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8180|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7596|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8134|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7177|2022-11-06

客服电话

电子邮件