三个男躁一个女,国精产品一区一手机的秘密,麦子交换系列最经典十句话,欧美国产综合欧美视频

<cite id="a9izi"></cite>

<sub id="a9izi"></sub>

<blockquote id="a9izi"><rt id="a9izi"></rt></blockquote>

登錄注冊寫文章

Deep Reinforcement Learning Papers

西方失敗9527

Deep Reinforcement Learning Papers

Deep Reinforcement Learning Papers

A list of recent papers regarding deep reinforcement learning.

The papers are organized based on manually-defined bookmarks.

They are sorted by time to see the recent papers first.

Any suggestions and pull requests are welcome.

Discrete Control

Continuous Control

Monte-Carlo Tree Search

Inverse Reinforcement Learning

Improving Exploration

Multi-Task and Transfer Learning

Hierarchical Learning

Model-Free Episodic Control, C. Blundell et al.,arXiv, 2016.

Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al.,arXiv, 2016.

Deep Successor Reinforcement Learning, T. D. Kulkarni et al.,arXiv, 2016.

Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al.,arXiv, 2016.

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al.,arXiv, 2016.

Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al.,ICML, 2016.

Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al.,IJCAI Deep RL Workshop, 2016.

Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al.,arXiv, 2016.

Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al.,ICML, 2016.

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al.,arXiv, 2016.

Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al.,arXiv, 2016.

Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al.,ICML, 2016.

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al.,arXiv, 2016.

Deep Exploration via Bootstrapped DQN, I. Osband et al.,arXiv, 2016.

Value Iteration Networks, A. Tamar et al.,arXiv, 2016.

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al.,arXiv, 2016.

Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al.,arXiv, 2016.

Mastering the game of Go with deep neural networks and tree search, D. Silver et al.,Nature, 2016.

Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al.,AAAI, 2016.

Memory-based control with recurrent neural networks, N. Heess et al.,NIPS Workshop, 2015.

How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. Fran?ois-Lavet et al.,NIPS Workshop, 2015.

Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al.,arXiv, 2015.

Strategic Dialogue Management via Deep Reinforcement Learning, H. Cuayáhuitl et al.,NIPS Workshop, 2015.

MazeBase: A Sandbox for Learning from Games, S. Sukhbaatar et al.,arXiv, 2016.

Learning Simple Algorithms from Examples, W. Zaremba et al.,arXiv, 2015.

Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al.,arXiv, 2015.

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al.,ICLR, 2016.

Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al.,ICLR, 2016.

Policy Distillation, A. A. Rusu et at.,ICLR, 2016.

Prioritized Experience Replay, T. Schaul et al.,ICLR, 2016.

Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al.,arXiv, 2015.

Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al.,ICLR, 2016.

Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al.,arXiv, 2015.

Generating Text with Deep Reinforcement Learning, H. Guo,arXiv, 2015.

ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al.,arXiv, 2015.

Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende,arXiv, 2015.

Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv, 2015.

Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al.,arXiv, 2015.

Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,ICLR, 2016.

Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al.,EMNLP, 2015.

Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai,arXiv, 2015.

Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al.,NIPS, 2015.

Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al.,NIPS, 2015.

Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al.,arXiv, 2015.

Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone,arXiv, 2015.

Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al.,arXiv, 2015.

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al.,arXiv, 2015.

Maximum Entropy Deep Inverse Reinforcement Learning, M. Wulfmeier et al.,arXiv, 2015.

High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al.,ICLR, 2016.

End-to-End Training of Deep Visuomotor Policies, S. Levine et al.,arXiv, 2015.

DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al.,RSS, 2015.

Universal Value Function Approximators, T. Schaul et al.,ICML, 2015.

Deterministic Policy Gradient Algorithms, D. Silver et al.,ICML, 2015.

Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML Workshop, 2015.

Trust Region Policy Optimization, J. Schulman et al.,ICML, 2015.

Human-level control through deep reinforcement learning, V. Mnih et al.,Nature, 2015.

Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al.,NIPS, 2014.

Playing Atari with Deep Reinforcement Learning, V. Mnih et al.,NIPS Workshop, 2013.

Model-Free Episodic Control, C. Blundell et al.,arXiv, 2016.

Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al.,arXiv, 2016.

Deep Successor Reinforcement Learning, T. D. Kulkarni et al.,arXiv, 2016.

Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al.,arXiv, 2016.

Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al.,ICML, 2016.

Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al.,IJCAI Deep RL Workshop, 2016.

Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al.,arXiv, 2016.

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al.,arXiv, 2016.

Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al.,ICML, 2016.

Deep Exploration via Bootstrapped DQN, I. Osband et al.,arXiv, 2016.

Value Iteration Networks, A. Tamar et al.,arXiv, 2016.

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al.,arXiv, 2016.

Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al.,arXiv, 2016.

Mastering the game of Go with deep neural networks and tree search, D. Silver et al.,Nature, 2016.

Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al.,AAAI, 2016.

How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. Fran?ois-Lavet et al.,NIPS Workshop, 2015.

Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al.,arXiv, 2015.

Strategic Dialogue Management via Deep Reinforcement Learning, H. Cuayáhuitl et al.,NIPS Workshop, 2015.

Learning Simple Algorithms from Examples, W. Zaremba et al.,arXiv, 2015.

Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al.,arXiv, 2015.

Prioritized Experience Replay, T. Schaul et al.,ICLR, 2016.

Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al.,arXiv, 2015.

Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al.,ICLR, 2016.

Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al.,arXiv, 2015.

Generating Text with Deep Reinforcement Learning, H. Guo,arXiv, 2015.

Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv, 2015.

Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al.,arXiv, 2015.

Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,ICLR, 2016.

Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al.,EMNLP, 2015.

Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al.,NIPS, 2015.

Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone,arXiv, 2015.

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al.,arXiv, 2015.

Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML Workshop, 2015.

Human-level control through deep reinforcement learning, V. Mnih et al.,Nature, 2015.

Playing Atari with Deep Reinforcement Learning, V. Mnih et al.,NIPS Workshop, 2013.

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al.,arXiv, 2016.

Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al.,ICML, 2016.

Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al.,arXiv, 2016.

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al.,arXiv, 2016.

Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al.,arXiv, 2016.

Mastering the game of Go with deep neural networks and tree search, D. Silver et al.,Nature, 2016.

Memory-based control with recurrent neural networks, N. Heess et al.,NIPS Workshop, 2015.

MazeBase: A Sandbox for Learning from Games, S. Sukhbaatar et al.,arXiv, 2016.

ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al.,arXiv, 2015.

Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,ICLR, 2016.

Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al.,NIPS, 2015.

High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al.,ICLR, 2016.

End-to-End Training of Deep Visuomotor Policies, S. Levine et al.,arXiv, 2015.

Deterministic Policy Gradient Algorithms, D. Silver et al.,ICML, 2015.

Trust Region Policy Optimization, J. Schulman et al.,ICML, 2015.

Discrete Control

Model-Free Episodic Control, C. Blundell et al.,arXiv, 2016.

Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al.,arXiv, 2016.

Deep Successor Reinforcement Learning, T. D. Kulkarni et al.,arXiv, 2016.

Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al.,arXiv, 2016.

Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al.,ICML, 2016.

Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al.,IJCAI Deep RL Workshop, 2016.

Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al.,arXiv, 2016.

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al.,arXiv, 2016.

Deep Exploration via Bootstrapped DQN, I. Osband et al.,arXiv, 2016.

Value Iteration Networks, A. Tamar et al.,arXiv, 2016.

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al.,arXiv, 2016.

Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al.,arXiv, 2016.

Mastering the game of Go with deep neural networks and tree search, D. Silver et al.,Nature, 2016.

Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al.,AAAI, 2016.

How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. Fran?ois-Lavet et al.,NIPS Workshop, 2015.

Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al.,arXiv, 2015.

Strategic Dialogue Management via Deep Reinforcement Learning, H. Cuayáhuitl et al.,NIPS Workshop, 2015.

Learning Simple Algorithms from Examples, W. Zaremba et al.,arXiv, 2015.

Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al.,arXiv, 2015.

Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al.,ICLR, 2016.

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al.,ICLR, 2016.

Policy Distillation, A. A. Rusu et at.,ICLR, 2016.

Prioritized Experience Replay, T. Schaul et al.,ICLR, 2016.

Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al.,arXiv, 2015.

Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al.,ICLR, 2016.

Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al.,arXiv, 2015.

Generating Text with Deep Reinforcement Learning, H. Guo,arXiv, 2015.

ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al.,arXiv, 2015.

Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende,arXiv, 2015.

Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv, 2015.

Recurrent Reinforcement Learning: A Hybrid Approach, X. Li et al.,arXiv, 2015.

Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al.,EMNLP, 2015.

Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai,arXiv, 2015.

Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al.,NIPS, 2015.

Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone,arXiv, 2015.

Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al.,arXiv, 2015.

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al.,arXiv, 2015.

Universal Value Function Approximators, T. Schaul et al.,ICML, 2015.

Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML Workshop, 2015.

Human-level control through deep reinforcement learning, V. Mnih et al.,Nature, 2015.

Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al.,NIPS, 2014.

Playing Atari with Deep Reinforcement Learning, V. Mnih et al.,NIPS Workshop, 2013.

Continuous Control

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al.,arXiv, 2016.

Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al.,ICML, 2016.

Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al.,arXiv, 2016.

Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al.,ICML, 2016.

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al.,arXiv, 2016.

Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al.,arXiv, 2016.

Memory-based control with recurrent neural networks, N. Heess et al.,NIPS Workshop, 2015.

Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende,arXiv, 2015.

Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,ICLR, 2016.

Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al.,NIPS, 2015.

Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al.,arXiv, 2015.

High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al.,ICLR, 2016.

End-to-End Training of Deep Visuomotor Policies, S. Levine et al.,arXiv, 2015.

DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al.,RSS, 2015.

Deterministic Policy Gradient Algorithms, D. Silver et al.,ICML, 2015.

Trust Region Policy Optimization, J. Schulman et al.,ICML, 2015.

Strategic Dialogue Management via Deep Reinforcement Learning, H. Cuayáhuitl et al.,NIPS Workshop, 2015.

MazeBase: A Sandbox for Learning from Games, S. Sukhbaatar et al.,arXiv, 2016.

Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al.,arXiv, 2015.

Generating Text with Deep Reinforcement Learning, H. Guo,arXiv, 2015.

Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al.,EMNLP, 2015.

Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, H. Mei et al.,arXiv, 2015.

Model-Free Episodic Control, C. Blundell et al.,arXiv, 2016.

Deep Successor Reinforcement Learning, T. D. Kulkarni et al.,arXiv, 2016.

Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al.,arXiv, 2016.

Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al.,ICML, 2016.

Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al.,IJCAI Deep RL Workshop, 2016.

Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al.,arXiv, 2016.

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al.,arXiv, 2016.

Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al.,arXiv, 2016.

Deep Exploration via Bootstrapped DQN, I. Osband et al.,arXiv, 2016.

Value Iteration Networks, A. Tamar et al.,arXiv, 2016.

Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al.,arXiv, 2016.

Mastering the game of Go with deep neural networks and tree search, D. Silver et al.,Nature, 2016.

Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al.,AAAI, 2016.

Memory-based control with recurrent neural networks, N. Heess et al.,NIPS Workshop, 2015.

How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. Fran?ois-Lavet et al.,NIPS Workshop, 2015.

Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al.,arXiv, 2015.

Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al.,arXiv, 2015.

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al.,ICLR, 2016.

Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al.,ICLR, 2016.

Policy Distillation, A. A. Rusu et at.,ICLR, 2016.

Prioritized Experience Replay, T. Schaul et al.,ICLR, 2016.

Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al.,ICLR, 2016.

Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al.,arXiv, 2015.

Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende,arXiv, 2015.

Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv, 2015.

Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,ICLR, 2016.

Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai,arXiv, 2015.

Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al.,NIPS, 2015.

Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al.,NIPS, 2015.

Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone,arXiv, 2015.

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al.,arXiv, 2015.

High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al.,ICLR, 2016.

End-to-End Training of Deep Visuomotor Policies, S. Levine et al.,arXiv, 2015.

Universal Value Function Approximators, T. Schaul et al.,ICML, 2015.

Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML Workshop, 2015.

Trust Region Policy Optimization, J. Schulman et al.,ICML, 2015.

Human-level control through deep reinforcement learning, V. Mnih et al.,Nature, 2015.

Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al.,NIPS, 2014.

Playing Atari with Deep Reinforcement Learning, V. Mnih et al.,NIPS Workshop, 2013.

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al.,arXiv, 2016.

Benchmarking Deep Reinforcement Learning for Continuous Control, Y. Duan et al.,ICML, 2016.

Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection, S. Levine et al.,arXiv, 2016.

Continuous Deep Q-Learning with Model-based Acceleration, S. Gu et al.,ICML, 2016.

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al.,arXiv, 2016.

Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al.,arXiv, 2016.

Memory-based control with recurrent neural networks, N. Heess et al.,NIPS Workshop, 2015.

Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control, F. Zhang et al.,arXiv, 2015.

Learning Continuous Control Policies by Stochastic Value Gradients, N. Heess et al.,NIPS, 2015.

Learning Deep Neural Network Policies with Continuous Memory States, M. Zhang et al.,arXiv, 2015.

High-Dimensional Continuous Control Using Generalized Advantage Estimation, J. Schulman et al.,ICLR, 2016.

End-to-End Training of Deep Visuomotor Policies, S. Levine et al.,arXiv, 2015.

DeepMPC: Learning Deep Latent Features for Model Predictive Control, I. Lenz, et al.,RSS, 2015.

Trust Region Policy Optimization, J. Schulman et al.,ICML, 2015.

Model-Free Episodic Control, C. Blundell et al.,arXiv, 2016.

Safe and Efficient Off-Policy Reinforcement Learning, R. Munos et al.,arXiv, 2016.

Deep Successor Reinforcement Learning, T. D. Kulkarni et al.,arXiv, 2016.

Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al.,arXiv, 2016.

Control of Memory, Active Perception, and Action in Minecraft, J. Oh et al.,ICML, 2016.

Dynamic Frame skip Deep Q Network, A. S. Lakshminarayanan et al.,IJCAI Deep RL Workshop, 2016.

Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al.,arXiv, 2016.

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al.,arXiv, 2016.

Deep Exploration via Bootstrapped DQN, I. Osband et al.,arXiv, 2016.

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al.,arXiv, 2016.

Asynchronous Methods for Deep Reinforcement Learning, V. Mnih et al.,arXiv, 2016.

Mastering the game of Go with deep neural networks and tree search, D. Silver et al.,Nature, 2016.

Increasing the Action Gap: New Operators for Reinforcement Learning, M. G. Bellemare et al.,AAAI, 2016.

How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies, V. Fran?ois-Lavet et al.,NIPS Workshop, 2015.

Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al.,arXiv, 2015.

MazeBase: A Sandbox for Learning from Games, S. Sukhbaatar et al.,arXiv, 2016.

Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al.,arXiv, 2015.

Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al.,ICLR, 2016.

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al.,ICLR, 2016.

Policy Distillation, A. A. Rusu et at.,ICLR, 2016.

Prioritized Experience Replay, T. Schaul et al.,ICLR, 2016.

Deep Reinforcement Learning with an Action Space Defined by Natural Language, J. He et al.,arXiv, 2015.

Deep Reinforcement Learning in Parameterized Action Space, M. Hausknecht et al.,ICLR, 2016.

Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning, S. Mohamed and D. J. Rezende,arXiv, 2015.

Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al.,arXiv, 2015.

Continuous control with deep reinforcement learning, T. P. Lillicrap et al.,ICLR, 2016.

Language Understanding for Text-based Games Using Deep Reinforcement Learning, K. Narasimhan et al.,EMNLP, 2015.

Giraffe: Using Deep Reinforcement Learning to Play Chess, M. Lai,arXiv, 2015.

Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al.,NIPS, 2015.

Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht and P. Stone,arXiv, 2015.

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al.,arXiv, 2015.

Universal Value Function Approximators, T. Schaul et al.,ICML, 2015.

Massively Parallel Methods for Deep Reinforcement Learning, A. Nair et al.,ICML Workshop, 2015.

Trust Region Policy Optimization, J. Schulman et al.,ICML, 2015.

Human-level control through deep reinforcement learning, V. Mnih et al.,Nature, 2015.

Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al.,NIPS, 2014.

Playing Atari with Deep Reinforcement Learning, V. Mnih et al.,NIPS Workshop, 2013.

Monte-Carlo Tree Search

Mastering the game of Go with deep neural networks and tree search, D. Silver et al.,Nature, 2016.

Better Computer Go Player with Neural Network and Long-term Prediction, Y. Tian et al.,ICLR, 2016.

Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, X. Guo et al.,NIPS, 2014.

Inverse Reinforcement Learning

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization, C. Finn et al.,arXiv, 2016.

Maximum Entropy Deep Inverse Reinforcement Learning, M. Wulfmeier et al.,arXiv, 2015.

Multi-Task and Transfer Learning

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning, E. Parisotto, et al.,ICLR, 2016.

Policy Distillation, A. A. Rusu et at.,ICLR, 2016.

ADAAPT: A Deep Architecture for Adaptive Policy Transfer from Multiple Sources, J. Rajendran et al.,arXiv, 2015.

Universal Value Function Approximators, T. Schaul et al.,ICML, 2015.

Improving Exploration

Unifying Count-Based Exploration and Intrinsic Motivation, M. G. Bellemare et al.,arXiv, 2016.

Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks, R. Houthooft et al.,arXiv, 2016.

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al.,arXiv, 2016.

Deep Exploration via Bootstrapped DQN, I. Osband et al.,arXiv, 2016.

Action-Conditional Video Prediction using Deep Networks in Atari Games, J. Oh et al.,NIPS, 2015.

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models, B. C. Stadie et al.,arXiv, 2015.

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks, J. N. Foerster et al.,arXiv, 2016.

Multiagent Cooperation and Competition with Deep Reinforcement Learning, A. Tampuu et al.,arXiv, 2015.

Hierarchical Learning

Deep Successor Reinforcement Learning, T. D. Kulkarni et al.,arXiv, 2016.

Hierarchical Reinforcement Learning using Spatio-Temporal Abstractions and Deep Neural Networks, R. Krishnamurthy et al.,arXiv, 2016.

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, T. D. Kulkarni et al.,arXiv, 2016.

最后編輯于：2017.12.08 13:44:18

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者
平臺聲明：文章內(nèi)容（如有圖片或視頻亦包括在內(nèi)）由作者上傳并發(fā)布，文章內(nèi)容僅代表作者本人觀點，簡書系信息發(fā)布平臺，僅提供信息存儲服務(wù)。

推薦閱讀更多精彩內(nèi)容

大個子的悲傷
我不爭不代表我不想要我不哭不代表我不難過我只是習(xí)慣了你們的不在意，習(xí)慣被你們忽視…… 我無數(shù)次的告...
百變小叮鐺閱讀 249評論 0贊 3
我們都在從自己的認知看世界
中午吃飯的時候，我們幾個同事不知道從什么話題引到了明星后臺的話題上。說起了馬思純， “她應(yīng)該是一個不張揚的人，從不...
煙花雨蕁閱讀 415評論 0贊 1
iOS證書問題
最近在給公司沒上線的項目進行xcode7無證書打包測試時，偶爾會出現(xiàn)打包幾天后或一兩個月內(nèi)點擊APP閃退的情況。...
生產(chǎn)八哥閱讀 347評論 0贊 0

1贊2贊

贊賞

手機看全文

主站蜘蛛池模板：昂仁县| 富阳市| 正镶白旗| 玛纳斯县| 荃湾区| 石棉县| 晋江市| 克拉玛依市| 巴彦县| 思南县| 襄汾县| 论坛| 芦溪县| 甘肃省| 延寿县| 乐平市| 纳雍县| 子洲县| 永济市| 呼玛县| 南召县| 中阳县| 东乡县| 信丰县| 沭阳县| 进贤县| 长岭县| 孙吴县| 高邮市| 太仓市| 通许县| 横峰县| 济南市| 阿荣旗| 无极县| 大埔区| 绿春县| 咸阳市| 石渠县| 平乐县| 遂溪县|

^{<blockquote id="anzqq"></blockquote>}

<cite id="anzqq"></cite>

<legend id="anzqq"><track id="anzqq"></track></legend>