花费 18 ms
[强化学习论文笔记(7)]:DPG

Deterministic Policy Gradient Algorithms 论文地址 DPG 笔记 出发点 首先最开始提出的policy gradient 算法是 stochastic ...

Sat Jan 04 03:45:00 CST 2020 0 1239
[强化学习论文笔记(4)]:DuelingDQN

Dueling Network Architectures for Deep Reinforcement Learning 论文地址 DuelingDQN 笔记 基本思路就是\(Q(s,a ...

Wed Jan 01 03:27:00 CST 2020 0 957
[强化学习论文笔记(3)]:DRQN

Deep Recurrent Q-Learning for Partially Observable MDPs 论文地址 DRQN 笔记 DQN 每一个decision time 需要该时刻前 ...

Wed Jan 01 01:09:00 CST 2020 0 856
[强化学习论文笔记(1)]:DQN

Playing Atari with Deep Reinforcement Learning 论文地址 DQN 笔记 这篇文章就是DQN,DRL领域非常重要的一篇文章,也是David Silv ...

Tue Dec 31 06:50:00 CST 2019 0 719
[强化学习论文笔记(2)]:DoubleDQN

Deep Reinforcement Learning with Double Q-learning 论文地址: Double-DQN Double Q-learning 笔记 在传统强化学习 ...

Tue Dec 31 21:19:00 CST 2019 0 229

 
粤ICP备18138465号  © 2018-2025 CODEPRJ.COM