fix typo in dqn

14864868 · Yuxin Wu · d381a5d8 · 14864868 · 14864868
Commit 14864868 authored Feb 13, 2017 by Yuxin Wu
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 3 deletions

examples/DeepQNetwork/DQN.py examples/DeepQNetwork/DQN.py +1 -1

examples/DeepQNetwork/README.md examples/DeepQNetwork/README.md +2 -2

No files found.
--- a/examples/DeepQNetwork/DQN.py
+++ b/examples/DeepQNetwork/DQN.py
@@ -153,7 +153,7 @@ class Model(ModelDesc):
        lr = symbf.get_scalar_var('learning_rate', 1e-3, summary=True)
        opt = tf.train.AdamOptimizer(lr, epsilon=1e-3)
        return optimizer.apply_grad_processors(
-            opt, [gradproc.GlobalNormalClip(10), gradproc.SummaryGradient()])
+            opt, [gradproc.GlobalNormClip(10), gradproc.SummaryGradient()])
 def get_config():

--- a/examples/DeepQNetwork/README.md
+++ b/examples/DeepQNetwork/README.md
@@ -19,12 +19,12 @@ Claimed performance in the paper can be reproduced, on several games I've tested
 ![DQN](curve-breakout.png)
-DQN typically took 2 days of training to reach a score of 400 on breakout game.
+DQN typically took 2 days of training to reach a score of 400 on breakout game (same as the paper).
 My Batch-A3C implementation only took <2 hours.
 Both were trained on one GPU with an extra GPU for simulation.
 The x-axis is the number of iterations, not wall time.
-D-DQN is faster at the beginning but will converge to 12 batches/s (768 frames/s) due of exploration annealing.
+Double-DQN is faster at the beginning but will converge to 12 batches/s (768 frames/s) due of exploration annealing.
 ## How to use