Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
S
seminar-breakout
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Analytics
Analytics
CI / CD
Repository
Value Stream
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Shashank Suhas
seminar-breakout
Commits
14864868
Commit
14864868
authored
Feb 13, 2017
by
Yuxin Wu
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
fix typo in dqn
parent
d381a5d8
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
3 deletions
+3
-3
examples/DeepQNetwork/DQN.py
examples/DeepQNetwork/DQN.py
+1
-1
examples/DeepQNetwork/README.md
examples/DeepQNetwork/README.md
+2
-2
No files found.
examples/DeepQNetwork/DQN.py
View file @
14864868
...
...
@@ -153,7 +153,7 @@ class Model(ModelDesc):
lr
=
symbf
.
get_scalar_var
(
'learning_rate'
,
1e-3
,
summary
=
True
)
opt
=
tf
.
train
.
AdamOptimizer
(
lr
,
epsilon
=
1e-3
)
return
optimizer
.
apply_grad_processors
(
opt
,
[
gradproc
.
GlobalNorm
al
Clip
(
10
),
gradproc
.
SummaryGradient
()])
opt
,
[
gradproc
.
GlobalNormClip
(
10
),
gradproc
.
SummaryGradient
()])
def
get_config
():
...
...
examples/DeepQNetwork/README.md
View file @
14864868
...
...
@@ -19,12 +19,12 @@ Claimed performance in the paper can be reproduced, on several games I've tested

DQN typically took 2 days of training to reach a score of 400 on breakout game.
DQN typically took 2 days of training to reach a score of 400 on breakout game
(same as the paper)
.
My Batch-A3C implementation only took <2 hours.
Both were trained on one GPU with an extra GPU for simulation.
The x-axis is the number of iterations, not wall time.
D-DQN is faster at the beginning but will converge to 12 batches/s (768 frames/s) due of exploration annealing.
D
ouble
-DQN is faster at the beginning but will converge to 12 batches/s (768 frames/s) due of exploration annealing.
## How to use
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment