Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
S
seminar-breakout
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Analytics
Analytics
CI / CD
Repository
Value Stream
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Shashank Suhas
seminar-breakout
Commits
f9bca2c4
Commit
f9bca2c4
authored
Jul 31, 2016
by
Yuxin Wu
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
update readme
parent
e42378ca
Changes
3
Show whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
8 additions
and
4 deletions
+8
-4
examples/Atari2600/README.md
examples/Atari2600/README.md
+4
-2
scripts/plot-point.py
scripts/plot-point.py
+1
-1
tensorpack/RL/simulator.py
tensorpack/RL/simulator.py
+3
-1
No files found.
examples/Atari2600/README.md
View file @
f9bca2c4
...
...
@@ -16,9 +16,11 @@ Claimed performance in the paper can be reproduced, on several games I've tested

DQN typically took 2 days of training to reach a score of 400 on breakout game.
My Batch-A3C implementation only took <2 hours
(one for training and one for simulation)
.
My Batch-A3C implementation only took <2 hours.
Both were trained on one GPU with an extra GPU for simulation.
This is probably the fastest RL trainer you'd find.
<!--
-This is probably the fastest RL trainer you'd find.
-->
The x-axis is the number of iterations, not wall time.
Iteration speed on Tesla M40 is about 9.7it/s for B-A3C.
...
...
scripts/plot-point.py
View file @
f9bca2c4
...
...
@@ -2,7 +2,7 @@
# -*- coding: utf-8 -*-
"""
A general curve plotter
used to create curves like
:
A general curve plotter
to create curves such as
:
https://github.com/ppwwyyxx/tensorpack/tree/master/examples/ResNet
A simplest example:
...
...
tensorpack/RL/simulator.py
View file @
f9bca2c4
...
...
@@ -27,7 +27,9 @@ except ImportError:
class
SimulatorProcess
(
multiprocessing
.
Process
):
""" A process that simulates a player """
"""
A process that simulates a player and communicates to master to get the next action
"""
__metaclass__
=
ABCMeta
def
__init__
(
self
,
idx
,
pipe_c2s
,
pipe_s2c
):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment