Commit f9bca2c4 authored by Yuxin Wu's avatar Yuxin Wu

update readme

parent e42378ca
...@@ -16,9 +16,11 @@ Claimed performance in the paper can be reproduced, on several games I've tested ...@@ -16,9 +16,11 @@ Claimed performance in the paper can be reproduced, on several games I've tested
![DQN](curve-breakout.png) ![DQN](curve-breakout.png)
DQN typically took 2 days of training to reach a score of 400 on breakout game. DQN typically took 2 days of training to reach a score of 400 on breakout game.
My Batch-A3C implementation only took <2 hours (one for training and one for simulation). My Batch-A3C implementation only took <2 hours.
Both were trained on one GPU with an extra GPU for simulation. Both were trained on one GPU with an extra GPU for simulation.
This is probably the fastest RL trainer you'd find. <!--
-This is probably the fastest RL trainer you'd find.
-->
The x-axis is the number of iterations, not wall time. The x-axis is the number of iterations, not wall time.
Iteration speed on Tesla M40 is about 9.7it/s for B-A3C. Iteration speed on Tesla M40 is about 9.7it/s for B-A3C.
......
...@@ -2,7 +2,7 @@ ...@@ -2,7 +2,7 @@
# -*- coding: utf-8 -*- # -*- coding: utf-8 -*-
""" """
A general curve plotter used to create curves like: A general curve plotter to create curves such as:
https://github.com/ppwwyyxx/tensorpack/tree/master/examples/ResNet https://github.com/ppwwyyxx/tensorpack/tree/master/examples/ResNet
A simplest example: A simplest example:
......
...@@ -27,7 +27,9 @@ except ImportError: ...@@ -27,7 +27,9 @@ except ImportError:
class SimulatorProcess(multiprocessing.Process): class SimulatorProcess(multiprocessing.Process):
""" A process that simulates a player """ """
A process that simulates a player and communicates to master to get the next action
"""
__metaclass__ = ABCMeta __metaclass__ = ABCMeta
def __init__(self, idx, pipe_c2s, pipe_s2c): def __init__(self, idx, pipe_c2s, pipe_s2c):
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment