Commit c0afb545 authored by Yuxin Wu's avatar Yuxin Wu

[DQN] Use ALE from atari_py

parent a4df1634
...@@ -17,8 +17,9 @@ used a modified version where each batch contains transitions from different sim ...@@ -17,8 +17,9 @@ used a modified version where each batch contains transitions from different sim
## Usage: ## Usage:
Install dependencies by `pip install 'gym[atari]'`.
### With ALE (paper's setting): ### With ALE (paper's setting):
Install [ALE](https://github.com/mgbellemare/Arcade-Learning-Environment) and `gym[atari]`.
Download an [atari rom](https://github.com/openai/atari-py/tree/gdb/atari_py/atari_roms), e.g.: Download an [atari rom](https://github.com/openai/atari-py/tree/gdb/atari_py/atari_roms), e.g.:
``` ```
...@@ -53,9 +54,10 @@ Claimed performance in the paper can be reproduced, on several games I've tested ...@@ -53,9 +54,10 @@ Claimed performance in the paper can be reproduced, on several games I've tested
![DQN](curve-breakout.png) ![DQN](curve-breakout.png)
| Environment | Avg Score | Download | | Environment | Avg Score | Download |
|:-------------|:---------:|:------------------------------------------------------------------------------------:| |:--------------|:---------:|:-------------------------------------------------------------------------------------:|
| breakout.bin | 465 | [:arrow_down:](http://models.tensorpack.com/DeepQNetwork/DoubleDQN-breakout.bin.npz) | | breakout.bin | 465 | [:arrow_down:](http://models.tensorpack.com/DeepQNetwork/DoubleDQN-breakout.bin.npz) |
| seaquest.bin | 8686 | [:arrow_down:](http://models.tensorpack.com/DeepQNetwork/DoubleDQN-seaquest.bin.npz) | | seaquest.bin | 8686 | [:arrow_down:](http://models.tensorpack.com/DeepQNetwork/DoubleDQN-seaquest.bin.npz) |
| ms_pacman.bin | 3323 | [:arrow_down:](http://models.tensorpack.com/DeepQNetwork/DoubleDQN-ms_pacman.bin.npz) |
## Speed ## Speed
On one GTX 1080Ti, On one GTX 1080Ti,
......
...@@ -8,7 +8,7 @@ import threading ...@@ -8,7 +8,7 @@ import threading
import cv2 import cv2
import gym import gym
import six import six
from ale_python_interface import ALEInterface from atari_py.ale_python_interface import ALEInterface
from gym import spaces from gym import spaces
from gym.envs.atari.atari_env import ACTION_MEANING from gym.envs.atari.atari_env import ACTION_MEANING
from six.moves import range from six.moves import range
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment