Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
S
seminar-breakout
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Analytics
Analytics
CI / CD
Repository
Value Stream
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Shashank Suhas
seminar-breakout
Commits
2cace49f
Commit
2cace49f
authored
Mar 28, 2018
by
Yuxin Wu
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
fix #715
parent
d4799335
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
4 additions
and
3 deletions
+4
-3
examples/A3C-Gym/README.md
examples/A3C-Gym/README.md
+1
-1
examples/A3C-Gym/train-atari.py
examples/A3C-Gym/train-atari.py
+3
-2
No files found.
examples/A3C-Gym/README.md
View file @
2cace49f
...
@@ -23,7 +23,7 @@ because the bottleneck in this implementation is not computation but simulation.
...
@@ -23,7 +23,7 @@ because the bottleneck in this implementation is not computation but simulation.
Some practicical notes:
Some practicical notes:
1.
Prefer Python 3.
1.
Prefer Python 3
; Windows not supported
.
2.
Occasionally, processes may not get terminated completely. It is suggested to use
`systemd-run`
to run any
2.
Occasionally, processes may not get terminated completely. It is suggested to use
`systemd-run`
to run any
multiprocess Python program to get a cgroup dedicated for the task.
multiprocess Python program to get a cgroup dedicated for the task.
3.
Training with a significant slower speed (e.g. on CPU) will result in very bad score, probably because of the slightly off-policy implementation.
3.
Training with a significant slower speed (e.g. on CPU) will result in very bad score, probably because of the slightly off-policy implementation.
...
...
examples/A3C-Gym/train-atari.py
View file @
2cace49f
...
@@ -231,8 +231,9 @@ def train():
...
@@ -231,8 +231,9 @@ def train():
# setup simulator processes
# setup simulator processes
name_base
=
str
(
uuid
.
uuid1
())[:
6
]
name_base
=
str
(
uuid
.
uuid1
())[:
6
]
namec2s
=
'ipc://@sim-c2s-{}'
.
format
(
name_base
)
prefix
=
'@'
if
sys
.
platform
.
startswith
(
'linux'
)
else
''
names2c
=
'ipc://@sim-s2c-{}'
.
format
(
name_base
)
namec2s
=
'ipc://{}sim-c2s-{}'
.
format
(
prefix
,
name_base
)
names2c
=
'ipc://{}sim-s2c-{}'
.
format
(
prefix
,
name_base
)
procs
=
[
MySimulatorWorker
(
k
,
namec2s
,
names2c
)
for
k
in
range
(
SIMULATOR_PROC
)]
procs
=
[
MySimulatorWorker
(
k
,
namec2s
,
names2c
)
for
k
in
range
(
SIMULATOR_PROC
)]
ensure_proc_terminate
(
procs
)
ensure_proc_terminate
(
procs
)
start_proc_mask_signal
(
procs
)
start_proc_mask_signal
(
procs
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment