Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
S
seminar-breakout
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Analytics
Analytics
CI / CD
Repository
Value Stream
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Shashank Suhas
seminar-breakout
Commits
d381a5d8
Commit
d381a5d8
authored
Feb 13, 2017
by
Yuxin Wu
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
bug fix
parent
77c8bde9
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
2 deletions
+3
-2
examples/DeepQNetwork/README.md
examples/DeepQNetwork/README.md
+1
-2
tensorpack/tfutils/optimizer.py
tensorpack/tfutils/optimizer.py
+2
-0
No files found.
examples/DeepQNetwork/README.md
View file @
d381a5d8
...
@@ -24,8 +24,7 @@ My Batch-A3C implementation only took <2 hours.
...
@@ -24,8 +24,7 @@ My Batch-A3C implementation only took <2 hours.
Both were trained on one GPU with an extra GPU for simulation.
Both were trained on one GPU with an extra GPU for simulation.
The x-axis is the number of iterations, not wall time.
The x-axis is the number of iterations, not wall time.
Iteration speed on Tesla M40 is about 9.7it/s for B-A3C.
D-DQN is faster at the beginning but will converge to 12 batches/s (768 frames/s) due of exploration annealing.
D-DQN is faster at the beginning but will converge to 12it/s due of exploration annealing.
## How to use
## How to use
...
...
tensorpack/tfutils/optimizer.py
View file @
d381a5d8
...
@@ -4,6 +4,7 @@
...
@@ -4,6 +4,7 @@
# Author: Yuxin Wu <ppwwyyxxc@gmail.com>
# Author: Yuxin Wu <ppwwyyxxc@gmail.com>
import
tensorflow
as
tf
import
tensorflow
as
tf
from
contextlib
import
contextmanager
from
.gradproc
import
apply_grad_processors
as
apply_gradproc
from
.gradproc
import
apply_grad_processors
as
apply_gradproc
__all__
=
[
'apply_grad_processors'
,
'ProxyOptimizer'
,
__all__
=
[
'apply_grad_processors'
,
'ProxyOptimizer'
,
...
@@ -86,6 +87,7 @@ class PostProcessVariablesOptimizer(ProxyOptimizer):
...
@@ -86,6 +87,7 @@ class PostProcessVariablesOptimizer(ProxyOptimizer):
update_op
=
tf
.
group
(
update_op
,
*
ops
,
name
=
name
)
update_op
=
tf
.
group
(
update_op
,
*
ops
,
name
=
name
)
return
update_op
return
update_op
@
contextmanager
def
_maybe_colocate
(
self
,
var
):
def
_maybe_colocate
(
self
,
var
):
G
=
tf
.
get_default_graph
()
G
=
tf
.
get_default_graph
()
if
self
.
_colocate
:
if
self
.
_colocate
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment