Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
S
seminar-breakout
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Analytics
Analytics
CI / CD
Repository
Value Stream
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Shashank Suhas
seminar-breakout
Commits
1525e800
Commit
1525e800
authored
Nov 11, 2017
by
Yuxin Wu
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
[FasterRCNN] separate `crop_and_resize` function
parent
42c9b8a7
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
36 additions
and
14 deletions
+36
-14
examples/FasterRCNN/model.py
examples/FasterRCNN/model.py
+36
-14
No files found.
examples/FasterRCNN/model.py
View file @
1525e800
...
...
@@ -296,25 +296,28 @@ def sample_fast_rcnn_targets(boxes, gt_boxes, gt_labels):
@
under_name_scope
()
def
roi_align
(
featuremap
,
boxes
,
output_shap
e
):
def
crop_and_resize
(
image
,
boxes
,
siz
e
):
"""
Better-aligned version of tf.image.crop_and_resize,
following our definition of floating point boxes.
Args:
featuremap: 1xCxHx
W
boxes:
Nx4 floatbox
output_shape: int
image: 1CH
W
boxes:
nx4, x1y1x2y2
size (int):
Returns:
NxCxoHxoW
n,C,size,size
"""
@
under_name_scope
()
def
transform_fpcoor_for_tf
(
boxes
,
image_shape
,
crop_shape
):
"""
The way crop_and_resize works (with normalized box):
The way
tf.image.
crop_and_resize works (with normalized box):
Initial point (the value of output[0]): x0_box * (W_img - 1)
Spacing: w_box * (W_img - 1) / (W_crop - 1)
Use the above grid to bilinear sample.
However, what
I
want is (with fpcoor box):
However, what
we
want is (with fpcoor box):
Spacing: w_box / W_crop
Initial point: x0_box + spacing/2 - 0.5
(-0.5 because bilinear sample assumes floating point coordinate (0.0, 0.0) is the same as pixel value (0, 0))
...
...
@@ -337,15 +340,34 @@ def roi_align(featuremap, boxes, output_shape):
return
tf
.
concat
([
ny0
,
nx0
,
ny0
+
nh
,
nx0
+
nw
],
axis
=
1
)
image_shape
=
tf
.
shape
(
image
)[
2
:]
boxes
=
transform_fpcoor_for_tf
(
boxes
,
image_shape
,
[
size
,
size
])
image
=
tf
.
transpose
(
image
,
[
0
,
2
,
3
,
1
])
# 1hwc
ret
=
tf
.
image
.
crop_and_resize
(
image
,
boxes
,
tf
.
zeros
([
tf
.
shape
(
boxes
)[
0
]],
dtype
=
tf
.
int32
),
crop_size
=
[
size
,
size
])
ret
=
tf
.
transpose
(
ret
,
[
0
,
3
,
1
,
2
])
# ncss
return
ret
@
under_name_scope
()
def
roi_align
(
featuremap
,
boxes
,
output_shape
):
"""
Args:
featuremap: 1xCxHxW
boxes: Nx4 floatbox
output_shape: int
Returns:
NxCxoHxoW
"""
image_shape
=
tf
.
shape
(
featuremap
)[
2
:]
featuremap
=
tf
.
transpose
(
featuremap
,
[
0
,
2
,
3
,
1
])
# to nhwc
# sample 4 locations per roi bin
boxes
=
transform_fpcoor_for_tf
(
boxes
,
image_shape
,
[
output_shape
*
2
,
output_shape
*
2
])
boxes
=
tf
.
stop_gradient
(
boxes
)
# TODO
ret
=
tf
.
image
.
crop_and_resize
(
featuremap
,
boxes
,
tf
.
zeros
([
tf
.
shape
(
boxes
)[
0
]],
dtype
=
tf
.
int32
),
crop_size
=
[
output_shape
*
2
,
output_shape
*
2
])
ret
=
tf
.
transpose
(
ret
,
[
0
,
3
,
1
,
2
])
# sample 4 locations per roi bin
ret
=
crop_and_resize
(
featuremap
,
boxes
,
output_shape
*
2
)
ret
=
tf
.
nn
.
avg_pool
(
ret
,
[
1
,
1
,
2
,
2
],
[
1
,
1
,
2
,
2
],
padding
=
'SAME'
,
data_format
=
'NCHW'
)
return
ret
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment