[FasterRCNN] using the wrong schedule for the published result

860efcf0 · Yuxin Wu · 17261566 · 860efcf0 · 860efcf0
Commit 860efcf0 authored Nov 16, 2017 by Yuxin Wu
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 3 deletions

examples/FasterRCNN/README.md examples/FasterRCNN/README.md +1 -1

examples/FasterRCNN/train.py examples/FasterRCNN/train.py +2 -2

No files found.
--- a/examples/FasterRCNN/README.md
+++ b/examples/FasterRCNN/README.md
@@ -47,7 +47,7 @@ To evaluate the performance (pretrained models can be downloaded in [model zoo](
 Mean Average Precision @IoU=0.50:0.95:

 + trainval35k/minival, FASTRCNN_BATCH=256: 34.2. Takes 49h on 8 TitanX.
-+ trainval35k/minival, FASTRCNN_BATCH=64: 32.7. Takes 25h on 8 TitanX.
+ trainval35k/minival, FASTRCNN_BATCH=64: 32.7. Takes 31h on 8 TitanX.

 The hyperparameters are not carefully tuned. You can probably get better performance by e.g. training longer.


--- a/examples/FasterRCNN/train.py
+++ b/examples/FasterRCNN/train.py
@@ -320,12 +320,12 @@ if __name__ == '__main__':
                    'learning_rate',
                    [(warmup_epoch * factor, 1e-2),
                     (150000 * factor // stepnum, 1e-3),
-                     (210000 * factor // stepnum, 1e-4)]),
+                     (230000 * factor // stepnum, 1e-4)]),
                EvalCallback(),
                GPUUtilizationTracker(),
            ],
            steps_per_epoch=stepnum,
-            max_epoch=230000 * factor // stepnum,
+            max_epoch=280000 * factor // stepnum,
            session_init=get_model_loader(args.load) if args.load else None,
        )
        trainer = SyncMultiGPUTrainerReplicated(get_nr_gpu())