update trainer doc (#353, #359)

ef8d4e49 · Yuxin Wu · f85c3003 · ef8d4e49
Commit ef8d4e49 authored Aug 02, 2017 by Yuxin Wu
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

docs/tutorial/trainer.md docs/tutorial/trainer.md +1 -1

No files found.
--- a/docs/tutorial/trainer.md
+++ b/docs/tutorial/trainer.md
@@ -38,7 +38,7 @@ Existing multi-GPU trainers include the logic of data-parallel training.
 You can enable them by just one line, and all the necessary logic to achieve the best performance was baked into the trainers already.
 The trainers can reach the same performance as the [official tensorflow benchmark](https://github.com/tensorflow/benchmarks).

-Please note that, in data-parallel training, all towers (all replicates of the model) will take 
+Please note that in data-parallel training, in each iteration all towers (all replicates of the model) will take 
 tensors from the InputSource (instead of taking one for all and split). So the total batch size
 would be multiplied by the number of GPUs.