update docs

9b62b218 · Yuxin Wu · 881c4ee6 · 9b62b218 · 9b62b218 · 9b62b218
Commit 9b62b218 authored Feb 09, 2017 by Yuxin Wu
9 changed files
--- a/docs/conf.py
+++ b/docs/conf.py
@@ -54,6 +54,7 @@ extensions = [
    'sphinx.ext.autodoc',
    'sphinx.ext.todo',
    'sphinx.ext.napoleon',
+    #'sphinx.ext.autosectionlabel',
    # 'sphinx.ext.coverage',
    'sphinx.ext.mathjax',
    'sphinx.ext.intersphinx',

--- a/docs/index.rst
+++ b/docs/index.rst
@@ -8,7 +8,7 @@ might not be correct.
 .. toctree::
  :maxdepth: 2
-  user/tutorials
+  tutorial/index
  casestudies/index
  modules/index

--- a/docs/user/callbacks.md
+++ b/docs/user/callbacks.md
-# Callbacks
+# Callback
 Apart from the actual training iterations that minimizes the cost,
 you almost surely would like to do something else during training.

--- a/docs/user/dataflow.md
+++ b/docs/user/dataflow.md
--- a/docs/tutorial/efficient-data.md
+++ b/docs/tutorial/efficient-data.md
+# Efficient Data Loading
+This tutorial gives an overview of how to efficiently load data in tensorpack, using ImageNet
+dataset as an example.
+Note that the actual performance would depend on not only the disk, but also
+memory (for caching) and CPU (for data processing), so the solution in this tutorial is
+not necessarily the best for different scenarios.
+### Use TensorFlow queues
+In general, ``feed_dict`` is slow and should never appear in your critical loop.
+i.e., you should avoid loops like this:
+```python
+while True:
+  X, y = get_some_data()
+  minimize_op.run(feed_dict={'X': X, 'y': y})
+```
+However, when you need to load data from Python-side, this is the only available interface in frameworks such as Keras, tflearn.
+You should use something like this instead:
+```python
+# Thread 1:
+while True:
+  X, y = get_some_data()
+  enqueue.run(feed_dict={'X': X, 'y': y})	 # feed data to a TensorFlow queue
+# Thread 2:
+while True:
+  minimize_op.run()	 # minimize_op was built from dequeued tensors
+```
+This is automatically handled by tensorpack trainers already (unless you used the demo ``SimpleTrainer``),
+see [Trainer](trainer.md) for details.
+TensorFlow is providing staging interface which may further improve the speed. This is
+[issue#140](https://github.com/ppwwyyxx/tensorpack/issues/140).
+### Figure out your bottleneck
--- a/docs/user/glance.md
+++ b/docs/user/glance.md
--- a/docs/user/tutorials.rst
+++ b/docs/user/tutorials.rst
@@ -5,10 +5,11 @@ Tutorials
 Test.
 .. toctree::
-  :maxdepth: 2
+  :maxdepth: 1
  glance
  dataflow
-  models
+  efficient-data
+  model
  trainer
-  callbacks
+  callback
--- a/docs/user/models.md
+++ b/docs/user/models.md
--- a/docs/user/trainer.md
+++ b/docs/user/trainer.md
-# Trainers
+# Trainer
 Training is basically **running something again and again**.
 Tensorpack base trainer implements the logic of *running the iteration*,
@@ -12,15 +12,14 @@ therefore you can use these trainers as long as you set `self.cost` in `ModelDes
 as did in most examples.
 Most existing trainers were implemented with a TensorFlow queue to prefetch and buffer
-training data, which is significantly faster than
+training data, which is faster than a naive `sess.run(..., feed_dict={...})`.
-a naive `sess.run(..., feed_dict={...})`.
 There are also multi-GPU trainers which includes the logic of data-parallel multi-GPU training,
 with either synchronous update or asynchronous update. You can enable multi-GPU training
 by just changing one line.
 To use trainers, pass a `TrainConfig` to configure them:
-````python
+```python
 config = TrainConfig(
           dataflow=my_dataflow,
           optimizer=tf.train.AdamOptimizer(0.01),
@@ -36,7 +35,7 @@ config = TrainConfig(
 # start multi-GPU training with synchronous update:
 SyncMultiGPUTrainer(config).train()
-````
+```
 Trainers just run some iterations, so there is no limit in where the data come from
 or what to do in an iteration.