a writeup for starting with TensorPack (#99)

72c97317 · Patrick Wieschollek · Yuxin Wu · 4a30d18d · 72c97317 · 72c97317
Commit 72c97317 authored Jan 13, 2017 by Patrick Wieschollek Committed by Yuxin Wu Jan 13, 2017
Showing with 458 additions and 1 deletion

docs/casestudies/colorize.md docs/casestudies/colorize.md +381 -0

docs/casestudies/index.rst docs/casestudies/index.rst +8 -0

docs/index.rst docs/index.rst +1 -0

docs/user/glance.md docs/user/glance.md +68 -1

No files found.
--- a/docs/casestudies/colorize.md
+++ b/docs/casestudies/colorize.md
--- a/docs/casestudies/index.rst
+++ b/docs/casestudies/index.rst
+Casestudies
+--------------------
+.. toctree::
+  :maxdepth: 1
+  colorize
--- a/docs/index.rst
+++ b/docs/index.rst
@@ -7,6 +7,7 @@ tensorpack is in early development.
  :maxdepth: 2
  user/tutorials
+  casestudies/index
  modules/index
 Indices and tables

--- a/docs/user/glance.md
+++ b/docs/user/glance.md
 ## A High-Level Glance
-TODO
+The following guide introduces some core concepts of TensorPack. In contrast to several other libraries TensorPack contains of several modules to build complex deep learning algorithms and train models with high accuracy and high speed.
+### DataFlow
+To train neural network architectures on extensive training data this library provides a data flow mechanism. This consists of several readers, mappings (e.g. image augmentations) and efficient prefetching.
+The following code reads images from a database produced in the fashion of Caffe and add several modifications such as resizing them to $255\times 255$ and converting these to gray-scale images. 
+````python
+ds = CaffeLMDB('/path/to/caffe/lmdb', shuffle=False)
+ds = AugmentImageComponent(ds, [imgaug.Resize((225, 225))])
+ds = MapData(ds, lambda dp: [np.dot(dp[0], [0.299, 0.587, 0.114])[:, :]])
+ds = BatchData(ds, 128)
+ds = PrefetchData(ds, 3, 2)
+````
+In addition, the input data is gathered in batches of 128 entries and prefetched in an extra process to avoid slow-downs due to GIL.
+### Layers and Architectures
+The library also contains several pre-implemented neural network modules and layers:
+- Convolution, Deconvolution
+- FullyConnected
+- nonlinearities such as ReLU, leakyReLU, tanh and sigmoid
+- pooling operations
+- regularization operations
+- batchnorm
+We also support of tfSlim out-of-the box. A LeNet architecture for MNIST would look like
+````python
+logits = (LinearWrap(image)  # the starting brace is only for line-breaking
+          .Conv2D('conv0')
+          .MaxPooling('pool0', 2)
+          .Conv2D('conv1', padding='SAME')
+          .Conv2D('conv2')
+          .MaxPooling('pool1', 2)
+          .Conv2D('conv3')
+          .FullyConnected('fc0', 512, nl=tf.nn.relu)
+          .Dropout('dropout', 0.5)
+          .FullyConnected('fc1', out_dim=10, nl=tf.identity)())
+````
+You should build your model within the ModelDesc-class.
+### Training
+Given TensorFlow's optimizers this library provides several training protocols even for efficient multi-GPU environments. There is support for single GPU, training on one machine with multiple GPUs (synchron or asyncron), training of Generative Adversarial networks and reinforcement learning.
+You only need to configure your training protocol like
+````python
+config =  TrainConfig(
+            dataflow=my_dataflow, 
+            optimizer=tf.train.AdamOptimizer(lr),
+            callbacks=Callbacks([ModelSaver(), ...]),
+            model=Model())
+# start training
+SimpleTrainer(config).train()
+````
+Switching between single-GPU and multi-GPU is as easy as replace the last line with
+````python
+# start multi-GPUtraining
+SyncMultiGPUTrainer(config).train()
+````
+### Callbacks
+The use of callbacks add the flexibility to execute code during training. These callbacks are triggered on several events such as after each step or at the end of one training epoch.
\ No newline at end of file