Commit 758ae94a authored by Yuxin Wu's avatar Yuxin Wu

mention imagenet synsets.

parent 4768b265
......@@ -87,7 +87,7 @@ On the other hand, DataFlow is:
used for training. No need for intermediate format when you don't.
1. **Flexible**: Since it is in pure Python, you can use any data format.
When you need, you can still easily serialize your dataflow to a single-file
format with
format with
[a few lines of code](../../modules/dataflow.html#tensorpack.dataflow.LMDBSerializer).
......@@ -152,6 +152,7 @@ or when you need to filter your data on the fly.
1. You do batch training
1. You use a constant batch size
1. Indices are sufficient to determine which samples to batch together
1. Multiprocessing is a better parallelization strategy than multithreading.
None of these are necessarily true.
......@@ -170,7 +171,7 @@ On the other hand, DataFlow:
```eval_rst
.. note:: An iterator interface is more generic than ``__getitem__``?
DataFlow's iterator interface can perfectly simulate the behavior of indexing interface like this:
DataFlow's iterator interface can perfectly simulate the behavior of indexing interface like this:
.. code-block:: python
......
......@@ -179,6 +179,7 @@ class ILSVRC12Files(RNGDataFlow):
class ILSVRC12(ILSVRC12Files):
"""
Produces uint8 ILSVRC12 images of shape [h, w, 3(BGR)], and a label between [0, 999].
The label map follows the synsets.txt file in http://dl.caffe.berkeleyvision.org/caffe_ilsvrc12.tar.gz.
"""
def __init__(self, dir, name, meta_dir=None,
shuffle=None, dir_structure=None):
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment