You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+19-8Lines changed: 19 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,12 +1,14 @@
1
1
# SSD: Single Shot MultiBox Object Detector, in PyTorch
2
2
A [PyTorch](http://pytorch.org/) implementation of [Single Shot MultiBox Detector](http://arxiv.org/abs/1512.02325) from the 2016 paper by Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang, and Alexander C. Berg. The official and original Caffe code can be found [here](https://github.com/weiliu89/caffe/tree/ssd).
3
3
4
+
***UPDATE:*** We have just added support for MS COCO! Check it out [below](#coco).
5
+
4
6
## Authors
5
7
6
8
*[**Max de Groot**](https://github.com/amdegroot)
7
9
*[**Ellis Brown**](http://github.com/ellisbrown)
8
10
9
-
***Note:*** Unfortunately, this is just a hobby of ours and not a full-time job, so we'll do our best to keep things up to date, but no guarantees. That being said, thanks to everyone for your continued help and feedback as it is really appreciated. We will try to address everything as soon as possible.
11
+
***Note:*** Unfortunately, this is just a hobby for us and not a full-time job, so we'll do our best to keep things up to date, but no guarantees. That being said, thanks to everyone for your continued help and feedback as it is really appreciated. We will try to address everything as soon as possible.
@@ -30,7 +32,7 @@ A [PyTorch](http://pytorch.org/) implementation of [Single Shot MultiBox Detecto
30
32
- Install [PyTorch](http://pytorch.org/) by selecting your environment on the website and running the appropriate command.
31
33
- Clone this repository.
32
34
* Note: We currently only support Python 3+.
33
-
- Then download the dataset by following the [instructions](#download-voc2007-trainval--test) below.
35
+
- Then download the dataset by following the [instructions](#datasets) below.
34
36
- We now support [Visdom](https://github.com/facebookresearch/visdom) for real-time loss visualization during training!
35
37
* To use Visdom in the browser:
36
38
```Shell
@@ -40,22 +42,32 @@ A [PyTorch](http://pytorch.org/) implementation of [Single Shot MultiBox Detecto
40
42
python -m visdom.server
41
43
```
42
44
* Then (during training) navigate to http://localhost:8097/ (see the Train section below for training details).
43
-
- Note: For training, we currently only support [VOC](http://host.robots.ox.ac.uk/pascal/VOC/), but are adding [COCO](http://mscoco.org/) and hopefully [ImageNet](http://www.image-net.org/) soon.
45
+
- Note: For training, we currently support [VOC](http://host.robots.ox.ac.uk/pascal/VOC/) and [COCO](http://mscoco.org/), and aim to add [ImageNet](http://www.image-net.org/) support soon.
44
46
- UPDATE: We have switched from PIL Image support to cv2. The plan is to create a branch that uses PIL as well.
45
47
46
48
## Datasets
47
-
To make things easy, we provide a simple VOC dataset loader that inherits `torch.utils.data.Dataset` making it fully compatible with the `torchvision.datasets`[API](http://pytorch.org/docs/torchvision/datasets.html).
49
+
To make things easy, we provide bash scripts to handle the dataset downloads and setup for you. We also provide simple dataset loaders that inherit `torch.utils.data.Dataset`, making them fully compatible with the `torchvision.datasets`[API](http://pytorch.org/docs/torchvision/datasets.html).
50
+
51
+
52
+
### COCO
53
+
Microsoft COCO: Common Objects in Context
54
+
55
+
##### Download COCO 2014
56
+
```Shell
57
+
# specify a directory for dataset to be downloaded into, else default is ~/data/
58
+
sh data/scripts/COCO2014.sh
59
+
```
48
60
49
61
### VOC Dataset
50
-
##### Download VOC2007 trainval & test
62
+
PASCAL VOC: Visual Object Classes
51
63
64
+
##### Download VOC2007 trainval & test
52
65
```Shell
53
66
# specify a directory for dataset to be downloaded into, else default is ~/data/
54
67
sh data/scripts/VOC2007.sh # <directory>
55
68
```
56
69
57
70
##### Download VOC2012 trainval
58
-
59
71
```Shell
60
72
# specify a directory for dataset to be downloaded into, else default is ~/data/
61
73
sh data/scripts/VOC2012.sh # <directory>
@@ -153,11 +165,10 @@ jupyter notebook
153
165
- Running `python -m demo.live` opens the webcam and begins detecting!
154
166
155
167
## TODO
156
-
We have accumulated the following to-do list, which you can expect to be done in the very near future
168
+
We have accumulated the following to-do list, which we hope to complete in the near future
157
169
- Still to come:
158
170
* Train SSD300 with batch norm
159
171
* Add support for SSD512 training and testing
160
-
* Add support for COCO dataset
161
172
* Create a functional model definition for Sergey Zagoruyko's [functional-zoo](https://github.com/szagoruyko/functional-zoo)
0 commit comments