-
Notifications
You must be signed in to change notification settings - Fork 3
Expand file tree
/
Copy pathREADME
More file actions
90 lines (56 loc) · 1.91 KB
/
README
File metadata and controls
90 lines (56 loc) · 1.91 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
# fMoW: Functional Map of the World
This code was developed by Jianmin Sun.
## Dependencies
The following libraries were used for training/testing the deep learning models:
Mxnet 0.11.0
Keras 2.0.8
Tensorflow 1.3.0
fMoW/baseline from(https://github.com/fMoW/baseline)
## Dataset
The following directory structure was used for training/testing:
```
fmow_dataset/
train/
airport/
airport_0/
airport_0_0_rgb.jpg
airport_0_0_rgb.json
...
airport_0_5_rgb.jpg
airport_0_5_rgb.json
...
...
zoo/
zoo_0/
zoo_0_0_rgb.jpg
zoo_0_0_rgb.json
...
zoo_0_8_rgb.jpg
zoo_0_8_rgb.json
...
test/
0000000/
0000000_0_rgb.jpg
0000000_0_rgb.json
...
0000000_5_rgb.jpg
0000000_5_rgb.json
...
```
## Results Format
This code will output txt files in the format required by Topcoder, where each line contains comma-separated values of the bounding box ID and a string containing the category.
## Running the Code
To first prepare the dataset for training and testing, prepare the RGB-only version of the dataset in ./data:
```
docker build -t fmow .
nvidia-docker run -v ./data/:/data -it fmow
```
for training (whole processes will take over 100 hours on g3.16xlarge)
baseline each epoch takes about 7 hours, total 6x7 hours needed
three mxnet models with eight epoch and one with six epoch
mxnet model each epoch takes about 2.2 hours, total 2.2x8x4 hours needed
bash train.sh /data/train
bash test.sh /data/train /data/test /work/out.txt model
Our best performing model is the CNN with meta data approach, which sums predictions over each temporal view and then takes an argmax.
## License
The license is Apache 2.0. See LICENSE.