The implementation of Bag of SIFT representation and linear SVM classifier can be divided into following parts.
In this part, for each of the 1500 training images, I set binsize = 8 and step = 8 for the vl_dsift function and randomly selected 200 features. This gave me a total of 300000 features. For the kmeans function I used vocab_size = 400.
In this part, I used two different steps values( 4 and 8 ) in the vl_dsift function for each test images. Then I created and normalized histograms of all 400 vocabulary. For step = 4, the highest acuracy I got is 71.3%, and for step = 8, the highest accuracy I got is 63.5%.
In this part, I trained 15 binary, 1-vs-all SVMs. And for each test case, I evaluated it on all 15 classifiers and the classifier which is most confidently positive(using the formula W'*X + B ) "wins". For this part, i tried different LAMBDA values. And the accuary for different LAMBDA values are shown in the following table.
LAMBDA | accuracy |
0.01 | 0.360 |
0.001 | 0.475 |
0.0001 | 0.541 |
0.00001 | 0.662 |
0.000001 | 0.713 |
My final tuned configurations for each parameter are show in the following table.
parameter | values |
bin size (buinding vocab) | 8 |
vocab_size | 400 |
sampled features | 300000 |
step | 8 |
bin size (bag of SIFT) | 8 |
LAMBDA | 0.000001 |
With those configurations, the best accuracy is
Category name | Accuracy | Sample training images | Sample true positives | False positives with true label | False negatives with wrong predicted label | ||||
---|---|---|---|---|---|---|---|---|---|
Kitchen | 0.410 | ![]() |
![]() |
![]() |
![]() |
![]() LivingRoom |
![]() InsideCity |
![]() LivingRoom |
![]() Store |
Store | 0.550 | ![]() |
![]() |
![]() |
![]() |
![]() Kitchen |
![]() Highway |
![]() Forest |
![]() LivingRoom |
Bedroom | 0.350 | ![]() |
![]() |
![]() |
![]() |
![]() LivingRoom |
![]() TallBuilding |
![]() LivingRoom |
![]() LivingRoom |
LivingRoom | 0.340 | ![]() |
![]() |
![]() |
![]() |
![]() Kitchen |
![]() Office |
![]() Kitchen |
![]() Kitchen |
Office | 0.870 | ![]() |
![]() |
![]() |
![]() |
![]() LivingRoom |
![]() InsideCity |
![]() Kitchen |
![]() LivingRoom |
Industrial | 0.230 | ![]() |
![]() |
![]() |
![]() |
![]() Store |
![]() TallBuilding |
![]() Highway |
![]() InsideCity |
Suburb | 0.910 | ![]() |
![]() |
![]() |
![]() |
![]() Industrial |
![]() Bedroom |
![]() Street |
![]() LivingRoom |
InsideCity | 0.370 | ![]() |
![]() |
![]() |
![]() |
![]() Street |
![]() Store |
![]() Kitchen |
![]() LivingRoom |
TallBuilding | 0.410 | ![]() |
![]() |
![]() |
![]() |
![]() InsideCity |
![]() Bedroom |
![]() Store |
![]() Store |
Street | 0.650 | ![]() |
![]() |
![]() |
![]() |
![]() InsideCity |
![]() LivingRoom |
![]() Store |
![]() InsideCity |
Highway | 0.780 | ![]() |
![]() |
![]() |
![]() |
![]() Coast |
![]() OpenCountry |
![]() Forest |
![]() Street |
OpenCountry | 0.620 | ![]() |
![]() |
![]() |
![]() |
![]() Industrial |
![]() Coast |
![]() Bedroom |
![]() Coast |
Coast | 0.670 | ![]() |
![]() |
![]() |
![]() |
![]() OpenCountry |
![]() OpenCountry |
![]() Office |
![]() OpenCountry |
Mountain | 0.690 | ![]() |
![]() |
![]() |
![]() |
![]() OpenCountry |
![]() OpenCountry |
![]() Forest |
![]() Forest |
Forest | 0.950 | ![]() |
![]() |
![]() |
![]() |
![]() Industrial |
![]() Bedroom |
![]() Mountain |
![]() Mountain |
Category name | Accuracy | Sample training images | Sample true positives | False positives with true label | False negatives with wrong predicted label |
Category name | Accuracy | Sample training images | Sample true positives | False positives with true label | False negatives with wrong predicted label | ||||
---|---|---|---|---|---|---|---|---|---|
Kitchen | 0.570 | ![]() |
![]() |
![]() |
![]() |
![]() Bedroom |
![]() Bedroom |
![]() Store |
![]() Industrial |
Store | 0.580 | ![]() |
![]() |
![]() |
![]() |
![]() Kitchen |
![]() Highway |
![]() InsideCity |
![]() Suburb |
Bedroom | 0.500 | ![]() |
![]() |
![]() |
![]() |
![]() Kitchen |
![]() Kitchen |
![]() Office |
![]() Industrial |
LivingRoom | 0.420 | ![]() |
![]() |
![]() |
![]() |
![]() Kitchen |
![]() Industrial |
![]() Bedroom |
![]() Store |
Office | 0.890 | ![]() |
![]() |
![]() |
![]() |
![]() LivingRoom |
![]() InsideCity |
![]() Kitchen |
![]() Kitchen |
Industrial | 0.680 | ![]() |
![]() |
![]() |
![]() |
![]() Kitchen |
![]() InsideCity |
![]() InsideCity |
![]() Highway |
Suburb | 0.990 | ![]() |
![]() |
![]() |
![]() |
![]() Mountain |
![]() Coast |
![]() InsideCity |
|
InsideCity | 0.700 | ![]() |
![]() |
![]() |
![]() |
![]() TallBuilding |
![]() Street |
![]() LivingRoom |
![]() Store |
TallBuilding | 0.770 | ![]() |
![]() |
![]() |
![]() |
![]() Industrial |
![]() InsideCity |
![]() Industrial |
![]() Coast |
Street | 0.650 | ![]() |
![]() |
![]() |
![]() |
![]() Forest |
![]() InsideCity |
![]() InsideCity |
![]() InsideCity |
Highway | 0.820 | ![]() |
![]() |
![]() |
![]() |
![]() Industrial |
![]() InsideCity |
![]() Coast |
![]() LivingRoom |
OpenCountry | 0.530 | ![]() |
![]() |
![]() |
![]() |
![]() Coast |
![]() Bedroom |
![]() Coast |
![]() Coast |
Coast | 0.790 | ![]() |
![]() |
![]() |
![]() |
![]() TallBuilding |
![]() OpenCountry |
![]() OpenCountry |
![]() Highway |
Mountain | 0.870 | ![]() |
![]() |
![]() |
![]() |
![]() OpenCountry |
![]() TallBuilding |
![]() Forest |
![]() Forest |
Forest | 0.930 | ![]() |
![]() |
![]() |
![]() |
![]() OpenCountry |
![]() Highway |
![]() OpenCountry |
![]() Mountain |
Category name | Accuracy | Sample training images | Sample true positives | False positives with true label | False negatives with wrong predicted label |