Features were extracted using a bag of SIFT representation. First we built a vocabulary by taking a subset of sift features of a sampling from the training images, then did k means samplings on this list of features to make K (I used 400 to balance time and accuracy) centroids of features. However, the actual features for each image are a histogram of samplings of sift features for that image binned into the nearest neighbor vocabulary word (centroid). Each test image was classified using a K nearest neighbors classifier for the bag of sifts features. Accuracy was optimized with K=4.
| Category name | Accuracy | Sample training images | Sample true positives | False positives with true label | False negatives with wrong predicted label | ||||
|---|---|---|---|---|---|---|---|---|---|
| Kitchen | 0.390 | Industrial |
Industrial |
Bedroom |
Office |
||||
| Store | 0.510 | TallBuilding |
TallBuilding |
LivingRoom |
Industrial |
||||
| Bedroom | 0.320 | OpenCountry |
Kitchen |
LivingRoom |
Office |
||||
| LivingRoom | 0.340 | Bedroom |
Store |
Office |
Office |
||||
| Office | 0.820 | Kitchen |
Industrial |
Kitchen |
Kitchen |
||||
| Industrial | 0.410 | TallBuilding |
Store |
Kitchen |
Store |
||||
| Suburb | 0.930 | OpenCountry |
Mountain |
LivingRoom |
Bedroom |
||||
| InsideCity | 0.450 | Street |
Suburb |
Industrial |
Street |
||||
| TallBuilding | 0.440 | InsideCity |
LivingRoom |
OpenCountry |
Street |
||||
| Street | 0.560 | Bedroom |
InsideCity |
Mountain |
Suburb |
||||
| Highway | 0.800 | Coast |
OpenCountry |
Street |
Suburb |
||||
| OpenCountry | 0.510 | Street |
Bedroom |
Highway |
Forest |
||||
| Coast | 0.560 | Highway |
OpenCountry |
Highway |
OpenCountry |
||||
| Mountain | 0.520 | OpenCountry |
Coast |
Forest |
Suburb |
||||
| Forest | 0.940 | Industrial |
OpenCountry |
Suburb |
Suburb |
||||
| Category name | Accuracy | Sample training images | Sample true positives | False positives with true label | False negatives with wrong predicted label | ||||