Image classification notes

label

 

an image is represented as one large 3-dimensional array of numbers

total numbers : 248*400*3 (wide*height*rgb(3 channels))

each number ranges from 0~255

 

Target: be invariant to the cross product of all these variations, while simultaneously retaining sensitivity to the inter-class variations.

take an array of pixels that represents a single image and assign a label to it

pipeline: input, train, evaluation

 

L1 distance  

Image classification notes

 

 

L2 distance  

Image classification notes

 

Evaluate on the test set only a single time, at the very end. (very carefully)

 

KNN k-nearest neighbor

instead of finding the single closest image in the training set, we will find the top k closest images, and have them vote on the label of the test image. In particular, when k = 1, we recover the Nearest Neighbor classifier. Intuitively, higher values of k have a smoothing effect that makes the classifier more resistant to outliers:

 Image classification notes

 

上一篇:Networkx study notes(0)


下一篇:Notes-VBA-区域选择