Learning a discriminative histogram representation for histogram intersection kernel (HIK) = 히스토그램 교차 커널을 위한 분별력 있는 히스토그램 표현 학습

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 379
  • Download : 0
This thesis studies a method to increase the performance of the image categorization based on the bag-of-visual words (BOV) model. The BOV model represents images as histograms of visual-words describing only their appearance while ignoring their spatial structure, and these histograms are classified based on the classification algorithm such as latent dirichlet allocation (LDA), probablistic latent semantic allocation (pLSA), or support vector machines (SVM). SVM using kernel functions are known to be the state-of-the-art classifier for the BOV model. Recently, the histogram intersection kernel (HIK) is introduced and known to be relatively faster and performs better for histogram features than other popular kernels. Currently, the HIK-based classifier such as intersection kernel SVM (IKSVM) is widely used for image recognition and classification tasks. The performance of the image categorization based on the BOV model mainly depends on the histogram representation and the classification algorithm. The focus of this paper is on the former. The objective of this thesis is learning a discriminative histogram representation for widely used histogram intersection kernel (HIK). The histogram feature space is modeled as the multivariate Gaussian distribution and we propose a learning criterion which can obtain a discriminative histogram representation by increasing inter-class distances of histogram features while decreasing intra-class distances of histogram features. The learning criterion is formulated as the linear programming (LP) problem, which can be optimized by the conventional linear programming solver. The original formulation are successful for binary or ternary classification problem, however, often fails to find the feasible solution and cannot complete the classification process for large database. So, we map histogram feature space into higher dimensional space by incorporating generalized HIK and exponential formulation is obtained. We show that the...
Yoo, Chang-Dongresearcher유창동researcher
한국과학기술원 : 전기 및 전자공학과,
Issue Date
467869/325007  / 020093235

학위논문(석사) - 한국과학기술원 : 전기 및 전자공학과, 2011.2, [ vi, 42p ]


machine learning; computer vision; bag of words model; object recognition; image categorization; 이미지분류; 기계학습; 컴퓨터 시각; 단어군집모델; 물체인식

Appears in Collection
Files in This Item
There are no files associated with this item.


  • mendeley


rss_1.0 rss_2.0 atom_1.0