DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Park, Sung-Soo | - |
dc.contributor.advisor | 박성수 | - |
dc.contributor.author | Hwang, Kyoung-Mi | - |
dc.contributor.author | 황경미 | - |
dc.date.accessioned | 2013-09-13 | - |
dc.date.available | 2013-09-13 | - |
dc.date.issued | 2013 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=513575&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/182536 | - |
dc.description | 학위논문(박사) - 한국과학기술원 : 산업및시스템공학과, 2013.2, [ vii, 84 p. ] | - |
dc.description.abstract | Data mining techniques extract useful information from large databases. The techniques can be categorized as being either descriptive or predictive. In this thesis, we focus on classification, the predictive data mining used for discrete target variables, and variable selection for classification. We propose classification algorithms for multi-class classification problems, and variable selection algorithms for binary classification and multi-class classification using signomial function. Specifically, this research contributes to the field of classification and variable selection by: 1. Constructing a multi-class classifier directly by solving a single optimization problem to be capable of capturing the correlations among classes; 2. Obtaining classifiers which are sparse and can be explicitly described in original space, which facilitates interpretation; 3. Determining a subset of variables that is desirable for predicting the output, considering nonlinear interactions of variables; 4. Performing variable selection for multi-class classification by treating multiple classes jointly to select a small common subset of variables. First, we propose two multi-class classification methods using signomial function. Each of them directly constructs a multi-class classifier by solving a single optimization problem. Since the number of possible signomial terms is huge, we propose a column generation method that iteratively generates good signomial terms. The both methods obtain better or comparable classification accuracies and give more sparse classifiers than the existing methods. Next, we propose two embedded variable selection methods using signomial function. We attempt to select, among a set of the input variables, those that lead to the best performance of the classifier. One method repeatedly removes variables based on backward selection, and the other method directly select a set of the variables by so... | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | multi-class variable selection | - |
dc.subject | variable selection | - |
dc.subject | multi-class classification | - |
dc.subject | signomial function | - |
dc.subject | sparse classifier | - |
dc.subject | 다중 분류 | - |
dc.subject | 변수 선택 | - |
dc.subject | 다중 분류 변수 선택 | - |
dc.subject | signomial 함수 | - |
dc.subject | 희박 분류기 | - |
dc.subject | 열생성 기법 | - |
dc.subject | column generation algorithm | - |
dc.title | Classification and variable selection algorithms using signomial function | - |
dc.title.alternative | Signomial 함수를 이용한 분류와 변수 선택 해법 | - |
dc.type | Thesis(Ph.D) | - |
dc.identifier.CNRN | 513575/325007 | - |
dc.description.department | 한국과학기술원 : 산업및시스템공학과, | - |
dc.identifier.uid | 020075205 | - |
dc.contributor.localauthor | Park, Sung-Soo | - |
dc.contributor.localauthor | 박성수 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.