(A) comparison of decision support tools = 의사결정 지원 도구들의 비교

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 325
  • Download : 0
Data mining (Berry and Linoff, 1997; Han and Kamber, 2001) is the process of uncovering previously unknown patterns and relationships in a large database using sophisticated statistical analysis and modeling techniques. The regression model, the decision tree and the neural network are the representative models for predictive modeling. These models have different characteristics and each has advantages and disadvantages. The regression model has several advantages including the ease of interpretation and the capability of representing the linear structure very well. But the regression model has disadvantages which are that this model assumes the linearity between input variables and the target variable and the independence of the input variables. The decision tree has several advantages including the ease of interpretation, the ability to model complex input/target associations and the ability automatically handle missing values without imputation. But the decision tree is less appropriate to predict the value of a continuous variable and the small perturbations in a train data set can sometimes have large effects on the structure of the tree. The neural network has several advantages including the versatility for approaching problems, the capability of producing good results in complicated domains and the capability of handling both continuous variables and categorical variables. But a drawback of the neural network is difficulty of interpretation of the model structure. We investigate important properties of each model through analyzing with real data. If any input variable containing missing values has a much effect on predicting a target variable, then the decision tree performs much better than the other two models. We examine the three models for credit scoring to illustrate this property. A drawback of the neural network is difficulty of interpretation. A reasonable effort for structure interpretation is using an approximation model for the model via the...
Advisors
Kim, Sung-Horesearcher김성호researcher
Description
한국과학기술원 : 응용수학전공,
Publisher
한국과학기술원
Issue Date
2002
Identifier
173586/325007 / 020003423
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 응용수학전공, 2002.2, [ [ii], 34 p. ]

Keywords

neural network; decision tree; regression model; data mining; assessment of the models; 모델평가; 신경망분석; 의사결정나무; 회귀분석; 데이터 마이닝

URI
http://hdl.handle.net/10203/42047
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=173586&flag=dissertation
Appears in Collection
MA-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0