R语言项目作业.docxVIP

  • 22
  • 0
  • 约2.7千字
  • 约 2页
  • 2017-06-26 发布于河北
  • 举报
R语言项目作业

R语言项目作业数据集描述Data Set Information:This research aimed at the case of customersa€? default payments in Taiwan and compares the predictive accuracy of probability of default among six data mining methods. From the perspective of risk management, the result of predictive accuracy of the estimated probability of default will be more valuable than the binary result of classification - credible or not credible clients. Because the real probability of default is unknown, this study presented the novel a€?Sorting Smoothing Methoda€? to estimate the real probability of default. With the real probability of default as the response variable (Y), and the predictive probability of default as the independent variable (X), the simple linear regression result (Y = A + BX) shows that the forecasting model produced by artificial neural network has the highest coefficient of determination; its regression intercept (A) is close to zero, and regression coefficient (B) to one. Therefore, among the six data mining techniques, artificial neural network is the only one that can accurately estimate the real probability of default.Attribute Information:This research employed a binary variable, default payment (Yes = 1, No = 0),{注意,这里说明最后一个变量是y} as the response variable. This study reviewed the literature and used the following 23 variables as explanatory variables:?X1: Amount of the given credit (NT dollar): it includes both the individual consumer credit and his/her family (supplementary) credit.?X2: Gender (1 = male; 2 = female).?X3: Education (1 = graduate school; 2 = university; 3 = high school; 4 = others).?X4: Marital status (1 = married; 2 = single; 3 = others).?X5: Age (year).?X6 - X11: History of past payment. We tracked the past monthly payment records (from April to September, 2005) as follows: X6 = the repayment status in September, 2005; X7 = the repayment status in August, 2005; . . .;X11 = the repayment status in April, 2005. The measurement scale for the repayment status is: -1

文档评论(0)

1亿VIP精品文档

相关文档