LOG IN⠴ݱâ

  • ȸ¿ø´ÔÀÇ ¾ÆÀ̵ð¿Í Æнº¿öµå¸¦ ÀÔ·ÂÇØ ÁÖ¼¼¿ä.
  • ȸ¿øÀÌ ¾Æ´Ï½Ã¸é ¾Æ·¡ [ȸ¿ø°¡ÀÔ]À» ´­·¯ ȸ¿ø°¡ÀÔÀ» ÇØÁֽñ⠹ٶø´Ï´Ù.

¾ÆÀ̵ð ÀúÀå

   

¾ÆÀ̵ð Áߺ¹°Ë»ç⠴ݱâ

HONGGIDONG ˼
»ç¿ë °¡´ÉÇÑ È¸¿ø ¾ÆÀ̵ð ÀÔ´Ï´Ù.

E-mail Áߺ¹È®ÀÎ⠴ݱâ

honggildong@naver.com ˼
»ç¿ë °¡´ÉÇÑ E-mail ÁÖ¼Ò ÀÔ´Ï´Ù.

¿ìÆí¹øÈ£ °Ë»ö⠴ݱâ

°Ë»ö

SEARCH⠴ݱâ

ºñ¹Ð¹øÈ£ ã±â

¾ÆÀ̵ð

¼º¸í

E-mail

ÇмúÀÚ·á °Ë»ö

±â°èÇнÀ ±â¹Ý ±â¾÷½Å¿ëÁ¤º¸ ºÐ¼®À» ÅëÇÑ Ã¤¹«ºÒÀÌÇà ¿¹Ãø

  • ¼Û¹ÎÂù NICEµð¾Øºñ °úÀå
  • ·ùµÎÁø ¼º±Õ°ü´ëÇб³ °æÁ¦Çаú ±³¼ö
º» ¿¬±¸´Â ½Å¿ëÁ¤º¸ Ç¥º»DB ¿ø°ÝºÐ¼®½Ã½ºÅÛ¿¡¼­ Á¦°øÇÏ´Â ±â¾÷½Å¿ëÁ¤º¸¸¦ ºÐ¼®ÇÏ¿© ±â¾÷ÀÇ Ã¤¹«ºÒÀÌÇàÀ» ¿¹ÃøÇÑ´Ù. »ç¾÷ÀÚ ±¸ºÐ¿¡ µû¶ó ºÐ¼®´ë»óÀ» ³ª´©°í Ç¥º»DB¿¡¼­ Á¦°øÇÏ´Â ±â¾÷½Å¿ëÁ¤º¸ÀÇ È°¿ë¿¡ µû¸¥ ½ÇÇèÀ» ±¸¼ºÇÑ´Ù. ¶ÇÇÑ, ´Ù¾çÇÑ ±â°èÇнÀ ±â¹ýÀ» ¸ð¼ö ÃßÁ¤ ¹æ½Ä¿¡ µû¶ó ¸ð¼öÀû ¹æ¹ý·Ð, ºñ¸ð¼öÀû ¹æ¹ý·Ð, Áظð¼öÀû ¹æ¹ý·ÐÀ¸·Î ±¸ºÐÇÏ¿© ¿¹Ãø¼º°ú¸¦ ºñ±³ÇÑ´Ù. Ç¥º»DB ¿ø½Ãµ¥ÀÌÅ͸¦ È°¿ëÇÑ ºÐ¼®º¸´Ù ´ëÃâ ¹× ¿¬Ã¼ Á¾·ù¿¡ µû¶ó °¡°øÇÑ ÀڷḦ È°¿ëÇÏ´Â °æ¿ì °¢ ±â°èÇнÀ ¸ðÇüº° ¼º´É°³¼±ÀÌ °üÃøµÇ¾úÀ¸³ª, ±â¾÷ Â÷ÁÖÀÇ Æ¯¼ºÁ¤º¸¿Í ±â¼ú½Å¿ëÆò°¡ Á¤º¸ÀÇ È°¿ëÀº ¸ðÇüº° ¼º´É°³¼±¿¡ ±â¿©ÇÏÁö ¸øÇÏ¿´´Ù. ¸ðµç ¼¼±×¸ÕÆ®¿¡¼­ Áظð¼öÀû ¹æ¹ý·Ð¿¡ ÇØ´çÇÏ´Â ½ÉÃþ½Å°æ¸Á ¸ðÇü¿¡ ´ëÇØ ¼º´ÉÀÌ °¡Àå ¿ì¼öÇÑ °ÍÀ¸·Î È®ÀεǾúÀ¸¸ç, Æ®¸®°è¿­ÀÌ ¾Æ´Ñ ºñ¸ð¼öÀû ¹æ¹ý·ÐÀÇ °æ¿ì ÀçÇöÀ²ÀÌ ³·°Ô °üÃøµÇ¾î 乫ºÒÀÌÇà ¿¹Ãø ¹®Á¦¿¡ ÀûÇÕÇÏÁö ¾Ê¾Ò´Ù. ±âÁ¸ ½Ç¹«¿¡¼­ »ç¿ëµÇ´Â ¸ð¼öÀû ¹æ¹ý·ÐÀ» È°¿ëÇÑ °æ¿ìº¸´Ù Áظð¼öÀû ¹æ¹ý·ÐÀ» È°¿ëÇÒ °æ¿ì ºÐ·ù ¼º´ÉÀÌ Çâ»óµÊÀ» È®ÀÎÇÏ¿´´Ù. º» ¿¬±¸´Â ½ÇÁ¦ ±â¾÷½Å¿ëÁ¤º¸¿¡ ´ëÇØ ±¸¼ºµÈ Ç¥º»DB¸¦ È°¿ëÇÏ¿© ±â¾÷ºÎ½Ç ¿¹ÃøÀ» ½ÃµµÇÑ ÃÖÃÊÀÇ ¿¬±¸À̸ç, ±â¾÷½Å¿ëÁ¤º¸¸¦ È°¿ëÇÏ´Â ¿©½Å ±ÝÀ¶±â°ü°ú ½Å¿ëÁ¤º¸»çÀÇ ÀÚ·á È°¿ë ¹× ¸ðÇü ±¸Ãà¿¡ ´ëÇÑ ¹æÇ⼺À» Á¦½ÃÇÑ´Ù.
±â°èÇнÀ,ºÎ½ÇÀ§Çè,¿©½Å°Å·¡Á¤º¸,乫ºÒÀÌÇà ¿¹Ãø,Ç¥º»DB¿ø°ÝºÐ¼®½Ã½ºÅÛ

Predicting Loan Delinquency by Analyzing Sample DB with Machine Learning

  • Minchan Song
  • Doojin Ryu
This paper investigates the ability to predict corporate default rates using loan-sample data from the Korea Credit Information Service's financial big data open system (CreDB). The corporate loan from financial institution increases financial institution's credit exposure. Because measurement of the impact on the credit risk in the financial institution is used in determining the pricing model and structure of loan products, it is an essential factor for the financial institution that affects its profit structure. In terms of risk management, predicting delinquency using loan data is necessary for 5,000 Korean financial institutions. In several studies, bankruptcy forecasting was conducted on listed companies that disclosed financial and stock price information. However, this study increases the practical utility by extending the analysis target to individual entrepreneurs and small and medium-sized enterprises(SMEs). In addition, this study presents representative big data analysis results by utilizing loan, delinquency, and technology credit information of approximately 1.1 million corporations, which is 20% of almost 5.6 million domestic sole proprietors and non-listed corporations. For loan data, it includes ten monthly loan type codes and eleven overdue reason codes. Prediction targets are separated by individual and corporate entrepreneurs. Also, analyses are divided by use of the processed dataset. For efficient analysis, the data dimension was reduced by changing the table structure through nested iterative operations while expanding the variable composition from a table consisting of N rows to one column. To reflect the characteristics of the data as much as possible, exploratory data analysis and feature-engineering were performed to process the data. Also, classification models are classified by four groups using a parametric method that nine models train for classification. Group 1 consists of Logistic Regression and Linear discriminant analysis based on the parametric method, group 2 consists of several algorithms that calculate the distance for model learning. In addition, group 3 consists of tree-based algorithms, which are also non-parametric methods. Group 4 consists of the semi-parametric method, which is deep neural network. However, out of the total 438,697 corporations, 810 defaulted, accounting for only 0.2% of the forecast, so the target distribution is severely imbalanced. For this reason, before model fitting, under sampling of imbalanced data was performed. The bias of the sampled training and validation data is minimized by performing. K-fold cross validation as much as the level of K=5. Finally, the analysis result suggests a significant effect on classification performance when the processed data is used. However, this study suggests no significant effect on performance when loan owner's characteristics are included. Moreover, tech-credit rating (TCB) information gives any meaningful effect regarding the type of corporation. Also, classification with Deep Neural Network (DNN), which is based on the Semi-parametric method, makes the best performance of binary classification. Non-parametric and Non-tree based models are not appropriate methods for analyzing loan data. In the case of the DNN based on the semi-parametric methodology, the highest classification performance was confirmed for all analyses and entrepreneurs' classifications performed in this study. The neural network used in this study consists of 14 hidden layers. According to the neural network baseline design, the sigmoid function was applied to the activation function's initial value, the relu function was applied to the hidden layer, and optimization was performed through the Adam optimizer. In particular, the analysis of credit transaction information based on credit information of all financial institutions in Korea was conducted, and there is a possibility for alleviating information asymmetry of individual credit institutions regarding risk management targets. In addition, in the case of parametric methodologies used in classical studies and most used in practice, the average classification performance for major segments was inferior to that of semi-parametric methodologies. Furthermore, the difference between these performances is up to 16 percent. This paper suggests the direction of using loan-sample data. It is foundational research for financial institutions that are using loan data for credit risk management. It is necessary to expand research focusing on semi-parametric methodologies about corporate credit information analysis.
Corporate Loan Data,Distress Risk,Machine Learning,Predicting Loan Delinquency,Sample DB Remote Analysis System