LOG IN⠴ݱâ

  • ȸ¿ø´ÔÀÇ ¾ÆÀ̵ð¿Í Æнº¿öµå¸¦ ÀÔ·ÂÇØ ÁÖ¼¼¿ä.
  • ȸ¿øÀÌ ¾Æ´Ï½Ã¸é ¾Æ·¡ [ȸ¿ø°¡ÀÔ]À» ´­·¯ ȸ¿ø°¡ÀÔÀ» ÇØÁֽñ⠹ٶø´Ï´Ù.

¾ÆÀ̵ð ÀúÀå

   

¾ÆÀ̵ð Áߺ¹°Ë»ç⠴ݱâ

HONGGIDONG ˼
»ç¿ë °¡´ÉÇÑ È¸¿ø ¾ÆÀ̵ð ÀÔ´Ï´Ù.

E-mail Áߺ¹È®ÀÎ⠴ݱâ

honggildong@naver.com ˼
»ç¿ë °¡´ÉÇÑ E-mail ÁÖ¼Ò ÀÔ´Ï´Ù.

¿ìÆí¹øÈ£ °Ë»ö⠴ݱâ

°Ë»ö

SEARCH⠴ݱâ

ºñ¹Ð¹øÈ£ ã±â

¾ÆÀ̵ð

¼º¸í

E-mail

Archive

Machine Learning Approach for Predicting U.S. ETFs¡¯ Tracking Errors ? Implications on U.S. Invested Fund

  • Jin-Hyung Cho Research Fellow (Ph.D.), Kakao
  • Gun-Hee Lee College of Economics and Finance, Hanyang University
  • Won-Eung Lee College of Economics and Finance, Hanyang University
  • Bong-Jun Kim Viterbi School of Engineering, University of Southern California
In recent decades, machine learning (ML) algorithms has gained wide popularity in the finance literature. The goal of this research is to exploit machine learning techniques in order to analyze the effect of exchange-traded fund (ETF) illiquidity on tracking errors. We demonstrate the superior performance of the machine learning models ? Random Forest and Gradient Boosting Decision Tree, in particular - over traditional linear models in predicting U.S. ETF¡¯s tracking errors. Moreover, our variable importance analysis suggests that the features such as underlying assets based on U.S. assets (Invested in US Asset) and expense ratio (Expense Ratio), are two key factors in the determination of predicting the tracking errors on the ETF illiquidity. Finally, we further conduct SHAP (Shapley Additive exPlanations) technique in order to observe the impact of a particular variable(feature) on the difference between the considered- and average predictions of our machine learning models. Our results indicate that the most relevant variable is Invested in US Asset, which is in align with the previous importance analysis.

  • Jin-Hyung Cho
  • Gun-Hee Lee
  • Won-Eung Lee
  • Bong-Jun Kim
In recent decades, machine learning (ML) algorithms has gained wide popularity in the finance literature. The goal of this research is to exploit machine learning techniques in order to analyze the effect of exchange-traded fund (ETF) illiquidity on tracking errors. We demonstrate the superior performance of the machine learning models ? Random Forest and Gradient Boosting Decision Tree, in particular - over traditional linear models in predicting U.S. ETF¡¯s tracking errors. Moreover, our variable importance analysis suggests that the features such as underlying assets based on U.S. assets (Invested in US Asset) and expense ratio (Expense Ratio), are two key factors in the determination of predicting the tracking errors on the ETF illiquidity. Finally, we further conduct SHAP (Shapley Additive exPlanations) technique in order to observe the impact of a particular variable(feature) on the difference between the considered- and average predictions of our machine learning models. Our results indicate that the most relevant variable is Invested in US Asset, which is in align with the previous importance analysis.
ETF,Tracking Error,Machine Learning,SHAP