To read this content please select one of the options below:

Financial distress prediction based on ensemble feature selection and improved stacking algorithm

Chong Wu (School of Management, Harbin Institute of Technology, Harbin, China)
Xiaofang Chen (School of Management, Harbin Institute of Technology, Harbin, China)
Yongjie Jiang (School of Management, Harbin Institute of Technology, Harbin, China)

Kybernetes

ISSN: 0368-492X

Article publication date: 26 February 2024

120

Abstract

Purpose

While the Chinese securities market is booming, the phenomenon of listed companies falling into financial distress is also emerging, which affects the operation and development of enterprises and also jeopardizes the interests of investors. Therefore, it is important to understand how to accurately and reasonably predict the financial distress of enterprises.

Design/methodology/approach

In the present study, ensemble feature selection (EFS) and improved stacking were used for financial distress prediction (FDP). Mutual information, analysis of variance (ANOVA), random forest (RF), genetic algorithms, and recursive feature elimination (RFE) were chosen for EFS to select features. Since there may be missing information when feeding the results of the base learner directly into the meta-learner, the features with high importance were fed into the meta-learner together. A screening layer was added to select the meta-learner with better performance. Finally, Optima hyperparameters were used for parameter tuning by the learners.

Findings

An empirical study was conducted with a sample of A-share listed companies in China. The F1-score of the model constructed using the features screened by EFS reached 84.55%, representing an improvement of 4.37% compared to the original features. To verify the effectiveness of improved stacking, benchmark model comparison experiments were conducted. Compared to the original stacking model, the accuracy of the improved stacking model was improved by 0.44%, and the F1-score was improved by 0.51%. In addition, the improved stacking model had the highest area under the curve (AUC) value (0.905) among all the compared models.

Originality/value

Compared to previous models, the proposed FDP model has better performance, thus bridging the research gap of feature selection. The present study provides new ideas for stacking improvement research and a reference for subsequent research in this field.

Keywords

Acknowledgements

This work was supported by the Philosophy and Social Science Research Planning Project of Heilongjiang Province (No. 20JYB039) and the National Natural Science Foundation of China (No. 72131005 and 72121001).

Citation

Wu, C., Chen, X. and Jiang, Y. (2024), "Financial distress prediction based on ensemble feature selection and improved stacking algorithm", Kybernetes, Vol. ahead-of-print No. ahead-of-print. https://doi.org/10.1108/K-08-2023-1428

Publisher

:

Emerald Publishing Limited

Copyright © 2024, Emerald Publishing Limited

Related articles