Bankruptcy prediction models are one of the most reliable and useful quantitative tool in financial decision-making performed by financial institutions. This dissertation will review, firstly, the vast literature in this field, distinguishing and analyzing the main theoretical characteristic of statistical and machine learning models. Secondly, by retrieving European Union data through the Orbis database, this research implements and compares eight model performances at one, two and three years prior the bankruptcy by coding in the Python language through the Jupyter Notebook application. The empirical part tries to be as realistic as possible: i) without any ex-post matching procedure that could fictitiously increase the performance of the models and ii) using vectors of only three explanatory variables taken for three consecutive years to constraint and prevent overfitting of the models. The main innovations that this research brings in the bankruptcy prediction field are the SMOTE and Tomek Link approach used to solve the imbalanced dataset problem and the hyperparameter tuning made through a randomized and grid search for each model to find the best parameters for the data sample.
Bankruptcy prediction models are one of the most reliable and useful quantitative tool in financial decision-making performed by financial institutions. This dissertation will review, firstly, the vast literature in this field, distinguishing and analyzing the main theoretical characteristic of statistical and machine learning models. Secondly, by retrieving European Union data through the Orbis database, this research implements and compares eight model performances at one, two and three years prior the bankruptcy by coding in the Python language through the Jupyter Notebook application. The empirical part tries to be as realistic as possible: i) without any ex-post matching procedure that could fictitiously increase the performance of the models and ii) using vectors of only three explanatory variables taken for three consecutive years to constraint and prevent overfitting of the models. The main innovations that this research brings in the bankruptcy prediction field are the SMOTE and Tomek Link approach used to solve the imbalanced dataset problem and the hyperparameter tuning made through a randomized and grid search for each model to find the best parameters for the data sample.
Bankruptcy prediction models: A comparative analysis
GREGGIO, NICOLÒ
2021/2022
Abstract
Bankruptcy prediction models are one of the most reliable and useful quantitative tool in financial decision-making performed by financial institutions. This dissertation will review, firstly, the vast literature in this field, distinguishing and analyzing the main theoretical characteristic of statistical and machine learning models. Secondly, by retrieving European Union data through the Orbis database, this research implements and compares eight model performances at one, two and three years prior the bankruptcy by coding in the Python language through the Jupyter Notebook application. The empirical part tries to be as realistic as possible: i) without any ex-post matching procedure that could fictitiously increase the performance of the models and ii) using vectors of only three explanatory variables taken for three consecutive years to constraint and prevent overfitting of the models. The main innovations that this research brings in the bankruptcy prediction field are the SMOTE and Tomek Link approach used to solve the imbalanced dataset problem and the hyperparameter tuning made through a randomized and grid search for each model to find the best parameters for the data sample.File | Dimensione | Formato | |
---|---|---|---|
Greggio_Nicolò.pdf
accesso aperto
Dimensione
5.97 MB
Formato
Adobe PDF
|
5.97 MB | Adobe PDF | Visualizza/Apri |
The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License
https://hdl.handle.net/20.500.12608/31451