DGHNL: A new deep genetic hierarchical network of learners for prediction of credit scoring

Pławiak, P; Abdar, Moloud; Pławiak, J; Makarenkov, V; Acharya, U R

abdar-dghnlanewdeep-2020.pdf (1.21 MB)

DGHNL: A new deep genetic hierarchical network of learners for prediction of credit scoring

journal contribution

posted on 2020-04-01, 00:00 authored by P Pławiak, Moloud Abdar, J Pławiak, V Makarenkov, U R Acharya

© 2019 Credit scoring (CS) is an effective and crucial approach used for risk management in banks and other financial institutions. It provides appropriate guidance on granting loans and reduces risks in the financial area. Hence, companies and banks are trying to use novel automated solutions to deal with CS challenge to protect their own finances and customers. Nowadays, different machine learning (ML) and data mining (DM) algorithms have been used to improve various aspects of CS prediction. In this paper, we introduce a novel methodology, named Deep Genetic Hierarchical Network of Learners (DGHNL). The proposed methodology comprises different types of learners, including Support Vector Machines (SVM), k-Nearest Neighbors (kNN), Probabilistic Neural Networks (PNN), and fuzzy systems. The Statlog German (1000 instances) credit approval dataset available in the UCI machine learning repository is used to test the effectiveness of our model in the CS domain. Our DGHNL model encompasses five kinds of learners, two kinds of data normalization procedures, two extraction of features methods, three kinds of kernel functions, and three kinds of parameter optimizations. Furthermore, the model applies deep learning, ensemble learning, supervised training, layered learning, genetic selection of features (attributes), genetic optimization of learners parameters, and novel genetic layered training (selection of learners) approaches used along with the cross-validation (CV) training-testing method (stratified 10-fold). The novelty of our approach relies on a proper flow and fusion of information (DGHNL structure and its optimization). We show that the proposed DGHNL model with a 29-layer structure is capable to achieve the prediction accuracy of 94.60% (54 errors per 1000 classifications) for the Statlog German credit approval data. It is the best prediction performance for this well-known credit scoring dataset, compared to the existing work in the field.

History

Journal

Information Sciences

Volume

516

Pagination

401 - 418

Publisher

Elsevier

Location

Amsterdam, The Netherlands

Publisher DOI

https://doi.org/10.1016/j.ins.2019.12.045

Link to full text

https://doi.org/10.1016/j.ins.2019.12.045

ISSN

0020-0255

Language

eng

Publication classification

C1 Refereed article in a scholarly journal

Usage metrics

Keywords

Credit scoring Machine learning Data mining Ensemble learning Deep learning Genetic algorithm Feature extraction and selection Science & Technology Technology Computer Science, Information Systems Computer Science CLASSIFICATION

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

DGHNL: A new deep genetic hierarchical network of learners for prediction of credit scoring

History

Journal

Volume

Pagination

Publisher

Location

Publisher DOI

Link to full text

ISSN

Language

Publication classification

Usage metrics

Categories

Keywords

Licence

Exports