Small-Scale Data Classification Based on Deep Forest
© Springer Nature Switzerland AG 2019. Developing effective and efficient small-scale data classification methods is very challenging in the digital age. Recent researches have shown that deep forest achieves a considerable increase in classification accuracy compared with general methods, especially when the training set is small. However, the standard deep forest may experience over-fitting and feature vanishing in dealing with small sample size. In this paper, we tackle this problem by proposing a skip connection deep forest (SForest), which can be viewed as a modification of the standard deep forest model. It leverages multi-class-grained scanning method to train multiple binary forest from different training sub-dataset of classes to encourage the diversity of ensemble and solve the class-imbalance problem. To expand the diversity of each layer in cascade forest, five different classifiers are employed. Meanwhile, the fitting quality of each classifiers is taken into consideration in representation learning. In addition, we propose a skip connection strategy to augment the feature vector, and use Gradient Boosting Decision Tree (GBDT) as the final classifier to improve the overall performance. Experiments demonstrated the proposed model achieved superior performance than the-state-of-the-art deep forest methods with almost the same parameter.
History
Pagination
428-439Location
Athens, GreeceStart date
2019-08-28End date
2019-08-30ISSN
0302-9743eISSN
1611-3349ISBN-13
9783030295509Language
engPublication classification
E1 Full written paper - refereedEditor/Contributor(s)
Douligeris C, Karagiannis D, Apostolou DTitle of proceedings
KSEM 2019 : Knowledge science, engineering and management : 12th international conference, KSEM 2019, Athens, Greece, August 28-30, 2019 : proceedingsEvent
Knowledge Science, Engineering and Management. Conference (2019 : 12th : Athens, Greece)Publisher
SpringerPlace of publication
Cham, SwitzerlandSeries
Lecture Notes in Computer Science; 11775Usage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC