•  Home
  • Library
  • DRO home
Submit research Contact DRO

DRO

Openly accessible

An evolving ensemble model of multi-stream convolutional neural networks for human action recognition in still images

Slade, S, Zhang, L, Yu, Y and Lim, Chee Peng 2022, An evolving ensemble model of multi-stream convolutional neural networks for human action recognition in still images, Neural Computing and Applications, vol. 34, pp. 9205-9231, doi: 10.1007/s00521-022-06947-6.

Attached Files
Name Description MIMEType Size Downloads

Title An evolving ensemble model of multi-stream convolutional neural networks for human action recognition in still images
Author(s) Slade, S
Zhang, L
Yu, Y
Lim, Chee PengORCID iD for Lim, Chee Peng orcid.org/0000-0003-4191-9083
Journal name Neural Computing and Applications
Volume number 34
Start page 9205
End page 9231
Total pages 27
Publisher Springer
Place of publication Berlin, Germany
Publication date 2022
ISSN 0941-0643
1433-3058
Keyword(s) ALGORITHM
Computer Science
Computer Science, Artificial Intelligence
CONTEXT
Convolutional neural network
Ensemble model
FEATURES
Human action recognition
Hyper-parameter optimisation
Object detection and classification
PSO
SCENE
Science & Technology
Technology
Summary AbstractStill image human action recognition (HAR) is a challenging problem owing to limited sources of information and large intra-class and small inter-class variations which requires highly discriminative features. Transfer learning offers the necessary capabilities in producing such features by preserving prior knowledge while learning new representations. However, optimally identifying dynamic numbers of re-trainable layers in the transfer learning process poses a challenge. In this study, we aim to automate the process of optimal configuration identification. Specifically, we propose a novel particle swarm optimisation (PSO) variant, denoted as EnvPSO, for optimal hyper-parameter selection in the transfer learning process with respect to HAR tasks with still images. It incorporates Gaussian fitness surface prediction and exponential search coefficients to overcome stagnation. It optimises the learning rate, batch size, and number of re-trained layers of a pre-trained convolutional neural network (CNN). To overcome bias of single optimised networks, an ensemble model with three optimised CNN streams is introduced. The first and second streams employ raw images and segmentation masks yielded by mask R-CNN as inputs, while the third stream fuses a pair of networks with raw image and saliency maps as inputs, respectively. The final prediction results are obtained by computing the average of class predictions from all three streams. By leveraging differences between learned representations within optimised streams, our ensemble model outperforms counterparts devised by PSO and other state-of-the-art methods for HAR. In addition, evaluated using diverse artificial landscape functions, EnvPSO performs better than other search methods with statistically significant difference in performance.
Language eng
DOI 10.1007/s00521-022-06947-6
Indigenous content off
Field of Research 0801 Artificial Intelligence and Image Processing
0906 Electrical and Electronic Engineering
1702 Cognitive Sciences
HERDC Research category C1 Refereed article in a scholarly journal
Free to Read? Yes
Persistent URL http://hdl.handle.net/10536/DRO/DU:30162324

Document type: Journal Article
Collections: Institute for Intelligent Systems Research and Innovation (IISRI)
Open Access Collection
Related Links
Link Description
Link to full text (Open Access)
Go to link with your DU access privileges
 
Connect to Elements publication management system
Go to link with your DU access privileges
 
Connect to link resolver
 
Unless expressly stated otherwise, the copyright for items in DRO is owned by the author, with all rights reserved.

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.

Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in TR Web of Science
Scopus Citation Count Cited 0 times in Scopus Google Scholar Search Google Scholar
Access Statistics: 14 Abstract Views, 0 File Downloads  -  Detailed Statistics
Created: Wed, 09 Feb 2022, 07:06:33 EST

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.