Deakin University
Browse

File(s) under permanent embargo

Early intent prediction of vulnerable road users from visual attributes using multi-task learning network

Version 2 2024-06-04, 02:20
Version 1 2018-07-29, 14:09
conference contribution
posted on 2024-06-04, 02:20 authored by K Saleh, M Hossny, S Nahavandi
© 2017 IEEE. In this paper we are presenting a novel approach for the problem of vulnerable road users (VRUs) attribute prediction which play such critical role for the intent prediction models of VRUs. We formulated the problem as a multi-task learning (MTL) image classification problem and we utilized a convolution neural network (ConvNet) based technique to exploit the commonality between two of the most important attributes of VRUs for intent prediction models (i.e, head orientation and body posture). We achieved classification accuracy scores of 83% and 76% for the body posture and head orientation attributes respectively. We compared the performance of our proposed solution against individual single task learning ConvNet models for each attribute and achieved significant overall accuracy over the two attribute classification tasks. Furthermore, we compared our proposed MTL-ConvNet model against other MTL approaches and achieved more than 18% AP score improvement in the classification of body posture attribute.

History

Pagination

3367-3372

Location

Banff, Canada

Start date

2017-10-05

End date

2017-10-08

ISBN-13

9781538616451

Language

eng

Publication classification

E1 Full written paper - refereed

Copyright notice

2017, IEEE

Title of proceedings

SMC 2017: Proceedings of IEEE International Conference on Systems, Man, and Cybernetics

Event

Systems, Man, and Cybernetics. International Conference (2017 : Banff, Canada)

Publisher

IEEE

Place of publication

Piscataway, N.J.