File(s) under permanent embargo
Early intent prediction of vulnerable road users from visual attributes using multi-task learning network
conference contribution
posted on 2017-11-27, 00:00 authored by K Saleh, Mohammed Hossny, Saeid Nahavandi© 2017 IEEE. In this paper we are presenting a novel approach for the problem of vulnerable road users (VRUs) attribute prediction which play such critical role for the intent prediction models of VRUs. We formulated the problem as a multi-task learning (MTL) image classification problem and we utilized a convolution neural network (ConvNet) based technique to exploit the commonality between two of the most important attributes of VRUs for intent prediction models (i.e, head orientation and body posture). We achieved classification accuracy scores of 83% and 76% for the body posture and head orientation attributes respectively. We compared the performance of our proposed solution against individual single task learning ConvNet models for each attribute and achieved significant overall accuracy over the two attribute classification tasks. Furthermore, we compared our proposed MTL-ConvNet model against other MTL approaches and achieved more than 18% AP score improvement in the classification of body posture attribute.