Deakin University
Browse

Compact and Low-Complexity Binary Feature Descriptor and Fisher Vectors for Video Analytics

Version 2 2024-06-06, 01:32
Version 1 2019-09-19, 08:16
journal contribution
posted on 2024-06-06, 01:32 authored by R Leyva, V Sanchez, Chang-Tsun LiChang-Tsun Li
© 1992-2012 IEEE. In this paper, we propose a compact and low-complexity binary feature descriptor for video analytics. Our binary descriptor encodes the motion information of a spatio-temporal support region into a low-dimensional binary string. The descriptor is based on a binning strategy and a construction that binarizes separately the horizontal and vertical motion components of the spatio-temporal support region. We pair our descriptor with a novel Fisher Vector (FV) scheme for binary data to project a set of binary features into a fixed length vector in order to evaluate the similarity between feature sets. We test the effectiveness of our binary feature descriptor with FVs for action recognition, which is one of the most challenging tasks in computer vision, as well as gait recognition and animal behavior clustering. Several experiments on the KTH, UCF50, UCF101, CASIA-B, and TIGdog datasets show that the proposed binary feature descriptor outperforms the state-of-the-art feature descriptors in terms of computational time and memory and storage requirements. When paired with FVs, the proposed feature descriptor attains a very competitive performance, outperforming several state-of-the-art feature descriptors and some methods based on convolutional neural networks.

History

Journal

IEEE Transactions on Image Processing

Volume

28

Pagination

6169-6184

Location

United States

ISSN

1057-7149

eISSN

1941-0042

Language

English

Publication classification

C1 Refereed article in a scholarly journal

Copyright notice

2019, IEEE

Issue

12

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC