Inspired by the binary-based descriptors (e.g. LBP, ALOHA, FREAK, BRISK), we propose the 3D Binary Pair Differences (3DBPD) video descriptor for action recognition. By comparing several spatio-temporal sub-regions around interests points, our descriptor is a feature vector with a dimensionality of up to 30% smaller than that of existing state-of-the-art descriptors. We demonstrate the effectiveness of the 3DBPD descriptor for action recognition with a SVM classifier and a simple Bag Of Video Words (BOV) generated using k-means. The proposed descriptor has very competitive recognition rates compared to other state-of-the-art descriptors, with an outstanding performance in terms of speed. Additionally, the 3DBPD descriptor requires a small codebook compared to those required by existing BOV-based descriptors.