Two-view methods have been well developed to identify human actions. However, in a case where the corresponding imaged points cannot induce distinguished measures, the performance of the methods deteriorates. For this reason, we propose a new view-invariant measure for human action recognition by enforcing tri-view constraints in this paper. This new measurement method can be tolerant to different rates of human actions and the anthropometric proportions. We apply our approach to video synchronization by imposing both the similarity ratio and the consistency in the trifocal tensor over entire video sequences. By testing on both synthetic and real data, our method has achieved higher tolerance to noise levels, as well as higher identification accuracy than the traditional two-view method. Experimental results demonstrate that our approach can identify human pose transitions, in spite of dynamic time-lines, different viewpoints and unknown camera parameters.
Wang, Y., Lin, W., Huang, X., & Lin, X. (2012). Human Action Recognition from Video Sequences by Enforcing Tri-view Constraints. Computer Journal, 55(9), 1030-1040. https://doi.org/10.1093/comjnl/bxr115