Abstract
In this work, a schematic model for human activity recognition based on multiple cues is introduced. In the beginning, a sequence of temporal silhouettes of the moving human body parts are extracted from a video clip (i.e., an action snippet). Next, each action snippet is temporally split into several time-slices represented by fuzzy intervals. As shape features, a variety of descriptors both boundary-based (Fourier descriptors, Curvature features) and region-based (Moments, Moment-based features) are then extracted from the silhouettes at each time-slice. Finally, an NB (Naïve Bayes) classifier is learned in the feature space for activity classification. The performance of the method was evaluated on the KTH dataset and the obtained results are quite encouraging and show that an accuracy on par with or exceeding that of existing methods is achievable. Further the simplicity and computational efficiency of the features employed allow the method to achieve real-time performance, and thus it can provide latency guarantees to real-time applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chakraborty, B., Bagdanov, A.D., Gonzàlez, J.: Towards real-time human action recognition. In: Araujo, H., Mendonça, A.M., Pinho, A.J., Torres, M.I. (eds.) IbPRIA 2009. LNCS, vol. 5524, pp. 425–432. Springer, Heidelberg (2009)
Jiang, Y.G., Ngo, C.W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: Proc. ACM Int. Conf. Image and Video Retrieval, vol. 8, pp. 494–501 (2007)
Little, L., Boyd, J.E.: Recognizing people by their gait: The shape of motion. International Journal of Computer Vision 1, 1–32 (1998)
Cutler, R., Davis, L.S.: Robust real-time periodic motion detection, analysis, and applications. IEEE Transactions on PAMI 22, 781–796 (2000)
Thuran, C., Hlaváč, V.: Pose primitive based human action recognition in videos or still images. In: IEEE Computer Society Conference on CVPR (2008)
Lu, W.L., Okuma, K., Little, J.J.: Tracking and recognizing actions of multiple hockey players using the boosted particle filter. Image and Vision Computing 27, 189–205 (2009)
Sadek, S., Al-Hamadi, A., Michaelis, B., Sayed, U.: Toward robust action retrieval in video. In: Proc. of BMVC 2010 (2010)
Laptev, I., Pérez, P.: Retrieving actions in movies. In: Proc. ICCV, pp. 1–8 (2007)
Dollàr, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: Proc. IEEE Workshop on VS-PETS, pp. 65–72 (2005)
Liu, J., Shah, M.: Learning human actions via information maximization. In: IEEE Int. Conference on Computer Vision and. Pattern Recognition (2008)
Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transactions on PAMI 23, 257–267 (2001)
Shechtman, E., Irani, M.: Space-time behavior based correlation. Computer Vision and. Pattern Recognition 1, 405–412 (2005)
Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH: A spatio-temporal maximum average correlation height filter for action recognition. In: Proc. CVPR (2008)
Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A biologically inspired system for action recognition. In: IEEE International Conference on Computer Vision, pp. 257–267 (2007)
Schindler, K., Van Gool, L.: Action snippets: How many frames does action recognition require? In: Proc. CVPR, pp. 1–8 (2008)
Laxton, B., Lim, J., Kriegman, D.: Leveraging temporal, contextual and ordering constraints for recognizing complex activities in video. In: Proc. CVPR, pp. 1–8 (2007)
Blei, D.M., Lafferty, J.D.: Correlated topic models. Advances in Neural Information Processing Systems (NIPS) 18, 147–154 (2006)
Stauffer, C., Grimson, W.E.L.: Adaptive background mixture models for real-time tracking. In: CVPR, pp. 246–2527 (1999)
Hu, M.: Visual pattern recognition by moment invariants. IRE Tr. on. Inf. Theory, 179–187 (1962)
Alajlan, N., Kamel, M.S., Freeman, G.: Multi-object image retrieval based on shape and topology. Signal Processing: Image Communication 21, 904–918 (2006)
Domingos, P., Pazzani, M.: On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning 29, 103–137 (1997)
Wang, Y., Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: CVPR (2009)
Rapantzikos, K., Avrithis, Y., Kollias, S.: Dense saliency-based spatiotemporal feature points for action recognition. In: CVPR, pp. 1–8 (2009)
Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: Proc. ICCV, vol. 1, pp. 166–173 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sadek, S., Al-Hamadi, A., Michaelis, B., Sayed, U. (2010). Human Activity Recognition: A Scheme Using Multiple Cues. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2010. Lecture Notes in Computer Science, vol 6454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17274-8_56
Download citation
DOI: https://doi.org/10.1007/978-3-642-17274-8_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17273-1
Online ISBN: 978-3-642-17274-8
eBook Packages: Computer ScienceComputer Science (R0)