Markerless human activity recognition method based on deep neural network model using multiple cameras

Published in CoDIT 2018, 2018

Most methods of multi-view human activity recognition can be classified as conventional computer vision approaches. Those approaches separate feature descriptor and discriminator. Hence, the feature extractor cannot learn from the mistakes made by the classifier. In this paper, a deep neural network (DNN) model for human activity estimation using multi-view sequences of raw images is presented. This approach incorporates features extractor and discriminator into a single model. The model comprises three parts, a convolutional neural network (CNN) block, MSLSTMRes, and a dense layer. This method enables discrimination of human activity such as “walk” and “sit down” by merely using sequences of raw images. Experimental results on IXMAS dataset using one-subject cross validation demonstrates high prediction rate that is comparable to other methods in the literature, which utilized preprocessed images such as silhouette and volumetric data and sophisticated feature extractor.

Recommended citation: Putra, P.U., Shima, K., & Shimatani, K. (2018). Markerless human activity recognition using multiple cameras. *CoDIT 2018*.

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)

Prasetia Utama Putra

Share on