Human action recognition based on 3D convolution neural networks from RGBD videos

Al-Akam, Rawya

Human action recognition based on 3D convolution neural networks from RGBD videos

dc.contributor.author	Al-Akam, Rawya
dc.contributor.author	Paulus, Dietrich
dc.contributor.author	Gharabaghi, Darius
dc.contributor.editor	Skala, Václav
dc.date.accessioned	2019-05-10T10:15:24Z
dc.date.available	2019-05-10T10:15:24Z
dc.date.issued	2018
dc.description.abstract	Human action recognition with color and depth sensors has received increasing attention in image processing and computer vision. This paper target is to develop a novel deep model for recognizing human action from the fusion of RGB-D videos based on a Convolutional Neural Network. This work is proposed a novel 3D Convolutional Neural Network architecture that implicitly captures motion information between adjacent frames, which are represented in two main steps: As a First, the optical flow is used to extract motion information from spatio-temporal domains of the different RGB-D video actions. This information is used to compute the features vector values from deep 3D CNN model. Secondly, train and evaluate a 3D CNN from three channels of the input video sequences (i.e. RGB, depth and combining information from both channels (RGB-D)) to obtain a feature representation for a 3D CNN model. For evaluating the accuracy results, a Convolutional Neural Network based on different data channels are trained and additionally the possibilities of feature extraction from 3D Convolutional Neural Network and the features are examined by support vector machine to improve and recognize human actions. From this methods, we demonstrate that the test results from RGB-D channels better than the results from each channel trained separately by baseline Convolutional Neural Network and outperform the state of the art on the same public datasets.	en
dc.format	9 s.	cs
dc.format.mimetype	application/pdf
dc.identifier.citation	WSCG 2018: poster papers proceedings: 26th International Conference in Central Europe on Computer Graphics, Visualization and Computer Visionin co-operation with EUROGRAPHICS Association, p. 18-26.	en
dc.identifier.doi	https://doi.org/10.24132/CSRN.2018.2803.3
dc.identifier.isbn	978-80-86943-42-8
dc.identifier.issn	2464-4617
dc.identifier.uri	wscg.zcu.cz/WSCG2018/!!_CSRN-2803.pdf
dc.identifier.uri	http://hdl.handle.net/11025/34633
dc.language.iso	en	en
dc.publisher	Václav Skala - UNION Agency	en
dc.relation.ispartofseries	WSCG 2018: poster papers proceedings	en
dc.rights	© Václav Skala - Union Agency	cs
dc.rights.access	openAccess	en
dc.subject	rozpoznání akce	cs
dc.subject	RGBD videa	cs
dc.subject	optický tok	cs
dc.subject	3D konvoluční neuronová síť	cs
dc.subject	podpora vektorového stroje	cs
dc.subject.translated	action recognition	en
dc.subject.translated	RGBD videos	en
dc.subject.translated	optical flow	en
dc.subject.translated	3D convolutional neural network	en
dc.subject.translated	support vector machines	en
dc.title	Human action recognition based on 3D convolution neural networks from RGBD videos	en
dc.type	konferenční příspěvek	cs
dc.type	conferenceObject	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en

Files

Original bundle

Showing 1 - 1 out of 1 results

Name:: Al-Akam.pdf
Size:: 1.27 MB
Format:: Adobe Portable Document Format
Description:: Plný text

Download

License bundle

Showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

WSCG 2018: Poster Papers Proceedings