Skip navigation
Please use this identifier to cite or link to this item: https://libeldoc.bsuir.by/handle/123456789/54304
Title: Hand Action Recognition
Authors: Nour Atamni
Said Naamneh
Jihad El-Sana
Keywords: материалы конференций;hand recognition;action recognition;action recognition dataset
Issue Date: 2023
Publisher: BSU
Citation: Nour Atamni. Hand Action Recognition / Nour Atamni, Said Naamneh, Jihad El-Sana // Pattern Recognition and Information Processing (PRIP'2023) = Распознавание образов и обработка информации (2023) : Proceedings of the 16th International Conference, October 17–19, 2023, Minsk, Belarus / United Institute of Informatics Problems of the National Academy of Sciences of Belarus. – Minsk, 2023. – P. 153–157.
Abstract: This paper presents a new dataset for hand action detection for manipulating (assembling and dismantling) mechanical devices and an action detection model based on Transformers. An entry in this dataset is a first-person-view video segment that shows hands performing an action. These hands may utilize a tool and act on an object of the device. These actions were categorized into 12 classes for simple representation. The deep learning model extracts features from each frame in a video, adds position embedding, and feeds the obtained feature vectors to a Transformer Encoder. The output vector goes through a fully connected network to obtain the final class. We have implemented our model and trained it using the presented dataset. We experimentally evaluate the learning and obtain encouraging results.
URI: https://libeldoc.bsuir.by/handle/123456789/54304
Appears in Collections:Pattern Recognition and Information Processing (PRIP'2023) = Распознавание образов и обработка информации (2023)

Files in This Item:
File Description SizeFormat 
Nour_Atamni_Hand.pdf2.54 MBAdobe PDFView/Open
Show full item record Google Scholar

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.