DC Field | Value | Language |
dc.contributor.author | Nour Atamni | - |
dc.contributor.author | Said Naamneh | - |
dc.contributor.author | Jihad El-Sana | - |
dc.coverage.spatial | Минск | en_US |
dc.date.accessioned | 2024-02-21T11:32:57Z | - |
dc.date.available | 2024-02-21T11:32:57Z | - |
dc.date.issued | 2023 | - |
dc.identifier.citation | Nour Atamni. Hand Action Recognition / Nour Atamni, Said Naamneh, Jihad El-Sana // Pattern Recognition and Information Processing (PRIP'2023) = Распознавание образов и обработка информации (2023) : Proceedings of the 16th International Conference, October 17–19, 2023, Minsk, Belarus / United Institute of Informatics Problems of the National Academy of Sciences of Belarus. – Minsk, 2023. – P. 153–157. | en_US |
dc.identifier.uri | https://libeldoc.bsuir.by/handle/123456789/54304 | - |
dc.description.abstract | This paper presents a new dataset for hand action detection for manipulating (assembling and dismantling)
mechanical devices and an action detection model based on Transformers. An entry in this dataset is a first-person-view video segment that shows hands performing an action. These hands may utilize a tool and act on an object of the device. These actions were categorized into 12 classes for simple representation.
The deep learning model extracts features from each frame in a video, adds position embedding, and feeds the obtained feature vectors to a Transformer Encoder. The output vector goes through a fully connected network to obtain the final class. We have implemented our model and trained it using the presented dataset. We experimentally evaluate the learning and obtain encouraging results. | en_US |
dc.language.iso | en | en_US |
dc.publisher | BSU | en_US |
dc.subject | материалы конференций | en_US |
dc.subject | hand recognition | en_US |
dc.subject | action recognition | en_US |
dc.subject | action recognition dataset | en_US |
dc.title | Hand Action Recognition | en_US |
dc.type | Article | en_US |
Appears in Collections: | Pattern Recognition and Information Processing (PRIP'2023) = Распознавание образов и обработка информации (2023)
|