Predictive hand gesture classification for real time robot control
Yuanda Hu, Jiacheng Xu, Zhiqiang Ma, Guitao Cao
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service (2018)
摘要/Abstract
在本文中,我们提出了一种基于视觉的人机交互手势识别系统。手势识别系统用于开发人类和我们的机械手之间的实时石头剪刀布游戏。我们的任务是通过使用高速摄像机尽快预测手势。由于计算复杂度,标准的基于长期循环卷积网络的动作分类系统无法满足于基于高速相机的分类任务。我们建议通过采用更高效的网络架构并使用基于阈值的方法来提前预测手势来解决这个问题。我们在石头剪刀布游戏的新手势数据集上验证了我们提出的方法。该模型能够成功学习持续时间和复杂程度不同的手势。对 CNN 和长期递归卷积网络进行了比较分析。我们报告的手势分类准确率为 97%,并且报告了每帧 7 毫秒的近乎实时的计算复杂性。
In this paper, we propose a vision-based hand gesture recognition system for human-computer interaction. The gesture recognition systems are employed in developing a rock-paper-scissors game between human and our robotic hands in realtime. Our task is to predict the gestures as soon as possible by using high-speed cameras. Due to the computational complexity, the standard long-term recurrent convolution network-based action classification system cannot be contented with classification tasks based on high-speed cameras. We propose to address this issue by employing a more efficient network architecture and using a threshold-based method to predict the gesture in advance. We validate our proposed method on the new gesture dataset for the rock-paper-scissors game. The model is able to successfully learn gestures varying in duration and complexity. A comparative analysis of CNN and long-term recurrent convolution network is performed. We report a gesture classification accuracy of 97% and report a near real-time computational complexity of 7 ms per frame.
相关信息/Info
作者/Authors
链接/Link
Yuanda Hu, Jiacheng Xu, Zhiqiang Ma, Guitao Cao
https://dl.acm.org/doi/abs/10.1145/3240876.3240914