针对复杂场景下手势分割准确性低,手势细粒度特征描述不充分和手势识别实时性差的问题,提出融合批量再标准化和YOLOv3的手势识别算法。首先,在复杂背景及不同光照条件下采集20种手势,运用数据增广策略进行样本扩充并建立标准手势库;然后通过K均值维度聚类获取训练集手势锚点框,负责对不同尺度手势进行检测;最后利用迁移学习和微调方法训练得到手势识别模型。为解决YOLOv3网络在手势训练阶段和预测阶段进行规范化处理时数据间可能存在较大偏差问题,采用批量再标准化方法提高手势识别准确性。手势识别过程具有快速、准确、非接触的优势,实验表明在正常实验环境下,手势平均识别率为97.6%,对于复杂背景下干扰较大的手势平均识别率达到89.2%以上,单次手势识别速度为0.04s。
Abstract
Aiming at the problem of low accuracy of gesture segmentation, inadequate description of fine-grained features and poor real-time gesture recognition in complex scenes, a hand gesture recognition algorithm combined with batch renormalization and YOLOv3 is proposed.Firstly, 20 gestures are collected under complex backgrounds and different lighting conditions, the data augmentation strategies are used for sample expansion, and a standard gesture library is established.Then, the anchor is obtained through K-means dimension clustering to detect the different scale hand types; Finally, the gesture recognition model is obtained by using the transform learning and fine-tuning.In order to solve the problem that there may be large deviations between the data when YOLOv3 network are standardized in the gesture training phase and the prediction phase, the batch renormalization method is adopted to improve the accuracy of hand gesture recognition.Experiments show that the average accuracy in normal experimental environment is 97.6%.the average recognition rate of gestures is over 89.2% in the complex environment, and the recognition speed is 0.04second.
关键词
计量学;手势识别;YOLOv3;批量再标准化;迁移学习 /
图像处理
Key words
metrology /
hand gesture recognition /
YOLOv3 /
batch renormalization /
transform learning /
image processing
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1]Badi H. Retraction: A Survey on Recent Vision-Based Gesture Recognition[J].Intelligent Industrial Systems, 2016, 2: 179-191.
[2]Sagayam K M, Hemanth D J. Hand posture and gesture recognition techniques for virtual reality applications: a survey[J].Virtual Reality, 2017, 21(2): 91-107.
[3]Pisharady P K, Saerbeck M.Recent methods and databases in vision-based hand gesture recognition[J].Computer Vision & Image Understanding, 2015, 141(C): 152-165.
[4]Chen D, Li G, Sun Y, et al. An Interactive Image Segmentation Method in Hand Gesture Recognition[J].Sensors, 2017, 17(2): 539-550.
[5]宋晓娜, 冯志全, 郝晓艳. 复杂背景下基于空间分布特征的手势识别算法[J].计算机辅助设计与图形学学报, 2010, 22(10): 1841-1848.
Song X N, Feng Z Q, He X Y. Gesture Recognition Algorithm Based on Spatial Distribution Features in Complex Background[J].Journal of Computer-Aided Design and Computer Graphics, 2010, 22(10): 1841-1848.
[6]张明达. 基于视觉的手势识别方法研究[D]. 兰州:兰州交通大学, 2017.
[7]Lu Z, Xiang C, Qiang L, et al. A Hand Gesture Recognition Framework and Wearable Gesture-Based Interaction Prototype for Mobile Devices[J].IEEE Transactions on Human-Machine Systems, 2017, 44(2): 293-299.
[8]陈杰, 尚丽. 基于核竞争学习算法的图像特征提取[J].计量学报, 2017, 38(5): 576-579.
Chen J, Shang L. Image feature extraction based on kernel competitive learning algorithm [J].Acta Metrologica Sinica, 2017,38(5): 576-579.
[9]熊娟, 文桦. 基于萤火虫搜索算法的图像纹理特征提取研究[J].计量学报, 2016, 37(3): 255-259.
Xiong J, Wen H. Research on Image Texture Feature Extraction Based on Firefly Search Algorithm [J].Acta Metrologica Sinica, 2016, 37(3): 255-259.
[10]Majdar R S, Ghassemian H. A probabilistic SVM approach for hyperspectral image classification using spectral and texture features[J].International Journal of Remote Sensing, 2017, 38(15): 4265-4284.
[11]程淑红, 马继勇, 张仕军, 等. 改进的混合高斯与YOLOv2融合烟雾检测算法[J]. 计量学报, 2019, 40(5): 798-803.
Cheng S H, Ma J Y, Zhang S J, et al. Smoke Detection Algorithm Combined with Improved Gaussian Mixture and YOLOv2[J]. Acta Metrologica Sinica, 2019, 40(5): 798-803.
[12]谭俊. 一个改进的YOLOv3目标识别算法研究[D].武汉: 华中科技大学, 2018.
[13]Wang L, Yang S, Yang S, et al. Automatic thyroid nodule recognition and diagnosis in ultrasound imaging with the YOLOv2 neural network[J].World Journal of Surgical Oncology, 2019, 17(1): 12-12.
[14]Redmon J, Farhadi A. YOLO9000: Better, Faster, Stronger[C]// IEEE Conference on Computer Vision & Pattern Recognition. Honolulu, HI, USA, 2017.
[15]Goussies N A, Mejail M. Transfer learning decision forests for gesture recognition[J].Journal of Machine Learning Research, 2017, 15(1): 3667-3690.
基金
国家自然科学基金(61601400);“十三五”装备预研共用技术项目(41412040302)