基于小样本深度学习的通风柜橱窗状态识别方法

马振伟; 何高奇; 袁玉波

doi:10.14135/j.cnki.1006-3080.20190412004

基于小样本深度学习的通风柜橱窗状态识别方法

Fume Hood Window State Recognition Method Based on Few Shot Deep Learning

摘要

摘要: 当实验人员离开化学实验室时，未及时关闭通风柜橱窗会造成严重的安全隐患以及能源浪费，且目前缺乏有效的信息化管理手段。本文利用计算机视觉技术非接触性、可扩展性强的优势，提出了基于小样本深度学习的通风柜橱窗状态识别方法。首先对监控视频进行预处理，基于运动特征和几何先验提取出通风柜橱窗区域；然后对改进的多尺度空洞原型网络进行训练，准确识别出通风柜橱窗的状态。在实际应用中，结合改进的人员检测算法有效减少了识别次数。经实验验证，该方法的准确率较卷积神经网络提升了10.95%，并且对光照变化的鲁棒程度较高，可有效满足化学实验室的日常安全管理要求。

Abstract: In the chemical laboratory, the laboratory staff often forget to close the fume hood window in time before they leave, which may cause potential safety hazards and the waste of energy. Therefore, it is necessary to develop a methodology for the safety management of fume hood window. To the best of the authors’ knowledge, the related research works about window status recognition are mainly through various electronic control systems, which are not suitable for fume hood window. By using computer vision with the advantages of non-contact and easy expansibility, this paper proposes a novel safety management method for fume hood window. Firstly, the surveillance videos are preprocessed and these areas of fume hood window are extracted via motion features and geometric priority. This can effectively reduce the influence of irrelevant area on window status recognition. Due to the lack of available data set and the limitation of the number of fume hood windows in the laboratory, this paper constructs a new data set containing 400 window images. By using few-shot learning, this paper proposes a recognition method on the status of fume hood window. Compared with the traditional few-shot learning dataset, fume hood window images have higher resolution such that it is difficult to extract effective features. To overcome this significant challenge, this paper applies dilation convolution to enlarge receptive field and constructs the inception layer with multi-scale dilation rate instead of traditional convolution layer. In order to avoid invalid detection on the window status while staff are in the laboratory, we use the moving foreground region extracted from the Gauss mixture model as the prior region of Yolov3 (You only look once version 3) target detection such that the error recognition can be greatly reduced. In the simulation experiment, the proposed method is compared with the traditional machine learning algorithm and CNN(convolutional neural network). LBP (local binary pattern), PCA(principal component analysis), ColorHist(histogram of color) and HOG(histogram of oriented gradient) are selected as the features of machine learning methods from the aspects of texture, dimension reduction, color and shape. It is shown via the experimental results that the proposed method can achieve 99.29% accuracy under normal illumination conditions, 17.20% higher than the best traditional HOG combined with Randomforest method and 10.95% higher than the convolution neural network. Under the condition of illumination change, the accuracy is 95.74%, which is less changed than the one under normal illumination.

HTML全文

参考文献(19)

施引文献

资源附件(0)