基于MAML算法的YOLOv3目标检测模型

沈震宇; 朱昌明; 王喆

doi:10.14135/j.cnki.1006-3080.20201128002

基于MAML算法的YOLOv3目标检测模型

YOLOv3 Object Detection Model Based on MAML Algorithm

摘要

摘要: 作为典型的一体化卷积神经网络，YOLOv3模型的网路传输途径简单，检测速度相对较快，但检测精度较低。当遇到新的目标在训练数据集中存在的样本较少时，模型检测会更加不准确，甚至会出现检测不到的情况。本文基于与模型不相关的元学习算法（MAML）改进了YOLOv3主干网络的结构，使其具有内循环和外循环的梯度下降，在初始参数基础上进行多步的梯度调整，达到仅用小样本数据就能快速收敛的目的。实验结果表明，该方法使得YOLOv3模型的检测精度提升了5.24%，且可以使梯度下降保持稳定，有效地满足YOLOv3模型在小样本数据训练情况下识别目标位置的精准性和泛化性。

Abstract: Object detection has been a research hotspot in the field of computer vision in recent years. Due to the extensive application of deep learning, the target detection technology combined with deep learning has been developing and making continuous breakthroughs. In the field of target detection, it is difficult to solve the problem of target detection with few sample categories, and detect small targets with high accuracy via the training with few sample categories. By means of the model-agnostic meta-learning (MAML) algorithm in meta learning, this paper improves the information transmission form of the backbone network in YOLOv3 to make Darknet-53 achieve two stages of parameter internal update and external update in gradient descent. By multi-step gradient adjustment on the initial parameters, the trained weights can focus more on the feature information of the target. Even only via a small number of sample categories, it can also maintain the sensitivity to the target in the new task. It is shown via the experiment results that the mean average precision(mAP) value of YOLOv3 model attains 74.81%, and the mAP value of YOLOv3 model based on MAML algorithm can reach 80.05%, which improves the accuracy by 5.24%. The network structure and training mechanism of the modified YOLOv3 via MAML can improve the accuracy of detection in training and prediction, and the trained weights can make the model have high detection accuracy and high generalization.

HTML全文

参考文献(18)

施引文献

资源附件(0)