基于改进 ＤＰＧＮ的少样本图像分类算法研究

The distribution propagation graph network（DPGN）is a few-shot image classification algorithm based on deep learning.Unfortunately，the DPGN algorithm completely ignores semantic information，which is important for fine-grained classification.Therefore，it delivers poor classification performances.This paper proposes a new Few-shot learning algorithm based on the DPGN algorithm，SinAM-FRN＿layer-ODConv-DM＆EMD＿Distribution Propagation Graph Network（SFOD＿DPGN）.

First，to address the inability to extract image features by the feature extraction module of the DPGN algorithm，the SimAM attention mechanism is integrated into four residual blocks of the feature extraction network ResNet12.The SimAM attention mechanism can generate three-dimensional weights for feature maps from both spatial and channel dimensions，and then aggregates the generated weights with the feature maps to enable the improved ResNet12 to learn more and richer image features；Second，in view that the normalization method of the ResNet12 is affected by the number of images selected in training，the combination of batch normalization and the ReLu activation function in the main path of each residual block of the ResNet12 is changed to the combination of the filter response normalization（FRN）and the threshold linear unit activation function（TLU）.Because of the FRN without mean operation，it easily leads to activation with arbitrary bias far from zero.If the FRN combines with the ReLu activation function，this bias has adverse effects on training.This paper employs the TLU after the FRN to address the problem.The SFOD＿DPGN algorithm improves the classification accuracy and ensures its inference speed.Then，it optimizes the classifier module of the DPGN algorithm.To solve poor classification performance of the classifier module，the full dimensional dynamic convolution（ODConv）is selected to replace the common convolution in the classifier module.The ODconv employs a linear combination of n convolutional kernels and parallel strategies to introduce multidimensional attention mechanisms for dynamic weighting，making the convolution operation dependent on the input.The ODconv improves the robustness of the SFOD＿DPGN algorithm.Finally，the DPGN algorithm uses the L2 distance measurement method in the classifier module，easily causing errors in calculating the distance between samples.Based on the characteristics of distance measurement methods，the Mahalanobis Distance（MD）is suitable for calculating the distance between samples（point graphs）.The Earth Moves’s Distance（EMD）distance ismore suitable for calculating the distance between distribution graphs.This paper uses the MD and EMD to replace the L2 in order to improve the ability of the classifier to measure the distance between samples.It improves the classification accuracy of the SFOD＿DPGN algorithm.

Experiments on the CUB-200-2011 dataset shows the SFOD＿DPGN algorithm is superior to the DPGN algorithm over 5way-1shot and 5way-5shot classification tasks.The accuracy improves by 7.97% and 2.66% respectively.Meanwhile，ablation experiments are performed for each part to verify the effect of the improved ResNet12 and the classifier module.Compared to the DPGN algorithm，after the SimAM attention mechanism is integrated into the ResNet12，the accuracy improves by 2.77% and 1.16% over 5way-1shot and 5way-5shot classification tasks respectively.Furthermore，after the improving the normalization method and activation function of the ResNet12，the accuracy is 5.00% and 2.04% higher respectively over 5way-1shot and 5way-5shot classification tasks.After the further replacement of the common convolution with the ODconv，the accuracy is up by 7.25% and 2.42% respectively over 5way-1shot and 5way-5shot classification tasks.Our experimental results demonstrate all improvements are effective to improve classification accuracy of the SFOD＿DPGN algorithm.

[1]	钱枫, 胡桂铭, 祝能, 邓明星, 王洁, 许小伟. 基于改进扩散模型的图像去雨方法[J]. 重庆理工大学学报（自然科学）, 2024, 38(1): 59-66.
[2]	周桐, 李冬春, 田雨聃. 隧道场景下行人检测ＤＡ-Ｚｅｒｏ-ＤＣＥ图像增强算法[J]. 重庆理工大学学报（自然科学）, 2024, 38(1): 122-130.
[3]	王建荣, 尉向前, 辛彬彬, 高睿丰, 李国. 一种改进Ｕ-Ｎｅｔ网络的心电图分类算法研究[J]. 重庆理工大学学报（自然科学）, 2024, 38(1): 142-149.
[4]	汤文亮, 曾建杨, 何文晶. 一种基于知识蒸馏的轨道检测轻量化模型[J]. 重庆理工大学学报（自然科学）, 2023, 37(9): 173-179.
[5]	张本文, 高瑞玮, 乔少杰. 新型融合注意力机制的遮挡面部表情识别框架[J]. 重庆理工大学学报（自然科学）, 2023, 37(9): 217-226.
[6]	李舜酩, 陆建涛, 沈涛. 不平衡转子系统弯扭耦合复杂故障智能诊断[J]. 重庆理工大学学报（自然科学）, 2023, 37(7): 101-109.
[7]	林慧斌, 习慈羊, 丁康. 用于滚动轴承局部故障诊断的深度降采样方法[J]. 重庆理工大学学报（自然科学）, 2023, 37(7): 110-119.
[8]	贾远鹏, 陈学文, 哈瑞峰. 双注意力机制下自动驾驶汽车车道线深度感知研究[J]. 重庆理工大学学报（自然科学）, 2023, 37(7): 44-50.
[9]	张皓帝, 张瑞乾, 童亮. 基于改进ＹＯＬＯｖ５ｓ的车辆目标检测方法[J]. 重庆理工大学学报（自然科学）, 2023, 37(7): 80-89.
[10]	谢炅宏, 陈永鹏, 李嘉琳. 基于多传感器信号融合和残差神经网络的齿轮箱故障诊断[J]. 重庆理工大学学报（自然科学）, 2023, 37(7): 144-152.
[11]	邓天民, 王丽, 刘旭慧. 基于注意力及特征融合的红外行人检测算法[J]. 重庆理工大学学报（自然科学）, 2023, 37(6): 196-203.
[12]	闫路, 来佳丽, 王明辉. 多信息融合和自注意力识别新冠磷酸化位点[J]. 重庆理工大学学报（自然科学）, 2023, 37(6): 242-248.
[13]	邴其春, 张伟健, 沈富鑫, 胡嫣然, 高鹏, 刘东杰. 基于变分模态分解和ＬＳＴＭ的短时交通流预测[J]. 重庆理工大学学报（自然科学）, 2023, 37(5): 169-177.
[14]	王东, 李佩声. 融合胶囊网络的中文短文本情感分析[J]. 重庆理工大学学报（自然科学）, 2023, 37(5): 178-184.
[15]	刘政, 刘鑫, 刘伟. 面向家庭用电负荷分解的时间卷积注意力网络[J]. 重庆理工大学学报（自然科学）, 2023, 37(4): 209-216.

基于改进ＤＰＧＮ的少样本图像分类算法研究

Research on image classification algorithm w ith few-shot based on im proved DPGN

PDF (PC)

赞

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

Metrics

本文评价

推荐阅读 10