融合多尺度特征的改进YOLOv11隧道裂缝识别

Improved YOLOv11 with Multi-scale Feature Fusion for Tunnel Crack Detection

  • 摘要: 隧道裂缝的快速准确识别对保障隧道运营安全至关重要,但现有检测方法在面对裂缝尺度差异大、形态复杂及纹理模糊等实际工况时仍存在精度不足与漏检率高的问题。为此,提出一种融合多尺度视觉编码与尺度动态损失的改进YOLOv11裂缝检测模型。该模型通过引入显式视觉中心模块(Explicit Visual Center Block, EVCBlock)增强多尺度裂缝特征的表达能力,同时设计基于尺度动态加权的损失函数(Scale-based Dynamic Loss, SDLoss)动态调整各尺度裂缝目标的检测权重,从而增强模型对微小裂缝与复杂纹理背景的鲁棒性。试验结果表明,该方法在精确率、召回率、mAP0.5mAP0.5:0.95上均优于基准模型YOLOv11n,分别达到84.5%、78.6%、88.1%和62.7%,较基准模型分别提升3.8%、3.7%、4.0%和4.0%。该方法在隧道复杂场景下裂缝检测更可靠,整体性能优于YOLOv5n、YOLOv6n、YOLOv8n等主流轻量级目标检测模型。

     

    Abstract: Rapid and accurate identification of tunnel cracks is crucial for ensuring the operational safety. However, existing methods struggle with low accuracy and high missed detection rates under challenging conditions involving significant scale variations, complex morphologies, and blurred textures. To address these issues, an improved YOLOv11 model integrating multi-scale visual encoding and scale-dynamic loss is proposed. Specifically, an Explicit Visual Center Block (EVCBlock) is introduced to enhance feature representation of multi-scale crack features. Simultaneously, a Scale-based Dynamic Loss (SDLoss) is designed to adaptively adjust detection weights across different scales, thereby improving the model’s robustness against tiny cracks and complex backgrounds. Experimental results demonstrated that the proposed method outperforms the baseline YOLOv11n, achieving a Precision of 84.5%, Recall of 78.6%, mAP@0.5 of 88.1%, and mAP@0.5:0.95 of 62.7%, representing improvements of 3.8% ,3.7% , 4.0% and 4.0%, respectively., The proposed model provides more reliable crack detection in complex tunnel scenarios, demonstrating superior performance over mainstream lightweight models such as YOLOv5n, YOLOv6n, and YOLOv8n.

     

/

返回文章
返回