基于数据聚类与敏感度分析的TBM施工数据筛选方法

王华伟; 史林肯; 刘忠磊; 李庆民; 刘四进; 于世孟; 费延达

doi:10.13807/j.cnki.mtt.2026.01.002

基于数据聚类与敏感度分析的TBM施工数据筛选方法

In-situ Data Screening Method for Tunnel Boring Machine based on Data Clustering and Global Sensitivity Analysis

摘要

摘要: 隧道掘进机（TBM）运行数据的监测日益完善，但高昂的计算成本限制了数据驱动技术在隧道施工现场的部署与应用。为此，提出一种基于数据聚类与敏感度分析的施工数据筛选方法，旨在通过减少样本容量和参数数量，以降低TBM施工数据挖掘和分析的计算成本。该方法设计了基于欧氏距离和输出变量方差的样本差异度量，利用拉格朗日乘子法确定两者方差的权重，推导了聚类目标函数的求解策略，实现施工数据从样本容量维度的筛选；同时，采用全局敏感度分析参数贡献度，实现施工数据从参数维度的筛选，最终形成TBM施工数据核心数据集。通过某TBM的掘进速度和地质类型预测进行验证，结果表明，所提数据筛选方法可以显著降低计算成本，同时保持掘进速度和地质类型的预测精度。掘进速度预测模型和地质类型预测模型计算成本降低50.35%以上，预测精度与总样本的预测精度接近（降幅小于5%）。

Abstract: Tunnel boring machine (TBM) operation monitoring is becoming increasingly sophisticated. However, the high computational cost restricts the deployment and application of data-driven technologies in tunnel projects. To solve this issue, an in-situ data screening method is proposed for the tunnel boring machine based on data clustering and global sensitivity analysis, aiming to reduce the computational cost of data mining and analysis tasks by sample size reduction and parameter selection, respectively. This method designs a new sample difference metric based on Euclidean distance and output variable variance. The Lagrange multiplier method is used to determine the respective weights of Euclidean distance and output variable variance. A solution strategy for the clustering objective function is derived, enabling the in-situ data screening in terms of sample size. Global sensitivity analysis is used to locate and select key parameters, enabling the in-situ data screening in terms of parameter number, ultimately forming a core dataset of TBM in-situ data. The proposed method is validated through the advance rate and geology prediction tasks of a tunnel boring machine. The experimental results indicate that the proposed sample size reduction method can greatly reduce the computational cost and maintain the prediction accuracy at the same time for both the advance rate and geology prediction tasks. The computational cost of the advance rate prediction model and the geological type prediction model is reduced by over 50.35%, and the prediction accuracy approaches that of the total samples (with a reduction of less than 5%).

HTML全文

参考文献(25)

施引文献

资源附件(0)