基于随机森林算法的管道缺陷预测方法
Pipeline defect prediction method based on Random Forests algorithm
-
摘要: 根据国内外管道缺陷预测模型与随机森林(Random Forests)模型的研究现状, 分析其优势与存在的问题, 并基于陕京管道GIS系统对某段管道的监控和记录数据, 建立并优化随机森林模型, 对该管段进行缺陷等级预测。随机森林模型可用于分析各指标对管道缺陷的影响程度, 具有指标重要度评估功能, 模型的评判精度、分级结果准确, 数据挖掘能力很强。将随机森林模型与GIS技术结合, 能更好地预测管道缺陷, 从而采取相应的控制措施。Abstract: In this paper, the research status quo of pipeline defect prediction model and Random Forests model at home and abroad was investigated, and their advantages and problems were analyzed. Then, based on the data of some pipeline section monitored and recorded by GIS system of Shaanxi-Beijing Gas Pipeline, the Random Forests model was established and optimized, and the defect level of the pipeline section was predicted. It is shown that the Random Forests model has the function of index importance evaluation and it can be used to analyze the contribution degree of each index to the pipeline defect. This model is characterized by high evaluation accuracy, accurate ranking result and strong data mining capacity. By combining the Random Forests model with the GIS technology, pipeline defects can be predicted better and the control measures can be taken correspondingly.