高级检索
陈以祺, 吴香华, 刘鹏, 等. 2022. 降水统计预报模型的模拟性能分析[J]. 气候与环境研究, 27(5): 578−590. doi: 10.3878/j.issn.1006-9585.2022.21058
引用本文: 陈以祺, 吴香华, 刘鹏, 等. 2022. 降水统计预报模型的模拟性能分析[J]. 气候与环境研究, 27(5): 578−590. doi: 10.3878/j.issn.1006-9585.2022.21058
CHEN Yiqi, WU Xianghua, LIU Peng, et al. 2022. Analysis of the Simulation Performances of Precipitation Statistical Forecasting Models [J]. Climatic and Environmental Research (in Chinese), 27 (5): 578−590. doi: 10.3878/j.issn.1006-9585.2022.21058
Citation: CHEN Yiqi, WU Xianghua, LIU Peng, et al. 2022. Analysis of the Simulation Performances of Precipitation Statistical Forecasting Models [J]. Climatic and Environmental Research (in Chinese), 27 (5): 578−590. doi: 10.3878/j.issn.1006-9585.2022.21058

降水统计预报模型的模拟性能分析

Analysis of the Simulation Performances of Precipitation Statistical Forecasting Models

  • 摘要: 降水预报模型的性能与诸多因素有关,除了与研究区域特征和研究数据有关,还受到模型自身算法、统计模拟方法、性能度量指标等的影响。本文基于2015~2019年我国黑龙江省28个站点逐日降水、平均气温和平均相对湿度等地面常规气象资料,运用留出法、自助法等蒙特卡洛统计模拟和机器学习方法,首次系统研究了黑龙江省夏季逐日降水预报模型的性能和模型性能的空间分布特征。结果表明,对研究区域整体来说,BP(Back Propagation)神经网络和支持向量机的总体预报性能没有显著差异,ROC(Receiver Operating Characteristic)曲线面积值均高于76%,显著优于决策树。自助法估计的模型预报性能始终优于留出法,并且有助于提高评估结果的保真性。对研究区域单个站点来说,除个别站点以外,支持向量机的准确率和ROC曲线面积值均高于80%,并且呈现东南大西北小的空间分布趋势,该趋势与降水频率的分布基本一致。支持向量机在小兴安岭和张广才岭的总体预报效果较好,三江平原次之,松嫩平原较差;而敏感度在山区大,平原区小,中部和南部大,东部次之,西部和北部小;特异度空间分布则恰好与敏感度相反。

     

    Abstract: The performance of a precipitation forecast model is related to many factors. In addition to research areas and research data characteristics, it is also affected by the model's algorithm, statistical simulation methods, and performance metrics. This paper is based on the daily rainfall, average temperature, and average relative humidity of 28 stations in Heilongjiang Province in China from 2015 to 2019, using Monte Carlo statistical simulation methods such as Hold-out, Bootstrap, and machine learning methods. For the first time, this paper systematically studied the performances of daily precipitation forecast models in Heilongjiang Province in the summer and the spatial distribution characteristics of the model performances. The results show that for the entire study area, the overall prediction performance of a BP (Back Propagation) neural network and support vector machine is not significantly different, and the value of the area under ROC cuvre is higher than 76%, which is significantly better than that of the decision tree. The prediction performance of the model estimated by Bootstrap is always better than that of Hold-out, and it helps improve the fidelity of the evaluation results. For a single station in the study area, except for certain stations, the value of accuracy and the area under ROC cuvre of the support vector machine are higher than 80%, and the spatial distribution trend is larger in the southeast and smaller in the northwest. This trend is basically consistent with the distribution of precipitation frequency. The overall prediction effect of the SVM (Support Vector Machine) model is better in the Xiaokingan and Zhangguangcai Mountains, followed by the Sanjiang and Songnen Plains. The sensitivity is higher in mountainous areas than in plain areas. The central and southern regions are larger, followed by the eastern region and then the western and northern regions. The spatial distribution of specificity is simply the opposite of that of sensitivity.

     

/

返回文章
返回