Machine Learning Analysis of Impact of Western US Fires on Central US Hailstorms

Xinming LIN; Jiwen FAN; Yuwei ZHANG; Z. Jason HOU

doi:10.1007/s00376-024-3198-7

Article Contents

Article > ADVANCES IN ATMOSPHERIC SCIENCES > 2024 > In press

Machine Learning Analysis of Impact of Western US Fires on Central US Hailstorms

1.
Pacific Northwest National Laboratory, Richland, WA 99354, USA
2.
Argonne National Laboratory, Lemont, IL 60439, USA

doi: 10.1007/s00376-024-3198-7

Abstract
Full Text(HTML)
Figures(10) / Table(1)
References(36)
Related papers
PDF

Abstract:
Fires, including wildfires, harm air quality and essential public services like transportation, communication, and utilities. These fires can also influence atmospheric conditions, including temperature and aerosols, potentially affecting severe convective storms. Here, we investigate the remote impacts of fires in the western United States (WUS) on the occurrence of large hail (size: ≥ 2.54 cm) in the central US (CUS) over the 20-year period of 2001–20 using the machine learning (ML), Random Forest (RF), and Extreme Gradient Boosting (XGB) methods. The developed RF and XGB models demonstrate high accuracy (> 90%) and F1 scores of up to 0.78 in predicting large hail occurrences when WUS fires and CUS hailstorms coincide, particularly in four states (Wyoming, South Dakota, Nebraska, and Kansas). The key contributing variables identified from both ML models include the meteorological variables in the fire region (temperature and moisture), the westerly wind over the plume transport path, and the fire features (i.e., the maximum fire power and burned area). The results confirm a linkage between WUS fires and severe weather in the CUS, corroborating the findings of our previous modeling study conducted on case simulations with a detailed physics model.
- wildfire,
- severe convective storm,
- hailstorm,
- machine learning
摘要: 火灾（包括野火）会危害空气质量，以及交通、通信和公用事业等基本公共服务。这些火灾还可能影响大气条件，包括温度和气溶胶，从而可能影响到强对流风暴。在此，我们使用机器学习（ML）方法，随机森林（RF）和极端梯度提升（XGB）模型，研究了在过去20年（2001年至2020年）美国西部（WUS）火灾对美国中部（CUS）大冰雹（尺寸：≥ 2.54厘米）发生的远程影响。所开发的RF和XGB模型在预测WUS火灾和CUS冰雹风暴同时发生的准确率很高（90%），F1-分数高达0.78，尤其是在四个州（即怀俄明州WY，南达科他州SD，内布拉斯加州NE和堪萨斯州KS）。从这两个ML模型中确定的关键变量包括火灾地区的气象变量（温度和湿度）、传输路径上的西风以及火灾的特征（即最大火力和燃烧面积）。这些研究结果证实了WUS火灾与CUS的恶劣天气之间的联系，印证了我们之前用详细物理模型对案例进行模拟的研究结果。
- 野火,
- 对流风暴,
- 冰雹风暴,
- 机器学习

Figure 1. Map of fire states in the WUS and hail states in the CUS. The three fire states in the WUS (highlighted by red points) are WA, OR and CA. The CUS states are divided into two columns: the original CS1 (i.e., MT, WY, CO, and NM) and original CS2 (i.e., NE, SD, NE, KS, OK and TX). States scattered with blue points are those more likely to be affected by fires in the WUS. The green dashed rectangle denotes the region with westerly winds in general, which is considered as the plume transport path from the WUS to the CUS.

DownLoad: Full-Size Img PowerPoint

Figure 2. Time series of co-occurring events of WUS fires and CUS large hail identified with daily hail counts ≥ 20 and fire size ≥ 20 km².

DownLoad: Full-Size Img PowerPoint

Figure 3. (a) Large hail count (size ≥ 2.54 cm) for each CUS state without considering fire. (b) Large hail count for each CUS state with WUS fires of which the burned area is no less than 20 km² and occurred within 2–4 days before the occurrence of large hail. (c) Ratio of co-occurring hail counts to total hail counts for each state.

DownLoad: Full-Size Img PowerPoint

Figure 4. Precision (blue), recall (red), and F1 score (green) curves of the (a, b) RF and (c, d) XGB models for CS1 and CS2 with the classification threshold ranging from 0 to 1. The red solid line shows the optimal classification threshold.

DownLoad: Full-Size Img PowerPoint

Figure 5. Average precision, recall, F1, and accuracy scores from five-fold cross-validation of the RF (blue) and XGB (orange) models for predicting large hail occurrence in (a) CS1 and (b) CS2 states.

DownLoad: Full-Size Img PowerPoint

Figure 6. Average precision, recall, F1, and accuracy scores from five-fold cross-validation of the (a) RF and (b) XGB models for predicting large hail occurrence in each state in CS1 and CS2.

DownLoad: Full-Size Img PowerPoint

Figure 7. Top 10 most important variables for (a) WY, (b) SD, (c) NE, and (d) KS.

DownLoad: Full-Size Img PowerPoint

Figure 8. The SHAP values for selected variables (e.g., U250, T_max at 850 hPa, maxFRP, etc.) in (a, b) WY and (c, d) NE. The SHAP values for the selected variables in SD and KS show similar patterns as those in NE.

DownLoad: Full-Size Img PowerPoint

Figure 9. SHAP values for the most important variables from both the RF and XGB models of (a–d) WY and (e–h) NE when assuming independence (x-axis) versus dependence (y-axis). Similar patterns for the SHAP values of these variables are found in SD and KS. The color scheme represents the values of variables.

DownLoad: Full-Size Img PowerPoint

Figure 10. Correlation of smoke aerosols with burned area in (a) WA two days before and (b) OR three days before, and with maximum fire power in OR (c) four days and (d) three days before the large hail event.

DownLoad: Full-Size Img PowerPoint

Table 1. Target and predictor variables used in the ML models.

Target variables	Abbreviation	Temporal resolution	Data Source
Daily occurrence of hail with size ≥ 2.54 cm (0/1) in a state	Hail occurrence	daily	SPC
Daily hail count for hail with size ≥ 2.54 cm in a state	Hail count	daily	SPC
Predictor variables	Abbreviation	Temporal resolution	Data Source
Mean maxFRP for fire grids in three WUS states within t days before hail	maxFRP _m_COW_dt	daily	MODIS
Maximum maxFRP for fire grids in three WUS states within t days before hail	maxFRP_max_COW_dt	daily	MODIS
Mean maxFRP for fire grids in states within t days before hail	maxFRP_m_s_dt	daily	MODIS
Maximum maxFRP for fire grids in states within t days before hail	maxFRP_max_s_dt	daily	MODIS
Total number of fire grids in three WUS states within t days before hail	ngrids_COW_dt	daily	MODIS
Total number of fire grids in states within t days before hail	ngrids_s_dt	daily	MODIS
Temporal change of fire grids in three WUS states within t days before hail	gdiff_COW_dt	daily	MODIS
Temporal change of fire grids in states within t days before hail	gdiff_s_dt	daily	MODIS
Mean BC+OC over fire grids in three WUS states within t days before hail	BCOC_m_COW_dt	daily	MERRA-2
Maximum BC+OC for all grids in three WUS states within t days before hail	BCOC_max_COW_dt	daily	MERRA-2
Mean BC+OC over fire grids in states within t days before hail	BCOC_m_s_dt	daily	MERRA-2
Maximum BC+OC for all grids in states within t days before hail	BCOC_max_s_dt	daily	MERRA-2
Mean RH at 850 hPa over three WUS states within t days before hail	RH850_m _dt	daily	MERRA-2
Maximum RH at 850 hPa over three WUS states within t days before hail	RH850_max _dt	daily	MERRA-2
Mean air temperature at 850 hPa over three WUS states within t days before hail	T_m _dt	daily	MERRA-2
Maximum air temperature at 850 hPa over three WUS states within t days before hail	T_max _dt	daily	MERRA-2
Mean U-wind at 850 hPa for grids along fire path within t days before hail	U850_m_dt	daily	MERRA-2
Maximum U-wind at 850 hPa for grids along fire path within t days before hail	U850_max_dt	daily	MERRA-2
Mean U-wind at 250 hPa for grids along fire path within t days before hail	U250_m_dt	daily	MERRA-2
Maximum U-wind at 250 hPa for grids along fire path within t days before hail	U250_max_dt	daily	MERRA-2
Notes: t ∈[1,2] for U-wind; t ∈[2,4] for other variables; s∈[CA, OR, WA]; the fire transport region (38°–44°N, 125°–112°W)

DownLoad: CSV

References

Abatzoglou, J. T., and C. A. Kolden, 2013: Relationships between climate and macroscale area burned in the western United States. International Journal of Wildland Fire, 22 (7), 1003−1020, https://doi.org/10.1071/WF13019.

Blair, S. F., D. R. Deroche, J. M. Boustead, J. W. Leighton, B. L. Barjenbruch, and W. P. Gargan, 2011: A radar-based assessment of the detectability of giant hail. E-Journal of Severe Storms Meteorology, 6 (7), https://doi.org/10.55599/ejssm.v6i7.34.

Blair, S. F., and Coauthors, 2017: High-resolution hail observations: Implications for NWS warning operations. Weather and Forecasting, 32 (3), 1101−1119, https://doi.org/10.1175/WAF-D-16-0203.1.

Boulesteix, A.-L., S. Janitza, J. Kruppa, and I. R. König, 2012: Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics. WIREs Data Mining and Knowledge Discovery, 2 (6), 493−507, https://doi.org/10.1002/widm.1072.

Breiman, L., 2001: Random forests. Machine Learning, 45 (1), 5−32, https://doi.org/10.1023/A:1010933404324.

Chen, T. Q., and C. Guestrin, 2016: XGBoost: A scalable tree boosting system. Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, California, USA, ACM, https://doi.org/10.1145/2939672.2939785.

Cunningham, P., and M. J. Reeder, 2009: Severe convective storms initiated by intense wildfires: Numerical simulations of pyro-convection and pyro-tornadogenesis. Geophys. Res. Lett., 36 (12), L12812, https://doi.org/10.1029/2009GL039262.

Dennis, E. J., and M. R. Kumjian, 2017: The impact of vertical wind shear on hail growth in simulated supercells. J. Atmos. Sci., 74 (3), 641−663, https://doi.org/10.1175/JAS-D-16-0066.1.

Dennison, P. E., S. C. Brewer, J. D. Arnold, and M. A. Moritz, 2014. Large wildfire trends in the western United States, 1984–2011. Geophys. Res. Lett., 41 (8), 2928−2933, https://doi.org/10.1002/2014GL059576.

Fromm, M., A. Tupper, D. Rosenfeld, R. Servranckx, and R. McRae, 2006: Violent pyro-convective storm devastates Australia’s capital and pollutes the stratosphere. Geophys. Res. Lett., 33 (5), L05815, https://doi.org/10.1029/2005GL025161.

Gelaro, R., and Coauthors, 2017: The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2). J. Climate, 30, 5419−5454, https://doi.org/10.1175/JCLI-D-16-0758.1.

Grell, G., S. R. Freitas, M. Stuefer, and J. Fast, 2011: Inclusion of biomass burning in WRF-Chem: Impact of wildfires on weather forecasts. Atmospheric Chemistry and Physics, 11 (11), 5289−5303, https://doi.org/10.5194/acp-11-5289-2011.

Huang, X., M. Li, J. Li, and Y. Song, 2012: A high-resolution emission inventory of crop burning in fields in China based on MODIS thermal anomalies/fire products. Atmospheric environment, 50, 9−15, https://doi.org/10.1016/j.atmosenv.2012.01.017.

Jacobo, J., and G. Zee, 2021: Climate change may be causing an early start to fire season in the West. Retrieved from https://abcnews.go.com/US/climate-change-causing-early-start-fire-season-west/story?id=77737065.

Jain, P., X. Wang, and M. D. Flannigan, 2017: Trend analysis of fire season length and extreme fire weather in North America between 1979 and 2015. International Journal of Wildland Fire, 26 (12), 1009—1020, https://doi.org/10.1071/WF17008.

Janzing, D., L. Minorics, and P. Blöbaum, 2019: Feature relevance quantification in explainable AI: A causal problem. arXiv preprint arXiv: 1910.13413, https://doi.org/10.48550/arXiv.1910.13413.

Jeong, J.-H., J. W. Fan, C. R. Homeyer, and Z. S. Hou, 2020: Understanding hailstone temporal variability and contributing factors over the U.S. southern great plains. J. Climate, 33 (10), 3947−3966, https://doi.org/10.1175/Jcli-D-19-0606.1.

Jeong, J.-H., J. W. Fan, and C. R. Homeyer, 2021: Spatial and temporal trends and variabilities of hailstones in the United States Northern Great Plains and their possible attributions. J. Climate, 34 (16), 6819−6840, https://doi.org/10.1175/Jcli-D-20-0245.1.

Jolly, W. M., M. A. Cochrane, P. H. Freeborn, Z. A. Holden, T. J. Brown, G. J. Williamson, and D. M. Bowman, 2015: Climate-induced variations in global wildfire danger from 1979 to 2013. Nature Communications, 6 (1), 7537, https://doi.org/10.1038/ncomms8537.

Kablick III, G., and Coauthors, 2018: The great slave lake PyroCb of 5 August 2014: Observations, simulations, comparisons with regular convection, and impact on UTLS water vapor. J. Geophys. Res., 123 (21), 12 332−12 352, https://doi.org/10.1029/2018JD028965.

Lee, H., S.-J. Jeong, O. Kalashnikova, M. Tosca, S.-W. Kim, and J.-S. Kug, 2018: Characterization of wildfire‐induced aerosol emissions from the Maritime Continent peatland and Central African dry savannah with MISR and CALIPSO aerosol products. J. Geophys. Res., 123 (6), 3116−3125, https://doi.org/10.1002/2017JD027415.

Lee, H.-H., and C. Wang, 2020: The impacts of biomass burning activities on convective systems over the Maritime Continent. Atmospheric Chemistry and Physics, 20 (4), 2533−2548, https://doi.org/10.5194/acp-20-2533-2020.

Lindsey, D. T., and Fromm, M., 2008: Evidence of the cloud lifetime effect from wildfire‐induced thunderstorms. Geophys. Res. Lett., 35 (22), L22809, https://doi.org/10.1029/2008GL035680.

Liu, X. X., and Coauthors, 2017: Airborne measurements of western U.S. wildfire emissions: Comparison with prescribed burning and air quality implications. J. Geophys. Res., 122 (11), 6108−6129, https://doi.org/10.1002/2016JD026315.

Liu, Y. Q., S. L. Goodrick, and J. A. Stanturf, 2013: Future U.S. wildfire potential trends projected using a dynamically downscaled climate change scenario. Forest Ecology and Management, 294, 120−135, https://doi.org/10.1016/j.foreco.2012.06.049.

Logan, T., X. Q. Dong, and B. K. Xi, 2018: Aerosol properties and their impacts on surface CCN at the ARM Southern Great Plains site during the 2011 Midlatitude Continental Convective Clouds Experiment. Adv. Atmos. Sci., 35 (2), 224−233, https://doi.org/10.1007/s00376-017-7033-2.

Lu, Z., and I. N. Sokolik, 2013: The effect of smoke emission amount on changes in cloud properties and precipitation: A case study of Canadian boreal wildfires of 2007. J. Geophys. Res., 118 (20), 11 777−11 793, https://doi.org/10.1002/2013JD019860.

Lundberg, S. M., and S.-I. Lee, 2017: A unified approach to interpreting model predictions. Proc. 31st International Conference on Neural Information Processing Systems, Long Beach, California, USA, Curran Associates Inc., 4768−4777.

Lundberg, S. M., and Coauthors, 2020: From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2 (1), 56−67, https://doi.org/10.1038/s42256-019-0138-9.

Mueller, S. E., A. E. Thode, E. Q. Margolis, L. L. Yocom, J. D. Young, and J. M. Iniguez, 2020: Climate relationships with increasing wildfire in the southwestern US from 1984 to 2015. Forest Ecology and Management, 460, 117861, https://doi.org/10.1016/j.foreco.2019.117861.

Nohara, Y., K. Matsumoto, H. Soejima, and N. Nakashima, 2019: Explanation of machine learning models using improved Shapley Additive Explanation. Proc. 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Niagara Falls, NY, USA, ACM, https://doi.org/10.1145/3307339.3343255.

Trentmann, J., and Coauthors, 2006: Modeling of biomass smoke injection into the lower stratosphere by a large forest fire (Part I): Reference simulation. Atmospheric Chemistry and Physics, 6 (12), 5247−5260, https://doi.org/10.5194/acp-6-5247-2006.

Zhang, Y. W., J. W. Fan, T. Logan, Z. Q. Li, and C. R. Homeyer, 2019: Wildfire impact on environmental thermodynamics and severe convective storms. Geophys. Res. Lett., 46 (16), 10 082−10 093, https://doi.org/10.1029/2019GL084534.

Zhang, Y. W., J. W. Fan, M. Shrivastava, C. R. Homeyer, Y. Wang, and J. H. Seinfeld, 2022: Notable impact of wildfires in the western United States on weather hazards in the central United States. Proceedings of the National Academy of Sciences of the United States of America, 119 (44), e2207329119, https://doi.org/10.1073/pnas.2207329119.

Wang, S. S.-C., and Y. Wang, 2020: Quantifying the effects of environmental factors on wildfire burned area in the south central US using integrated machine learning techniques. Atmospheric Chemistry and Physics, 20 (18), 11065—11087, https://doi.org/10.5194/acp-20-11065-2020.

Westerling, A. L., H. G. Hidalgo, D. R. Cayan, and T. W. Swetnam, 2006: Warming and earlier spring increase western US forest wildfire activity. Science, 313 (5789), 940−943, https://doi.org/10.1126/science.112883.

[1]	Haochen LI, Chen YU, Jiangjiang XIA, Yingchun WANG, Jiang ZHU, Pingwen ZHANG, 2019: A Model Output Machine Learning Method for Grid Temperature Forecasts in the Beijing Area, ADVANCES IN ATMOSPHERIC SCIENCES, 36, 1156-1170. doi: 10.1007/s00376-019-9023-z
[2]	Huiling YANG, Hui XIAO, Chunwei GUO, Guang WEN, Qi TANG, Yue SUN, 2017: Comparison of Aerosol Effects on Simulated Spring and Summer Hailstorm Clouds, ADVANCES IN ATMOSPHERIC SCIENCES, 34, 877-893. doi: 10.1007/s00376-017-6138-y
[3]	Nian LIU, Zhongwei YAN, Xuan TONG, Jiang JIANG, Haochen LI, Jiangjiang XIA, Xiao LOU, Rui REN, Yi FANG, 2022: Meshless Surface Wind Speed Field Reconstruction Based on Machine Learning, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 1721-1733. doi: 10.1007/s00376-022-1343-8
[4]	Honghua Dai, 1996: Machine Learning of Weather Forecasting Rules from Large Meteorological Data Bases, ADVANCES IN ATMOSPHERIC SCIENCES, 13, 471-488. doi: 10.1007/BF03342038
[5]	Chao LIU, Shu YANG, Di DI, Yuanjian YANG, Chen ZHOU, Xiuqing HU, Byung-Ju SOHN, 2022: A Machine Learning-based Cloud Detection Algorithm for the Himawari-8 Spectral Image, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 1994-2007. doi: 10.1007/s00376-021-0366-x
[6]	Michael B. RICHMAN, Lance M. LESLIE, Theodore B. TRAFALIS, Hicham MANSOURI, 2015: Data Selection Using Support Vector Regression, ADVANCES IN ATMOSPHERIC SCIENCES, 32, 277-286. doi: 10.1007/s00376-014-4072-9
[7]	Mingyue SU, Chao LIU, Di DI, Tianhao LE, Yujia SUN, Jun LI, Feng LU, Peng ZHANG, Byung-Ju SOHN, 2023: A Multi-Domain Compression Radiative Transfer Model for the Fengyun-4 Geosynchronous Interferometric Infrared Sounder (GIIRS), ADVANCES IN ATMOSPHERIC SCIENCES, 40, 1844-1858. doi: 10.1007/s00376-023-2293-5
[8]	Zhenchen LIU, Wen Zhou, Xin Wang, 2024: Extreme Meteorological Drought Events over China (1951—2022): migration pattern, diversity of temperature extremes, and decadal variations, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-024-4004-2
[9]	Jiangjiang XIA, Haochen LI, Yanyan KANG, Chen YU, Lei JI, Lve WU, Xiao LOU, Guangxiang ZHU, Zaiwen Wang, Zhongwei YAN, Lizhi WANG, Jiang ZHU, Pingwen ZHANG, Min CHEN, Yingxin ZHANG, Lihao GAO, Jiarui HAN, 2020: Machine Learning−based Weather Support for the 2022 Winter Olympics, ADVANCES IN ATMOSPHERIC SCIENCES, 37, 927-932. doi: 10.1007/s00376-020-0043-5
[10]	Yang LI, Yubao LIU, Rongfu SUN, Fengxia GUO, Xiaofeng XU, Haixiang XU, 2023: Convective Storm VIL and Lightning Nowcasting Using Satellite and Weather Radar Measurements Based on Multi-Task Learning Models, ADVANCES IN ATMOSPHERIC SCIENCES, 40, 887-899. doi: 10.1007/s00376-022-2082-6
[11]	WANG Donghai, Xiaofan LI, Wei-Kuo TAO, 2010: Responses of Vertical Structures in Convective and Stratiform Regions to Large-Scale Forcing during the Landfall of Severe Tropical Storm Bilis (2006), ADVANCES IN ATMOSPHERIC SCIENCES, 27, 33-46. doi: 10.1007/s00376-009-8139-y
[12]	Fei Shiqiang, Tan Zhemin, 2001: On the Helicity Dynamics of Severe Convective Storms, ADVANCES IN ATMOSPHERIC SCIENCES, 18, 67-86. doi: 10.1007/s00376-001-0005-5
[13]	Zhenglong LI, Jun LI, Pei WANG, Agnes LIM, Jinlong LI, Timothy J. SCHMIT, Robert ATLAS, Sid-Ahmed BOUKABARA, Ross N. HOFFMAN, 2018: Value-added Impact of Geostationary Hyperspectral Infrared Sounders on Local Severe Storm Forecasts——via a Quick Regional OSSE, ADVANCES IN ATMOSPHERIC SCIENCES, 35, 1217-1230. doi: 10.1007/s00376-018-8036-3
[14]	Pei WANG, Zhenglong LI, Jun LI, Timothy J. SCHMIT, 2021: Added-value of GEO-hyperspectral Infrared Radiances for Local Severe Storm Forecasts Using the Hybrid OSSE Method, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 1315-1333. doi: 10.1007/s00376-021-0443-1
[15]	Xinlin YANG, Jianhua SUN, 2018: Organizational Modes of Severe Wind-producing Convective Systems over North China, ADVANCES IN ATMOSPHERIC SCIENCES, 35, 540-549. doi: 10.1007/s00376-017-7114-2
[16]	Wanli LI, Xiushu QIE, Shenming FU, Debin SU, Yonghai SHEN, 2016: Simulation of Quasi-Linear Mesoscale Convective Systems in Northern China: Lightning Activities and Storm Structure, ADVANCES IN ATMOSPHERIC SCIENCES, 33, 85-100. doi: 10.1007/s00376-015-4170-3
[17]	Yanqing Gao, Xiaofeng Wang, Wei Guo, 2024: Impact of Assimilating FY-4A Lightning Data with a Latent Heat Nudging Method on Short-Term Forecasts of Severe Convective Events in Eastern China, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-024-3339-z
[18]	Dongmei XU, Zhiquan LIU, Shuiyong FAN, Min CHEN, Feifei SHEN, 2021: Assimilating All-sky Infrared Radiances from Himawari-8 Using the 3DVar Method for the Prediction of a Severe Storm over North China, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 661-676. doi: 10.1007/s00376-020-0219-z
[19]	CHEN Hua, GUO Jing, XIONG Wei, GUO Shenglian, Chong-Yu XU, 2010: Downscaling GCMs Using the Smooth Support Vector Machine Method to Predict Daily Precipitation in the Hanjiang Basin, ADVANCES IN ATMOSPHERIC SCIENCES, 27, 274-284. doi: 10.1007/s00376-009-8071-1
[20]	Lei HAN, Mingxuan CHEN, Kangkai CHEN, Haonan CHEN, Yanbiao ZHANG, Bing LU, Linye SONG, Rui QIN, 2021: A Deep Learning Method for Bias Correction of ECMWF 24–240 h Forecasts, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 1444-1459. doi: 10.1007/s00376-021-0215-y

PDF

Get Citation+

Export:

Article Metrics

Article Views: 563 Times

PDF downloads: 52 Times

Cited by: Times

Proportional views

Manuscript History

Manuscript received: 26 August 2023

Manuscript revised: 19 January 2024

Manuscript accepted: 31 January 2024

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

1. Introduction

Fires, like other natural hazards such as extreme precipitation, have a substantial impact on both ecosystems and human communities, inflicting significant harm to the environment and our overall health and wellbeing. Under global warming, wildfire activities become more and more frequent globally (Jolly et al., 2015). In the western United States (WUS), wildfires have been increasing in size, frequency and severity over the last several decades (Dennison et al., 2014; Mueller et al., 2020). Previous studies have demonstrated that fire activities can significantly affect weather and climate by releasing substantial amounts of heat, gases, and aerosol particles into the atmosphere (Abatzoglou and Kolden, 2013; Liu et al., 2017; Lee et al., 2018; Zhang et al., 2019, 2022). The heat emitted from fires can increase low-level temperatures and dramatically impact environmental thermodynamics (Trentmann et al., 2006; Kablick III et al., 2018; Zhang et al., 2019); fire-induced aerosols can impact severe convective storms (SCSs) and climate through aerosol–radiation and aerosol–cloud interactions (Lindsey and Fromm, 2008; Lu and Sokolik, 2013; Logan et al., 2018; Zhang et al., 2019, 2022).

However, studies of the impacts of fire on SCSs have tended to focus on either pyrocumulonimbus clouds (Fromm et al., 2006; Cunningham and Reeder, 2009; Kablick III et al., 2018; Zhang et al., 2019) or the local impact of wildfire aerosols (Lindsey and Fromm, 2008; Grell et al., 2011; Lu and Sokolik, 2013; Lee and Wang, 2020). The remote impact of fires on SCSs has not yet been explored to a sufficient extent. For example, large WUS wildfires emit enormous quantities of aerosols and sensible heat during the wildfire season, which could impact the environment for severe weather in the central United States (CUS). However, WUS wildfires occur most often in late summer and fall, which do not coincide with the severe weather seasons (i.e., spring and summer) in the CUS. Nonetheless, it has been observed that wildfires in WUS have begun to start earlier and earlier under climate change (Westerling et al., 2006; Jain et al., 2017; Jacobo and Zee, 2021). For example, the fire season in 2018 started in May in both the WUS and CUS. Such an earlier start to the fire season extends its duration and leads to it more likely coinciding with the severe weather season in the CUS. During the week of 23–29 July 2018, there was an extreme co-occurring event with storms occurring on four to five consecutive days and large western wildfires (e.g., Carr Fire and Mendocino Complex Fire).

In an earlier study, we simulated this extreme case with detailed physics and explored the remote effects of western wildfires on precipitation and hail in the CUS (Zhang et al., 2022—hereafter referred to as Zhang2022). Model results showed that WUS wildfires notably increase the frequencies of heavy precipitation rate (> 40 mm h⁻¹) and significant severe hail (> 5.08 cm) in the CUS, through the effects of both aerosol and sensible heat from wildfires. The model results revealed a synoptic-scale change in weather caused by WUS wildfires; that is, enhanced westerly winds, which make the meteorological environment more conducive to SCSs and increase the transportation of aerosols. However, this modeling study based on cases had limitations in terms of generality, particularly considering the stochastic nature of convective storm simulations.

Following on Zhang2022, here, we systematically examine the impacts of WUS fires on CUS SCSs over a two-decade period from 2001 to 2020 using machine learning (ML) methods. Zhang2022 showed that co-occurring cases of western wildfires and central SCSs are limited during a 10-year period. Therefore, to increase the sample size for reliable ML analysis, we not only extend the study period to 20 years, but also consider all fire types, including prescribed and agricultural fires, in selecting the co-occurring events. ML models are built to explore the linkage between hailstones in the CUS and the features of WUS fires (e.g., fire size, fire intensity, and smoke aerosols), with consideration of both meteorological factors and smoke aerosols over the fire regions as well as along the transport path. Two tree-based ML models that use ensemble learning algorithms—namely, Random Forest (RF) (Breiman, 2001) and Extreme Gradient Boosting (XGB) (Chen and Guestrin, 2016)—are adopted and developed to extract the nonlinear relationships between WUS fires and CUS hailstorms and examine variable contributions for the prediction of hailstones. To gain robust feature rankings for the constructed ML models, we use Shapley additive explanation (SHAP) values (Nohara et al., 2019) from both the RF and XGB models to evaluate the contribution of each predictor.

The ensemble learning approaches in RF and XGB can address the limitations of traditional linear regression methods in representing complex nonlinear relationships with variable interactions and obtain a robust predictive understanding of the occurrence of hail in the CUS associated with WUS fires. The findings can provide insights for designing long-term infrastructure or mitigating risk associated with these extreme events.

The rest of paper is structured as follows: We first introduce the data and ML methodology in section 2. Section 3 presents the development and evaluation of the two ML classification models for the occurrence of large hail in different regions and states and discusses the contributions of the most important variables influencing the occurrence of large hail. We summarize the limitations and applicability of the ML model and future work in section 4.

4. Conclusion and discussion

In this study, we employed tree-based ML methods to study the relationship between WUS fires and the occurrence of large hail in the CUS using 20 years of fire and hail data (from 2001 to 2020). To do so, ML classification models were built to predict the occurrence of large hail in the CUS states, using the co-occurring WUS fire features and the related meteorological variables over the fire region and along the path of fire plumes as predictors. The resulting RF and XGB classification models can make accurate predictions for the occurrence of large hail in some central US states with ~90% accuracy and F1 scores up to 0.78. This indicates WUS fires are correlated with the occurrence of large hail in the CUS. The ML analysis also shows that, compared to the CS1 states, WUS wildfires could have a larger impact on some CS2 states (further downwind than CS1), which may be related to more frequent hailstorms in the CS2 states. Additionally, ML models perform the best in the four states (WY, SD, NE, and KS) that are within the path of fire plumes, with large hail occurrences impacted more by the fires in OR and WA than those in CA. For MT and ND, located in the northern part of the CUS, which deviates slightly from the path of westerly winds, the performances of the ML classification models are not as good as those for the other states mentioned above, indicating the impact of WUS fires may be insignificant.

The SHAP rankings of the RF and XGB models identify the low-level temperature and RH in the fire region and westerly winds, which are related to the transport of moisture and aerosols, as the most important variables for the prediction of large hail occurrence in CS1 and CS2. For the four states where the ML models perform the best, fire features such as maximum fire power and burned area, are identified as important variables by both RF and XGB. Smoke aerosol is also identified by XGB as a top-20 important variable. Although smoke aerosol is not shown among the top 10 most important variables in ML models, it is correlated with fire power and burned area, and thus its contribution might be taken into account through these variables in the ML models. In short, the ML analysis of these 20 years of data show a relationship between WUS fires and the occurrence of CUS large hail, which corroborates the modeling study of Zhang2022. Based on Zhang2022 and this study, we expect persistent fires in the western US may enhance the occurrence of large hail in the central US when hailstorms coexist.

The observed linkage between wildfires in the WUS and the occurrence of large hail in the CUS can also be explained based on physical mechanisms, which were discussed in our earlier modeling study (Zhang2022). The sensible heat emitted from WUS fires can increase low-level temperatures and contribute to stronger westerly winds. The intensified westerly winds then increase moisture transport from WUS to CUS and wind shear in the CUS, leading to a meteorological condition more conducive to SCSs. The intensified westerly winds also produce stronger aerosol transport to the CUS, contributing to the formation of large hail through aerosol–cloud interactions. Zhang2022 showed that smoke aerosols contribute just as importantly to the enhanced occurrence of large hail as the sensible heat released by fires from that particular case study. Here, the ML models do not identify its key role, which could be due to colinearity with other variables such as fire power and burned area as well as the complex interactions.

It should be noted that we did not consider the local meteorological variables in the CUS in building our ML models since our aim was to identify the correlation between WUS fires and CUS large hail. We carried out tests by adding the local meteorological variables of the CUS states into the ML models, and the results showed that the training performance was much better, while the testing performance showed no obvious improvement. Also, the local meteorological variables became the dominant variables in the variable rankings. This makes sense physically since the local meteorological variables in the CUS should be the first order of factors determining the occurrence of SCSs. The WUS fires can only be an additional factor that may impact the storm intensity and thus the occurrence of large hail. Therefore, the ML models built in this work are for examining the nonlinear relationship between WUS fires and the occurrence of large hail in the CUS, which is a better approach than transitional statistical methods that have limitations in representation of system complexity, including nonlearity and high dimensionality.

We also tried to build RF and XGB regression models to examine the relationship between the daily count (i.e., the number) of large hail in the CUS and WUS fires. However, these models performed poorly, with obvious overfitting problems and underestimation of the large hail count. Physically, the number of large hail events is also very difficult to predict owing to our limited understanding of the processes and factors impacting their formation (Dennis and Kumjian, 2017; Jeong et al., 2020, 2021). As greater understanding of the physical mechanisms is revealed in the future, more relevant variables may be added to the ML models to improve their performances. On the other hand, data imbalance may also be another factor affecting ML model performance. Currently, about 90% of the occurrence of large hail data is zero for any specific state we investigated. In the future, we may consider using other ML techniques such as data augmentation, which increases the number of examples in the minority class, or transfer learning, which leverages pre-trained models that have been trained on similar data, to improve the model’s performance on imbalanced data.

Author contributions Xinming LIN conducted the technical work. Jiwen FAN conceived the idea. Jiwen FAN and Z. Jason HOU guided the research. Yuwei ZHANG provided comments on technical details. All authors contributed to the writing of the manuscript.

Acknowledgements. This paper is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research program as part of the Regional and Global Model Analysis and Multi-Sector Dynamics program areas (Award Number DE-SC0016605). Argonne National Laboratory is operated for the DOE by UChicago Argonne, LLC, under contract DE-AC02-06CH11357. This research used resources of the National Energy Research Scientific Computing Center (NERSC). NERSC is a U.S. DOE Office of Science User Facility operated under Contract DE-AC02-05CH11231.

Reference

Blair, S. F., and Coauthors, 2017: High-resolution hail observations: Implications for NWS warning operations. Weather and Forecasting, 32 (3), 1101−1119, https://doi.org/10.1175/WAF-D-16-0203.1.

Breiman, L., 2001: Random forests. Machine Learning, 45 (1), 5−32, https://doi.org/10.1023/A:1010933404324.

Dennis, E. J., and M. R. Kumjian, 2017: The impact of vertical wind shear on hail growth in simulated supercells. J. Atmos. Sci., 74 (3), 641−663, https://doi.org/10.1175/JAS-D-16-0066.1.

Gelaro, R., and Coauthors, 2017: The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2). J. Climate, 30, 5419−5454, https://doi.org/10.1175/JCLI-D-16-0758.1.

Janzing, D., L. Minorics, and P. Blöbaum, 2019: Feature relevance quantification in explainable AI: A causal problem. arXiv preprint arXiv: 1910.13413, https://doi.org/10.48550/arXiv.1910.13413.

Lindsey, D. T., and Fromm, M., 2008: Evidence of the cloud lifetime effect from wildfire‐induced thunderstorms. Geophys. Res. Lett., 35 (22), L22809, https://doi.org/10.1029/2008GL035680.

Machine Learning Analysis of Impact of Western US Fires on Central US Hailstorms

Abstract:

References

Get Citation+

Share Article

Article Metrics

Proportional views

Manuscript History

Online:

通讯作者: 陈斌, bchen63@163.com