Enhancing Deep Learning Soil Moisture Forecasting Models by Integrating Physics-based Models

Lu LI; Yongjiu DAI; Zhongwang WEI; Wei SHANGGUAN; Nan WEI; Yonggen ZHANG; Qingliang LI; Xian-Xiang LI

doi:10.1007/s00376-023-3181-8

Article Contents

Article > ADVANCES IN ATMOSPHERIC SCIENCES > 2024 > In press

Enhancing Deep Learning Soil Moisture Forecasting Models by Integrating Physics-based Models

1.
School of Atmospheric Sciences, Sun Yat-sen University, Guangzhou 510275, China
2.
Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Guangzhou 510275, China
3.
Guangdong Province Key Laboratory for Climate Change and Natural Disaster Studies, Guangzhou 510275, China
4.
Institute of Surface-Earth System Science, School of Earth System Science, Tianjin University, Tianjin 300072, China
5.
College of Computer Science and Technology, Changchun Normal University, Changchun 130123, China

doi: 10.1007/s00376-023-3181-8

Abstract
Full Text(HTML)
Figures(8) / Table(1)
References(60)
Related papers
PDF

Abstract:
Accurate soil moisture (SM) prediction is critical for understanding hydrological processes. Physics-based (PB) models exhibit large uncertainties in SM predictions arising from uncertain parameterizations and insufficient representation of land-surface processes. In addition to PB models, deep learning (DL) models have been widely used in SM predictions recently. However, few pure DL models have notably high success rates due to lacking physical information. Thus, we developed hybrid models to effectively integrate the outputs of PB models into DL models to improve SM predictions. To this end, we first developed a hybrid model based on the attention mechanism to take advantage of PB models at each forecast time scale ( attention model). We further built an ensemble model that combined the advantages of different hybrid schemes ( ensemble model). We utilized SM forecasts from the Global Forecast System to enhance the convolutional long short-term memory (ConvLSTM) model for 1–16 days of SM predictions. The performances of the proposed hybrid models were investigated and compared with two existing hybrid models. The results showed that the attention model could leverage benefits of PB models and achieved the best predictability of drought events among the different hybrid models. Moreover, the ensemble model performed best among all hybrid models at all forecast time scales and different soil conditions. It is highlighted that the ensemble model outperformed the pure DL model over 79.5% of in situ stations for 16-day predictions. These findings suggest that our proposed hybrid models can adequately exploit the benefits of PB model outputs to aid DL models in making SM predictions.
- soil moisture forecasting,
- hybrid model,
- deep learning,
- ConvLSTM,
- attention mechanism
摘要: 准确的土壤湿度预测至关重要。由于对地表过程表征不准确等原因，基于物理模型的土壤湿度预测表现出较大的不确定性。尽管近期深度学习模型被广泛应用于土壤湿度预测，但由于缺乏物理信息，在中期预报中很少有深度学习模型能够提供令人满意的效果。我们开发了混合预报模型，能有效地将物理模型的预报信息融合到深度学习模型中，从而改进土壤湿度预测。首先，我们基于注意力机制，在不同的时空尺度上充分融合深度学习和物理模型各自的优势。并进一步结合不同混合方案的优势，构建了集合的混合预报模型。为验证所提出的模型，我们在中国区域内将 GFS 的土壤湿度预报融合入ConvLSTM模型，进行 1-16 天的土壤湿度预测。结果表明，我们所提出的混合预报模型在不同的预报时间尺度、不同土壤条件以及干旱极端事件预报中均为最优模型。我们提出的混合模式可以有效改进中期土壤湿度的预报，并能为利用物理模型信息改进深度学习预报提供可靠的范例。
- 土壤湿度预报,
- 混合模型,
- 深度学习,
- 注意力机制,
- 卷积长短期记忆模型

Figure 1. The detail of the (a) ConvLSTM-ED model and (b) inner structure of ConvLSTM.

DownLoad: Full-Size Img PowerPoint

Figure 2. The model structures of (a) the condition model, (b) the attention model, and (c) the attention block. The dimension of each intermediate feature of the attention block is annotated. H, W and F are the sizes of the height, width and features dimensions, respectively. The abbreviations of intermediate features are the same as given in the main text. The colors in (c) indicate weights.

DownLoad: Full-Size Img PowerPoint

Figure 3. The mean (a) R and (b) ubRMSE of different forecast models at different forecast time scales. Dash lines denote the performance of SMAP L4 data evaluated by in-situ observations. The abbreviations of model names are the same as in section 3.

DownLoad: Full-Size Img PowerPoint

Figure 4. The spatial distribution of performance (R) in 1-, 7- and 16-day forecasts of different models. We used the average model as the baseline hybrid model to evaluate the performances of the different hybrid models. Panels (a–c) show the performance of the average model, while the remaining rows show the differences between the R of the target model and the R of the average model. Red points indicate that the model improved the performance compared to the average model, while blue points show a declined performance.

DownLoad: Full-Size Img PowerPoint

Figure 5. The R of the (a–c) GFS, (d–f) ConvLSTM-ED, and (g–i) attention models, and (j–l) the improvement of the attention model compared to the condition model.

DownLoad: Full-Size Img PowerPoint

Figure 6. TCA-based SNR of different models. The triplets of the TCA are [*, ERA5-Land, SoMo.ml], where * denotes the forecast models. Panels (a–c) show the average model. The remaining rows show the difference between the SNR of the target model and the average model.

DownLoad: Full-Size Img PowerPoint

Figure 7. TCA-based SNR of (a-c) ConvLSTM-ED, (d–f) ensemble model predictions and (h) the SMAP L4 datasets. The triplet of TCA is [*, ERA5-Land, SoMo.ml], where * denotes the forecast models and SMAP L4.

DownLoad: Full-Size Img PowerPoint

Figure 8. The kernel density curve of the SWDI of the in situ observations from different forecast models (lines with different colors) at the (a) week-1 and (b) week-2 forecast.

DownLoad: Full-Size Img PowerPoint

Table 1. The probability of an accurate drought event detection by different models over different climate regions based on in situ SM observations. The abbreviations of the model names are the same as in Fig. 1. The week 1 and week 2 columns represent the ability to forecast the 1-week and 2-week drought. n denotes the number of stations located over target climate regions.

Model	Tropical (n=16)		Arid (n=91)		Temperate (n=642)		Cold (n=350)		Polar (n=30)
Model	Week 1	Week 2	Week 1	Week 2	Week 1	Week 2	Week 1	Week 2	Week 1	Week 2
GFS	0.578	0.493	0.511#	0.477#	0.665*	0.582	0.506#	0.469#	0.396#	0.370#
ConvLSTM	0.720	0.661	0.573*	0.521	0.605#	0.560	0.575	0.532	0.656	0.637
average	0.521	0.479	0.536	0.492	0.643	0.592	0.542	0.502	0.529	0.502
condition	0.744*	0.693*	0.543	0.519	0.605#	0.532#	0.582	0.545	0.640	0.578
attention	0.655	0.630	0.570	0.536*	0.629	0.598*	0.599*	0.550*	0.696*	0.644*
ensemble	0.506#	0.474#	0.551	0.531	0.613	0.564	0.571	0.538	0.622	0.577
*Best model to detect drought events over the target climate region. #Worst model to detect drought events over the target climate region.

DownLoad: CSV

References

Beck, H. E., and Coauthors, 2021: Evaluation of 18 satellite-and model-based soil moisture products using in situ measurements from 826 sensors. Hydrology and Earth System Sciences, 25, 17−40, https://doi.org/10.5194/hess-25-17-2021.

Brooks, P. D., J. Chorover, Y. Fan, S. E. Godsey, R. M. Maxwell, J. P. McNamara, and C. Tague, 2015: Hydrological partitioning in the critical zone: Recent advances and opportunities for developing transferable understanding of water cycle dynamics. Water Resourse Research., 51, 6973−6987, https://doi.org/10.1002/2015WR017039.

Cai, Y. L., P. R. Fan, S. Lang, M. Y. Li, Y. Muhammad, and A. X. Liu, 2022: Downscaling of SMAP soil moisture data by using a deep belief network. Remote Sensing, 14, 5681, https://doi.org/10.3390/rs14225681.

Cho, K., B. Van Merrienboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio, 2014: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv: 1406.1078, https://doi.org/10.48550/arXiv.1406.1078.

Crow, W. T., F. Chen, R. H. Reichle, Y. Xia, and Q. Liu, 2018: Exploiting soil moisture, precipitation, and streamflow observations to evaluate soil moisture/runoff coupling in land surface models. Geophysical Research Letter, 45, 4869−4878, https://doi.org/10.1029/2018GL077193.

Cui, Z., Y. L. Zhou, S. L. Guo, J. Wang, and C. Y. Xu, 2022: Effective improvement of multi-step-ahead flood forecasting accuracy through encoder-decoder with an exogenous input structure. Journal of Hydrology, 609, 127764, https://doi.org/10.1016/j.jhydrol.2022.127764.

Daw, A., A. Karpatne, W. D. Watkins, J. S. Read, and V. Kumar, 2022: Physics-guided neural networks (PGNN): An application in lake temperature modeling. Knowledge Guided Machine Learning, Chapman and Hall/CRC, 353−372.

de Rosnay, P., J. Muñoz-Sabater, C. Albergel, L. Isaksen, S. English, M. Drusch, and J. P. Wigneron, 2020: SMOS brightness temperature forward modelling and long term monitoring at ECMWF. Remote Sensing of Environment, 237, 111424, https://doi.org/10.1016/j.rse.2019.111424.

Dharssi, I., K. J. Bovis, B. Macpherson, and C. P. Jones, 2011: Operational assimilation of ASCAT surface soil wetness at the Met Office. Hydrology and Earth System Sciences, 15, 2729−2746, https://doi.org/10.5194/hess-15-2729-2011.

Dorigo, W., and Coauthors, 2017: ESA CCI Soil Moisture for improved Earth system understanding: State-of-the art and future directions. Remote Sensing of Environment, 203, 185−215, https://doi.org/10.1016/j.rse.2017.07.001.

Dorigo, W. A., and Coauthors, 2013: Global automated quality control of in situ soil moisture data from the international soil moisture network. Vadose Zone Journal, 12, 1−21, https://doi.org/10.2136/vzj2012.0097.

ElSaadani, M., E. Habib, A. M. Abdelhameed, and M. Bayoumi, 2021: Assessment of a spatiotemporal deep learning approach for soil moisture prediction and filling the gaps in between soil moisture observations. Frontiers in Artificial Intelligence, 4, 636234, https://doi.org/10.3389/frai.2021.636234.

Entekhabi, D., R. H. Reichle, R. D. Koster, and W. T. Crow, 2010: Performance metrics for soil moisture retrievals and application requirements. Journal of Hydrometeorology, 11, 832−840, https://doi.org/10.1175/2010JHM1223.1.

Esit, M., S. Kumar, A. Pandey, D. M. Lawrence, I. Rangwala, and S. Yeager, 2021: Seasonal to multi-year soil moisture drought forecasting. npj Climate and Atmospheric Science, 4, 16, https://doi.org/10.1038/s41612-021-00172-z.

Fan, Y., and H. van den Dool, 2011: Bias correction and forecast skill of NCEP GFS ensemble week-1 and week-2 precipitation, 2-m surface air temperature, and soil moisture forecasts. Weather and Forecasting, 26, 355−370, https://doi.org/10.1175/WAF-D-10-05028.1.

Fang, K., and C. P. Shen, 2020: Near-real-time forecast of satellite-based soil moisture using long short-term memory with an adaptive data integration kernel. Journal of Hydrometeorology, 21, 399−413, https://doi.org/10.1175/JHM-D-19-0169.1.

Fang, K., M. Pan, and C. P. Shen, 2019: The value of SMAP for long-term soil moisture estimation with the help of deep learning. IEEE Transactions on Geoscience and Remote Sensing, 57, 2221−2233, https://doi.org/ 10.1109/TGRS.2018.2872131.

Fang, K., C. P. Shen, D. Kifer, and X. Yang, 2017: Prolongation of SMAP to spatiotemporally seamless coverage of continental U.S. using a deep learning neural network. Geophys. Res. Lett., 44 , 11 030−11 039, https://doi.org/10.1002/2017GL075619.

Feng, D. P., J. T. Liu, K. Lawson, and C. P. Shen, 2022: Differentiable, learnable, regionalized process-based models with multiphysical outputs can approach state-of-the-art hydrologic prediction accuracy. Water Resour. Res., 58, e2022WR032404, https://doi.org/10.1029/2022WR032404.

Ford, T. W., and S. M. Quiring, 2019: Comparison of contemporary in situ, model, and satellite remote sensing soil moisture with a focus on drought monitoring. Water Resour. Res., 55, 1565−1582, https://doi.org/10.1029/2018WR024039.

Gruber, A., C. H. Su, S. Zwieback, W. Crow, W. Dorigo, and W. Wagner, 2016: Recent advances in (soil moisture) triple collocation analysis. International Journal of Applied Earth Observation and Geoinformation, 45, 200−211, https://doi.org/10.1016/j.jag.2015.09.002.

Heimhuber, V., M. G. Tulbure, and M. Broich, 2017: Modeling multidecadal surface water inundation dynamics and key drivers on large river basin scale using multiple time series of earth-observation and river flow data. Water Resour. Res., 53, 1251−1269, https://doi.org/10.1002/2016WR019858.

Huang, F. N., W. Shangguan, Q. L. Li, L. Li, and Y. Zhang, 2023: Beyond prediction: An integrated post-hoc approach to interpret complex model in hydrometeorology. Environmental Modelling & Software, 167, 105762, https://doi.org/10.1016/j.envsoft.2023.105762.

Kanamitsu, M., C.-H. Lu, J. Schemm, and W. Ebisuzaki, 2003: The predictability of soil moisture and near-surface temperature in Hindcasts of the NCEP seasonal forecast model. J. Climate, 16, 510−521, https://doi.org/10.1175/1520-0442(2003)016<0510:TPOSMA>2.0.CO;2.

Kannan, A., G. Tsagkatakis, R. Akbar, D. Selva, V. Ravindra, R. Levinson, S. Nag, and M. Moghaddam, 2022: Forecasting soil moisture using a deep learning model integrated with passive microwave retrieval. Preprints, IGARSS 2022−2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, IEEE, 6112−6114, https://doi.org/10.1109/IGARSS46834.2022.9883245.

Karthikeyan, L., and A. K. Mishra, 2021: Multi-layer high-resolution soil moisture estimation using machine learning over the United States. Remote Sensing of Environment, 266, 112706, https://doi.org/10.1016/j.rse.2021.112706.

Kim, H., and Coauthors, 2020: Global scale error assessments of soil moisture estimates from microwave-based active and passive satellites and land surface models over forest and mixed irrigated/dryland agriculture regions. Remote Sensing of Environment, 251, 112052, https://doi.org/10.1016/j.rse.2020.112052.

Kingma, D. P., and J. Ba, 2017: Adam: A method for stochastic optimization. arXiv:1412.6980, https://doi.org/10.48550/arXiv.1412.6980.

Klocek, S., and Coauthors, 2022: MS-nowcasting: Operational precipitation nowcasting with convolutional LSTMs at Microsoft weather. arXiv:2111.09954, https://doi.org/10.48550/arXiv.2111.09954.

Lawston, P. M., J. A. Santanello Jr., and S. V. Kumar, 2017: Irrigation signals detected from SMAP soil moisture retrievals. Geophys. Res. Lett., 44 , 11 860−11 867, https://doi.org/10.1002/2017GL075733.

Lee, J., S. Park, J. Im, C. Yoo, and E. Seo, 2022: Improved soil moisture estimation: Synergistic use of satellite observations and land surface models over CONUS based on machine learning. J. Hydrol., 609, 127749, https://doi.org/10.1016/j.jhydrol.2022.127749.

Li, L., Y. J. Dai, W. Shangguan, N. Wei, Z. W. Wei, and S. Gupta, 2022a: Multistep forecasting of soil moisture using spatiotemporal deep encoder–decoder networks. Journal of Hydrometeorology, 23, 337−350, https://doi.org/10.1175/JHM-D-21-0131.1.

Li, L., Y. J. Dai, W. Shangguan, Z. W. Wei, N. Wei, and Q. L. Li, 2022b: Causality-structured deep learning for soil moisture predictions. Journal of Hydrometeorology, 23, 1315−1331, https://doi.org/10.1175/JHM-D-21-0206.1.

Li, Q. L., Z. Y. Wang, W. Shangguan, L. Li, Y. F. Yao, and F. H. Yu, 2021: Improved daily SMAP satellite soil moisture prediction over China using deep learning model with transfer learning. J. Hydrol., 600, 126698, https://doi.org/10.1016/j.jhydrol.2021.126698.

Li, Y., S. Grimaldi, V. R. N. Pauwels, and J. P. Walker, 2018: Hydrologic model calibration using remotely sensed soil moisture and discharge measurements: The impact on predictions at gauged and ungauged locations. J. Hydrol., 557, 897−909, https://doi.org/10.1016/j.jhydrol.2018.01.013.

Liu, L. C., and Coauthors, 2022: KGML-ag: A modeling framework of knowledge-guided machine learning to simulate agroecosystems: A case study of estimating N₂O emission using data from mesocosm experiments. Geoscientific Model Development, 15, 2839−2858, https://doi.org/10.519 4/gmd-15-2839-2022.

Liu, W. B., T. Yang, F. B. Sun, H. Wang, Y. Feng, and M. Y. Du, 2021: Observation-constrained projection of global ﬂood magnitudes with anthropogenic warming. Water Resour. Res., 57, e2020WR028830, https://doi.org/10.1029/2020WR028830.

Luo, L. F., E. F. Wood, and M. Pan, 2007: Bayesian merging of multiple climate model forecasts for seasonal hydrological predictions. J. Geophys. Res.: Atmos., 112, D10102, https://doi.org/10.1029/2006JD007655.

Maggioni, V., E. N. Anagnostou, and R. H. Reichle, 2012: The impact of model and rainfall forcing errors on characterizing soil moisture uncertainty in land surface modeling. Hydrology and Earth System Sciences, 16, 3499−3515, https://doi.org/10.5194/hess-16-3499-2012.

Martínez-Fernández, J., A. González-Zamora, N. Sánchez, and A. Gumuzzio, 2015: A soil water based index as a suitable agricultural drought indicator. J. Hydrol., 522, 265−273, https://doi.org/10.1016/j.jhydrol.2014.12.051.

Mishra, A., T. Vu, A. V. Veettil, and D. Entekhabi, 2017: Drought monitoring with soil moisture active passive (SMAP) measurements. J. Hydrol., 552, 620−632, https://doi.org/10.1016/j.jhydrol.2017.07.033.

Muñoz-Sabater, J., H. Lawrence, C. Albergel, P. Rosnay, L. Isaksen, S. Mecklenburg, Y. Kerr, and M. Drusch, 2019: Assimilation of SMOS brightness temperatures in the ECMWF integrated forecasting system. Quart. J. Roy. Meteor. Soc., 145, 2524−2548, https://doi.org/10.1002/qj.3577.

Muñoz-Sabater, J., and Coauthors, 2021: ERA5-Land: A state-of-the-art global reanalysis dataset for land applications. Earth System Science Data, 13, 4349−4383, https://doi.org/10.5194/essd-13-4349-2021.

O, S., and R. Orth, 2021: Global soil moisture data derived through machine learning trained with in-situ measurements. Scientific Data, 8, 170, https://doi.org/10.1038/s41597-021-00964-1.

Peng, J., and Coauthors, 2021: A roadmap for high-resolution satellite soil moisture applications–confronting product characteristics with user requirements. Remote Sensing of Environment, 252, 112162, https://doi.org/10.1016/j.rse.2020.112162.

Read, J. S., and Coauthors, 2019: Process-guided deep learning predictions of lake water temperature. Water Resour. Res., 55, 9173−9190, https://doi.org/10.1029/2019WR024922.

Reichle, R. H., and Coauthors, 2017: Assessment of the SMAP Level-4 surface and root-zone soil moisture product using in situ measurements. Journal of Hydrometeorology, 18, 2621−2645, https://doi.org/10.1175/JHM-D-17-0063.1.

Santanello, J. A. Jr., P. Lawston, S. Kumar, and E. Dennis, 2019: Understanding the impacts of soil moisture initial conditions on NWP in the context of land–atmosphere coupling. Journal of Hydrometeorology, 20, 793−819, https://doi.org/10.1175/JHM-D-18-0186.1.

Seneviratne, S. I., T. Corti, E. L. Davin, M. Hirschi, E. B. Jaeger, I. Lehner, B. Orlowsky, and A. J. Teuling, 2010: Investigating soil moisture–climate interactions in a changing climate: A review. Earth-Science Reviews, 99, 125−161, https://doi.org/10.1016/j.earscirev.2010.02.004.

Slater, L. J., and Coauthors, 2023: Hybrid forecasting: Blending climate predictions with AI models. Hydrology and Earth System Sciences, 27, 1865−1889, https://doi.org/10.5194/hess-27-1865-2023.

Speight, L. J., M. D. Cranston, C. J. White, and L. Kelly, 2021: Operational and emerging capabilities for surface water ﬂood forecasting. WIREs Water, 8, e1517, https://doi.org/10.1002/wat2.1517.

Stoffelen, A., 1998: Toward the true near-surface wind speed: Error modeling and calibration using triple collocation. J. Geophys. Res.: Oceans, 103, 7755−7766, https://doi.org/10.1029/97JC03180.

Wigneron, J. P., and Coauthors, 2018: SMOS-IC: Current status and overview of soil moisture and VOD applications. Preprints, IGARSS 2018–2018 IEEE International Geoscience and Remote Sensing Symp., Valencia, Spain, IEEE, 1451−1453, https://doi.org/10.1109/IGARSS.2018.8519382.

Willard, J., X. W. Jia, S. M. Xu, M. Steinbach, and V. Kumar, 2022a: Integrating scientific knowledge with machine learning for engineering and environmental systems. ACM Computing Surveys, ACM Computing Surveys, 55, 66, https://doi.org/10.1145/3514228.

Wood, A. W., and D. P. Lettenmaier, 2006: A test bed for new seasonal hydrologic forecasting approaches in the western United States. Bull. Amer. Meteor. Soc., 87, 1699−1712, https://doi.org/10.1175/BAMS-87-12-1699.

Xia, Y. L., J. Sheffield, M. B. Ek, J. R. Dong, N. Chaney, H. L. Wei, J. Meng, and E. F. Wood, 2014: Evaluation of multi-model simulated soil moisture in NLDAS-2. J. Hydrol., 512, 107−125, https://doi.org/10.1016/j.jhydrol.2014.02.027.

Yamazaki, D., and Coauthors, 2017: A high-accuracy map of global terrain elevations. Geophys. Res. Lett., 44, 5844−5853, https://doi.org/10.1002/2017GL072874.

Yang, H. C., H. X. Wang, G. B. Fu, H. M. Yan, P. P. Zhao, and M. H. Ma, 2017: A modified soil water deficit index (MSWDI) for agricultural drought monitoring: Case study of Songnen Plain, China. Agricultural Water Management, 194, 125−138, https://doi.org/10.1016/j.agwat.2017.07.022.

Yin, J. F., C. R. Hain, X. W. Zhan, J. R. Dong, and M. Ek, 2019: Improvements in the forecasts of near-surface variables in the Global Forecast System (GFS) via assimilating ASCAT soil moisture retrievals. J. Hydrol., 578, 124018, https://doi.org/10.1016/j.jhydrol.2019.124018.

Zhang, R. Q., and Coauthors, 2021: Assessment of agricultural drought using soil water deficit index based on ERA5-land soil moisture data in four southern provinces of China. Agriculture, 11, 411, https://doi.org/10.3390/agriculture1105 0411.

[1]	Yunqing LIU, Lu YANG, Mingxuan CHEN, Linye SONG, Lei HAN, Jingfeng XU, 2024: A Deep Learning Approach for Forecasting Thunderstorm Gusts in the Beijing–Tianjin–Hebei Region, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-023-3255-7
[2]	Tingyu WANG, Ping HUANG, Xianke YANG, 2024: Understanding the Low Predictability of the 2015/16 El Niño Event Based on a Deep Learning Model, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-024-3238-3
[3]	Hanxiao Yuan, Yang Liu, Qiuhua TANG, Jie LI, Guanxu CHEN, Wuxu CAI, 2024: ST-LSTM-SA：A new ocean sound velocity fields prediction model based on deep learning, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-024-3219-6
[4]	Lei HAN, Mingxuan CHEN, Kangkai CHEN, Haonan CHEN, Yanbiao ZHANG, Bing LU, Linye SONG, Rui QIN, 2021: A Deep Learning Method for Bias Correction of ECMWF 24–240 h Forecasts, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 1444-1459. doi: 10.1007/s00376-021-0215-y
[5]	Jiang HUANGFU, Zhiqun HU, Jiafeng ZHENG, Lirong WANG, Yongjie ZHU, 2024: Study on Quantitative Precipitation Estimation by Polarimetric Radar Using Deep Learning, ADVANCES IN ATMOSPHERIC SCIENCES, 41, 1147-1160. doi: 10.1007/s00376-023-3039-0
[6]	D. R. Johnson, Zhuojian Yuan, 1998: The Development and Initial Tests of an Atmospheric Model Based on a Vertical Coordinate with a Smooth Transition from Terrain Following to Isentropic Coordinates, ADVANCES IN ATMOSPHERIC SCIENCES, 15, 283-299. doi: 10.1007/s00376-998-0001-0
[7]	Kanghui ZHOU, Jisong SUN, Yongguang ZHENG, Yutao ZHANG, 2022: Quantitative Precipitation Forecast Experiment Based on Basic NWP Variables Using Deep Learning, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 1472-1486. doi: 10.1007/s00376-021-1207-7
[8]	Xiaoran DONG, Yafei NIE, Jinfei WANG, Hao LUO, Yuchun GAO, Yun WANG, Jiping LIU, Dake CHEN, Qinghua YANG, 2024: Deep Learning Shows Promise for Seasonal Prediction of Antarctic Sea Ice in a Rapid Decline Scenario, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-024-3380-y
[9]	Pumeng LYU, Tao TANG, Fenghua LING, Jing-Jia LUO, Niklas BOERS, Wanli OUYANG, Lei BAI, 2024: ResoNet: Robust and Explainable ENSO Forecasts with Hybrid Convolution and Transformer Networks, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-024-3316-6
[10]	Chentao SONG, Jiang ZHU, Xichen LI, 2024: Assessments of Data-Driven Deep Learning Models on One-Month Predictions of Pan-Arctic Sea Ice Thickness, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-023-3259-3
[11]	Tingyu WANG, Ping HUANG, 2024: Superiority of a Convolutional Neural Network Model over Dynamical Models in Predicting Central Pacific ENSO, ADVANCES IN ATMOSPHERIC SCIENCES, 41, 141-154. doi: 10.1007/s00376-023-3001-1
[12]	Ya WANG, Gang HUANG, Baoxiang PAN, Pengfei LIN, Niklas BOERS, Weichen TAO, Yutong CHEN, BO LIU, Haijie LI, 2024: Correcting Climate Model Sea Surface Temperature Simulations with Generative Adversarial Networks: Climatology, Interannual Variability, and Extremes, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-024-3288-6
[13]	Gang HUANG, Ya WANG, Yoo-Geun HAM, Bin MU, Weichen TAO, Chaoyang XIE, 2024: Toward a Learnable Climate Model in the Artificial Intelligence Era, ADVANCES IN ATMOSPHERIC SCIENCES. doi: 10.1007/s00376-024-3305-9
[14]	Temesgen Gebremariam ASFAW, Jing-Jia LUO, 2024: Downscaling Seasonal Precipitation Forecasts over East Africa with Deep Convolutional Neural Networks, ADVANCES IN ATMOSPHERIC SCIENCES, 41, 449-464. doi: 10.1007/s00376-023-3029-2
[15]	Hui LIU, Bo HU, Yuesi WANG, Guangren LIU, Liqin TANG, Dongsheng JI, Yongfei BAI, Weikai BAO, Xin CHEN, Yunming CHEN, Weixin DING, Xiaozeng HAN, Fei HE, Hui HUANG, Zhenying HUANG, Xinrong LI, Yan LI, Wenzhao LIU, Luxiang LIN, Zhu OUYANG, Boqiang QIN, Weijun SHEN, Yanjun SHEN, Hongxin SU, Changchun SONG, Bo SUN, Song SUN, Anzhi WANG, Genxu WANG, Huimin WANG, Silong WANG, Youshao WANG, Wenxue WEI, Ping XIE, Zongqiang XIE, Xiaoyuan YAN, Fanjiang ZENG, Fawei ZHANG, Yangjian ZHANG, Yiping ZHANG, Chengyi ZHAO, Wenzhi ZHAO, Xueyong ZHAO, Guoyi ZHOU, Bo ZHU, 2017: Two Ultraviolet Radiation Datasets that Cover China, ADVANCES IN ATMOSPHERIC SCIENCES, 34, 805-815. doi: 10.1007/s00376-017-6293-1
[16]	Ruian TIE, Chunxiang SHI, Gang WAN, Xingjie HU, Lihua KANG, Lingling GE, 2022: CLDASSD: Reconstructing Fine Textures of the Temperature Field Using Super-Resolution Technology, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 117-130. doi: 10.1007/s00376-021-0438-y
[17]	Jinhe YU, Lei BI, Wei HAN, Xiaoye ZHANG, 2022: Application of a Neural Network to Store and Compute the Optical Properties of Non-Spherical Particles, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 2024-2039. doi: 10.1007/s00376-021-1375-5
[18]	LIU Shikuo, LIU Shida, FU Zuntao, SUN Lan, 2005: A Nonlinear Coupled Soil Moisture-Vegetation Model, ADVANCES IN ATMOSPHERIC SCIENCES, 22, 337-342. doi: 10.1007/BF02918747
[19]	DAN Li, JI Jinjun, ZHANG Peiqun, 2005: The Soil Moisture of China in a High Resolution Climate-Vegetation Model, ADVANCES IN ATMOSPHERIC SCIENCES, 22, 720-729. doi: 10.1007/BF02918715
[20]	Binghao JIA, Longhuan WANG, Yan WANG, Ruichao LI, Xin LUO, Jinbo XIE, Zhenghui XIE, Si CHEN, Peihua QIN, Lijuan LI, Kangjun CHEN, 2021: CAS-LSM Datasets for the CMIP6 Land Surface Snow and Soil Moisture Model Intercomparison Project, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 862-874. doi: 10.1007/s00376-021-0293-x

PDF

Get Citation+

Export:

Article Metrics

Article Views: 1303 Times

PDF downloads: 144 Times

Cited by: Times

Proportional views

Manuscript History

Manuscript received: 15 August 2023

Manuscript revised: 11 October 2023

Manuscript accepted: 16 November 2023

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

1. Introduction

Soil moisture (SM) plays an important role in climate and hydrological systems by balancing the interaction of water and energy exchange processes (Seneviratne et al., 2010; Crow et al., 2018). Thus, accurate predictions of SM could improve various crucial applications, e.g., drought monitoring and water resource management (Dorigo et al., 2017).

Physics-based (PB) models have long been used to forecast SM (Kanamitsu et al., 2003; Wood and Lettenmaier, 2006), and show excellent potential in simulating SM dynamics (Xia et al., 2014; Esit et al., 2021). Therefore, PB models have been widely used for early warning of drought (Luo et al., 2007) and for identifying SM–precipitation feedback (Santanello et al., 2019). However, current PB models still have some flaws, such as uncertainties in model parameters and insufficient representation of land-surface processes (Brooks et al., 2015). On the other hand, deep learning (DL) models are well known for their ability to learn nonlinear mapping relationships and show a remarkable capability in short-term SM forecasting (Fang et al., 2017; Fang and Shen, 2020; Li et al., 2022a). For example, Fang et al. (2017) utilized the long short-term memory (LSTM) model to extrapolate Soil Moisture Active Passive (SMAP) SM data spatiotemporally, based on meteorological forcing, and concluded that LSTM outperformed traditional machine learning models (e.g., Random Forest). In addition, Li et al. (2021) utilized the convolutional LSTM (ConvLSTM) model, which can capture the information of both temporal and spatial dimensions, to forecast SM and showed that the ConvLSTM model outperformed the LSTM model in short-term SM forecasting.

To take advantage of both PB and DL models, various studies have developed so-called hybrid methods to incorporate physical information into DL models. Hybrid models have many categories [see Slater et al. (2023) for a comprehensive study], and one popular and effective approach is to utilize the output of PB models to enhance DL models. There are two main methods for achieving this. (1) Averaging the outputs of the PB and DL models is the most commonly used hybrid method because of its simplicity. Fang et al. (2019) averaged the SM simulated by LSTM and the Noah land surface model (LSM) and confirmed that this model combination could effectively improve the long-term simulation of SM. However, simple averaging can transfer the system error of PB models into DL models, which is not the optimal way to exploit the value of both models. (2) Another straightforward way is to feed the output of PB models as features into DL models. For example, Daw et al. (2022) enhanced the ability of the DL model to predict lake temperature by using the output of the general lake model as input features for the neural network. Cui et al. (2022) proposed an LSTM-based encoder–decoder model that improved the accuracy of multi-step-ahead flood forecasting by adding the output of the Xinanjiang hydrological model as input features into the decoder. Similarly, Klocek et al. (2022) used the output of the HRRR (high-resolution rapid refresh) model as an exogenous input into the decoder of the ConvLSTM-based encoder–decoder model, and significantly improved the long-term predictability of precipitation.

However, previous hybrid models have predominantly depended on DL models to automatically extract significant features from PB model outputs without fully capturing the essential spatiotemporal features in different regions and forecast time scales. The attention mechanism is typically used to extract important spatiotemporal patterns from various features for SM predictions (Li et al., 2022a). However, to the best of our knowledge, the attention mechanism has not been used to incorporate physical information into DL models for improving SM forecasts. By relying on the attention mechanism, it is possible to adaptively learn important spatiotemporal patterns from PB model outputs to aid DL models in making predictions at each forecast time scale.

Moreover, while various hybrid methods have been widely used and demonstrated promising performance in hydrological forecasting (Slater et al., 2023), including flood (Speight et al., 2021), streamflow (Liu et al., 2021), and precipitation (Klocek et al., 2022), no attempts have been undertaken to develop a model that leverages the benefits of these hybrid methods specifically for SM predictions. Ensemble methods have been proven to combine the strengths of different DL models and enhance predictability regarding each DL model (Lee et al., 2022). Therefore, ensembling the forecasts of different hybrid models can unify their predictions in a single model and reduce the errors from every single model.

In the present study, we aimed to effectively incorporate the outputs of PB models into DL models for accurate multi-step-ahead (1–16 days) SM predictions. To achieve this, we first developed an attention-based hybrid model to leverage the benefits of PB models at each forecast time scale. Then, we further developed an ensemble model that combined the advantages of different hybrid models to achieve exceptional multi-step-ahead SM predictions. More specifically, we (1) verified whether attention mechanisms can adaptively take advantage of PB models at different forecast time scales and regions to improve SM predictions spatiotemporally; (2) verified whether ensemble methods can utilize the advantages of different hybrid models to create an optimal model among all forecast models; and (3) thoroughly analyzed and discussed the pros and cons of different hybrid SM forecasting methods for multi-timescale predictions.

The remainder of the paper is organized as follows. Section 2 describes the data used in the study and the preprocessing procedures. Section 3 describes the proposed hybrid models and existing practical hybrid models. The evaluation processes, based on conventional metrics, triple collocation, and extreme indices are also shown in this section. Section 4 presents the evaluation results with respect to in situ and gridded data, along with some further discussion of our findings. Lastly, section 5 summarizes our conclusions.

5. Conclusion

In this paper, we first propose an attention hybrid model based on condition hybrid schemes and an attention mechanism to utilize the advantages of both PB and DL models over different forecast timescales and regions. An ensemble model is then further proposed by averaging the outputs of two existing practical hybrid models (average and condition) and the proposed attention model. To the best of our knowledge, this is the first study taking both the attention mechanism and ensemble methods to integrate PB and DL models for SM forecasting. We thoroughly assessed the predictability of the two proposed hybrid models (i.e., attention and ensemble) and two existing hybrid models (i.e., average, and condition) based on in situ and gridded data evaluation. Generally, the proposed hybrid models outperformed the two existing hybrid models, and could greatly improve the long-term predictability, and predictability of drought events, compared to pure DL models. The main conclusions were as follows:

(1) The proposed ensemble hybrid model achieved the best general performance among all hybrid models under different soil conditions over all forecast timescales (from 1 to 16 days), especially for long-term forecasting. Notably, the ensemble hybrid model improved 65% of the mean values of R and 6% of the mean values of ubRMSE for the 16-day forecast compared to the ConvLSTM-ED model, and the ensemble hybrid model outperformed the ConvLSTM-ED model over 79.5% of the validation stations.

(2) The proposed attention hybrid model achieved the best drought predictability among all hybrid models. This model could accurately detect 60.6% and 56.8% of drought events for 1- and 2-week forecasts, respectively, and had generally the best drought detection ability over arid, temperate, cold, and polar regions. The attention hybrid model was able to detect an additional 2.4% and 3% of drought events compared with the ConvLSTM-ED model for 1- and 2-week forecasts, respectively.

(3) Different hybrid schemes had their pros and cons, and our proposed model solved the problems encountered by existing hybrid methods to some extent. For example, the average model is simple and effective, but the performance significantly degraded as the forecast time scale increased. The condition model could significantly improve the long-term predictability of SM but involves the bias of the GFS model in short-term predictions. The proposed attention model solves the problem of the condition model and is suitable for forecasting extreme drought events. The proposed ensemble method performed best among all hybrid models, and is suitable for long-term, stable predictions, which mainly focus on the average state of SM.

Finally, we provide some future ideas for improving SM forecasting by relying on hybrid models that integrate PB and DL models together. Firstly, DL and PB models should be further improved for SM forecasting separately. For example, DL models should be trained with multi-source data based on multimodal learning to avoid reliance on single datasets. Secondly, it is best to use some physical laws to guide the design of the structure, the parameter initializations, and the loss functions of DL models, which could help provide physically consistent predictions. Thirdly, the attention mechanism shows an excellent ability to utilize the benefits of different input features. The attention mechanism used in this study only focuses on the channel features. Therefore, axial attention modules should be developed to adaptively extract essential features of PB and DL models over different axials, which may further enhance the ability to identify valuable information.

Acknowledgements. Lu LI was supported by the Natural Science Foundation of China (Grant Nos. 42088101 and 42205149); Zhongwang WEI was supported by the Natural Science Foundation of China (Grant No. 42075158); Wei SHANGGUAN was supported by the Natural Science Foundation of China (Grant No. 41975122); and Yonggen ZHANG was supported by the National Natural Science Foundation of Tianjin (Grant No. 20JCQNJC01660). All data, source codes and example codes are available at https://github.com/leelew/HybridHydro.

Electronic supplementary material: Supplementary material is available in the online version of this article at https://doi.org/10.1007/s00376-023-3181-8.

Reference

Kingma, D. P., and J. Ba, 2017: Adam: A method for stochastic optimization. arXiv:1412.6980, https://doi.org/10.48550/arXiv.1412.6980.

Klocek, S., and Coauthors, 2022: MS-nowcasting: Operational precipitation nowcasting with convolutional LSTMs at Microsoft weather. arXiv:2111.09954, https://doi.org/10.48550/arXiv.2111.09954.

O, S., and R. Orth, 2021: Global soil moisture data derived through machine learning trained with in-situ measurements. Scientific Data, 8, 170, https://doi.org/10.1038/s41597-021-00964-1.

Read, J. S., and Coauthors, 2019: Process-guided deep learning predictions of lake water temperature. Water Resour. Res., 55, 9173−9190, https://doi.org/10.1029/2019WR024922.

Slater, L. J., and Coauthors, 2023: Hybrid forecasting: Blending climate predictions with AI models. Hydrology and Earth System Sciences, 27, 1865−1889, https://doi.org/10.5194/hess-27-1865-2023.

Yamazaki, D., and Coauthors, 2017: A high-accuracy map of global terrain elevations. Geophys. Res. Lett., 44, 5844−5853, https://doi.org/10.1002/2017GL072874.

Enhancing Deep Learning Soil Moisture Forecasting Models by Integrating Physics-based Models

Abstract:

References

Get Citation+

Share Article

Article Metrics

Proportional views

Manuscript History

Online:

通讯作者: 陈斌, bchen63@163.com