Advanced Search
Article Contents

A Physics-informed Deep-learning Intensity Prediction Scheme for Tropical Cyclones over the Western North Pacific


doi:  10.1007/s00376-024-3282-z

  • Accurate prediction of tropical cyclone (TC) intensity is challenging due to the complex physical processes involved. Here, we introduce a new TC intensity prediction scheme for the western North Pacific (WNP) based on a time-dependent theory of TC intensification, termed the energetically based dynamical system (EBDS) model, together with the use of a long short-term memory (LSTM) neural network. In time-dependent theory, TC intensity change is controlled by both the internal dynamics of the TC system and various environmental factors, expressed as environmental dynamical efficiency. The LSTM neural network is used to predict the environmental dynamical efficiency in the EBDS model trained using best-track TC data and global reanalysis data during 1982–2017 . The transfer learning and ensemble methods are used to retrain the scheme using the environmental factors predicted by the Global Forecast System (GFS) of the National Centers for Environmental Prediction during 2017–21. The predicted environmental dynamical efficiency is finally iterated into the EBDS equations to predict TC intensity. The new scheme is evaluated for TC intensity prediction using both reanalysis data and the GFS prediction data. The intensity prediction by the new scheme shows better skill than the official prediction from the China Meteorological Administration (CMA) and those by other state-of-art statistical and dynamical forecast systems, except for the 72-h forecast. Particularly at the longer lead times of 96 h and 120 h, the new scheme has smaller forecast errors, with a more than 30% improvement over the official forecasts.
    摘要: 热带气旋(TC)强度变化涉及复杂的物理过程,其准确预报是当前TC业务预报中一项重要的挑战。本文结合最新提出的TC增强理论(称为基于能量的动力系统模型EBDS)和长短时记忆(LSTM)神经网络,研制了一个针对西北太平洋(WNP)的TC强度预报新方法。在TC增强理论中,TC强度变化受到TC系统内部动力学和各种环境因子的共同控制,这些环境因子的作用可用环境动力效率表示。基于1982-2017年的TC最佳数据集和全球再分析数据,使用LSTM神经网络来预训练EBDS模型中的环境动力效率。进一步,基于2017-2021年美国国家环境预报中心全球预报系统(GFS)预报的环境因子,利用迁移学习和集成方法来微调模型。最后将预报的环境动力效率迭代到EBDS方程中来预报TC强度。使用再分析数据和GFS预报数据对该方法预报的TC强度进行评估。结果表明,除了72小时预报,新方法的强度预报较中国气象局(CMA)的官方预报和其它主流的统计及动力预报更优。特别是对于96小时和120小时预报,新方法的预报误差更小,相比官方预报有超过30%的改进。
  • 加载中
  • Figure 1.  Number of samples in the training (blue), validation (purple), and test (green) data at different forecast lead times with a 6-hour interval.

    Figure 2.  The 17-day EBDS model “predicted” intensity of Super Typhoon Noru (2017) and the corresponding CMA best-track intensity.

    Figure 3.  Transfer learning structure in the EBDS_LSTM model.

    Figure 4.  Schematic of the ensemble forecast in the EBDS_LSTM model. X(n) indicates predictors as listed in Table 1 and Y(1)m indicates TC intensity, where m is the number of ensemble members (20).

    Figure 5.  Schematic of the EBDS_LSTM forecast scheme.

    Figure 6.  (a) Averaged RMSEs (m s−1) of the 7-day intensity forecasts from the EBDS_LSTM scheme (red bars) and averaged RMSEs of the 5-day forecasts from the CMA (black bars) at 24-h intervals for all independent TC cases in 2017 based on the NCEP–NCAR reanalysis data. (b) Averaged RMSEs (m s−1) of the 5-day real-time intensity forecasts at 24-h intervals from the EBDS_LSTM scheme and the CMA for all independent cases in 2017. (c) Averaged RMSEs (m s−1) of the 5-day real-time intensity forecasts at 24-h intervals for all independent cases in 2022 from the EBDS_LSTM scheme (red), CMA (black), NCEP-GFS (green), ECMWF (light green), and WIPS (orange). The EBDS_LSTM model was trained on NCEP–NCAR reanalysis from 1982 to 2016. For (a), predictions were made based on reanalysis; for (b), on GFS forecast data for the year 2017 without transfer learning; and for (c), on GFS forecast data for the year 2022, using transfer learning with GFS data during 2019–21.

    Figure 7.  (a, b) Observed tracks and intensities of (a) Noru and (b) Lan in 2017. (c, d) The 7-day forecasts of the intensity for (c) Noru and (d) Lan at different forecast times with 6-h intervals based on NCEP–NCAR reanalysis data (dashed lines) and the corresponding CMA best-track intensity (red solid line). (e, f) Diagnosed (red) and predicted (black) dynamical efficiency factor E′ for (e) Noru and (f) Lan.

    Figure 8.  (a, b) Observed tracks and intensities for (a) Surigae and (b) Nyatoh in 2021. (c–f) The 5-day forecasts of the intensity for (c, e) Surigae and (d, f) Nyatoh at different forecast times with 6-h intervals from (c, d) the EBDS_LSTM forecasts (dashed lines) and (e, f) the CMA forecasts (dashed lines). In (c–f), red solid lines indicate the corresponding CMA best-track intensity.

    Table 1.  Predictors and corresponding descriptions.

    Predictor Units Description
    VWS m s−1 Averaged vertical wind shear between 200 and 850 hPa within a radius of 5° of the TC center
    AU m s−1 Averaged zonal wind at 200 hPa within a radius of 5° of the TC center
    AV85 s−1 Averaged absolute vorticity at 850 hPa within an annulus area between radii of 5° and 10° of the TC center
    DIV85 s−1 Averaged divergence at 850 hPa within an annulus area between radii of 5° and 10° of the TC center
    DIV20 s−1 Averaged divergence at 200 hPa within an annulus area between radii of 5° and 10° of the TC center
    H50 gpm 500-hPa geopotential height at the TC center
    VOR85_lon ° Longitude of the greatest vorticity at 850 hPa in the range of 2° (4°) of the TC center at 0−24 h (> 24 h) forecasts
    VOR85_lat ° Latitude of the greatest vorticity at 850 hPa in the range of 2° (4°) of the TC center at 0−24 h (> 24 h) forecasts
    RH8570 % Averaged relative humidity at 850–700 hPa within an annulus area between radii of 5° and 10° of the TC center
    RH5030 % Averaged relative humidity at 500–300 hPa within an annulus area between radii of 5° and 10° of the TC center
    MPI m s−1 Maximum potential intensity estimated using the SST averaged within a radius of 300 km from the TC center
    DownLoad: CSV

    Table 2.  RMSEs (m s−1) of intensity forecasts for Noru and Lan in 2017 at 24, 48, 72, 96, 120, 144, and 168 h from the EBDS_LSTM scheme and the CMA 5-day forecasts. Smaller RMSEs between the two methods are shown in boldface.

    Forecast time (h) RMSE (m s−1) in Noru RMSE (m s−1) in Lan
    EBDS_LSTM CMA EBDS_LSTM CMA
    24 4.23 7.32 3.82 5.40
    48 4.48 8.84 4.16 5.36
    72 4.36 9.20 4.40 5.23
    96 4.29 9.00 4.85 7.04
    120 4.31 8.14 5.36 9.51
    144 4.45 - 3.64 -
    168 4.61 - 2.83 -
    DownLoad: CSV

    Table 3.  RMSEs (m s−1) of intensity forecasts for Surigae and Nyatoh in 2021 at 24, 48, 72, 96 and 120 h forecasts from the EBDS_LSTM method and the CMA 5-day forecasts. Smaller RMSEs between the two methods are shown in boldface.

    Forecast Time (h) RMSE (m s−1) in Surigae RMSE (m s−1) in Nyatoh
    EBDS_LSTM CMA EBDS_LSTM CMA
    24 4.23 5.53 3.82 6.60
    48 4.48 8.64 4.16 11.21
    72 4.36 9.73 4.40 13.88
    96 4.29 9.12 4.85
    120 4.31 7.68 5.36
    DownLoad: CSV
  • Baik, J. J., and J. S. Paek, 1998: A climatology of sea surface temperature and the maximum intensity of western North Pacific tropical cyclones. J. Meteor. Soc. Japan, 76, 129−137, https://doi.org/10.2151/jmsj1965.76.1_129.
    Baik, J. J., and J. S. Paek, 2000: A neural network model for predicting typhoon intensity. J. Meteor. Soc. Japan, 78, 857−869, https://doi.org/10.2151/jmsj1965.78.6_857.
    Bender, M. A., I. Ginis, R. Tuleya, B. Thomas, and T. Marchok, 2007: The operational GFDL coupled hurricane-ocean prediction system and a summary of its performance. Mon. Wea. Rev., 135, 3965−3989, https://doi.org/10.1175/2007MWR 2032.1.
    Bi, K. F., L. X. Xie, H. H. Zhang, X. Chen, X. T. Gu, and Q. Tian, 2023: Accurate medium-range global weather forecasting with 3D neural networks. Nature, 619, 533−538, https://doi.org/10.1038/s41586-023-06185-3.
    Bozinovski, S., 2020: Reminder of the first paper on transfer learning in neural networks, 1976. Informatica, 44, 291−302, https://doi.org/10.31449/inf.v44i3.2828.
    Cangialosi, J. P., E. Blake, M. DeMaria, A. Penny, A. Latto, E. Rappaport, and V. Tallapragada, 2020: Recent progress in tropical cyclone intensity forecasting at the National Hurricane Center. Wea. Forecasting, 35, 1913−1922, https://doi.org/10.1175/WAF-D-20-0059.1.
    Chen, G. M., X. P. Zhang, L. N. Bai, and R. J. Wan, 2019: Verification on forecasts of tropical cyclones over western North Pacific and South China Sea in 2017. Meteorological Monthly, 45, 577−586, https://doi.org/10.7519/j.issn.1000-0526.2019.04.012.
    Chen, P. Y., H. Yu, and J. C. L. Chan, 2011: A western North Pacific tropical cyclone intensity prediction scheme. Acta Meteorologica Sinica, 25, 611−624, https://doi.org/10.1007/s13351-011-0506-9.
    Cloud, K. A., B. J. Reich, C. M. Rozoff, S. Alessandrini, W. E. Lewis, and L. D. Monache, 2019: A feed forward neural network based on model output statistics for short-term hurricane intensity prediction. Wea. Forecasting, 34, 985−997, https://doi.org/10.1175/WAF-D-18-0173.1.
    DeMaria, M., 2009: A simplified dynamical system for tropical cyclone intensity prediction. Mon. Wea. Rev., 137, 68−82, https://doi.org/10.1175/2008MWR2513.1.
    DeMaria, M., C. R. Sampson, J. A. Knaff, and K. D. Musgrave, 2014: Is tropical cyclone intensity guidance improving?. Bull. Amer. Meteor. Soc., 95, 387−398, https://doi.org/10.1175/BAMS-D-12-00240.1.
    DeMaria, M., M. Mainelli, L. K. Shay, J. A. Knaff, and J. Kaplan, 2005: Further improvements to the Statistical Hurricane Intensity Prediction Scheme (SHIPS). Wea. Forecasting, 20, 531−543, https://doi.org/10.1175/WAF862.1.
    Dong, L., S. Z. Gao, Y. L. Xu, X. Y. Lü, and Y. W. Huang, 2019: Analysis of characteristics and forecast difficulties of TCs on western North Pacific in 2017. Meteorological Monthly, 45, 1322−1334, https://doi.org/10.7519/j.issn.1000-0526.2019.09.012.
    Dvorak, V. F., 1975: Tropical cyclone intensity analysis and forecasting from satellite imagery. Mon. Wea. Rev., 103, 420−430, https://doi.org/10.1175/1520-0493(1975)103<04 20:TCIAAF>2.0.CO;2.
    Elman, J. L., 1990: Finding structure in time. Cognitive Science, 14, 179−211, https://doi.org/10.1207/s15516709cog1402_1.
    Hao, Z. K., S. M. Liu, Y. C. Zhang, C. Y. Ying, Y. Feng, H. Su, and J. Zhu, 2023: Physics-informed machine learning: A survey on problems, methods and applications. arXiv:2211.08064, https://doi.org/10.48550/arXiv.2211.08064.
    Heming, J. T., 2016: Met office unified model tropical cyclone performance following major changes to the initialization scheme and a model upgrade. Wea. Forecasting, 31, 1433−1449, https://doi.org/10.1175/WAF-D-16-0040.1.
    Heming, J. T., and Coauthors, 2019: Review of recent progress in tropical cyclone track forecasting and expression of uncertainties. Tropical Cyclone Research and Review, 8, 181−218, https://doi.org/10.1016/j.tcrr.2020.01.001.
    Hochreiter, S., and J. Schmidhuber, 1997: Long short-term memory. Neural Computation, 9, 1735−1780, https://doi.org/10.1162/neco.1997.9.8.1735.
    Jarvinen, B. R., and C. J. Neumann, 1979: Statistical forecasts of tropical cyclone intensity for the North Atlantic basin. NOAA Tech. Memo. NWS NHC-10.
    Kalnay, E., and Coauthors, 1996: The NCEP/NCAR 40-year reanalysis project. Bull. Amer. Meteor. Soc., 77, 437−472, https://doi.org/10.1175/1520-0477(1996)077<0437:TNYRP>2.0.CO;2.
    Karniadakis, G. E., I. G. Kevrekidis, L. Lu, P. Perdikaris, S. F. Wang, and L. Yang, 2021: Physics-informed machine learning. Nature Reviews Physics, 3, 422−440, https://doi.org/10.1038/s42254-021-00314-5.
    Knaff, J. A., C. R. Sampson, and M. DeMaria, 2005: An operational statistical typhoon intensity prediction scheme for the western North Pacific. Wea. Forecasting, 20, 688−699, https://doi.org/10.1175/WAF863.1.
    Knaff, J. A., M. DeMaria, C. R. Sampson, and J. M. Gross, 2003: Statistical, 5-day tropical cyclone intensity forecasts derived from climatology and persistence. Wea. Forecasting, 18, 80−92, https://doi.org/10.1175/1520-0434(2003)018<0080:SDTCIF>2.0.CO;2.
    Li, Z. C., L. Zhang, Q. F. Qian, S. H. Ma, J. Xu, K. Dai, Y. Shen, and Y. D. Wang, 2020: The development and consideration of typhoon forecast operation of National Meteorological Center. Transactions of Atmospheric Sciences, 43, 10−19, https://doi.org/10.13878/j.cnki.dqkxxb.20200110015.
    Pan, B., X. Xu, and Z. W. Shi, 2019: Tropical cyclone intensity prediction based on recurrent neural networks. Electronics Letters, 55, 413−415, https://doi.org/10.1049/el.2018.8178.
    Pan, S. J., and Q. Yang, 2010: A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22, 1345−1359, https://doi.org/10.1109/TKDE.2009.191.
    Raissi, M., P. Perdikaris, and G. E. Karniadakis, 2017: Physics informed deep learning (Part I): Data-driven solutions of nonlinear partial differential equations. arXiv:1711,10561, https://doi.org/10.48550/arXiv.1711.10561.
    Raissi, M., P. Perdikaris, and G. E. Karniadakis, 2019: Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys., 378, 686−707, https://doi.org/10.1016/j.jcp.2018.10.045.
    Srivastava, N., G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, 2014: Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15, 1929−1958.
    Tong, B., X. Wang, J. Y. Fu, P. W. Chan, and Y. C. He, 2022: Short-term prediction of the intensity and track of tropical cyclone via ConvLSTM model. Journal of Wind Engineering and Industrial Aerodynamics, 226, 105026, https://doi.org/10.1016/j.jweia.2022.105026.
    Velden, C. S., T. L. Olander, and R. M. Zehr, 1998: Development of an objective scheme to estimate tropical cyclone intensity from digital geostationary satellite infrared imagery. Wea. Forecasting, 13, 172−186, https://doi.org/10.1175/1520-0434(1998)013<0172:DOAOST>2.0.CO;2.
    Wang, C., G. Zheng, X. F. Li, Q. Xu, B. Liu, and J. Zhang, 2022: Tropical cyclone intensity estimation from geostationary satellite imagery using deep convolutional neural networks. IEEE Trans. Geosci. Remote Sens., 60, 4101416, https://doi.org/10.1109/TGRS.2021.3066299.
    Wang, Y., and C.-C. Wu, 2004: Current understanding of tropical cyclone structure and intensity changes-A review. Meteorol. Atmos. Phys., 87, 257−278, https://doi.org/10.1007/s00703-003-0055-6.
    Wang, Y. Q., Y.-L. Li, and J. Xu, 2021b: A new time-dependent theory of tropical cyclone intensification. J. Atmos. Sci., 78, 3855−3865, https://doi.org/10.1175/JAS-D-21-0169.1.
    Wang, Y. Q., Y.-L. Li, J. Xu, Z.-M. Tan, and Y.-L. Lin, 2021a: The intensity-dependence of tropical cyclone intensification rate in a simplified energetically based dynamical system model. J. Atmos. Sci., 78, 2033−2045, https://doi.org/10.1175/JAS-D-20-0393.1.
    Wu, Y. Q., X. Y. Geng, Z. L. Liu, and Z. W. Shi, 2022: Tropical cyclone forecast using multitask deep learning framework. IEEE Geoscience and Remote Sensing Letters, 19, 6503505, https://doi.org/10.1109/LGRS.2021.3132395.
    Xu, J., and Y. Q. Wang, 2022: Potential intensification rate of tropical cyclones in a simplified energetically based dynamical system model: An observational analysis. J. Atmos. Sci., 79, 1045−1055, https://doi.org/10.1175/JAS-D-21-0217.1.
    Xu, J., Y. Q. Wang, and C. Yang, 2023: Quantifying the environmental effects on tropical cyclone intensity change using a simple dynamically based dynamical system model. J. Atmos. Sci., 80, 2897−2913, https://doi.org/10.1175/JAS-D-23-0058.1.
    Xu, W. W., K. Balaguru, A. August, N. Lalo, N. Hodas, M. DeMaria, and D. Judi, 2021: Deep learning experiments for tropical cyclone intensity forecasts. Wea. Forecasting, 36, 1453−1470, https://doi.org/10.1175/WAF-D-20-0104.1.
    Yang, F. L., H. L. Pan, S. K. Krueger, S. Moorthi, and S. J. Lord, 2006: Evaluation of the NCEP global forecast system at the ARM SGP Site. Mon. Wea. Rev., 134, 3668−3690, https://doi.org/10.1175/MWR3264.1.
    Yang, F. L., and Coauthors, 2020: Model upgrade plan and initial results from a prototype NCEP Global Forecast System Version 16. 10th Conference on Transition of Research to Operations, Boston, MA, AMS.
    Ying, M., W. Zhang, H. Yu, X. Q. Lu, J. X. Feng, Y. X. Fan, Y. T. Zhu, and D. Q. Chen, 2014: An overview of the China Meteorological Administration tropical cyclone database. J. Atmos. Oceanic Technol., 31, 287−301, https://doi.org/10.1175/JTECH-D-12-00119.1.
    Yu, H., P.-Y. Chen, Q.-Q. Li, and B. Tang, 2013: Current capability of operational numerical models in predicting tropical cyclone intensity in the western North Pacific. Wea. Forecasting, 28, 353−367, https://doi.org/10.1175/WAF-D-11-001 00.1.
    Yuan, S. J., C. Wang, B. Mu, F. F. Zhou, and W. S. Duan, 2021: Typhoon intensity forecasting based on LSTM using the rolling forecast method. Algorithms, 14, 83, https://doi.org/10.3390/a14030083.
    Zhou, Y. C., J. W. Zhao, R.-F. Zhan, P. Y. Chen, Z. W. Wu, and L. Wang, 2021: A Logistic-growth-equation-based intensity prediction scheme for western North Pacific tropical cyclones. Adv. Atmos. Sci., 38, 1750−1762, https://doi.org/10.1007/s00376-021-0435-1.
  • [1] Yanchen ZHOU, Jiuwei ZHAO, Ruifen ZHAN, Peiyan CHEN, Zhiwei WU, Lan WANG, 2021: A Logistic-growth-equation-based Intensity Prediction Scheme for Western North Pacific Tropical Cyclones, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 1750-1762.  doi: 10.1007/s00376-021-0435-1
    [2] ZHAO Haikun, WU Liguang*, and WANG Ruifang, 2014: Decadal Variations of Intense Tropical Cyclones over the Western North Pacific during 19482010, ADVANCES IN ATMOSPHERIC SCIENCES, 31, 57-65.  doi: 10.1007/s00376-013-3011-5
    [3] Chang-Hoi HO, Joo-Hong KIM, Hyeong-Seog KIM, Woosuk CHOI, Min-Hee LEE, Hee-Dong YOO, Tae-Ryong KIM, Sangwook PARK, 2013: Technical Note on a Track-pattern-based Model for Predicting Seasonal Tropical Cyclone Activity over the Western North Pacific, ADVANCES IN ATMOSPHERIC SCIENCES, 30, 1260-1274.  doi: 10.1007/s00376-013-2237-6
    [4] Ruifen ZHAN, Yuqing WANG, Yihui DING, 2022: Impact of the Western Pacific Tropical Easterly Jet on Tropical Cyclone Genesis Frequency over the Western North Pacific, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 235-248.  doi: 10.1007/s00376-021-1103-1
    [5] ZHOU Yang, JIANG Jing, Youyu LU, and HUANG Anning, 2013: Revealing the effects of the El Nio-Southern oscillation on tropical cyclone intensity over the western North Pacific from a model sensitivity study, ADVANCES IN ATMOSPHERIC SCIENCES, 30, 1117-1128.  doi: 10.1007/s00376-012-2109-5
    [6] Xiaohao QIN, Wansuo DUAN, Pak-Wai CHAN, Boyu CHEN, Kang-Ning HUANG, 2023: Effects of Dropsonde Data in Field Campaigns on Forecasts of Tropical Cyclones over the Western North Pacific in 2020 and the Role of CNOP Sensitivity, ADVANCES IN ATMOSPHERIC SCIENCES, 40, 791-803.  doi: 10.1007/s00376-022-2136-9
    [7] Jihang LI, Zhiyan ZHANG, Lu LIU, Xubin ZHANG, Jingxuan QU, Qilin WAN, 2021: The Simulation of Five Tropical Cyclones by Sample Optimization of Ensemble Forecasting Based on the Observed Track and Intensity, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 1763-1777.  doi: 10.1007/s00376-021-0353-2
    [8] LU Riyu*, DONG Huilin, SU Qin, and Hui DING, 2014: The 30-60-day Intraseasonal Oscillations over the Subtropical Western North Pacific during the Summer of 1998, ADVANCES IN ATMOSPHERIC SCIENCES, 31, 1-7.  doi: 10.1007/s00376-013-3019-x
    [9] Sining LING, Riyu LU, 2022: Tropical Cyclones over the Western North Pacific Strengthen the East Asia–Pacific Pattern during Summer, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 249-259.  doi: 10.1007/s00376-021-1171-2
    [10] Haikun ZHAO, Chunzai WANG, Ryuji YOSHIDA, 2016: Modulation of Tropical Cyclogenesis in the Western North Pacific by the Quasi-Biweekly Oscillation, ADVANCES IN ATMOSPHERIC SCIENCES, 33, 1361-1375.  doi: 10.1007/s00376-016-5267-z
    [11] ZHAO Haikun, WU Liguang, ZHOU Weican, 2010: Assessing the Influence of the ENSO on Tropical Cyclone Prevailing Tracks in the Western North Pacific, ADVANCES IN ATMOSPHERIC SCIENCES, 27, 1361-1371.  doi: 10.1007/s00376-010-9161-9
    [12] Pan SONG, Jiang ZHU, Zhong ZHONG, Linlin QI, Xiaodan WANG, 2016: Impact of Atmospheric and Oceanic Conditions on the Frequency and Genesis Location of Tropical Cyclones over the Western North Pacific in 2004 and 2010, ADVANCES IN ATMOSPHERIC SCIENCES, 33, 599-613.  doi: 10.1007/s00376-015-5046-2
    [13] Shumin CHEN, Weibiao LI, Zhiping WEN, Mingsen ZHOU, Youyu LU, Yu-Kun QIAN, Haoya LIU, Rong FANG, 2018: Variations in High-frequency Oscillations of Tropical Cyclones over the Western North Pacific, ADVANCES IN ATMOSPHERIC SCIENCES, 35, 423-434.  doi: 10.1007/s00376-017-7060-z
    [14] Hong HUANG, Dan WU, Yuan WANG, Zhen WANG, Yu LIU, 2024: Track-Pattern-Based Characteristics of Extratropical Transitioning Tropical Cyclones in the Western North Pacific, ADVANCES IN ATMOSPHERIC SCIENCES, 41, 1251-1263.  doi: 10.1007/s00376-023-2330-4
    [15] Yining Gu, Ruifen Zhan, Xiaomeng Li, 2024: Environmental conditions conducive to the formation of Multiple Tropical Cyclones over the Western North Pacific, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-024-3237-4
    [16] Fumin REN, Yanjun XIE, Biwen YIN, Mingyang WANG, Guoping LI, 2020: Establishment of an Objective Standard for the Definition of Binary Tropical Cyclones in the Western North Pacific, ADVANCES IN ATMOSPHERIC SCIENCES, 37, 1211-1221.  doi: 10.1007/s00376-020-9287-3
    [17] SONG Yajuan, WANG Lei, LEI Xiaoyan, WANG Xidong, 2015: Tropical Cyclone Genesis Potential Index over the Western North Pacific Simulated by CMIP5 Models, ADVANCES IN ATMOSPHERIC SCIENCES, 32, 1539-1550.  doi: 10.1007/s00376-015-4162-3
    [18] ZHANG Ying, WANG Huijun, SUN Jianqi, Helge DRANGE, 2010: Changes in the Tropical Cyclone Genesis Potential Index over the Western North Pacific in the SRES A2 Scenario, ADVANCES IN ATMOSPHERIC SCIENCES, 27, 1246-1258.  doi: 10.1007/s00376-010-9096-1
    [19] Xiaoqin LU, Hui YU, Ming YING, Bingke ZHAO, Shuai ZHANG, Limin LIN, Lina BAI, Rijin WAN, 2021: Western North Pacific Tropical Cyclone Database Created by the China Meteorological Administration, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 690-699.  doi: 10.1007/s00376-020-0211-7
    [20] Jie WU, Xuejie GAO, Yingmo ZHU, Ying SHI, Filippo GIORGI, 2022: Projection of the Future Changes in Tropical Cyclone Activity Affecting East Asia over the Western North Pacific Based on Multi-RegCM4 Simulations, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 284-303.  doi: 10.1007/s00376-021-0286-9

Get Citation+

Export:  

Share Article

Manuscript History

Manuscript received: 23 October 2023
Manuscript revised: 07 February 2024
Manuscript accepted: 13 March 2024
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

A Physics-informed Deep-learning Intensity Prediction Scheme for Tropical Cyclones over the Western North Pacific

    Corresponding author: Ruifen ZHAN, zhanrf@fudan.edu.cn
    Corresponding author: Yuqing WANG, yuqing@hawaii.edu
  • 1. Department of Atmospheric and Oceanic Sciences & Institute of Atmospheric Sciences, Fudan University, Shanghai 200438, China
  • 2. Shanghai Typhoon Institute of China Meteorological Administration, Shanghai 200030, China
  • 3. Department of Atmospheric Sciences and International Pacific Research Center, University of Hawaii at Manoa, Honolulu 96822, USA
  • 4. Key Laboratory of Mesoscale Severe Weather/MOE, and School of the Atmospheric Sciences, Nanjing University, Nanjing 210023, China
  • 5. School of Computer Science, Fudan University, Shanghai 200438, China

Abstract: Accurate prediction of tropical cyclone (TC) intensity is challenging due to the complex physical processes involved. Here, we introduce a new TC intensity prediction scheme for the western North Pacific (WNP) based on a time-dependent theory of TC intensification, termed the energetically based dynamical system (EBDS) model, together with the use of a long short-term memory (LSTM) neural network. In time-dependent theory, TC intensity change is controlled by both the internal dynamics of the TC system and various environmental factors, expressed as environmental dynamical efficiency. The LSTM neural network is used to predict the environmental dynamical efficiency in the EBDS model trained using best-track TC data and global reanalysis data during 1982–2017 . The transfer learning and ensemble methods are used to retrain the scheme using the environmental factors predicted by the Global Forecast System (GFS) of the National Centers for Environmental Prediction during 2017–21. The predicted environmental dynamical efficiency is finally iterated into the EBDS equations to predict TC intensity. The new scheme is evaluated for TC intensity prediction using both reanalysis data and the GFS prediction data. The intensity prediction by the new scheme shows better skill than the official prediction from the China Meteorological Administration (CMA) and those by other state-of-art statistical and dynamical forecast systems, except for the 72-h forecast. Particularly at the longer lead times of 96 h and 120 h, the new scheme has smaller forecast errors, with a more than 30% improvement over the official forecasts.

摘要: 热带气旋(TC)强度变化涉及复杂的物理过程,其准确预报是当前TC业务预报中一项重要的挑战。本文结合最新提出的TC增强理论(称为基于能量的动力系统模型EBDS)和长短时记忆(LSTM)神经网络,研制了一个针对西北太平洋(WNP)的TC强度预报新方法。在TC增强理论中,TC强度变化受到TC系统内部动力学和各种环境因子的共同控制,这些环境因子的作用可用环境动力效率表示。基于1982-2017年的TC最佳数据集和全球再分析数据,使用LSTM神经网络来预训练EBDS模型中的环境动力效率。进一步,基于2017-2021年美国国家环境预报中心全球预报系统(GFS)预报的环境因子,利用迁移学习和集成方法来微调模型。最后将预报的环境动力效率迭代到EBDS方程中来预报TC强度。使用再分析数据和GFS预报数据对该方法预报的TC强度进行评估。结果表明,除了72小时预报,新方法的强度预报较中国气象局(CMA)的官方预报和其它主流的统计及动力预报更优。特别是对于96小时和120小时预报,新方法的预报误差更小,相比官方预报有超过30%的改进。

    • Tropical cyclones (TCs) are severe weather systems. Strong winds, heavy rain, and storm surges brought by TCs can cause serious economic losses and pose great threats to human lives in coastal regions, and even inland areas. In addition to that of the track, accurate prediction of TC intensity is key to the preparedness and prevention for a potentially coming TC. However, in the past three decades, while significant improvements have been achieved in TC track prediction (Heming et al., 2019), the progress in TC intensity prediction has been slow (Wang and Wu, 2004; DeMaria et al., 2014; Cangialosi et al., 2020), including for TCs over the western North Pacific (WNP) (Yu et al., 2013; Dong et al., 2019; Li et al., 2020). Since TC disasters are closely related to the TC’s intensity, improving the accuracy of TC intensity forecasting is a cutting-edge research area.

      Over the years, various methods have been developed for TC intensity forecasting, including simple extrapolation (Dvorak, 1975; Velden et al., 1998), statistical methods (Jarvinen and Neumann, 1979; Knaff et al., 2003), dynamical methods (Bender et al., 2007; Heming, 2016), statistical–dynamical methods (DeMaria et al., 2005; Knaff et al, 2005; Chen et al., 2011), and dynamical system model approaches (DeMaria, 2009). Among these methods, the dynamical system model approach developed by DeMaria (2009) stands out as exceptionally promising owing to its simplicity and reliability in applications. This approach is based on a logistic growth equation (LGE) that uses Statistical Hurricane Intensity Prediction Scheme (SHIPS) predictors to determine key parameters in the LGE. It has successfully guided operational intensity forecasts of TCs over both the North Atlantic and eastern North Pacific at the U.S. National Hurricane Center (Cangialosi et al., 2020).

      In the new era of big data, there is growing interest in using deep learning as a powerful and state-of-the-art tool for TC intensity prediction (e.g., Zhou et al. 2021; Tong et al., 2022). Many studies have explored various neural network frameworks, such as artificial neural networks (Baik and Paek, 2000), recurrent neural networks (Pan et al., 2019), convolutional neural networks (Conv; Wang et al., 2022), long short-term memory (LSTM) recurrent neural networks (Yuan et al., 2021), and ConvLSTM (Tong et al., 2022). However, these neural network frameworks have several limitations, including short forecast lead times (e.g., 24 h; Cloud et al., 2019; Wu et al., 2022), unsatisfactory application results (Xu et al., 2021), coarse time granularity of predictions (Pan et al., 2019), and lack of interpretability at the application level, and only pursue high goodness of fit whilst neglecting the physical basis (Wu et al., 2022).

      To address these limitations, a new framework called physics-informed deep learning has been proposed (Raissi et al., 2017, 2019; Karniadakis et al., 2021). This framework aims to incorporate physical laws into data-driven deep-learning models, improving prediction performance and the interpretability (Hao et al., 2023). Recently, Zhou et al. (2021) developed a new intensity prediction method for TCs over the WNP, in conjunction with a machine-learning method and the dynamical system model (LGE) used in DeMaria (2009). Both hindcasts and forecasts demonstrated that this physics-informed machine-learning scheme has better forecasting skill than current operational schemes and official forecasts for the basin. However, because the LGE was originally designed to model population growth, it lacks a real physical basis for TC intensity change. In addition, the LightGBM (Light Gradient Boosting Machine) model used in Zhou et al. (2021) failed to capture the time-dependent characteristic of TC intensity change. It is expected that a physics-based dynamical system model, together with the use of a deep-learning method, could be more attractive and possess even better skill in predicting TC intensity.

      In recent years, Wang et al. (2021a, b) developed both energetically based and dynamically based time-dependent theories of TC intensification, which are simply called energetically based dynamical system (EBDS) model and dynamically based dynamical system (DBDS) model, respectively. Although they originated from two independent theories, by modifying one of the assumptions, the DBDS equation can become the same as the EBDS. As such, in this study, we focus on the EBDS model. The EBDS model can be considered as a physics-based dynamical system model for TC intensity change. Despite being derived under some critical assumptions, the model can reproduce TC intensification in idealized full-physics model simulations (Wang et al., 2021a, b) and the TC potential intensification rate in observations (Xu and Wang, 2022). Wang et al. (2021b) also proposed that the environmental dynamical effects on TC intensity change can be included in the EBDS model by introducing an environmental dynamical efficiency. This has been recently attempted by Xu et al. (2023) to quantify the dynamical effects of various environmental factors on TC intensity change over the WNP.

      In the EBDS model, if the environmental dynamical efficiency can be accurately predicted, the TC intensity can be predicted using simple time integration of the model equation. The key to the construction of the prediction scheme is thus to effectively predict the environmental dynamical efficiency, which is a function of various key environmental factors. Since current global high-resolution numerical weather prediction models are skillful in predicting large-scale oceanic and atmospheric circulations and thus the environmental dynamical factors affecting TC intensity change (e.g., Heming et al., 2019), it is feasible to develop a new prediction scheme for TC intensity based on this physics-based dynamical system model (e.g., EBDS).

      The main objective of this study is to construct a novel prediction scheme for TC intensity over the WNP based on a combination of the EBDS model for TC intensity change and the LSTM recurrent neural network for predicting the time-varying environmental dynamical efficiency in the EBDS model. The LSTM neural network is particularly suitable for predicting time series (Pan et al., 2019; Karniadakis et al., 2021; Yuan et al., 2021; Tong et al., 2022; Hao et al., 2023). With the predicted time-varying environmental dynamical efficiency in the EBDS model, we achieve TC intensity prediction by time integration of the time-dependent differential equation (EBDS model), which describes TC intensity change controlled by both internal dynamics and the environmental dynamical effects. We demonstrate that our newly constructed prediction scheme (named the EBDS_LSTM scheme) has promising skill in predicting TC intensity over the WNP. The rest of the paper is organized as follows: The data and methods used in this study are briefly described in section 2. How the EBDS_LSTM scheme was constructed is detailed in section 3. The performance of the new scheme is evaluated in section 4. Lastly, the main results are summarized and discussed in section 5.

    2.   Data and methods
    • In this study, the TC best-track dataset during 1982–2022 was obtained from the Shanghai Typhoon Institute (STI) of the China Meteorological Administration (CMA) (Ying et al., 2014), which includes TC intensity (maximum sustained surface wind speed) and central position (longitude and latitude) at 6-hour intervals. All data over land were excluded since the maximum potential intensity (MPI) in TC intensity theory is limited to the ocean. To conduct forecasts, the official real-time forecast data for TC tracks from the CMA during 2019–22 were obtained from the TC operational database at the STI. To demonstrate the performance of our new prediction scheme, we also utilized the CMA TC intensity forecasts in 2017, 2021 and 2022, and TC intensity forecasts in 2022 from two dynamical models—the Global Forecast System (GFS) developed by the National Centers for Environmental Prediction (NCEP) (NCEP-GFS), and the European Centre for Medium-Range Weather Forecasts (ECMWF) model—and from the dynamical–statistical model of the WNP TC intensity forecasting scheme (WIPS) developed at CMA-STI, all obtained from the TC operational database at the STI.

      The TC data were processed as follows: (1) TC samples with near-surface wind speed below 15 m s−1 throughout the TC lifetime were excluded from our analysis to avoid measurement errors for weak TCs; (2) when TCs passed over small islands, a decaying coefficient of 0.95 was applied to the MPI immediately prior to island-crossing to replace the missing MPI data over the islands as an empirical approximation of TC weakening; (3) when the TC center was located north of 40°N followed by rapid MPI changes, samples were excluded to eliminate TCs experiencing extratropical transition. In relation to the first point, a threshold of 18 m s−1 was also evaluated, which shows similar results to those based on a 15 m s−1 threshold. Given that the LSTM model requires a look-back window for initialization, and considering the advantage of a larger sample size for improving the fitting performance, we ultimately set 15 m s−1 as the threshold. After such processing, the total number of TCs from 1982 to 2017 was 1044, with the maximum duration of 492 hours for one TC. The numbers of the training, validation, and test samples at different forecast lead times from 6 to 168 h at 6-h intervals are shown in Fig. 1, in which the training samples account for more than 80% of the total samples and the validation samples are close to 20%.

      Figure 1.  Number of samples in the training (blue), validation (purple), and test (green) data at different forecast lead times with a 6-hour interval.

      The environmental fields during 1982–2017 were abstracted from the reanalysis dataset of the NCEP and National Center for Atmospheric Research (NCEP–NCAR), which has a horizontal resolution of 2.5° × 2.5° (Kalnay et al., 1996). Additionally, the real-time forecast data from NCEP-GFS for the period 2017–22 were utilized to identify the environmental fields for forecast application (Yang et al., 2006; Yang et al., 2020). Considering that the GFS was upgraded in 2019, only the forecast data from 2019 to 2021 were applied to the transfer learning as described below.

    • The simple time-dependent EBDS model for TC intensification rate developed by Wang et al. (2021a, b) can be given as

      where $ {V}_{\mathrm{m}\mathrm{a}\mathrm{x}} $ is the maximum near-surface wind speed (TC intensity) at a given time t, $ \partial V_{\mathrm{m}\mathrm{a}\mathrm{x}}/\partial\mathrm{\mathit{t}} $ is the rate of TC intensity change, and $ {V}_{\mathrm{m}\mathrm{p}\mathrm{i}} $ is the MPI. $ {C}_{D} $ is the surface drag coefficient, with a value of 2.2 × 10−3, $ h $ is the atmospheric boundary layer height, often taken to be 2000 m, and $ \alpha $ represents the reduction factor (about 0.8) of the depth-averaged boundary layer wind speed to the 10-m wind speed. These three parameters are assumed to be constants, as in Xu et al. (2023). In Eq. (1), $ {E}^{*} $ represents the internal dynamical efficiency of the TC system, which is a function of the TC relative intensity defined as the intensity normalized by the corresponding MPI given by $ {E}^{*}={({V}_{\mathrm{m}\mathrm{a}\mathrm{x}}/{V}_{\mathrm{m}\mathrm{p}\mathrm{i}})}^{3/2} $, E′ represents the environmental dynamical efficiency determined by various environmental factors 1, 2, 3,$ \cdots $, which can be given as $ {E}^{{'}}={E}_{1}^{{'}}{E}_{2}^{{'}}{E}_{3}^{{'}}\cdots $. By definition, $ 0 < {E}^{{'}}\leqslant 1 $. At present, there is no theoretical or empirical quantitative expression available for E′ or any of its components $ {E}_{i}^{{'}} $ ($ i=\mathrm{1,2},3,\cdots $).

      For a given TC in the best-track data, the rate of its intensity change on the left-hand side of Eq. (1) is estimated using the time series of $ {V}_{\mathrm{m}\mathrm{a}\mathrm{x}} $, and its corresponding MPI is estimated from the underlying SST using the empirical formula given by Baik and Paek (1998), which is more stable with reasonable accuracy. Therefore, with the three constants and track being given, the environmental dynamical efficiency E′ can be calculated from Eq. (1) as follows:

      where $ \partial {v}_{\mathrm{m}\mathrm{a}\mathrm{x}}/\partial t $ was calculated from the best-track data using a 12-h centered time differencing scheme. By calculating all selected TC samples, a large dataset for $ {E}^{{'}} $ can be obtained. Such a dataset can be used to train the prediction scheme for $ {E}^{{'}} $ using the LSTM recurrent neural network as described below.

      Figure 2 shows the 17-day EBDS model “predicted” intensity of Super Typhoon Noru (2017) using the 6-h forward differencing scheme (first order) and the corresponding CMA best-track intensity. As expected, the EBDS model reproduces nearly every aspect of the intensity variation of Noru (2017) throughout its entire lifespan, including its double-peak intensity evolution, rapid intensification, and rapid decay, although the peak intensity is slightly underestimated. Furthermore, after long-term and multiple iterations, TC intensity can self-adaptively converge near the best-track intensity without significant error accumulation, demonstrating that the model can be used for real-time forecasting. The key to accurate real-time forecasts lies in how the environmental dynamical efficiency E′ is predicted.

      Figure 2.  The 17-day EBDS model “predicted” intensity of Super Typhoon Noru (2017) and the corresponding CMA best-track intensity.

    • The LSTM is used to predict E′ based on sequential data of various environmental factors. Recurrent neural networks (RNNs) are neural network structures capable of addressing sequential prediction problems (Elman, 1990). They establish connections between data in a temporal sequence. LSTM, introduced by Hochreiter and Schmidhuber (1997), is a special type of RNN that can overcome the limitations of an RNN in capturing long-term dependencies and mitigating the vanishing gradient problem. This is achieved by incorporating gate structures that enable selective memory, forgetfulness, input, and output across different units. The input data for the LSTM model includes three dimensions: the first dimension represents the number of samples, the second dimension is the variable sample time steps, and the third dimension represents the number of factors in the sample. The output can be time series of predicted values of arbitrary length.

      The depth of LSTM structures can be increased through layer stacking, which helps capture the complex patterns underlying long-term sequences more accurately. Since TC formation and development are influenced by multiple internal and environmental factors, making it a complex system with certain regularity, LSTM networks, capable of retaining long-term information, are suitable for fitting such types of data. Note that LSTM networks are prone to gradient vanishing or exploding problems, which can impact the training process. Thus, we normalized data before training LSTM deep learning models using $ {{x}}'_{i}{{}}={({x}}_{i}-{{x}}_{\mathrm{m}\mathrm{i}\mathrm{n}})/({{x}}_{\mathrm{m}\mathrm{a}\mathrm{x}}-{{x}}_{\mathrm{m}\mathrm{i}\mathrm{n}}) $, where $ {x}'_{i } $ is the normalized x at time i, and xmin and xmax are the minimum and maximum of time series x, respectively.

    • We used the same predictors as those used in WIPS developed at CMA-STI (Chen et al., 2011; see Table 1). WIPS is a dynamical–statistical scheme and has been operational for a decade and proven to be skillful among CMA’s operational intensity forecast schemes (Chen et al., 2019).

      Predictor Units Description
      VWS m s−1 Averaged vertical wind shear between 200 and 850 hPa within a radius of 5° of the TC center
      AU m s−1 Averaged zonal wind at 200 hPa within a radius of 5° of the TC center
      AV85 s−1 Averaged absolute vorticity at 850 hPa within an annulus area between radii of 5° and 10° of the TC center
      DIV85 s−1 Averaged divergence at 850 hPa within an annulus area between radii of 5° and 10° of the TC center
      DIV20 s−1 Averaged divergence at 200 hPa within an annulus area between radii of 5° and 10° of the TC center
      H50 gpm 500-hPa geopotential height at the TC center
      VOR85_lon ° Longitude of the greatest vorticity at 850 hPa in the range of 2° (4°) of the TC center at 0−24 h (> 24 h) forecasts
      VOR85_lat ° Latitude of the greatest vorticity at 850 hPa in the range of 2° (4°) of the TC center at 0−24 h (> 24 h) forecasts
      RH8570 % Averaged relative humidity at 850–700 hPa within an annulus area between radii of 5° and 10° of the TC center
      RH5030 % Averaged relative humidity at 500–300 hPa within an annulus area between radii of 5° and 10° of the TC center
      MPI m s−1 Maximum potential intensity estimated using the SST averaged within a radius of 300 km from the TC center

      Table 1.  Predictors and corresponding descriptions.

      The transfer learning method (Pan and Yang, 2010; Bozinovski, 2020) was employed to address the data availability limitation in training deep learning neural networks. In this study, we primarily utilized the NCEP/NCAR reanalysis data to pre-train the model. After obtaining a well-performing pretrained model, we transferred it to the GFS real-time forecast data, which has a smaller data volume, for further training of the model (as shown in Fig. 3). The model training was performed using the Keras API under the Tensor Flow framework.

      Figure 3.  Transfer learning structure in the EBDS_LSTM model.

      The commonly used regularization technique called “dropout” was employed to mitigate overfitting in the training process of deep-learning models (Srivastava et al., 2014). Since the results obtained from each training iteration may differ slightly due to factors such as different initial parameters, data perturbations, or the randomness of optimization algorithms, we trained 20 models and calculated their ensemble mean for final forecasts. The detailed framework is illustrated in Fig. 4.

      Figure 4.  Schematic of the ensemble forecast in the EBDS_LSTM model. X(n) indicates predictors as listed in Table 1 and Y(1)m indicates TC intensity, where m is the number of ensemble members (20).

    3.   Construction of the EBDS_LSTM model
    • Based on the selected predictors and the EBDS model in Eq. (1), a new TC intensity prediction scheme over the WNP was constructed using the TC best-track data and NCEP–NCAR reanalysis data during 1982–2016, and the CMA TC forecast track data and NCEP-GFS forecast data during 2019–21. Figure 5 summarizes the workflow in building the new TC intensity prediction scheme, termed the EBDS_LSTM scheme, which consists of two main parts: the model pretraining and transfer learning. The workflow for each part can be broadly divided into data preprocessing, hyperparameter optimization, model training, and validation and analysis of model performance. The data preprocessing was described in section 2. The modeling approach mainly involves stacking two LSTM layers, each with 128 neurons, followed by a fully connected layer with a single neuron as the output, representing the predictand at the required forecast time. Dropout layers are included between the layers and the dropout rates are separately set as a hyperparameter. The activation function used in the final fully connected layer is set as the second hyperparameter. Whether the former sequence of the predictand itself is included as a predictor (i.e., whether it is a rolling forecast) is controlled by a switch, which serves as the third hyperparameter. The model’s prediction target is set as a seven-day (five-day) forecast from 6 h to 168 h (120 h) at 6-h intervals for the reanalysis data (GFS forecast data), and the overall error in the test set is used to measure the model’s fitting and generalization ability.

      Figure 5.  Schematic of the EBDS_LSTM forecast scheme.

      It is found that linear activation (tanh) functions are difficult to fit and result in larger prediction errors, while the rectified linear activation function (relu) might suffer from gradient vanishing that prevents proper model fitting. In contrast, the activation function elu (Exponential Linear Unit) shows better stability and smaller errors. Finally, two dropout rates of 0.4, the activation function elu, and the Adam optimizer with an initial learning rate of 0.001, which helps the model to automatically adjust the learning rate, were determined in the model. As mentioned above, in the original model based on the reanalysis data, the training dataset during 1982–2016 was used to train the relationship between predictors and $E' $, and the testing dataset in 2017 was built to predict $E' $ from 6 to 168 h at 6-h intervals. In the target model based on the GFS forecast data, the training dataset during 1982–2017 using the reanalysis data was adopted to pretrain the relationship between predictors and E', the GFS forecast dataset during 2019–21 was used to build the transfer learning model, and the target dataset during 2021–22 was built to predict E' from 6 to 120 h at 6-h intervals. To conduct independent forecasts, we excluded the cases from the training set when performing the experimental forecasts for those cases in 2021. Note that using these two sets of models, we can predict E' at each forecast time. The internal dynamical efficiency E* can be determined using Vmax and MPI at the initial time t0. Subsequently, we can predict Vmax for the subsequent moment t1, using the EBDS equation with a forward-time-differencing scheme. Based on the Vmax and MPI at t1, we are able to iteratively calculate E* for t1, and this procedure is repeated for subsequent time points. Through this iterative process, we achieve the prediction of the corresponding E*, E', and Vmax across various forecast times.

    4.   Performance of the new scheme
    • The performance of the new scheme is evaluated based on independent forecasts for all TCs in 2017 using the reanalysis data and real-time forecasts for TC cases in 2021 and all TC cases in 2017 and 2022 based on the CMA TC track forecasts and the NCEP-GFS real-time forecast data. Note that the real-time GFS forecast data from 2017 was primarily used to determine whether transfer learning was necessary.

    • We first applied the newly developed scheme to the test set of the NCEP–NCAR reanalysis data in 2017. Figure 6a shows the root-mean-square errors (RMSEs) at different forecast times and the number of forecast samples of 7-day intensity forecasts from the EBDS_LSTM scheme and the 5-day forecasts from the CMA at 24-h intervals for all independent TC cases in 2017. In general, the RMSEs from the CMA forecasts increase with increasing forecast time. Interestingly, the RMSEs from our new model increase from 6 h to 72 h, but decrease rapidly afterwards. The performance of the EBDS_LSTM model is comparable to CMA within the forecast lead time of 72 h, while there are improvements of approximately 30% and 45.6% in the forecast accuracy at 96 h and 120 h, respectively. The good performance in medium and long-term forecasts demonstrates the superiority of the EBDS_LSTM scheme in capturing the major characteristics of TC intensity changes with the physical principles informed.

      Figure 6.  (a) Averaged RMSEs (m s−1) of the 7-day intensity forecasts from the EBDS_LSTM scheme (red bars) and averaged RMSEs of the 5-day forecasts from the CMA (black bars) at 24-h intervals for all independent TC cases in 2017 based on the NCEP–NCAR reanalysis data. (b) Averaged RMSEs (m s−1) of the 5-day real-time intensity forecasts at 24-h intervals from the EBDS_LSTM scheme and the CMA for all independent cases in 2017. (c) Averaged RMSEs (m s−1) of the 5-day real-time intensity forecasts at 24-h intervals for all independent cases in 2022 from the EBDS_LSTM scheme (red), CMA (black), NCEP-GFS (green), ECMWF (light green), and WIPS (orange). The EBDS_LSTM model was trained on NCEP–NCAR reanalysis from 1982 to 2016. For (a), predictions were made based on reanalysis; for (b), on GFS forecast data for the year 2017 without transfer learning; and for (c), on GFS forecast data for the year 2022, using transfer learning with GFS data during 2019–21.

      The two most representative TCs, Noru (2017) and Lan (2017), were selected for case verification from all TC data in the test set of 2017 based on the following reasons: (1) they reached the highest maximum intensity among all TCs in the test set; (2) both had a duration of more than one week; (3) the entire lifespan of their formation and development was in accordance with cognition; (4) TC Noru (2017) underwent two rapid intensification stages, while TC Lan (2017) experienced rapid intensification and rapid weakening within a short period of time, representing different types of TC intensity changes. Figures 7ad present the duration, track, and intensity of the two TCs. Noru formed as a tropical storm to the northeast of the Northern Mariana Islands on 20 July and reached its first peak intensity of 40 m s−1 on 24 July. After a brief weakening, it re-intensified and became a super typhoon on 30 July, with a maximum intensity of 52 m s−1, and then weakened back to a tropical storm before reaching Japan. Lan formed on 15 October in the southwestern NWP. It rapidly intensified into a super typhoon on 20 October, reaching its maximum intensity of 58 m s−1 a day later, and then rapidly weakened to a tropical storm and later dissipated within only two days.

      Figure 7.  (a, b) Observed tracks and intensities of (a) Noru and (b) Lan in 2017. (c, d) The 7-day forecasts of the intensity for (c) Noru and (d) Lan at different forecast times with 6-h intervals based on NCEP–NCAR reanalysis data (dashed lines) and the corresponding CMA best-track intensity (red solid line). (e, f) Diagnosed (red) and predicted (black) dynamical efficiency factor E′ for (e) Noru and (f) Lan.

      Figures 7c and d show the 7-day intensity forecasts for the two TCs from the EBDS_LSTM scheme based on the NCEP–NCAR reanalysis data, along with the best-track intensity. As expected, the EBDS_LSTM scheme reasonably captures the characteristics of intensity changes in different stages of the corresponding TCs. It not only accurately predicts the initial stage of intensification but also captures the rapid intensification and rapid weakening stages of both TCs. The EBDS_LSTM scheme also captures the peak intensity of the two TCs reasonably. Furthermore, we also calculated the RMSEs at different forecast times for the two cases and compared with the CMA forecasts (Table 2). The forecast errors from the CMA generally increase with increasing forecast time, while that from the EBDS_LSTM scheme remain small and even decrease at longer forecast times. Compared to the CMA forecasts, our EBDS_LSTM model improves by approximately 15.85%–52.6% for forecast times before 72 h. As for those beyond 72 h, our new scheme improves by about 31.1%–52.3%. This suggests that the accumulated errors may neutralize and lead to comparatively small errors in long-term forecasts because it is physics-based, and because of the use of the LSTM algorithm. In particular, the EBDS model yields promising results in long-term forecasts, which might be benefited by the constraints on physical equations, while it also achieves satisfactory short-term forecasts due to the powerful fitting and optimization capabilities of deep learning.

      Forecast time (h) RMSE (m s−1) in Noru RMSE (m s−1) in Lan
      EBDS_LSTM CMA EBDS_LSTM CMA
      24 4.23 7.32 3.82 5.40
      48 4.48 8.84 4.16 5.36
      72 4.36 9.20 4.40 5.23
      96 4.29 9.00 4.85 7.04
      120 4.31 8.14 5.36 9.51
      144 4.45 - 3.64 -
      168 4.61 - 2.83 -

      Table 2.  RMSEs (m s−1) of intensity forecasts for Noru and Lan in 2017 at 24, 48, 72, 96, 120, 144, and 168 h from the EBDS_LSTM scheme and the CMA 5-day forecasts. Smaller RMSEs between the two methods are shown in boldface.

      Figures 7e and f illustrate the diagnosed and predicted environmental dynamical efficiency (E') for the two TCs. In both cases, the diagnosed E' exhibits a similar evolution to the TC intensity but with a lead of 6 hours. The E' for Noru exhibits a prominent bimodal structure with two growth and decay stages, albeit with some jagged fluctuations, while Lan experiences one peak with some relatively large fluctuations. This suggests that the effect of E' is vital. For the predicted E', the model generally reproduces well the change in the diagnosed E', although some underestimations are visible when the TC is near its peak intensity. Note that the predicted E' and TC intensity are much smoother than the diagnosed E' and best-track intensity. This can be attributed partly to the noise in observational data, and partly to the model’s adjustments during training, which aim to minimize overfitting by targeting more generalized patterns.

    • Prior to utilizing GFS forecast data in our final real-time forecast model, we assessed the necessity of transfer learning. Here, the EBDS_LSTM was trained on reanalysis data from 1982 to 2016, and then made forecasts directly using GFS forecast data for 2017, with results shown in Fig. 6b. As anticipated, the forecast error at each forecast time exceeded 15 m s−1, significantly higher than that from the CMA forecast. It strongly suggests the necessity and importance of implementing transfer learning for tuning the model.

      Furthermore, based on the pretrained EBDS_LSTM scheme using the NCEP–NCAR reanalysis data, the scheme was further trained using the GFS forecast data from 2019 to 2021, except for two testing cases of Typhoon Surigae (202102) and Typhoon Nyatoh (202121), which experienced rapid intensification and were selected for real-time forecast verification. Figures 8a and b show the observed tracks and intensities of Surigae and Nyatoh. Surigae formed as a tropical storm over the WNP on 13 April and rapidly intensified within a few days to become a super typhoon with a peak intensity of 68 m s−1 on 17 April. It approached the Philippines and turned northeastward over the ocean, and then gradually weakened. Nyatoh formed on 30 November in the southwestern WNP, near the Philippines and north of Indonesia. After a few days of slow intensification, it rapidly intensified into a super typhoon with a maximum intensity of 55 m s−1 on 3 December. It then quickly weakened to a tropical storm and dissipated in a day.

      Figure 8.  (a, b) Observed tracks and intensities for (a) Surigae and (b) Nyatoh in 2021. (c–f) The 5-day forecasts of the intensity for (c, e) Surigae and (d, f) Nyatoh at different forecast times with 6-h intervals from (c, d) the EBDS_LSTM forecasts (dashed lines) and (e, f) the CMA forecasts (dashed lines). In (c–f), red solid lines indicate the corresponding CMA best-track intensity.

      Figures 8c and d show the intensity forecasts from the EBDS_LSTM scheme based on predictors obtained from the GFS real-time forecast data and the real-time track forecasts from the CMA. The EBDS_LSTM scheme captures well the characteristics of intensity changes in almost all stages of the two TCs, including the magnitude of intensification and weakening rates. It accurately captures both the initial intensification stage and the rapid intensification and rapid weakening stages of the two TCs. Compared with that from the CMA, forecasts from the EBDS_LSTM scheme show better transitions in the TC development stages and overall better skill in predicting the peak intensity. We also calculated the corresponding RMSEs of the forecasts for the two cases (see Table 3). The RMSEs of the CMA forecasts exhibit an increase with increasing forecast lead time, reaching a maximum at 72 h, followed by a slight decrease. In contrast, the RMSEs of the forecasts from the EBDS_LSTM scheme are small at all lead times, showing no significant increase with increasing forecast time. This indicates that the EBDS_LSTM scheme performs better in both short-term and long-term forecasts for both TCs than the CMA forecasts.

      Forecast Time (h) RMSE (m s−1) in Surigae RMSE (m s−1) in Nyatoh
      EBDS_LSTM CMA EBDS_LSTM CMA
      24 4.23 5.53 3.82 6.60
      48 4.48 8.64 4.16 11.21
      72 4.36 9.73 4.40 13.88
      96 4.29 9.12 4.85
      120 4.31 7.68 5.36

      Table 3.  RMSEs (m s−1) of intensity forecasts for Surigae and Nyatoh in 2021 at 24, 48, 72, 96 and 120 h forecasts from the EBDS_LSTM method and the CMA 5-day forecasts. Smaller RMSEs between the two methods are shown in boldface.

      To demonstrate the overall good performance in the forecast mode of our developed EBDS_LSTM scheme, we also conducted retrospective forecasts for all 23 TCs in 2022 (excluding Typhoon 2211 because of missing data). Figure 6c shows the RMSEs of 5-day intensity forecasts from the EBDS_LSTM scheme and a comparison with those from several main forecast agencies. One notable feature is that, except for the 72-h forecast, the EBDS_LSTM scheme performs better at all other forecast times. Particularly at the longer lead times of 96 h and 120 h, the EBDS_LSTM scheme has smaller forecast errors, with a 10% and 25% improvement over the NCEP-GFS forecasts and a 51% and 55% improvement over the ECMWF forecasts, respectively. Additionally, it is worth mentioning that despite utilizing the same predictors as WIPS, the EBDS_LSTM scheme exhibits a significant improvement. This suggests that the EBDS_LSTM model has the potential to contribute to improving TC intensity forecasts over the WNP.

    5.   Summary and discussion
    • We introduced a new prediction scheme for TC intensity over the WNP based on the cutting-edge EBDS model, together with the LSTM framework of deep learning. The latter is used to predict the environmental dynamical efficiency in the EBDS model. Since the LSTM algorithm exhibits an outstanding performance for time series predictions, it is suitable for use in the prediction of TC intensity. We also used other approaches in constructing the new prediction scheme, including optimization and tuning of model hyperparameters, transfer learning, and the ensemble method.

      Independent forecasts for all TCs in 2017 based on the NCEP–NCAR reanalysis data show that the EBDS_LSTM scheme has better skill in predicting TC intensity over the WNP than the CMA’s official operational forecasts, which is one of the best forecasts for the basin. The good performance of the newly developed EBDS_LSTM scheme for “real-time” forecasts was demonstrated with two TC cases in 2021 and all TC cases in 2022. The new scheme shows better prediction skill than the official prediction from the CMA and other state-of-art statistical and dynamical forecast systems, particularly for long lead-time forecasts of 96 h and 120 h.

      In comparison with traditional dynamical system model approaches, such as the LGE model, the EBDS_LSTM scheme is physics-based and shows superior forecasting capabilities, significantly reducing prediction errors, and offering a new independent TC intensity prediction scheme for the WNP. We also compare the current model to other popular deep-learning methods, including early attempts with artificial neural networks and more recent advancements using RNNs, multi-module-task models, LSTM, and Pangu-Weather (Pan et al., 2019; Baik and Paek, 2000; Yuan et al., 2021; Wang et al., 2022; Bi et al., 2023). These studies illustrate incremental improvements in TC intensity prediction, and generally show similar forecasting skill within 48 h to those in our new model. However, these methods highlight limitations such as forecast duration and a disregard for the underlying physical processes. As such, our model shows better skill for long lead-time forecasts (96 h and 120 h), aligning with the physical principles of TC intensity change.

      We would like to point out that, although the scope of this study has focused on prediction of TC intensity over the WNP, the new scheme can be potentially applied to other TC basins, such as the North Atlantic and eastern North Pacific, because the EBDS model is physics-based, which is independent of the basin. In addition, through sensitivity experiments by varying different environmental variables, the EBDS_LSTM scheme can also be used to identify key factors related to the rapid intensification and rapid weakening phases of TCs. This will potentially not only progressively enhance the forecasting skill, but also help explore unknown physical mechanisms from a theoretical perspective, bridging the gap between the black box of deep learning and practical applications of weather forecasting in a more transparent way.

      Acknowledgements. The authors appreciate Yinglong XU and Shijin YUAN for their constructive comments. This work was supported by the National Key R&D Program of China (Grant No. 2017YFC1501604) and the National Natural Science Foundation of China (Grant Nos. 41875114 and 41875057). The CMA best-track TC dataset was downloaded from http://tcdata.typhoon.org.cn/. The official real-time forecast data of the CMA and the GFS forecast fields were derived from the TC operational database at STI. The NCEP–NCAR reanalysis data were downloaded from https://www.esrl.noaa.gov/psd/data/gridded/data.ncep.reanalysis.html.

Reference

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return