Advanced Search
Article Contents

Correcting Climate Model Sea Surface Temperature Simulations with Generative Adversarial Networks: Climatology, Interannual Variability, and Extremes


doi:  10.1007/s00376-024-3288-6

  • Climate models are vital for understanding and projecting global climate change and its associated impacts. However, these models suffer from biases that limit their accuracy in historical simulations and the trustworthiness of future projections. Addressing these challenges requires addressing internal variability, hindering the direct alignment between model simulations and observations, and thwarting conventional supervised learning methods. Here, we employ an unsupervised Cycle-consistent Generative Adversarial Network (CycleGAN), to correct daily Sea Surface Temperature (SST) simulations from the Community Earth System Model 2 (CESM2). Our results reveal that the CycleGAN not only corrects climatological biases but also improves the simulation of major dynamic modes including the El Niño-Southern Oscillation (ENSO) and the Indian Ocean Dipole mode, as well as SST extremes. Notably, it substantially corrects climatological SST biases, decreasing the globally averaged Root-Mean-Square Error (RMSE) by 58%. Intriguingly, the CycleGAN effectively addresses the well-known excessive westward bias in ENSO SST anomalies, a common issue in climate models that traditional methods, like quantile mapping, struggle to rectify. Additionally, it substantially improves the simulation of SST extremes, raising the pattern correlation coefficient (PCC) from 0.56 to 0.88 and lowering the RMSE from 0.5 to 0.32. This enhancement is attributed to better representations of interannual, intraseasonal, and synoptic scales variabilities. Our study offers a novel approach to correct global SST simulations and underscores its effectiveness across different time scales and primary dynamical modes.
    摘要: 气候模式对于理解和预估全球气候变化及其影响至关重要。然而,当前气候模式仍存在不少模拟偏差,这限制了其历史模拟的准确性和未来预估的可信度。由于内部变率的存在,模式模拟结果与观测数据无法完全一一对应,这限制了传统基于监督学习的校正方法的应用。本文基于无监督的循环一致生成对抗网络(CycleGAN)来校正通用地球系统模式(CESM2)的日海表温度(SST)模拟结果。研究结果表明,CycleGAN 不仅能校正气候态偏差,还能显著改进对厄尔尼诺-南方涛动(ENSO)、印度洋偶极子(IOD)等动力模态以及海温极值的模拟能力。CycleGAN将全球平均均方根误差(RMSE)降低了 58%,同时显著减弱了ENSO SST 异常过度西伸的偏差,该偏差是西北太平洋异常反气旋乃至东亚夏季风模拟最主要的误差来源之一,而传统方法(如quantile map)很难纠正这一模式共同偏差。此外,它还改进了对海温极值的模拟,将模态相关系数(PCC)从 0.56 提高到 0.88,将RMSE从 0.5 降低到 0.32。这种改进可能与年际、季节内和天气尺度过程的改进有关。本研究提供了一种校正全球 SST 模拟的新方法,并证明了该方法对于提升动力模态模拟能力的有效性。
  • 加载中
  • Figure 1.  (a) Schematic of the CycleGAN. (b) The details of ResNet, which has nine residual blocks. (c) The details of the first residual block, and (d) other blocks.

    Figure 2.  Qualitative comparison of the intermittency in daily ST on the same date (3 November 2004), from the (a) observation, (b) CESM2 model, and (c) CycleGAN-based post-processing. White contour lines (with an interval of 1 K) denote the monthly SSTAs in November 2004.

    Figure 3.  The disparities between climatological annual mean SSTs in (a, c, e, g) CESM2 simulations, (b, d, f, h) GAN-corrected SSTs, and (a, b) HadISST, (c, d) ERSST, (e, f) NCEP, (g, h) OISST.

    Figure 4.  The disparities between climatological annual mean SSTs in (a) CESM2 simulations, (b) QM-corrected SST, and NCEP.

    Figure 5.  ENSO-related SSTAs in (a) observation, (b) CESM2, (c) QM, and (d) CycleGAN. Stippling indicates where the regressions are significant at the 95% confidence level, based on a Student’s t-test. The hatched areas represent significant differences in El Niño composite SSTs within the CycleGAN-corrected fields compared to CESM2. The black contour line denotes +0.2 K in observations.

    Figure 6.  (Left) The standard deviation of SST and (b) the IOD-related SSTA during SON in (a, e) observation, (b, f) CESM2, (c, g) QM, and (d, h) CycleGAN, respectively. Stippling indicates the regressions are significant at the 95% confidence level, based on a Student’s t-test. The hatched areas represent significant differences in positive IOD composite SSTs within the CycleGAN compared to CESM2.

    Figure 7.  Same as in Fig. 5, but for the 95th percentile of SSTAs.

    Figure 8.  PDFs of average tropical SSTAs for (a) NCEP and CESM2, and (b) NCEP and CycleGAN. Shading indicates areas where the SSTA is above the 90th percentile.

    Figure 9.  Same as in Fig. 8, but for the Northeast Pacific SSTAs.

    Figure 10.  Standard deviations of (left) interannual DJF SSTAs and (right) SSTAs filtered by a band of 2–90 days in (a, d) NCEP, (b, e) CESM2, and (c, f) CycleGAN, respectively.

  • Arjovsky, M., S. Chintala, and L. Bottou, 2017: Wasserstein generative adversarial networks. International Conference on Machine Learning, PMLR, 214−223.
    Bai, K. X., N. B. Chang, H. J. Yu, and W. Gao, 2016: Statistical bias correction for creating coherent total ozone record from OMI and OMPS observations. Remote Sensing of Environment, 182, 150−168, https://doi.org/10.1016/j.rse.2016.05.007.
    Capotondi, A., C. Deser, A. S. Phillips, Y. Okumura, and S. M. Larson, 2020: ENSO and pacific decadal variability in the community earth system model version 2. Journal of Advances in Modeling Earth Systems, 12 (12), e2019MS002022, https://doi.org/10.1029/2019MS002022.
    Chen, J., F. P. Brissette, and R. Leconte, 2011: Uncertainty of downscaling method in quantifying the impact of climate change on hydrology. J. Hydrol., 401 (3−4), 190−202, https://doi.org/10.1016/j.jhydrol.2011.02.020.
    Chen, J., F. P. Brissette, D. Chaumont, and M. Braun, 2013: Performance and uncertainty evaluation of empirical downscaling methods in quantifying the climate change impacts on hydrology over two North American river basins. J. Hydrol., 479, 200−214, https://doi.org/10.1016/j.jhydrol.2012.11.062.
    Chen, K., G. G. Gawarkiewicz, S. J. Lentz, and J. M. Bane, 2014: Diagnosing the warming of the Northeastern U.S. Coastal Ocean in 2012: A linkage between the atmospheric jet stream variability and ocean response. J. Geophys. Res.: Oceans, 119 (1), 218−227, https://doi.org/10.1002/2013JC009393.
    Danabasoglu, G., and Coauthors, 2020: The community earth system model version 2 (CESM2). Journal of Advances in Modeling Earth Systems, 12 (2), e2019MS001916, https://doi.org/10.1029/2019MS001916.
    Deser, C., L. Terray, and A. S. Phillips, 2016: Forced and Internal components of winter air temperature trends over North America during the past 50 years: Mechanisms and implications. J. Climate, 29 (6), 2237−2258, https://doi.org/10.1175/JCLI-D-15-0304.1 .
    Deser, C., A. Phillips, V. Bourdette, and H. Y. Teng, 2012: Uncertainty in climate change projections: The role of internal variability. Climate Dyn., 38 (3−4), 527−546, https://doi.org/10.1007/s00382-010-0977-x.
    Doi, T., C. X. Yuan, S. K. Behera, and T. Yamagata, 2015: Predictability of the California Niño/Niña. J. Climate, 28 (18), 7237−7249, https://doi.org/10.1175/JCLI-D-15-0112.1
    Echevin, V., F. Colas, D. Espinoza-Morriberon, L. Vasquez, T. Anculle, and D. Gutierrez, 2018: Forcings and evolution of the 2017 coastal El Niño off Northern Peru and Ecuador. Frontiers in Marine Science, 5, 367, https://doi.org/10.3389/fmars.2018.00367.
    Eyring, V., S. Bony, G. A. Meehl, C. A. Senior, B. Stevens, R. J. Stouffer, and K. E. Taylor, 2016: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization. Geoscientific Model Development, 9 (5), 1937−1958, https://doi.org/10.5194/gmd-9-1937-2016.
    Goodfellow, I. J., J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, 2014: Generative adversarial nets. Proc. 27th International Conf. on Neural Information Processing Systems, Montreal, Canada, MIT Press, 2672−2680.
    Hawkins, E., and R. Sutton, 2011: The potential to narrow uncertainty in projections of regional precipitation change. Climate Dyn., 37 (1−2), 407−418, https://doi.org/10.1007/s00382-010-0810-6.
    Hess, P., S. Lange, C. Schötz, and N. Boers, 2023: Deep learning for bias-correcting CMIP6-class Earth system models. Earth's Future, 11, e2023EF004002, https://doi.org/10.102 9/2023EF004002.
    Hess, P., M. Drüke, S. Petri, F. M. Strnad, and N. Boers, 2022: Physically constrained generative adversarial networks for improving precipitation fields from Earth system models. Nature Machine Intelligence, 4 (10), 828−839, https://doi.org/10.1038/s42256-022-00540-1.
    Hoffman, J., E. Tzeng, T. Park, J. Y. Zhu, P. Isola, K. Saenko, A. A. Efros, and T. Darrell, 2017: CyCADA: Cycle-consistent adversarial domain adaptation. arXiv:1711.03213, https://doi.org/10.48550/arXiv.1711.03213.
    Holbrook, N. J., and Coauthors, 2019: A global assessment of marine heatwaves and their drivers. Nature Communications, 10, 2624, https://doi.org/10.1038/s41467-019-102 06-z.
    Hu, K. M., G. Huang, and S.-P. Xie, 2019: Assessing the internal variability in multi-decadal trends of summer surface air temperature over East Asia with a large ensemble of GCM simulations. Climate Dyn., 52, 6229−6242, https://doi.org/10.1007/s00382-018-4503-x.
    Huang, B. Y., C. Y. Liu, V. Banzon, E. Freeman, G. Graham, B. Hankins, T. Smith, and H.-M. Zhang, 2021: Improvements of the Daily Optimum Interpolation Sea Surface Temperature (DOISST) version 2.1. J. Climate, 34 (8), 2923−2939, https://doi.org/10.1175/JCLI-D-20-0166.1.
    Huang, B., and Coauthors, 2017: Extended Reconstructed Sea Surface Temperature, Version 5 (ERSSTv5): Upgrades, validations, and intercomparisons. J. Climate, 30, 8179−8205, https://doi.org/10.1175/JCLI-D-16-0836.1.
    Huang, P., and J. Ying, 2015: A multimodel ensemble pattern regression method to correct the tropical pacific SST change patterns under global warming. J. Climate, 28 (12), 4706−4723, https://doi.org/10.1175/JCLI-D-14-00833.1.
    Jakob Themeßl, M., A. Gobiet, and A. Leuprecht, 2011: Empirical-statistical downscaling and error correction of daily precipitation from regional climate models. International Journal of Climatology, 31 (10), 1530−1544, https://doi.org/10.1002/joc.2168.
    Jiang, W. P., P. Huang, G. Huang, and J. Ying, 2021: Origins of the excessive westward extension of ENSO SST simulated in CMIP5 and CMIP6 Models. J. Climate, 34 (8), 2839−2851, https://doi.org/10.1175/JCLI-D-20-0551.1.
    Jiang, W. P., G. Huang, P. Huang, R. G. Wu, K. M. Hu, and W. Chen, 2019: Northwest Pacific anticyclonic anomalies during Post–El Niño summers determined by the pace of El Niño decay. J. Climate, 32 (12), 3487−3503, https://doi.org/10.1175/JCLI-D-18-0793.1.
    Johnson, N. C., and Y. Kosaka, 2016: The impact of eastern equatorial Pacific convection on the diversity of boreal winter El Niño teleconnection patterns. Climate Dyn., 47 (12), 3737−3765, https://doi.org/10.1007/s00382-016-3039-1.
    Kanamitsu, M., W. Ebisuzaki, J. Woollen, S.-K.Yang, J. J. Hnilo, M. Fiorino, and G. L. Potter, 2002: NCEP–DOE AMIP-II Reanalysis (R-2). Bulletin of the American Meteorological Society, 83 (11), 1631−1644, https://doi.org/10.1175/bams-83-11-1631(2002)083<1631:nar>2.3.co;2
    Li, D. L., J. L. Feng, Z. H. Xu, B. S. Yin, H. Y. Shi, and J. F. Qi, 2019: Statistical bias correction for simulated wind speeds over CORDEX-East Asia. Earth and Space Science, 6 (2), 200−211, https://doi.org/10.1029/2018EA000493.
    Liu, F., B. Wang, Y. Ouyang, H.Wang, S. Qiao, G. Chen, and W. Dong, 2022a: Intraseasonal variability of global land monsoon precipitation and its recent trend. npj Climate and Atmosphe ric Sciences, 5, 30, https://doi.org/10.1038/s41612-022-00253-7.
    Liu, F., and Coauthors, 2022b: Tropical volcanism enhanced the East Asian summer monsoon during the last millennium. Nature Communications, 13, 3429, https://doi.org/10.1038/s41467-022-31108-7.
    Li, G., and S. P. Xie, 2012: Origins of tropical-wide SST biases in CMIP multi-model ensembles. Geophys. Res. Lett., 39, L22703, https://doi.org/10.1029/2012GL053777.
    Li, G., and S. P. Xie, 2014: Tropical biases in CMIP5 multimodel ensemble: The excessive equatorial Pacific cold tongue and double ITCZ problems. J. Climate, 27 (4), 1765−1780, https://doi.org/10.1175/JCLI-D-13-00337.1.
    Liu, L., S.-P. Xie, X.-T. Zheng, T. Li, Y. Du, G. Huang, and W.-D. Yu, 2014: Indian Ocean variability in the CMIP5 multi-model ensemble: The zonal dipole mode. Climate Dyn., 43 (5−6), 1715−1730, https://doi.org/10.1007/s00382-013-2000-9.
    Liu, Z. C., W. Zhou, and Y. Yuan, 2023: 3D DBSCAN detection and parameter sensitivity of the 2022 Yangtze river summertime heatwave and drought. Atmos. Ocean. Sci. Lett., 16 (4), 100324, https://doi.org/10.1016/j.aosl.2022.100324.
    Liu, Z. C., W. Zhou, R. H. Zhang, Y. Zhang, and Y. Wang, 2022: Global-scale interpretable drought reconstruction utilizing anomalies of atmospheric dynamics. Journal of Hydrometeorology, 23 (9), 1507−1524, https://doi.org/10.1175/JHM-D-22-0006.1.
    Oliver, E. C. J., J. A. Benthuysen, N. L. Bindoff, A. J. Hobday, N. J. Holbrook, C. N. Mundy, and S. E. Perkins-Kirkpatrick, 2017: The unprecedented 2015/16 Tasman Sea marine heatwave. Nature Communications, 8, 16101, https://doi.org/10.1038/ncomms16101.
    Oliver, E. C. J., J. A. Benthuysen, S. Darmaraki, M. G. Donat, A. J. Hobday, N. J. Holbrook, R. W. Schlegel, and A. Sen Gupta, 2021: Marine heatwaves. Annual Review of Marine Science, 13, 313−342, https://doi.org/10.1146/annurev-marine-032720-095144.
    Oliver, E. C. J., and Coauthors, 2018: Longer and more frequent marine heatwaves over the past century. Nature Communications, 9, 1324, https://doi.org/10.1038/s41467-018-03732-9.
    Pan, B. X., G. J. Anderson, A. Goncalves, D. D. Lucas, C. J. W. Bonfils, J. Lee, Y. Tian, and H. Y. Ma, 2021: Learning to correct climate projection biases. Journal of Advances in Modeling Earth Systems, 13 (10), e2021MS002509, https://doi.org/10.1029/2021MS002509.
    Tang, H. S., G. Huang, K. M. Hu, W. P. Jiang, W. C. Tao, Y. Wang, and H. Y. Hou, 2023: Weak persistence of Northwest Pacific anomalous anticyclone during post-El Niño summers in CMIP5 and CMIP6 models. Climate Dyn., 61 (7), 3805−3830, https://doi.org/10.1007/s00382-023-06772-0.
    Tao, W. C., G. Huang, K. M. Hu, X. Qu, G. H. Wen, and H. N. Gong, 2015: Interdecadal modulation of ENSO teleconnections to the Indian Ocean Basin Mode and their relationship under global warming in CMIP5 models. International Journal of Climatology, 35 (3), 391−407, https://doi.org/10.1002/joc.3987.
    Tao, W. C., G. Huang, R. G. Wu, K. M. Hu, P. F. Wang, and H. N. Gong, 2018: Origins of biases in CMIP5 models simulating Northwest Pacific summertime atmospheric circulation anomalies during the decaying phase of ENSO. J. Climate, 31 (14), 5707−5729, https://doi.org/10.1175/JCLI-D-17-0289.1.
    Tao, W. C., G. Huang, P. F. Wang, Y. Liu, G. H. Wen, and D. H. Dong, 2019: Dominant modes of CMIP3/5 models simulating northwest Pacific circulation anomalies during post-ENSO summer and their SST dependence. Theor. Appl. Climatol., 138 (3−4), 1809−1820, https://doi.org/10.1007/s00704-019-02936-3.
    Taylor, K. E., R. J. Stouffer, and G. A. Meehl, 2012: An overview of CMIP5 and the experiment design. Bull. Amer. Meteor. Soc., 93 (4), 485−498, https://doi.org/10.1175/BAMS-D-11-00094.1.
    Teutschbein, C., and J. Seibert, 2012: Bias correction of regional climate model simulations for hydrological climate-change impact studies: Review and evaluation of different methods. J. Hydrol., 456−457 , 12−29, https://doi.org/10.1016/j.jhydrol.2012.05.052.
    Trenberth, K. E., G. W. Branstator, D. Karoly, A. Kumar, N. C. Lau, and C. Ropelewski, 1998: Progress during TOGA in understanding and modeling global teleconnections associated with tropical sea surface temperatures. J. Geophys. Res.: Oceans, 103 (C7), 14 291−14 324, https://doi.org/10.1029/97JC01444.
    Wang, Y., G. Huang, and K. M. Hu, 2020: Internal variability in multidecadal trends of surface air temperature over antarctica in austral winter in model simulations. Climate Dyn., 55 (9−10), 2835−2847, https://doi.org/10.1007/s00382-020-05412-1.
    Wang, Y., K. M. Hu, G. Huang, and W. C. Tao, 2021: Asymmetric impacts of El Niño and La Niña on the Pacific-North American teleconnection pattern: The role of subtropical jet stream. Environmental Research Letters, 16 (11), 114040, https://doi.org/10.1088/1748-9326/ac31ed.
    Wang, Y., K. M. Hu, G. Huang, and W. C. Tao, 2023: The role of nonlinear energy advection in forming asymmetric structure of ENSO teleconnections over the North Pacific and North America. Geophys. Res. Lett., 50 (17), e2023GL105277, https://doi.org/10.1029/2023GL105277.
    Wang, Y., G. Huang, K. M. Hu, W. C. Tao, H. N. Gong, K. Yang, and H. S. Tang, 2022: Understanding the eastward shift and intensification of the ENSO teleconnection over South Pacific and Antarctica under greenhouse warming. Frontiers in Earth Science, 10, 916624, https://doi.org/10.3389/feart.2022.916624.
    Yi, Z. L., H. Zhang, P. Tan, and M. L. Gong, 2017: DualGAN: Unsupervised dual learning for image-to-image translation. 2017 IEEE International Conf. on Computer Vision (ICCV), Venice, Italy, IEEE, 2868−2876, https://doi.org/10.1109/ICCV.2017.310.
    Zheng, Y. X., J. L. Lin, and T. Shinoda, 2012: The equatorial Pacific cold tongue simulated by IPCC AR4 coupled GCMs: Upper ocean heat budget and feedback analysis. J. Geophys. Res.: Oceans, 117, C05024, https://doi.org/10.1029/2011jc007746.
    Zhu, J. Y., T. Park, P. Isola, and A. A. Efros, 2017: Unpaired image-to-image translation using cycle-consistent adversarial networks. 2017 IEEE International Conf. on Computer Vision (ICCV), Venice, Italy, IEEE, 2242−2251, https://doi.org/10.1109/ICCV.2017.244.
  • [1] Tingyu WANG, Ping HUANG, Xianke YANG, 2024: Understanding the Low Predictability of the 2015/16 El Niño Event Based on a Deep Learning Model, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-024-3238-3
    [2] Lei HAN, Mingxuan CHEN, Kangkai CHEN, Haonan CHEN, Yanbiao ZHANG, Bing LU, Linye SONG, Rui QIN, 2021: A Deep Learning Method for Bias Correction of ECMWF 24–240 h Forecasts, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 1444-1459.  doi: 10.1007/s00376-021-0215-y
    [3] Hanxiao Yuan, Yang Liu, Qiuhua TANG, Jie LI, Guanxu CHEN, Wuxu CAI, 2024: ST-LSTM-SA:A new ocean sound velocity fields prediction model based on deep learning, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-024-3219-6
    [4] Temesgen Gebremariam ASFAW, Jing-Jia LUO, 2024: Downscaling Seasonal Precipitation Forecasts over East Africa with Deep Convolutional Neural Networks, ADVANCES IN ATMOSPHERIC SCIENCES, 41, 449-464.  doi: 10.1007/s00376-023-3029-2
    [5] Jing Ma, Haiming Xu, Changming Dong, Jingjia Luo, 2023: Forecast Skills and Predictability Source of the Marine Heatwaves in the NUIST-CFS1.0 Hindcasts, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-023-3139-x
    [6] Yunqing LIU, Lu YANG, Mingxuan CHEN, Linye SONG, Lei HAN, Jingfeng XU, 2024: A Deep Learning Approach for Forecasting Thunderstorm Gusts in the Beijing–Tianjin–Hebei Region, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-023-3255-7
    [7] Jiang HUANGFU, Zhiqun HU, Jiafeng ZHENG, Lirong WANG, Yongjie ZHU, 2024: Study on Quantitative Precipitation Estimation by Polarimetric Radar Using Deep Learning, ADVANCES IN ATMOSPHERIC SCIENCES, 41, 1147-1160.  doi: 10.1007/s00376-023-3039-0
    [8] YANG Yang, REN Rongcai, Ming CAI, RAO Jian, 2015: Attributing Analysis on the Model Bias in Surface Temperature in the Climate System Model FGOALS-s2 through a Process-Based Decomposition Method, ADVANCES IN ATMOSPHERIC SCIENCES, 32, 457-469.  doi: 10.1007/s00376-014-4061-z
    [9] Kanghui ZHOU, Jisong SUN, Yongguang ZHENG, Yutao ZHANG, 2022: Quantitative Precipitation Forecast Experiment Based on Basic NWP Variables Using Deep Learning, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 1472-1486.  doi: 10.1007/s00376-021-1207-7
    [10] Lu LI, Yongjiu DAI, Zhongwang WEI, Wei SHANGGUAN, Nan WEI, Yonggen ZHANG, Qingliang LI, Xian-Xiang LI, 2024: Enhancing Deep Learning Soil Moisture Forecasting Models by Integrating Physics-based Models, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-023-3181-8
    [11] Xiaoran DONG, Yafei NIE, Jinfei WANG, Hao LUO, Yuchun GAO, Yun WANG, Jiping LIU, Dake CHEN, Qinghua YANG, 2024: Deep Learning Shows Promise for Seasonal Prediction of Antarctic Sea Ice in a Rapid Decline Scenario, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-024-3380-y
    [12] Banglin ZHANG, Vijay TALLAPRAGADA, Fuzhong WENG, Jason SIPPEL, Zaizhong MA, 2016: Estimation and Correction of Model Bias in the NASA/GMAO GEOS5 Data Assimilation System: Sequential Implementation, ADVANCES IN ATMOSPHERIC SCIENCES, 33, 659-672.  doi: 10.1007/ s00376-015-5155-y
    [13] Pumeng LYU, Tao TANG, Fenghua LING, Jing-Jia LUO, Niklas BOERS, Wanli OUYANG, Lei BAI, 2024: ResoNet: Robust and Explainable ENSO Forecasts with Hybrid Convolution and Transformer Networks, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-024-3316-6
    [14] Chentao SONG, Jiang ZHU, Xichen LI, 2024: Assessments of Data-Driven Deep Learning Models on One-Month Predictions of Pan-Arctic Sea Ice Thickness, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-023-3259-3
    [15] Tingyu WANG, Ping HUANG, 2024: Superiority of a Convolutional Neural Network Model over Dynamical Models in Predicting Central Pacific ENSO, ADVANCES IN ATMOSPHERIC SCIENCES, 41, 141-154.  doi: 10.1007/s00376-023-3001-1
    [16] Xianghui FANG, Fei ZHENG, Kexin LI, Zeng-Zhen HU, Hongli REN, Jie WU, Xingrong CHEN, Weiren LAN, Yuan YUAN, Licheng FENG, Qifa CAI, Jiang ZHU, 2023: Will the Historic Southeasterly Wind over the Equatorial Pacific in March 2022 Trigger a Third-year La Niña Event?, ADVANCES IN ATMOSPHERIC SCIENCES, 40, 6-13.  doi: 10.1007/s00376-022-2147-6
    [17] Pengfei LIN, Zhipeng YU, Hailong LIU, Yongqiang YU, Yiwen LI, Jirong JIANG, Wei XUE, Kangjun CHEN, Qian YANG, Bowen ZHAO, Jilin WEI, Mengrong DING, Zhikuo SUN, Yaqi WANG, Yao MENG, Weipeng ZHENG, Jinfeng MA, 2020: LICOM Model Datasets for the CMIP6 Ocean Model Intercomparison Project, ADVANCES IN ATMOSPHERIC SCIENCES, 37, 239-249.  doi: 10.1007/s00376-019-9208-5
    [18] Ruth GEEN, Marianne PIETSCHNIG, Shubhi AGRAWAL, Dipanjan DEY, F. Hugo LAMBERT, Geoffrey K. VALLIS, 2023: The Relationship between Model Biases in East Asian Summer Monsoon Rainfall and Land Evaporation, ADVANCES IN ATMOSPHERIC SCIENCES, 40, 2029-2042.  doi: 10.1007/s00376-023-2297-1
    [19] Gang HUANG, Ya WANG, Yoo-Geun HAM, Bin MU, Weichen TAO, Chaoyang XIE, 2024: Toward a Learnable Climate Model in the Artificial Intelligence Era, ADVANCES IN ATMOSPHERIC SCIENCES.  doi: 10.1007/s00376-024-3305-9
    [20] Ge SONG, Rongcai REN, 2023: The Subsurface and Surface Indian Ocean Dipoles and Their Association with ENSO in CMIP6 models, ADVANCES IN ATMOSPHERIC SCIENCES, 40, 975-987.  doi: 10.1007/s00376-022-2086-2

Get Citation+

Export:  

Share Article

Manuscript History

Manuscript received: 25 October 2023
Manuscript revised: 21 December 2023
Manuscript accepted: 31 January 2024
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Correcting Climate Model Sea Surface Temperature Simulations with Generative Adversarial Networks: Climatology, Interannual Variability, and Extremes

    Corresponding author: Gang HUANG, hg@mail.iap.ac.cn
  • 1. State Key Laboratory of Numerical Modeling for Atmospheric Sciences and Geophysical Fluid Dynamics, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China
  • 2. Laboratory for Regional Oceanography and Numerical Modeling, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China
  • 3. College of Earth and Planetary Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
  • 4. Earth System Modelling, School of Engineering and Design, Technical University of Munich, Ottobrunn 85521, Germany
  • 5. Potsdam Institute for Climate Impact Research, Potsdam 14412, Germany
  • 6. Department of Mathematics and Global Systems Institute, University of Exeter, Exeter EX4 4QE, UK
  • 7. CMA Earth System Modeling and Prediction Centre (CEMC), Chinese Academy of Meteorological Sciences, Beijing 100081, China

Abstract: Climate models are vital for understanding and projecting global climate change and its associated impacts. However, these models suffer from biases that limit their accuracy in historical simulations and the trustworthiness of future projections. Addressing these challenges requires addressing internal variability, hindering the direct alignment between model simulations and observations, and thwarting conventional supervised learning methods. Here, we employ an unsupervised Cycle-consistent Generative Adversarial Network (CycleGAN), to correct daily Sea Surface Temperature (SST) simulations from the Community Earth System Model 2 (CESM2). Our results reveal that the CycleGAN not only corrects climatological biases but also improves the simulation of major dynamic modes including the El Niño-Southern Oscillation (ENSO) and the Indian Ocean Dipole mode, as well as SST extremes. Notably, it substantially corrects climatological SST biases, decreasing the globally averaged Root-Mean-Square Error (RMSE) by 58%. Intriguingly, the CycleGAN effectively addresses the well-known excessive westward bias in ENSO SST anomalies, a common issue in climate models that traditional methods, like quantile mapping, struggle to rectify. Additionally, it substantially improves the simulation of SST extremes, raising the pattern correlation coefficient (PCC) from 0.56 to 0.88 and lowering the RMSE from 0.5 to 0.32. This enhancement is attributed to better representations of interannual, intraseasonal, and synoptic scales variabilities. Our study offers a novel approach to correct global SST simulations and underscores its effectiveness across different time scales and primary dynamical modes.

摘要: 气候模式对于理解和预估全球气候变化及其影响至关重要。然而,当前气候模式仍存在不少模拟偏差,这限制了其历史模拟的准确性和未来预估的可信度。由于内部变率的存在,模式模拟结果与观测数据无法完全一一对应,这限制了传统基于监督学习的校正方法的应用。本文基于无监督的循环一致生成对抗网络(CycleGAN)来校正通用地球系统模式(CESM2)的日海表温度(SST)模拟结果。研究结果表明,CycleGAN 不仅能校正气候态偏差,还能显著改进对厄尔尼诺-南方涛动(ENSO)、印度洋偶极子(IOD)等动力模态以及海温极值的模拟能力。CycleGAN将全球平均均方根误差(RMSE)降低了 58%,同时显著减弱了ENSO SST 异常过度西伸的偏差,该偏差是西北太平洋异常反气旋乃至东亚夏季风模拟最主要的误差来源之一,而传统方法(如quantile map)很难纠正这一模式共同偏差。此外,它还改进了对海温极值的模拟,将模态相关系数(PCC)从 0.56 提高到 0.88,将RMSE从 0.5 降低到 0.32。这种改进可能与年际、季节内和天气尺度过程的改进有关。本研究提供了一种校正全球 SST 模拟的新方法,并证明了该方法对于提升动力模态模拟能力的有效性。

    • Climate model simulations play an indispensable role in contemporary climate science, serving as fundamental resources for climate change detection, attribution, and projection. Their simulations, especially from the Coupled Model Intercomparison Project (CMIP), underpin the assessments conducted by the Intergovernmental Panel on Climate Change (IPCC), offering comprehensive evaluations and pivotal references that support the advancement of global climate science and guide critical decision-making (Taylor et al., 2012; Eyring et al., 2016). However, the utility of CMIP models in both detection and attribution of climate change, as well as climate projections, is significantly impeded by various persistent model biases (Li and Xie, 2012, 2014; Zheng et al., 2012; Liu et al., 2014; Tao et al., 2018; Jiang et al., 2019, 2021; Capotondi et al., 2020; Danabasoglu et al., 2020).

      Among these biases, a prominent concern centers on the representation of sea surface temperatures (SSTs) in coupled general circulation models (CGCMs), especially the persistent cold tongue bias found in the equatorial Pacific (Li and Xie, 2012, 2014; Zheng et al., 2012). This phenomenon, characterized by a cold bias in the climatological SSTs, has long perplexed the climate community due to its potential repercussions on climatological precipitation patterns and the accurate simulation of the El Niño-Southern Oscillation (ENSO), thereby affecting the fidelity of global climate simulations (Tao et al., 2018; Jiang et al., 2019; Tang et al., 2023). Additionally, substantial biases manifest themselves at interannual scales, with ENSO SST variability exhibiting an excessive westward extension into the equatorial western Pacific, consequently impacting ENSO decay processes and regional climate variability. These biases, commonly present in most of the CMIP models, are hardly canceled out, even by multi-model means (Huang and Ying, 2015; Jiang et al., 2021; Tang et al., 2023), thus impacting historical simulations and future projections.

      Efforts to rectify these climate model biases have spawned the development of various methodologies. Traditional methods predominantly center around the local adjustment of specific statistical characteristics. These adjustments typically target parameters like the mean and variance (involving variance adjustment; Chen et al., 2011, 2013; Teutschbein and Seibert, 2012; Li et al., 2019), and frequency distributions (via techniques like quantile mapping; QM; Jakob Themeβl et al., 2011). However, these approaches fall short of providing precise corrections for model errors at the daily timescale and in dynamical climate variabilities. Traditional methods primarily focus on specific statistical features or target adjustments at individual grid points and can by design not correct spatial patterns (Hess et al., 2022, 2023).

      The challenge of climate model bias correction is exacerbated by the internal variability of the Earth system, characterized by inherent nonlinearity (Hawkins and Sutton, 2011; Deser et al., 2012, 2016; Hu et al., 2019; Wang et al., 2020). In particular, discrepancies arise due to internal variability in model simulations and observations at corresponding times. Consequently, discerning whether disparities between model fields and observational datasets arise from internal variability or genuine model biases poses a substantial challenge; identifying and, on that basis, correcting the bias for a particular day in model data is inherently difficult. This presents a challenge when trying to employ traditional statistical methods and supervised learning techniques to correct model simulations.

      The advent of deep learning, particularly the development of unsupervised and semi-supervised learning techniques, has introduced novel avenues for addressing climate model biases. Generative Adversarial Networks (GANs), pioneered by Goodfellow et al. (2014), leverage a generator-discriminator framework to generate images that are in their characteristics and statistical properties virtually indistinguishable from target images. Subsequently, Hoffman et al. (2017), Yi et al. (2017), and Zhu et al. (2017) expanded upon the GAN methodology, introducing the Cycle-consistent GAN (CycleGAN), which incorporates a cycle-consistent loss to facilitate the bidirectional transformation of data styles. Pan et al. (2021) have explored the application of this method to rectify precipitation data in CGCM simulations, yielding promising results in correcting precipitation biases in the United States. Moreover, the extension of this method to global precipitation simulations demonstrated its efficacy in correcting spatial patterns and in ameliorating the Pacific double Intertropical Convergence Zone (ITCZ) bias (Hess et al., 2022, 2023), a prevalent issue across most CGCMs. These applications have predominantly concentrated on evaluating the enhancement of fundamental statistical aspects of precipitation.

      The critical inquiry here is whether this method can also address bias-correct modeled SST fields. Furthermore, it is unclear whether it can correct the dynamical oceanic modes and adapt to variations driven by distinct physical mechanisms at various timescales. These unclarified issues presently lack a definitive resolution.

      In this paper, we employ a CycleGAN-based approach to correct global daily SST fields to address these concerns. We find that our method not only addresses biases at climatological averages but also tackles errors associated with ENSO, the Indian Ocean Dipole (IOD), and SST extremes. The remainder of this paper is structured as follows. Section 2 provides introductions of the datasets and methodology employed, section 3 unveils the primary results, and section 4 discusses and summarizes our findings.

    2.   Data and methods
    • We employ daily surface temperature (ST) data from the National Center for Environmental Prediction-Department of Energy Atmospheric Reanalysis (NCEP; Kanamitsu et al., 2002) at a resolution of 2.5° × 2.5° for the period 1950–2014. Modeled daily ST fields from 1950–2014 are obtained from the Community Earth System Model 2 (CESM2; Danabasoglu et al., 2020), a widely used and well-organized model, which can be considered a state-of-the-art climate model. The global gridded monthly SST datasets from Extended Reconstructed Sea Surface Temperature (ERSST.v5; Huang et al. 2017), the NOAA 1/4° Daily Optimum Interpolation SST (OISST; Huang et al., 2021), and the Hadley Centre Global Sea Ice and SST (HadISST) are utilized. These three SST datasets and NCEP ST, ranging from 1991−2014, are utilized for model testing. All datasets are interpolated to a resolution of 2.5° × 2.5°.

      To extract ENSO-related SST variability, we utilize the unstandardized Niño-3.4 index. This index is defined as the SST anomalies (SSTA) averaged over the region between 5°S–5°N, and 120°–170°W from December to February (DJF). We performed a regression analysis of this unstandardized Niño-3.4 index on the interannual SSTAs, thus revealing the underlying ENSO-driven SST variability. The IOD SST is obtained in a similar way to that of the ENSO SST but uses the Dipole Mode Index, defined as the anomalous SST gradient between the western equatorial Indian Ocean (50°–70°E and 10°S–10°N) and the southeastern equatorial Indian Ocean (90°–110°E and 10°S–0°) during September–October–November (SON).

    • CycleGAN is a groundbreaking deep-learning model designed for unpaired image-to-image translation. It enables the transformation of images from one domain to another even when paired training data is unavailable (Zhu et al., 2017), such as in the case addressed in the present study. Unpaired image-to-image translation involves the mapping of images from one domain (the source domain, denoted by X) to another (the target domain, denoted by Y) without requiring a direct one-to-one correspondence between the images in the two domains. In our case, we apply the principles of unpaired image-to-image translation for the correction of CGCM daily SST data. Specifically, we consider the CESM2 SSTs as domain X and NCEP STs as domain Y. The training and validation data comprises the CESM2 and NCEP ST data from 1950 to 1990, while the testing data spans from 1991 to 2014. A normalization process is employed to scale the data.

      Figure 1 gives the schematic of the CycleGAN. Within the CESM2, biases exist in the internal variability and externally forced responses within model simulations on any given day. However, the internal variability in observations on the same day may not correspond to that in the model. For example, the model might simulate an El Niño year, while observations indicate a La Niña year. Therefore, directly adjusting the model to match the observational state would be profoundly inaccurate. Our goal is to eliminate biases in the internal variability and externally forced responses while simultaneously preserving the model's internal variability (e.g., the ENSO phase).

      Figure 1.  (a) Schematic of the CycleGAN. (b) The details of ResNet, which has nine residual blocks. (c) The details of the first residual block, and (d) other blocks.

      To achieve this objective, a generator [G(x)] and a discriminator are first employed in this model (Fig. 1a). The primary function of the generator is to transform the CESM2 data to match the observational SST, while the discriminator assesses whether the generated outcomes originate from observations. A balance between these two components yields the best results, wherein the discriminator should be incapable of distinguishing between images generated by the generator and observations. This phase aids in correcting the GCM simulations. Subsequently, a secondary generator [F(y)] and a discriminator are utilized to retain the internal variability. The secondary generator, F(y), is tasked with reconverting the previously generated images back into the original CESM2 data. This process ensures that the generated images by G(x) retain the essential characteristics of CESM2. From a mathematical perspective, the architecture of F(y) ensures a bidirectional mapping between CESM2 and the observational data. In a certain context, this secondary generator facilitates the preservation of internal variability within CESM2, as internal variability is a predominant aspect of a daily timescale.

      The loss function of this model is as follows:

      The loss includes the adversarial loss of XY [$ {\mathcal{L}}_{\mathrm{G}\mathrm{A}\mathrm{N}}\left(G,{D}_{Y},X,Y\right) $, see Eq. (2)], YX [$ {\mathcal{L}}_{\mathrm{G}\mathrm{A}\mathrm{N}} \left(F, {D}_{X}, Y,X\right) $, see Eq. (3)], the consistency loss [$ {\mathcal{L}}_{\mathrm{c}\mathrm{y}\mathrm{c}}\left(G,F\right) $], and the identity loss [$ {\mathcal{L}}_{\mathrm{i}\mathrm{d}}\left(G,F\right) $]. G denotes a generator that tries to generate images G(x) that are indistinguishable from those in Y, F represents a generator that tries to generate images F(y) similar to those in X. $ {D}_{Y} $ and $ {D}_{X} $ denote the discriminators aiming to distinguish between G(x) and y, and F(y) and x, respectively.

      Equations (2) and (3) show the adversarial loss, where $ \mathbb{E} $ means the expected value. Moreover, following Zhu et al. (2017), the consistency loss and identity loss [Eqs. (4) and (5)] are further added to help retain the internal variability and avoid introducing additional bias, the weights of these terms are controlled by $ {\lambda }_{c} $ and $ {\lambda }_{i} $, which are set to 10 and 0.5, respectively.

      The generative networks consist of three components: down-sampling with two convolutional layers, followed by nine residual blocks comprising a total of 21 convolutional layers, and finally two up-sampling layers (as depicted in Fig. 1). The first residual block, represented in Fig. 1c, contains five convolutional layers, while the subsequent blocks, as shown in Fig. 1d, each consist of two convolutional layers. This deep generative network helps generate finer SST images. The architecture of the discriminator aligns with the model presented by Zhu et al. (2017). Gradient clipping is employed to optimize the training of the model. In training the GAN, the Wasserstein loss is utilized, chosen for its known stability (Arjovsky et al., 2017).

      In the following development, we employ linear regression and composite analysis and determine the statistical significance according to a two-tailed Student’s t-test.

    3.   Results
    • Figure 2 presents a qualitative comparison of daily ST on the same date (3 November 2004). CycleGAN essentially preserves most features present in CESM2 results, notably the La Niña-like SSTAs (Figs. 2b, c), while also exhibiting a greater resemblance to observations (Figs. 2a, c). This improvement is particularly notable in regions such as the tropical Atlantic, South Africa, and western South America. This comparison effectively signifies that the data post-processed by CycleGAN, in addition to correcting model outcomes, preserves the internal variability of the model.

      Figure 2.  Qualitative comparison of the intermittency in daily ST on the same date (3 November 2004), from the (a) observation, (b) CESM2 model, and (c) CycleGAN-based post-processing. White contour lines (with an interval of 1 K) denote the monthly SSTAs in November 2004.

      In the subsequent analysis, we evaluate the correction from three key perspectives: climatology, interannual variability, and extreme events. To validate the effectiveness of the correction, we performed comparisons between results obtained from CESM2, GAN-corrected SSTs, and four reference datasets (HadISST, ERSST, NCEP, OISST).

    • Figure 3 illustrates the disparities between CESM2 model data, GAN-corrected outcomes, and observational datasets. Overall, CESM2 displays a substantial warm bias when compared to the four SST datasets. There is a distinct dipole bias in SSTs in the North Atlantic, particularly along the Gulf Stream and North Atlantic Current, where warm and cold biases coexist in proximity. A similar dipole bias is observed in the Southern Ocean at mid-to-high latitudes. Notably, prominent warm biases are noticeable near eastern boundary upwelling regions, encompassing areas close to California, South America, and Africa (Figs. 3a, c, e, g). These biases are likely associated with discrepancies in wind stress patterns within these regions and the spatial resolution of the model (Capotondi et al., 2020). Additionally, a significant warm bias is evident in the tropical Pacific. This warm bias is distinct from the well-known cold tongue bias observed in most CMIP5 models and even differs from CESM1, signifying disparities between CESM2 and its predecessors, a feature also acknowledged in previous CESM2 assessment studies (e.g., Capotondi et al., 2020).

      Figure 3.  The disparities between climatological annual mean SSTs in (a, c, e, g) CESM2 simulations, (b, d, f, h) GAN-corrected SSTs, and (a, b) HadISST, (c, d) ERSST, (e, f) NCEP, (g, h) OISST.

      After the correction by the CycleGAN, there is a substantial reduction in the bias in the climatological SSTs (Figs. 2b, d, f, h). The dipole biases in the North Atlantic and the Southern Hemisphere are notably reduced. Only minor cold biases persist in the vicinity of eastern Africa and to the east of New Zealand. Furthermore, the warm biases in the tropical Pacific are significantly attenuated. The overall bias diminishes from 1.25°C (1.19°C to 1.28°C) in CESM2 to 0.52°C (0.4°C to 0.57°C). Through a comparative analysis involving four sets of SST data, we find that the biases, both in magnitude and spatial pattern, are highly consistent across the different datasets. This robust agreement underscores the reliability of our evaluation. In addition, CycleGAN exhibits clear advantages over traditional methods. We compare its performance with the widely utilized modified QM approach (Jakob Themeßl et al., 2011; Bai et al., 2016), widely employed for the correction of historical simulations and future projections within CMIP. When applied to SST correction, the modified QM yields an error of 0.6, which is inferior to the results obtained by CycleGAN (Fig. 4). It is noteworthy that the performance of QM and CycleGAN is remarkably close in tropical regions, and in the North Pacific, QM even outperforms it in the latter. However, QM exhibits larger errors in the high-latitude regions of the Southern Ocean.

      Figure 4.  The disparities between climatological annual mean SSTs in (a) CESM2 simulations, (b) QM-corrected SST, and NCEP.

      The errors calculated based on the NCEP data align closely with those from other datasets, further affirming the suitability of this dataset for model training. Given the substantial convergence in results across three SST datasets (HadISST, ERSST, OISST), for the sake of brevity, we will exclusively employ HadISST as the observational reference in the subsequent sections.

    • In addition to addressing biases at the climatological scale, an essential aspect of climate model correction, and perhaps even more crucial, is the capability to simulate the interannual dynamical variability such as ENSO and the IOD. The accurate representation of these variabilities directly influences the ability to capture global and regional climate variability. Figure 5 displays the DJF ENSO SSTA, in observations, CESM2, and CycleGAN-corrected SST. Notably, CESM2 exhibits an excessive westward bias in the equatorial Pacific, with a pronounced warm bias in the equatorial western Pacific. This bias is well-known and prevalent in many models, constituting a common bias in most CMIP5 and CMIP6 models (Tao et al., 2015, 2018, 2019; Jiang et al., 2021). It is worth noting that this substantial warm bias in ENSO SSTAs can be primarily attributed to the climatological cold tongue bias (Li and Xie, 2012, 2014; Jiang et al., 2021). The cold tongue bias manifests as a phenomenon in the model simulation, signifying that the climatological annual mean (SST) in the central-eastern tropical Pacific is colder than observed. This is characterized by an excessively strong and westward-extending cold tongue in the equatorial Pacific. The presence of the cold tongue bias can influence SSTAs in this region through its impact on temperature advection and other related processes. Despite the warm bias in the western Pacific in CESM2, its climatology also exhibits a warm bias. This suggests that the warm bias in the western Pacific of CESM2 may be attributed to other mechanisms.

      Figure 5.  ENSO-related SSTAs in (a) observation, (b) CESM2, (c) QM, and (d) CycleGAN. Stippling indicates where the regressions are significant at the 95% confidence level, based on a Student’s t-test. The hatched areas represent significant differences in El Niño composite SSTs within the CycleGAN-corrected fields compared to CESM2. The black contour line denotes +0.2 K in observations.

      The modified QM method exhibits limited efficacy in the correction of ENSO SSTs. Its RMSE and PCC closely align with CESM2, and is ineffective in addressing distributional biases, such as excessive westward bias (Fig. 5c). After correction by the CycleGAN, the warm bias is greatly diminished in the equatorial-western Pacific, with its intensity comparable to observations. Furthermore, the warm bias in the central and eastern Pacific, as observed in CESM2, is also notably reduced. A significance test was performed for composite El Niño SSTs, revealing significant differences between CESM2 and the CycleGAN-corrected results in several regions, including the equatorial western Pacific, the equatorial sides of the central Pacific, and the tropical eastern Pacific (Fig. 5d). This indicates a statistically reliable improvement by the CycleGAN in addressing warm biases in CESM2. Overall, upon applying the correction, we observe a significant reduction in the excessive westward bias of ENSO SST variability, with the distribution closely resembling observations. The RMSE decreases from 0.14 in CESM2 to 0.06, and the PCC increases from 0.89 to 0.95, corresponding to a 57% reduction and a 6.7% improvement, respectively.

      Figure 6 shows the standard deviation of SSTs in the Indian Ocean and the IOD SSTA. In the CESM2 simulation, there is a significant overestimation of variability in the southeastern and western equatorial Indian Ocean, with the center of variability extending too far westward in the southeastern Indian Ocean when compared to observations. Similar to its limited efficacy in ENSO correction, the modified QM method demonstrates constrained effectiveness in the correction of SST variability. While the CycleGAN markedly reduces the biases in the southeastern and western equatorial Indian Ocean compared to CESM2, the excessive westward extension is substantially attenuated. The IOD mode also exhibits bias, for example, the standard deviation of SSTs (Fig. 6f). In CESM2, the IOD SSTA in the southeastern Indian Ocean is significantly stronger and extends too far westward compared to observations. In observations, negative anomalies roughly extend to 85°E, while in CESM2, the negative anomaly extends to around 70°E. CycleGAN successfully reduces this bias, with the negative anomaly located east of 80°E, significantly closer to observations. As such, its performance is noticeably superior to modified QM, with the latter making only marginal adjustments to the intensity of SSTA. The RMSE of CycleGAN is substantially decreased compared to CESM2, from 0.34 to 0.21, while the PCC increases from 0.82 to 0.92.

      Figure 6.  (Left) The standard deviation of SST and (b) the IOD-related SSTA during SON in (a, e) observation, (b, f) CESM2, (c, g) QM, and (d, h) CycleGAN, respectively. Stippling indicates the regressions are significant at the 95% confidence level, based on a Student’s t-test. The hatched areas represent significant differences in positive IOD composite SSTs within the CycleGAN compared to CESM2.

    • In addition to interannual dynamical variability, marine heatwaves are a recent focus in oceanic research. Frequent marine heatwaves significantly impact marine ecosystems, particularly marine fisheries and coral reefs (Oliver et al., 2017, 2018, 2021; Holbrook et al., 2019; Liu et al., 2022, 2023). We primarily use the 95th percentile of daily SSTA to quantify SST extremes and evaluate the performance of CESM2 and the GAN-correction in representing them. To conduct the evaluation, we utilized NCEP data spanning from 1991 to 2014 for testing. Figure 7 illustrates the 95th percentile of NCEP, CESM2, and CycleGAN-corrected SSTA, respectively. In the NCEP data, regions with high values of the 95th percentile are concentrated in the western boundary current extension regions, the central and eastern equatorial Pacific, and the northeast Pacific Ocean, consistent with previous studies (Chen et al., 2014; Echevin et al., 2018; Oliver et al., 2017, 2021). When compared to NCEP, the spatial distribution of the 95th percentile in CESM2 is generally similar. However, CESM2 significantly overestimates the intensity of extreme SSTAs in the central and eastern tropical Pacific while underestimating it in the northwest Pacific. In contrast, CycleGAN-corrected extreme SSTA shows better agreement with observations in terms of intensity and spatial distribution. The PCC improves from 0.56 to 0.88, and the RMSE decreases from 0.5 to 0.32, slightly outperforming those obtained with modified QM.

      Figure 7.  Same as in Fig. 5, but for the 95th percentile of SSTAs.

      Particularly noteworthy is the significant reduction of the overestimation bias in the central and eastern tropical Pacific and the enhancement in the underestimation in the northeast Pacific, making CycleGAN's results more consistent with NCEP. Figures 8 and 9 present probability density functions (PDFs) of SSTAs in the tropical (5°S–5°N) and northeast (40°–60°N; 180°–120°W) Pacific, respectively. On average, CESM2 exhibits a wider distribution in the tropics, indicating a stronger variability. CESM2 has a 95th percentile of 1.41 degrees for the tropical region, while NCEP only has 1.18 degrees, resulting in a 19.5% overestimation. This aligns with the earlier result of the overestimation of SSTAs in the tropical region, especially in the central and eastern Pacific (Fig. 7). After correction, the distribution of SSTA in the tropics closely matches observations, with 95th percentiles of approximately 1.17, indicating a substantial reduction in the overestimation bias of CESM2 in the tropics. A similar situation is observed in the northeast Pacific. In this area, the SSTA distribution in CESM2 is overall narrower than observations, indicating a lower variability. Correspondingly, its 95th percentile is lower than that of NCEP. After correction, the distribution and the 95th percentile of SSTAs are much closer to NCEP.

      Figure 8.  PDFs of average tropical SSTAs for (a) NCEP and CESM2, and (b) NCEP and CycleGAN. Shading indicates areas where the SSTA is above the 90th percentile.

      Figure 9.  Same as in Fig. 8, but for the Northeast Pacific SSTAs.

      Unlike methods such as modified QM that directly adjust data by quantiles to make the local distribution closer to observations, the CycleGAN learns and transforms high-dimensional data features. An important question naturally arises: which key processes does CycleGAN capture that result in the improvement of simulated SST extremes? To address this question, we further perform an analysis of variability at various timescales, involving interannual variabilities and frequencies that less than 90 days (encompassing the Madden−Julian Oscillation (MJO), Intraseasonal Oscillation (ISO), and synoptic-scale eddies). Figure 10a presents the interannual standard deviation of DJF SSTs based on observations. It primarily exhibits an ENSO-like pattern in the equatorial central and eastern Pacific. Moreover, notable high-variability regions are evident in the Northeast Pacific, North Pacific, mid-latitudes in the South Pacific, and adjacent to western boundary currents. In general, the spatial distribution of SST interannual variability in observations closely resembles the distribution of the 95th percentile of SSTA (shown in Fig. 7), indicating the direct influence of interannual variability, especially ENSO, on extreme SSTs as in previous studies (Doi et al., 2015; Oliver et al., 2021). Additionally, ENSO can induce SST variations in the Northeast Pacific and North Atlantic regions by exciting Rossby wave trains (Wang et al., 2021, 2022, 2023), further influencing extreme events in those areas (Trenberth et al., 1998; Johnson and Kosaka, 2016). When compared to observations, CESM2 exhibits an overall overestimation of interannual variability, particularly pronounced in the central-eastern Pacific. This overestimation is likely related to the higher ENSO variability within CESM2 (Capotondi et al., 2020). The consistency of the stronger interannual variability in the tropical central-eastern Pacific and the larger bias of extreme SSTAs in Fig. 7 suggest that the overestimation of interannual variability is a key factor contributing to the high bias in extremes in CESM2. In contrast to CESM2, SST fields corrected by the CycleGAN show a substantial reduction in the error in interannual variability, with the RMSE decreasing from 0.31 to 0.1. CycleGAN-corrected results exhibit a closer match to observations in terms of intensity, particularly in the tropical Pacific, South Pacific, Atlantic, and Indian Ocean. While the intensity in the tropical central-eastern Pacific remains higher than observed, notable improvements are observed compared to CESM2, consistent with the enhancement seen in the extreme SSTAs for this region in Fig. 7. Concerning intraseasonal and synoptic-scale variability, the variability in NCEP is predominantly concentrated in the tropical Pacific and western boundary current regions (Fig. 10d), likely associated with boundary currents, subseseanal variations (Liu et al., 2022a, 2022b), and active mesoscale eddy activity (Oliver et al., 2021). CESM2 simulations generally underestimate variability at these scales (Fig. 10e). In contrast, CycleGAN-corrected SSTs exhibit substantial improvements, especially in the Northeast Pacific, where the distribution is in closer agreement with NCEP (Fig. 10f). In summary, the CycleGAN demonstrates significant improvements in SST variability at different timescales, facilitating more accurate simulation of the complex oceanic dynamic processes, particularly regarding interannual variability and variability at intraseasonal and synoptic scales.

      Figure 10.  Standard deviations of (left) interannual DJF SSTAs and (right) SSTAs filtered by a band of 2–90 days in (a, d) NCEP, (b, e) CESM2, and (c, f) CycleGAN, respectively.

    4.   Conclusion and discussion
    • In this study, we employed CycleGAN to correct the daily SSTs from CESM2 historical simulations. We conducted a comprehensive assessment of this model, considering various aspects such as climatology, interannual variability, and extremes. Our findings reveal significant improvements across these evaluation dimensions. At the climatological scale, the CycleGAN substantially reduced bias in annual mean climatological SST. Specifically, there is a 58% reduction in RMSE relative to CESM2, from 1.25 (1.19–1.28) degrees in CESM2 to 0.52 (0.4–0.57) degrees. At the interannual scale, involving two primary tropical modes, ENSO and IOD, we observe significant enhancements in simulating these modes with the CycleGAN. For ENSO SSTAs, the RMSE decreases from 0.14 to 0.06, corresponding to a reduction of 57%. CycleGAN effectively addresses a common bias of ENSO SST found in many climate models, known as the excessive westward bias in the equatorial Pacific that traditional methods, like quantile mapping, struggle to rectify. In the case of IOD, CESM2 tends to produce excessively strong and westward-extending anomalies in the southeastern Indian Ocean. After the correction, these biases are substantially reduced, resulting in an increased PCC of up to 0.92.

      Moreover, we investigate the performance in simulating SST extremes. The CycleGAN corrects the overestimation in extremes in most regions and addresses the underestimation in the Northeast Pacific. The improved performance of the CycleGAN in simulating the distribution of SSTAs and their associated extremes can be attributed to its ability to capture different temporal scales of variability, including interannual variability and variability at intraseasonal and synoptic scales, encompassing periods shorter than 90 days.

      In summary, our study demonstrates that the CycleGAN offers comprehensive enhancements across various timescales and physical processes. Its utility extends beyond merely correcting specific statistical measures such as first and second-order moments locally; it also enhances the simulation of critical air-sea coupling modes like ENSO and IOD. These findings underscore the significant potential of CycleGAN as a valuable tool for climate model correction and climate projection.

      We employ NCEP data for the evaluation of SST extremes. It is crucial to highlight the potential inconsistency among various observational datasets at the daily timescale, particularly in oceanic data. Consequently, it is essential to clarify that the presented results regarding SST extremes should be interpreted as an assessment of the capacity of CycleGAN to enhance the simulation of SST extremes rather than its precision in replicating observations. Achieving the latter would demand a more extensive dataset, particularly observational station data.

      Moreover, it is crucial to acknowledge that, beyond the correction of historical simulations, the central consideration is the adaptability of these methods to future projections. Recent studies, exemplified by Hess et al. (2022), suggest that CycleGAN, through post-processing or incorporating constraints like global mean values, can reproduce trend signals under global warming. Further research is needed to refine the capability of capturing regional-scale warming responses. Subsequent investigations should also involve the development of correction datasets encompassing multiple models and variables, accompanied by a comprehensive analysis of physical mechanisms and laws in the corrected data. Comparisons with methods integrated with climate models, such as surface flux adjustments, should be explored. Furthermore, the direct coupling of the proposed correction method with climate models represents a critical avenue for future inquiry.

      Acknowledgements. The study is jointly supported by the National Natural Science Foundation of China (Grant Nos. 42141019 and 42261144687), the Second Tibetan Plateau Scientific Expedition and Research (STEP) program (Grant No. 2019QZKK0102), the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDB42010404), the National Natural Science Foundation of China (Grant No. 42175049) and the Guangdong Meteorological Service Science and Technology Research Project (Grant No. GRMC2021M01). Ya WANG would like to thank the National Key Scientific and Technological Infrastructure project “Earth System Science Numerical Simulator Facility” (EarthLab) for computational support and Prof. Shiming XIANG for many useful discussions. Niklas BOERS acknowledges funding from the Volkswagen foundation. The code of CycleGAN can be found in https://github.com/junyanz/CycleGAN.

Reference

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return