Advanced Search
Article Contents

Background Error Covariance Statistics of Hydrometeor Control Variables Based on Gaussian Transform

Fund Project:

This research was funded by National Key Research and Development Program of China (2017YFC1502102), National Natural Science Foundation of China (41675102), and Graduate Research and Innovation Projects of Jiangsu Province (KYCX20_0910)


doi: 10.1007/s00376-021-0271-3

  • Use of data assimilation to initialize hydrometeors plays a vital role in numerical weather prediction (NWP). To directly analyze hydrometeors in data assimilation systems from cloud-sensitive observations, hydrometeor control variables are necessary. Common data assimilation systems theoretically require that the probability density functions (PDFs) of analysis, background, and observation errors should satisfy the Gaussian unbiased assumptions. In this study, a Gaussian transform method is proposed to transform hydrometeors to more Gaussian variables, which is modified from the Softmax function and renamed as Quasi-Softmax transform. The Quasi-Softmax transform method then is compared to the original hydrometeor mixing ratios and their logarithmic transform and Softmax transform. The spatial distribution, the non-Gaussian nature of the background errors, and the characteristics of the background errors of hydrometeors in each method are studied. Compared to the logarithmic and Softmax transform, the Quasi-Softmax method keeps the vertical distribution of the original hydrometeor mixing ratios to the greatest extent. The results of the D′Agostino test show that the hydrometeors transformed by the Quasi-Softmax method are more Gaussian when compared to the other methods. The Gaussian transform has been added to the control variable transform to estimate the background error covariances. Results show that the characteristics of the hydrometeor background errors are reasonable for the Quasi-Softmax method. The transformed hydrometeors using the Quasi-Softmax transform meet the Gaussian unbiased assumptions of the data assimilation system, and are promising control variables for data assimilation systems.
    摘要: 水凝物变量的初始化在数值模式预报中起到了非常重要的作用。资料同化理论上要求分析场、背景场以及观测误差概率密度函数满足高斯无偏分布的假定。针对水凝物变量的非高斯问题,本研究提出了一个适用于水凝物变量的高斯转换方法,并分别对不同转换方法获得的水凝物变量的空间分布特征、误差的高斯性以及背景误差协方差的特征进行了分析。高斯诊断分析表明,与其它转换方法相比,本方法转换后的水凝物变量比其它方法转换的变量更接近高斯分布,同时该方法保持了水凝物变量本身的空间分布特征。进一步地,作者将该方法引入到控制变量转换中,并统计分析了控制变量转换后的水凝物变量的背景误差协方差,结果表明,转换后的水凝物变量背景误差协方差的结构特征是合理的。本研究表明,经过本文方法高斯转换后的水凝物变量可以满足资料同化系统中背景误差高斯无偏分布的假定,可以作为资料同化系统的控制变量。
  • 加载中
  • Figure 1.  (a) Observed 12-h accumulated precipitation (units: mm) from 0600 UTC to 1800 UTC 2 July 2016 in the study domain, (b) the brightness temperature (K) of channel 8 from Himawari-8 AHI valid at 1800 UTC 2 July 2016, and (c) the vertical profiles of qc, qi, qr, and qs (g kg−1) from one ensemble member valid at 1800 UTC 2 July 2016.

    Figure 2.  Transformed ${q_{\rm{s}}}$ at model level 25 (~ 300 hPa) for (a) Origin (10−5 kg kg−1), (b) Log10 (logarithmic transform; kg kg−1), (c) Softmax (10−6 kg kg−1) and (d) Q_softmax (10−5 kg kg−1) from one sample.

    Figure 3.  The vertical profiles of (a) qc, (b) qi, (c) qr, and (d) qs for Origin (10−5 kg kg−1), Log10 (kg kg−1), Softmax (10−6 kg kg−1) and Q_softmax (10−5 kg kg−1) from one sample.

    Figure 4.  K2 of qs at model level 25 (~ 300 hPa) for (a) Origin, (b) Log10, (c) Softmax and (d) Q_Softmax.

    Figure 5.  K2 of qi at model level 25 (~ 300 hPa) for (a) Origin, (b) Log10, (c) Softmax and (d) Q_softmax.

    Figure 6.  Vertical profiles of K2 of (a) qc, (b) qi, (c) qr, and (d) qs for the four experiments. For each level, values are averaged over the horizontal domain.

    Figure 7.  Horizontal standard deviation of the transformed qs at the 25th model level for (a) Origin (10−4 kg kg−1), (b) Log10 (kg kg−1), (c) Softmax (10−7 kg kg−1) and (d) Q_Softmax (10−6 kg kg−1).

    Figure 8.  Vertical standard deviation profile of (a) qc, (b) qi, (c) qr, and (d) qs for Origin (10−5 kg kg−1), Log10 (10 kg kg−1), Softmax (10−8 kg kg−1) and Q_softmax (10−7 kg kg−1).

    Figure 9.  Horizontal length scale (km) for (a) qc, (b) qi, (c) qr, and (d) qs for Origin, Log10, Softmax and Q_softmax.

    Table 1.  Four experiments and their hydrometeor control variables.

    ExperimentsHydrometeor control variables
    Origin${q_{i,j,k}}$
    Log10$\lg \left(\dfrac{{{q_{i,j,k}}}}{{{q_{\rm{0}}}}}\right);{q_0} = {10^{ - 3}}{\rm{kg}}\;{\rm{k}}{{\rm{g}}^{ - 1}} $
    Softmax$\dfrac{{\exp (\beta {q_{i,j,k}})}}{{\displaystyle\sum {\exp (\beta {q_{i,j,k}})} }}$
    Q_softmax$\frac{{\exp (\beta {q_{i,j,k}})}}{{\displaystyle\sum\limits_{{{\overline q }_{i,j}} > 0} {\exp (\beta {{\overline q }_{i,j}}){\rm{ - }}\displaystyle\sum\limits_{{q_{i,j,k}} > 0} {\exp (\beta {{\overline q }_{i,j}})} } }},{\overline q _{i,j}} = \dfrac{1}{K}\displaystyle\sum\limits_{k = 1}^K {{q_{i,j,k}}} $
    DownLoad: CSV
  • Auligné, T., 2014: Multivariate minimum residual method for cloud retrieval. Part I: Theoretical aspects and simulated observation experiments. Mon. Wea. Rev., 142, 4383−4398, https://doi.org/10.1175/MWR-D-13-00172.1.
    Auligné, T., A. Lorenc, Y. Michel, T. Montmerle, A. Jones, M. Hu, and J. Dudhia, 2011: Toward a new cloud analysis and prediction system. Bull. Amer. Meteor. Soc., 92, 207−210, https://doi.org/10.1175/2010BAMS2978.1.
    Barker, D. M., W. Huang, Y. R. Guo, A. J. Bourgeois, and Q. N. Xiao, 2004: A three-dimensional variational data assimilation system for MM5: Implementation and initial results. Mon. Wea. Rev., 132, 897−914, https://doi.org/10.1175/1520-0493(2004)132<0897:ATVDAS>2.0.CO;2.
    Bauer, P., and Coauthors, 2011: Satellite cloud and precipitation assimilation at operational NWP centres. Quart. J. Roy. Meteor. Soc., 137, 1934−1951, https://doi.org/10.1002/qj.905.
    Bishop, C., 1995: Neural Networks for Pattern Recognition. Oxford University Press, 482 pp.
    Bocquet, M., C. A. Pires, and L. Wu, 2010: Beyond Gaussian statistical modeling in geophysical data assimilation. Mon. Wea. Rev., 138, 2997−3023, https://doi.org/10.1175/2010MWR3164.1.
    Boukabara, S. A., and Coauthors, 2011: MiRS: An all-weather 1DVAR satellite data assimilation and retrieval system. IEEE Trans. Geosci. Remote Sens., 49(9), 3249−3272, https://doi.org/10.1109/TGRS.2011.2158438.
    Bridle, J. S., 1990: Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition. Neurocomputing: Algorithms, Architectures and Applications, F. F. Soulie and J. Hérault, Eds., Springer, 227−236, https://doi.org/10.1007/978-3-642-76153-9_28.
    Buehner, M., and D. Jacques, 2020: Non-Gaussian deterministic assimilation of radar-derived precipitation accumulations. Mon. Wea. Rev., 148, 783−808, https://doi.org/10.1175/MWR-D-19-0199.1.
    Chen, H. Q., Y. D. Chen, J. D. Gao, T. Sun, and J. T. Carlin, 2020: A radar reflectivity data assimilation method based on background-dependent hydrometeor retrieval: An observing system simulation experiment. Atmospheric Research, 243, 105022, https://doi.org/10.1016/j.atmosres.2020.105022.
    Chen, Y. D., H. L. Wang, J. Z. Min, X. Y. Huang, P. Minnis, R. Z. Zhang, J. Haggerty, and R. Palikonda, 2015: Variational assimilation of cloud liquid/ice water path and its impact on NWP. J. Appl. Meteorol. Climatol., 54, 1809−1825, https://doi.org/10.1175/JAMC-D-14-0243.1.
    Chen, Y. D., R. Z. Zhang, D. M. Meng, J. Z. Min, and L. N. Zhang, 2016: Variational assimilation of satellite cloud water/ice path and microphysics scheme sensitivity to the assimilation of a rainfall case. Adv. Atmos. Sci., 33(10), 1158−1170, https://doi.org/10.1007/s00376-016-6004-3.
    D′Agostino, R. B., 1970: Transformation to normality of the null distribution of g1. Biometrika, 57, 679−681, https://doi.org/10.1093/biomet/57.3.679.
    Dowell, D. C., L. J. Wicker, and C. Snyder, 2011: Ensemble Kalman filter assimilation of radar observations of the 8 May 2003 Oklahoma City Supercell: Influences of reflectivity observations on storm-scale analyses. Mon. Wea. Rev., 139, 272−294, https://doi.org/10.1175/2010MWR3438.1.
    Errico, R. M., L. Fillion, D. Nychka, and Z. Q. Lu, 2000: Some statistical considerations associated with the data assimilation of precipitation observations. Quart. J. Roy. Meteor. Soc., 126, 339−359, https://doi.org/10.1002/qj.49712656217.
    Errico, R. M., P. Bauer, and J. Mahfouf, 2007: Issues regarding the assimilation of cloud and precipitation data. J. Atmos. Sci., 64, 3785−3798, https://doi.org/10.1175/2006JAS2044.1.
    Fabry, F., 2010: For how long should what data be assimilated for the mesoscale forecasting of convection and why? Part II: On the observation signal from different sensors Mon. Wea. Rev., 138, 256−264, https://doi.org/10.1175/2009MWR2884.1.
    Fabry, F. and J. Z. Sun, 2010: For how long should what data be assimilated for the mesoscale forecasting of convection and why? Part I: On the propagation of initial condition errors and their implications for data assimilation Mon. Wea. Rev., 138, 242−255, https://doi.org/10.1175/2009MWR2883.1.
    Fletcher, S. J., and M. Zupanski, 2007: Implications and impacts of transforming lognormal variables into normal variables in VAR. Meteorol. Z., 16, 755−765, https://doi.org/10.1127/0941-2948/2007/0243.
    Gao, J. D., and D. J. Stensrud, 2012: Assimilation of reflectivity data in a convective-scale, cycled 3DVAR Framework with hydrometeor classification. J. Atmos. Sci., 69, 1054−1065, https://doi.org/10.1175/JAS-D-11-0162.1.
    Gao, J. D., and D. J. Stensrud, 2014: Some observing system simulation experiments with a hybrid 3DEnVAR system for storm-scale radar data assimilation. Mon. Wea. Rev., 142, 3326−3346, https://doi.org/10.1175/MWR-D-14-00025.1.
    Hólm, E., and J. D. Gong, 2010: Use of cloud condensate in the background error formulation. ECMWF-JCSDA Workshop, June 2010, 111−119.
    Hólm, E. V., E. Andersson, A. Beljaars, P. Lopez, J. F. Mahfouf, A. Simmons, and J. N. Thépaut, 2002: Assimilation and modelling of the hydrological cycle: ECMWF’s status and plans. European Centre for Medium-Range Weather Forecasts (ECMWF), Technical Memorandum, https://doi.org/10.21957/kry8prwuq.
    Hu, M., M. Xue, and K. Brewster, 2006: 3DVAR and cloud analysis with WSR-88D level-II data for the prediction of the fort worth, Texas, Tornadic thunderstorms. Part I: Cloud analysis and its impact. Mon. Wea. Rev., 134, 675−698, https://doi.org/10.1175/MWR3092.1.
    Huang, Y. J., and Coauthors, 2018: Forecasting severe convective storms with WRF-based RTFDDA radar data assimilation in Guangdong, China. Atmospheric Research, 209, 131−143, https://doi.org/10.1016/j.atmosres.2018.03.010.
    Jones, T. A., D. J. Stensrud, P. Minnis, and R. Palikonda, 2013: Evaluation of a forward operator to assimilate cloud water path into WRF-DART. Mon. Wea. Rev., 141, 2272−2289, https://doi.org/10.1175/MWR-D-12-00238.1.
    Kawabata, T., and G. Ueno, 2020: Non-Gaussian probability densities of convection initiation and development investigated using a particle filter with a storm-scale numerical weather prediction model. Mon. Wea. Rev., 148, 3−20, https://doi.org/10.1175/MWR-D-18-0367.1.
    Kerr, C. A., D. J. Stensrud, and X. G. Wang, 2015: Assimilation of cloud-top temperature and radar observations of an idealized splitting supercell using an Observing System Simulation Experiment. Mon. Wea. Rev., 143, 1018−1034, https://doi.org/10.1175/MWR-D-14-00146.1.
    Kotsuki, S., T. Miyoshi, K. Terasaki, G. Y. Lien, and E. Kalnay, 2017: Assimilating the global satellite mapping of precipitation data with the Nonhydrostatic Icosahedral Atmospheric Model (NICAM). J. Geophys. Res., 122, 631−650, https://doi.org/10.1002/2016JD025355.
    Lawson, W. G. and J. A. Hansen, 2005: Alignment error models and ensemble-based data assimilation. Mon. Wea. Rev., 133, 1687−1709, https://doi.org/10.1175/MWR2945.1.
    Legrand, R., Y. Michel, and T. Montmerle, 2016: Diagnosing non-Gaussianity of forecast and analysis errors in a convective-scale model. Nonlinear Processes in Geophysics, 23, 1−12, https://doi.org/10.5194/npg-23-1-2016.
    Li, X. L., J. R. Mecikalski, and D. Posselt, 2017: An ice-phase microphysics forward model and preliminary results of polarimetric radar data assimilation. Mon. Wea. Rev., 145, 683−708, https://doi.org/10.1175/MWR-D-16-0035.1.
    Lien, G. Y., E. Kalnay, and T. Miyoshi, 2013: Effective assimilation of global precipitation: Simulation experiments. Tellus A, 65, 19915, https://doi.org/10.3402/tellusa.v65i0.19915.
    Liu, C. S., M. Xue, and R. Kong, 2020: Direct variational assimilation of radar reflectivity and radial velocity data: Issues with nonlinear reflectivity operator and solutions. Mon. Wea. Rev., 148, 1483−1502, https://doi.org/10.1175/MWR-D-19-0149.1.
    Liu, Z. Q., X. Y. Zhang, T. Auligné, and H. C. Lin, 2009. Variational analysis of hydrometeors with satellite radiance observations: A simulated study. Proc. 10th WRF Users’ Workshop, 23−26 June 2009, Boulder, CO.
    Meng, D. M., Y. D. Chen, H. L. Wang, Y. F. Gao, R. Potthast, and Y. B. Wang, 2019: The evaluation of EnVar method including hydrometeors analysis variables for assimilating cloud liquid/ice water path on prediction of rainfall events. Atmospheric Research, 219, 1−12, https://doi.org/10.1016/j.atmosres.2018.12.017.
    Michel, Y., T. Auligné, and T. Montmerle, 2011: Heterogeneous convective-scale background error covariances with the inclusion of hydrometeor variables. Mon. Wea. Rev., 139, 2994−3015, https://doi.org/10.1175/2011MWR3632.1.
    Montmerle, T., and L. Berre, 2010: Diagnosis and formulation of heterogeneous background-error covariances at the mesoscale. Quart. J. Roy. Meteor. Soc., 136, 1408−1420, https://doi.org/10.1002/qj.655.
    Pan, S. J., J. D. Gao, D. J. Stensrud, X. G. Wang, and T. A. Jones, 2018: Assimilation of radar radial velocity and reflectivity, satellite cloud water path, and total precipitable water for convective-scale NWP in OSSEs. J. Atmos. Oceanic Technol., 35, 67−89, https://doi.org/10.1175/JTECH-D-17-0081.1.
    Putnam, B., Xue, M., Jung, Y., Snook, N., and Zhang, G., 2019: Ensemble Kalman Filter Assimilation of Polarimetric Radar Observations for the 20 May 2013 Oklahoma Tornadic Supercell Case. Mon. Wea. Rev., 147, 2511−2533, https://doi.org/10.1175/MWR-D-18-0251.1.
    Poterjoy, J., 2016: A localized particle filter for high-dimensional nonlinear systems. Mon. Wea. Rev., 144, 59−76, https://doi.org/10.1175/MWR-D-15-0163.1.
    Ravela, S., K. Emanuel, D. McLaughlin, 2007: Data assimilation by field alignment. Physica D: Nonlinear Phenomena, 230(1−2), 127−145, https://doi.org/10.1016/j.physd.2006.09.035.
    Shen, Y., P. Zhao, Y. Pan, and J. J. Yu, 2014: A high spatiotemporal gauge-satellite merged precipitation analysis over China. J. Geophys. Res., 119, 3063−3075, https://doi.org/10.1002/2013JD020686.
    Skamarock, W. C., and Coauthors, 2008: A description of the advanced research WRF version 3. NCAR/TN-475+STR, 113 pp. https://doi.org/10.5065/D68S4MVH.
    Sun, J. Z., and N. A. Crook, 1997: Dynamical and microphysical retrieval from Doppler radar observations using a cloud model and its adjoint. Part I: Model development and simulated data experiments. J. Atmos. Sci., 54, 1642−1661, https://doi.org/10.1175/1520-0469(1997)054<1642:DAMRFD>2.0.CO;2.
    Thode, H. C. Jr., 2002: Testing for Normality. Marcel Dekker, 368 pp.
    Tong, M. J., and M. Xue, 2005: Ensemble Kalman filter assimilation of Doppler radar data with a compressible nonhydrostatic model: OSS experiments. Mon. Wea. Rev., 133, 1789−1807, https://doi.org/10.1175/MWR2898.1.
    Toth, Z., S. Albers, and Y. F. Xie, 2012: Analysis of fine-scale weather phenomena. Bull. Amer. Meteor. Soc., 93, ES35−ES38, https://doi.org/10.1175/BAMS-D-11-00148.1.
    Wang, H. L., J. Z. Sun, S. Y. Fan, and X. Y. Huang, 2013: Indirect assimilation of radar reflectivity with WRF 3D-var and its impact on prediction of four summertime convective events. J. Appl. Meteorol. Climatol., 52, 889−902, https://doi.org/10.1175/JAMC-D-12-0120.1.
    Wang, H. L., and Coauthors, 2018: Incorporating geostationary lightning data into a radar reflectivity based hydrometeor retrieval method: An observing system simulation experiment. Atmospheric Research, 209, 1−13, https://doi.org/10.1016/j.atmosres.2018.03.002.
    Wang, Y. M., and X. G. Wang, 2017: Direct assimilation of radar reflectivity without tangent linear and adjoint of the nonlinear observation operator in the GSI-based EnVar system: Methodology and experiment with the 8 May 2003 Oklahoma City tornadic supercell. Mon. Wea. Rev., 145, 1447−1471, https://doi.org/10.1175/MWR-D-16-0231.1.
    Xiao, Q. N., Y. H. Kuo, J. Z. Sun, W. C. Lee, D. M. Barker, and E. Lim, 2007: An approach of radar reflectivity data assimilation and its assessment with the inland QPF of Typhoon Rusa (2002) at landfall. J. Appl. Meteorol. Climatol., 46, 14−22, https://doi.org/10.1175/JAM2439.1.
    Yang, C., Z. Q. Liu, J. Bresch, S. R. H. Rizvi, X. Y. Huang, and J. Z. Min, 2016: AMSR2 all-sky radiance assimilation and its impact on the analysis and forecast of Hurricane Sandy with a limited-area data assimilation system. Tellus A, 68, 30917, https://doi.org/10.3402/tellusa.v68.30917.
  • [1] ZHENG Xiaogu, WU Guocan, ZHANG Shupeng, LIANG Xiao, DAI Yongjiu, LI Yong, , 2013: Using Analysis State to Construct a Forecast Error Covariance Matrix in Ensemble Kalman Filter Assimilation, ADVANCES IN ATMOSPHERIC SCIENCES, 30, 1303-1312.  doi: 10.1007/s00376-012-2133-5
    [2] FU Weiwei, ZHOU Guangqing, WANG Huijun, 2004: Ocean Data Assimilation with Background Error Covariance Derived from OGCM Outputs, ADVANCES IN ATMOSPHERIC SCIENCES, 21, 181-192.  doi: 10.1007/BF02915704
    [3] Jie HE, Xulin MA, Xuyang GE, Juanjuan LIU, Wei CHENG, Man-Yau CHAN, Ziniu XIAO, 2021: Variational Quality Control of Non-Gaussian Innovations in the GRAPES m3DVAR System: Mass Field Evaluation of Assimilation Experiments, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 1510-1524.  doi: 10.1007/s00376-021-0336-3
    [4] LIU Juanjuan, WANG Bin, WANG Shudong, 2010: The Structure of Background-error Covariance in a Four-dimensional Variational Data Assimilation System: Single-point Experiment, ADVANCES IN ATMOSPHERIC SCIENCES, 27, 1303-1310.  doi: 10.1007/s00376-010-9067-6
    [5] YAN Changxiang, ZHU Jiang, XIE Jiping, 2015: An Ocean Data Assimilation System in the Indian Ocean and West Pacific Ocean, ADVANCES IN ATMOSPHERIC SCIENCES, 32, 1460-1472.  doi: 10.1007/s00376-015-4121-z
    [6] Xiaogu ZHENG, 2009: An Adaptive Estimation of Forecast Error Covariance Parameters for Kalman Filtering Data Assimilation, ADVANCES IN ATMOSPHERIC SCIENCES, 26, 154-160.  doi: 10.1007/s00376-009-0154-5
    [7] Bohua Huang, James L. Kinter III, Paul S. Schopf, 2002: Ocean Data Assimilation Using Intermittent Analyses and Continuous Model Error Correction, ADVANCES IN ATMOSPHERIC SCIENCES, 19, 965-992.  doi: 10.1007/s00376-002-0059-z
    [8] LIU Ye, YAN Changxiang, 2010: Application of a Recursive Filter to a Three-Dimensional Variational Ocean Data Assimilation System, ADVANCES IN ATMOSPHERIC SCIENCES, 27, 293-302.  doi: 10.1007/s00376-009-8112-9
    [9] BAI Yulong, LI Xin, and HUANG Chunlin, 2013: Handling error propagation in sequential data assimilation using an evolutionary strategy, ADVANCES IN ATMOSPHERIC SCIENCES, 30, 1096-1105.  doi: 10.1007/s00376-012-2115-7
    [10] GAO Feng*, Peter P. CHILDS, Xiang-Yu HUANG, Neil A. JACOBS, and Jinzhong MIN, 2014: A Relocation-based Initialization Scheme to Improve Track-forecasting of Tropical Cyclones, ADVANCES IN ATMOSPHERIC SCIENCES, 31, 27-36.  doi: 10.1007/s00376-013-2254-5
    [11] Zhiyong MENG, Eugene E. CLOTHIAUX, 2022: Contributions of Fuqing ZHANG to Predictability, Data Assimilation, and Dynamics of High Impact Weather: A Tribute, ADVANCES IN ATMOSPHERIC SCIENCES, 39, 676-683.  doi: 10.1007/s00376-021-1362-x
    [12] Yong LI, Siming LI, Yao SHENG, Luheng WANG, 2018: Data Assimilation Method Based on the Constraints of Confidence Region, ADVANCES IN ATMOSPHERIC SCIENCES, 35, 334-345.  doi: 10.1007/s00376-017-7045-y
    [13] Banglin ZHANG, Vijay TALLAPRAGADA, Fuzhong WENG, Jason SIPPEL, Zaizhong MA, 2015: Use of Incremental Analysis Updates in 4D-Var Data Assimilation, ADVANCES IN ATMOSPHERIC SCIENCES, 32, 1575-1582.  doi: 10.1007/s00376-015-5041-7
    [14] Fuqing ZHANG, Meng ZHANG, James A. HANSEN, 2009: Coupling Ensemble Kalman Filter with Four-dimensional Variational Data Assimilation, ADVANCES IN ATMOSPHERIC SCIENCES, 26, 1-8.  doi: 10.1007/s00376-009-0001-8
    [15] Jidong GAO, Keith BREWSTER, Ming XUE, 2006: A Comparison of the Radar Ray Path Equations and Approximations for Use in Radar Data Assimilation, ADVANCES IN ATMOSPHERIC SCIENCES, 23, 190-198.  doi: 10.1007/s00376-006-0190-3
    [16] Ting WANG, Jianfang FEI, Xiaoping CHENG, Xiaogang HUANG, Jian ZHONG, 2018: Estimating the Correlated Observation-Error Characteristics of the Chinese FengYun Microwave Temperature Sounder and Microwave Humidity Sounder, ADVANCES IN ATMOSPHERIC SCIENCES, 35, 1428-1441.  doi: 10.1007/s00376-018-8014-9
    [17] Xiaoli XIA, Jinzhong MIN, Feifei SHEN, Yuanbing WANG, Chun YANG, 2019: Aerosol Data Assimilation Using Data from Fengyun-3A and MODIS: Application to a Dust Storm over East Asia in 2011, ADVANCES IN ATMOSPHERIC SCIENCES, 36, 1-14.  doi: 10.1007/s00376-018-8075-9
    [18] HOU Tuanjie, Fanyou KONG, CHEN Xunlai, LEI Hengchi, HU Zhaoxia, 2015: Evaluation of Radar and Automatic Weather Station Data Assimilation for a Heavy Rainfall Event in Southern China, ADVANCES IN ATMOSPHERIC SCIENCES, 32, 967-978.  doi: 10.1007/s00376-014-4155-7
    [19] Yaodeng CHEN, Jie SHEN, Shuiyong FAN, Deming MENG, Cheng WANG, 2020: Characteristics of Fengyun-4A Satellite Atmospheric Motion Vectors and Their Impacts on Data Assimilation, ADVANCES IN ATMOSPHERIC SCIENCES, 37, 1222-1238.  doi: 10.1007/s00376-020-0080-0
    [20] Guifu ZHANG, Jidong GAO, Muyun DU, 2021: Parameterized Forward Operators for Simulation and Assimilation of Polarimetric Radar Data with Numerical Weather Predictions, ADVANCES IN ATMOSPHERIC SCIENCES, 38, 737-754.  doi: 10.1007/s00376-021-0289-6

Get Citation+

Export:  

Share Article

Manuscript History

Manuscript received: 12 August 2020
Manuscript revised: 11 January 2021
Manuscript accepted: 19 January 2021
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Background Error Covariance Statistics of Hydrometeor Control Variables Based on Gaussian Transform

    Corresponding author: Yaodeng CHEN, keyu@nuist.edu.cn
  • Key Laboratory of Meteorological Disaster of Ministry of Education (KLME) / Collaborative Innovation Center on Forecast and Evaluation of Meteorological Disasters, Nanjing University of Information Science and Technology, Nanjing 210044, China

Abstract: Use of data assimilation to initialize hydrometeors plays a vital role in numerical weather prediction (NWP). To directly analyze hydrometeors in data assimilation systems from cloud-sensitive observations, hydrometeor control variables are necessary. Common data assimilation systems theoretically require that the probability density functions (PDFs) of analysis, background, and observation errors should satisfy the Gaussian unbiased assumptions. In this study, a Gaussian transform method is proposed to transform hydrometeors to more Gaussian variables, which is modified from the Softmax function and renamed as Quasi-Softmax transform. The Quasi-Softmax transform method then is compared to the original hydrometeor mixing ratios and their logarithmic transform and Softmax transform. The spatial distribution, the non-Gaussian nature of the background errors, and the characteristics of the background errors of hydrometeors in each method are studied. Compared to the logarithmic and Softmax transform, the Quasi-Softmax method keeps the vertical distribution of the original hydrometeor mixing ratios to the greatest extent. The results of the D′Agostino test show that the hydrometeors transformed by the Quasi-Softmax method are more Gaussian when compared to the other methods. The Gaussian transform has been added to the control variable transform to estimate the background error covariances. Results show that the characteristics of the hydrometeor background errors are reasonable for the Quasi-Softmax method. The transformed hydrometeors using the Quasi-Softmax transform meet the Gaussian unbiased assumptions of the data assimilation system, and are promising control variables for data assimilation systems.

摘要: 水凝物变量的初始化在数值模式预报中起到了非常重要的作用。资料同化理论上要求分析场、背景场以及观测误差概率密度函数满足高斯无偏分布的假定。针对水凝物变量的非高斯问题,本研究提出了一个适用于水凝物变量的高斯转换方法,并分别对不同转换方法获得的水凝物变量的空间分布特征、误差的高斯性以及背景误差协方差的特征进行了分析。高斯诊断分析表明,与其它转换方法相比,本方法转换后的水凝物变量比其它方法转换的变量更接近高斯分布,同时该方法保持了水凝物变量本身的空间分布特征。进一步地,作者将该方法引入到控制变量转换中,并统计分析了控制变量转换后的水凝物变量的背景误差协方差,结果表明,转换后的水凝物变量背景误差协方差的结构特征是合理的。本研究表明,经过本文方法高斯转换后的水凝物变量可以满足资料同化系统中背景误差高斯无偏分布的假定,可以作为资料同化系统的控制变量。

    • The initialization of hydrometeors plays a vital role in numerical weather prediction (NWP) due to their participation in the microphysical process related to clouds and precipitation (Errico et al., 2007; Bauer et al., 2011; Kerr et al., 2015). Various methods have been applied to improve the initialization of hydrometeors in NWP, such as the cloud analysis technique (Hu et al., 2006; Toth et al., 2012), the nudging-based technique (Huang et al., 2018; Wang et al., 2018) and some more advanced data assimilation techniques, like variational assimilation method (Sun and Crook, 1997; Xiao et al., 2007; Gao and Stensrud, 2012; Wang et al., 2013; Chen et al., 2015, 2016; Chen et al., 2020), Ensemble Kalman Filter (EnKF) technique (Tong and Xue, 2005; Dowell et al., 2011; Jones et al., 2013; Putnam et al., 2019), and the hybrid Ensemble-Variational (EnVar) method (Gao and Stensrud, 2014; Wang and Wang, 2017; Pan et al., 2018; Meng et al., 2019).

      In the variational and EnKF-based data assimilation techniques, the analysis is obtained by finding a maximum likelihood of the probability density functions (PDFs) of the true state of atmosphere when observations and a priori background estimation are given. Common data assimilation systems theoretically require that the PDFs of analysis, background, and observation errors satisfy the Gaussian unbiased distribution. If the assumption is not satisfied, unrealistic analysis will arise in the data assimilation process (Errico et al., 2000; Ravela et al., 2007).

      A non-Gaussian (NG) nature of the background errors could result from the time integration of the model nonlinearity (Bocquet et al., 2010), especially the highly nonlinear physical processes in NWP (Auligné et al., 2011). The displacement errors of meteorological features may also lead to NG in the background errors (Lawson and Hansen, 2005). Among the model variables which are commonly used in the data assimilation systems, hydrometeors tend to have the highest degree of nonlinearity and the lowest predictability (Fabry and Sun, 2010; Fabry, 2010). Thus, NG is inevitable when the control variables of data assimilation systems include hydrometeors, which is of vital importance for assimilating cloud sensitive observations, like radar reflectivity and cloudy satellite radiances (Errico et al., 2007).

      Various studies have focused on how to include hydrometeors as control variables in the data assimilation systems in order to directly analyze hydrometeors from the cloud sensitive observations. The total water mixing ratio (TWMR), which is the sum of the humidity-related variable and hydrometeors, is used as the control variable in many studies (Xiao et al., 2007; Liu et al., 2009; Yang et al., 2016; Li et al., 2017). However, TWMR is indeed a humidity control variable, and it is often limited to the simple and incomplete microphysical process employed to separate the hydrometeor increments from the total humidity increments. Hydrometeor mixing ratios are also chosen as the control variables in some data assimilation systems (Gao and Stensrud, 2012; Wang et al., 2013; Chen et al., 2015). Hydrometeor mixing ratios are easy to implement in a data assimilation system, but the NG of the hydrometeor background errors has not been taken into consideration in past studies. Some other studies chose the logarithm of hydrometeor mixing ratios as the control variables in the data assimilation systems (Boukabara et al., 2011; Michel et al., 2011; Liu et al., 2020), but did not include a thorough discussion of the NG aspect of the logarithm of hydrometeors. Recently, some researchers have used reflectivity as the control variable in data assimilation system (Wang and Wang, 2017), but its application is limited to radar reflectivity assimilation with pure ensemble background error covariances. To obtain more Gaussian hydrometeor control variables, Hólm and Gong (2010) explored how to extend the humidity control variable transform method (Hólm, 2002) of the European Centre for Medium-Range Weather Forecasts (ECMWF) to include hydrometeors. In their research, the normalized hydrometeors were selected as the control variable candidates, but the exact formulation of the normalization was not given and needs further investigation.

      In this study, a new Gaussian transform method is proposed with the objective to construct more Gaussian hydrometeor control variables in variational data assimilation systems. The new Gaussian transform is modified based on the Softmax function (Bridle, 1990), and is named the Quasi-Softmax function in this study. This article will be organized as follows. In section 2, the Quasi-Softmax method, the D′Agostino test (D′Agostino, 1970), the configuration of the experiments, and the description of statistical samples are presented. In section 3, the discussion of the transformed hydrometeors from the perspective of spatial distribution, NG and characteristics of background errors is given. Finally, the conclusions are drawn in section 4.

    2.   Data and Methodology
    • For a set of samples ${x_1},{x_2}, \cdots,{x_n}, \cdots,{x_N}$, the normalization of their exponential functions are ${{\rm{x}}_1},{{\rm{x}}_2}, \cdots,{{\rm{x}}_n}, \cdots,{{\rm{x}}_N}$, where ${{\rm{x}}_n}$ is defined as

      This transformation is called the Softmax function (Bridle, 1990), which is commonly used in neural networks. The numerator of Eq. (1) is the exponential function of ${x_n}$, and the denominator is the sum of the exponentials of all the samples; β is a parameter that controls the degree of increase in the contrast of the Softmax function. The Softmax function is a normalized exponential function and is often used in neural networks for classification problems. The Softmax function can be used to represent the probability of class membership for parameters with exponential distributions such as the Gaussian distribution (Bishop, 1995). The Softmax function has been employed to transform cloud fraction to a more Gaussian-like control variable in retrieving cloud fraction from satellite radiances (Auligné, 2014).

      Considering that the magnitude of hydrometeor mixing ratios is relatively small and the typical non-precipitation region may cover a large area, the calculated denominator in Softmax function may be very close at different levels, making it possible that the vertical distribution characteristics of hydrometeor mixing ratios may be lost after transformation. To handle this issue, a modification has been made to the Softmax function, renamed as the Quasi-Softmax function. With the Quasi-Softmax function, the original hydrometeor mixing ratio ${q_{i,j,k}}$ is transformed to ${Q_{i,j,k}}$:

      In Eq. (2), ${q_{i,j,k}}$ is the corresponding hydrometeor at the coordinate position $(i,j,k)$, and ${\bar q_{i,j}}$ is the average of the vertical profiles of hydrometeors at horizontal positions $(i,j)$, which is defined as:

      where K is the number of vertical levels. Compared to the original Softmax function, the denominator of Quasi-Softmax function becomes the sum of the exponential function of ${\bar q_{i,j}}$ in certain areas rather than the whole model domain. The sum is calculated over an area where ${\bar q_{i,j}}$ > 0 and ${q_{i,j,k}}$ = 0. To increase the contrast after transformation, β is set to 100 for cloud water mixing ratio (${q_{\rm{c}}}$) and rain water mixing ratio (${q_{\rm{r}}}$), and 1000 for cloud ice mixing ratio (${q_{\rm{i}}}$) and snow mixing ratio (${q_{\rm{s}}}$). In this study, the Quasi-Softmax function is applied to the full variable states rather than the perturbations.

    • The degree to which samples deviate from being truly Gaussian can be detected from the PDF’s skewness and kurtosis. The skewness measures asymmetry of the PDF about its mean, while kurtosis is a measure of how peaked is the distribution. For a given sample ${x_1},{x_2}, \cdots,{x_n}, \cdots,{x_N}$, its skewness and kurtosis can be calculated as:

      where ${G_3}$ and ${G_4}$ are the skewness and kurtosis of the sample, respectively, and $\bar x$ is the mean of the sample. For a Gaussian distribution, skewness is zero, whereas positive (negative) ${G_3}$ values indicate a median of PDF that is smaller (larger) than its mean and with a large right (left) tail. The kurtosis will be 3 if the distribution is Gaussian, with larger tails and a narrow modal peak resulting in larger ${G_4}$ values. Sample skewness and kurtosis can be used together to detect deviations from being exactly Gaussian, but in NWP, the ensemble number is relatively small (typically <100), making the normality of skewness and kurtosis often difficult to attain with sufficient accuracy (Thode, 2002). Therefore, we introduce the D′Agostin test (hereafter K2 test; D′Agostin et al., 1970) to diagnosis the degree of NG of samples. The K2 test is a univariate statistical test which combines the transformed skewness and kurtosis, and it can be used to test the NG of samples with number > 20 (Thode, 2002). In the K2 test, ${G_3}$ and ${G_4}$ are transformed to ${f_3}({G_3})$ and ${f_4}({G_4})$, respectively, where ${f_3}({G_3})$ is defined as:

      and ${f_4}({G_4})$ is defined as

      Positive (negative) ${f_3}({G_3})$ values mean that the PDF distribution of sample has a median smaller (higher) than the mean with a longer right (left) tail, while positive (negative) ${f_4}({G_4})$ values indicate that the PDF has a larger (smaller) modal peak than the Gaussian distribution. Finally, ${f_3}({G_3})$ and ${f_4}({G_4})$ are combined to produce an omnibus test ${K^2}$:

      The ${K^2}$ (hereafter K2) is zero when the PDF of the sample is a Gaussian distribution. The higher the calculated K2 value is, the greater the NG of the sample will be. Legrand et al. (2016) used the K2 test to diagnose NG of forecast and analysis errors in a convective-scale model, and the NG of common variables relating to wind, temperature and humidity fields were well quantified by the K2 test. Therefore, in this study, the K2 test is employed to diagnose the NG of background errors of hydrometeors as well as that of the transformed hydrometeors. The detailed description of K2 test can be found in Thode (2002) and Legrand et al. (2016).

    • In this study, we focus more on the variational DA method, in which the background error covariance is static, homogeneous, and isotropic. The control variable transform (CVTs) method (Barker et al., 2004), which is common employed to model the background error covariance in variational DA systems, is used in this study. With the CVTs method, the square root of the background error ${{B}}$ matrix is decomposed into a series of sub-matrices:

      where ${{{U}}_{\rm{p}}}$, ${{{U}}_{\rm{v}}}$, and ${{{U}}_{\rm{h}}}$ are physical, vertical, and horizontal transforms, respectively. In this study, the cross-variable correlations among hydrometeors and other control variables are not considered in the physical transform ${{{U}}_{\rm{p}}}$; a recursive iterative filter is employed to calculate the vertical auto-correlations in the vertical transform ${{{U}}_{\rm{v}}}$; the horizontal auto-correlations are calculated with the application of recursive filters in horizontal transforms ${{{U}}_{\rm{h}}}$. In this study, a Gaussian transform ${{{U}}_{\rm{g}}}$ is added before the existing three transforms, and then the square root of the ${{B}}$ matrix is expressed as the product:

      The Gaussian transform is conducted before the physical transform, and it is applied to the full model variables rather than perturbations.

    • In this study, a heavy rainfall case that occurred in the middle and lower reaches of the Yangtze River from late June to early July 2016 was studied. This event resulted in great economic losses in China. The period from 0600 to 1800 UTC 2 July 2016 was selected as the period of interest. The 12-h accumulated precipitation for this period in the simulation domain is shown in Fig. 1a, as reported by the China Hourly Merged Precipitation Analysis (CHMPA; Shen et al., 2014). Figure 1b shows the brightness temperature of the channel 8 of the Himawari-8 Advanced Himawari Imager (AHI) valid at 1800 UTC 2 July 2016, where the cold colors indicate the cloudy regions, corresponding well to the precipitation areas shown in Fig. 1a.

      Figure 1.  (a) Observed 12-h accumulated precipitation (units: mm) from 0600 UTC to 1800 UTC 2 July 2016 in the study domain, (b) the brightness temperature (K) of channel 8 from Himawari-8 AHI valid at 1800 UTC 2 July 2016, and (c) the vertical profiles of qc, qi, qr, and qs (g kg−1) from one ensemble member valid at 1800 UTC 2 July 2016.

      The Weather Research and Forecasting (WRF) model V3.8.1 (Skamarock et al., 2008) is used as the NWP model in this study. The horizontal grid spacing is 4 km, and the number of horizontal grid points is 550×450. The number of vertical levels is 51, and the model top set to 10 hPa. The following physics parameterization schemes are adopted: the WRF single-moment 6-class microphysics scheme (WSM6); the Rapid Radiative Transfer Model for GCMs (RRTMG) shortwave and longwave radiation schemes; the Mellor-Yamada-Janjić (MYJ) boundary layer scheme. No cumulus parameterization is employed.

      Considering that hydrometeors evolve rapidly with time, in this study we chose to use the ensemble sample to calculate hydrometeor background errors, as employed in previous studies (Michel et al., 2011; Legrand et al., 2016). In order to obtain the statistical samples of background errors for hydrometeors, an 80-member ensemble forecast was carried out, which was initialized from an 80-member ensemble analysis valid at 0600 UTC 2 July 2016. The 80-member ensemble analysis was provided by the EnKF system of NCEP’s operational Global Data Assimilation System (GDAS). The 12-h forecasts of the 80-member ensemble valid at 1800 UTC 2 July 2016 were used as the statistical samples, and the background errors of hydrometeors were approximated by the deviations of each ensemble member from the ensemble mean. Figure 1c shows the vertical profiles of ${q_{\rm{c}}}$, ${q_{\rm{i}}}$, ${q_{\rm{r}}}$, and ${q_{\rm{s}}}$ from one ensemble member. The liquid hydrometeor mixing ratios (${q_{\rm{c}}}$ and ${q_{\rm{r}}}$) are confined primarily to levels below 500 hPa, while the ice particle mixing ratios (${q_{\rm{i}}}$ and ${q_{\rm{s}}}$) are only found in the middle and upper levels between 700 and 150 hPa. The magnitude of the four hydrometeors is about 10−5 kg kg−1, so only the levels at which the mean value of each hydrometeor is greater than 10−6 kg kg−1 are diagnosed in this study.

      This study aims to find a Gaussian transform method to construct more Gaussian hydrometeor control variables in data assimilation systems. Four experiments are designed, and the details of the four experiments are shown in Table 1. The experiment Origin uses the original hydrometeors as a benchmark. It has been pointed out that the logarithmic transform, like denary logarithmic (Log10), can bring the PDFs of background errors for some variables closer to Gaussian (Errico et al., 2007; Fletcher and Zupanski., 2007), so the experiment Log10 employs the logarithm of hydrometeors as in Michel et al. (2011). The Softmax function is used in experiment Softmax in this study; The newly constructed Quasi-Softmax function is employed in the experiment Q_softmax.

      ExperimentsHydrometeor control variables
      Origin${q_{i,j,k}}$
      Log10$\lg \left(\dfrac{{{q_{i,j,k}}}}{{{q_{\rm{0}}}}}\right);{q_0} = {10^{ - 3}}{\rm{kg}}\;{\rm{k}}{{\rm{g}}^{ - 1}} $
      Softmax$\dfrac{{\exp (\beta {q_{i,j,k}})}}{{\displaystyle\sum {\exp (\beta {q_{i,j,k}})} }}$
      Q_softmax$\frac{{\exp (\beta {q_{i,j,k}})}}{{\displaystyle\sum\limits_{{{\overline q }_{i,j}} > 0} {\exp (\beta {{\overline q }_{i,j}}){\rm{ - }}\displaystyle\sum\limits_{{q_{i,j,k}} > 0} {\exp (\beta {{\overline q }_{i,j}})} } }},{\overline q _{i,j}} = \dfrac{1}{K}\displaystyle\sum\limits_{k = 1}^K {{q_{i,j,k}}} $

      Table 1.  Four experiments and their hydrometeor control variables.

    3.   Results
    • In order to evaluate the impacts of different transform methods on the spatial distribution of hydrometeors, in this subsection the horizontal and vertical distribution of hydrometeors before and after transformation are studied.

      Figure 2 shows the horizontal distribution of the various transformed ${q_{\rm{s}}}$ fields at model level 25 (~300 hPa). The horizontal distribution of ${q_{\rm{s}}}$ in the four experiments are similar but display different values, but because all three transform methods are mathematical, this ensures that the values of the variables before and after the transformation can correspond one to one. The values of ${q_{\rm{s}}}$ in Log10 has been transformed to be negative ranging from −3 to 0, and the larger the original ${q_{\rm{s}}}$ is, the closer to 0 the transformed value in Log10 it will be. Softmax and Q_softmax have similar horizontal distribution characteristics to that of ${q_{\rm{s}}}$, but with different values due to the denominators in the transform function. It is also worth noticing that Softmax and Q_softmax have transformed ${q_{\rm{s}}}$ from zero to positive values. Their impact is discussed below.

      Figure 2.  Transformed ${q_{\rm{s}}}$ at model level 25 (~ 300 hPa) for (a) Origin (10−5 kg kg−1), (b) Log10 (logarithmic transform; kg kg−1), (c) Softmax (10−6 kg kg−1) and (d) Q_softmax (10−5 kg kg−1) from one sample.

      Compared to the horizontal distribution, the characteristics of the vertical distribution of hydrometeors are greatly changed. Figure 3 shows the vertical profiles of ${q_{\rm{c}}}$, ${q_{\rm{i}}}$, ${q_{\rm{r}}}$, and ${q_{\rm{s}}}$ for the four experiments. The vertical distribution of the transformed hydrometeors in Log10 has changed a lot when compared to that in Origin. It can be seen from Fig. 3b that the peaks of the transformed hydrometeors in Log10 are at the levels where the original hydrometeors are less, while the levels with more hydrometeors in Origin cannot be easily distinguished in Log10. The values of the transformed hydrometeors in Softmax are nearly the same at all levels. This can be explained by the fact that the denominators of the Softmax function, which is the sum over the horizontal domain, are very similar at each level. Compared with Log10 and Softmax, the shape of the vertical profiles of transformed hydrometeors in Quasi-Softmax are much closer to that in Origin, indicating that the vertical distribution characteristics of hydrometeor mixing ratios are kept to some extent after using the Quasi-Softmax transformation, and that our modification to the denominator of Softmax function is reasonable.

      Figure 3.  The vertical profiles of (a) qc, (b) qi, (c) qr, and (d) qs for Origin (10−5 kg kg−1), Log10 (kg kg−1), Softmax (10−6 kg kg−1) and Q_softmax (10−5 kg kg−1) from one sample.

    • The spatial distribution of the hydrometeors for the four experiments were compared, and the results show that the characteristics of the distribution of the original hydrometeors are similar with that in the Q_softmax. The NG of the background errors of hydrometeors for different transform methods are further studied in this subsection.

      An example of the horizontal structures of NG is given for ${q_{\rm{s}}}$ at the model level 25 (~300 hPa) of the four experiments by Fig. 4. The horizontal K2 values of original ${q_{\rm{s}}}$ (Fig. 4a) are large and the maximum value is > 180, indicating high NG of ${q_{\rm{s}}}$. The greatest K2 values in Origin are in the intersections of the clear sky and cloudy regions, while the values are relatively smaller inside the clouds, roughly between 25 and 75. As is shown in Fig. 4b, the NG of the transformed ${q_{\rm{s}}}$ in Log10 is decreased in the intersection regions of cloudy and clear sky, but the K2 values in the cloudy regions increase considerably when compared to that in Origin. For the Softmax method (Fig. 4c), the NG in the intersections of cloudy and clear area is reduced much, but the K2 values in cloudy regions are nearly unchanged. Almost all K2 values in Q_softmax (Fig. 4d) have been decreased to < 25, indicating that the ${q_{\rm{s}}}$ has been transformed into a more Gaussian-like variable.

      Figure 4.  K2 of qs at model level 25 (~ 300 hPa) for (a) Origin, (b) Log10, (c) Softmax and (d) Q_Softmax.

      The horizontal distribution of K2 for ${q_{\rm{i}}}$ at model level 25 (~300 hPa) is shown in Fig. 5. In Origin, the NG of ${q_{\rm{i}}}$ is decreased considerably when compared to ${q_{\rm{s}}}$, but the NG is still large in the intersection of clear and cloudy regions. Similar with ${q_{\rm{s}}}$, the NG of ${q_{\rm{i}}}$ is not decreased either in Log10 or Softmax. However, after transformed by the Q_softmax method, the K2 of ${q_{\rm{i}}}$ in most regions are decreased to < 25, indicating the NG has been decreased appreciably. It is also worth noticing that, in the area where there are small K2 values in either Origin and Log10, the K2 values in both Softmax and Q_Softmax are not null. This can be explained by the differences in transforming zero-value quantities among different ensembles, as mentioned previously. To avoid this phenomenon, an effective solution is to calculate the background error covariance in cloudy and clear regions as in Montmerle and Berre (2010), though this is beyond discussion in this study.

      Figure 5.  K2 of qi at model level 25 (~ 300 hPa) for (a) Origin, (b) Log10, (c) Softmax and (d) Q_softmax.

      The vertical profiles of K2 for the four experiments related to NG are shown in Fig. 6 for ${q_{\rm{c}}}$, ${q_{\rm{i}}}$, ${q_{\rm{r}}}$, and ${q_{\rm{s}}}$. The original hydrometeors show large deviation from being Gaussian with K2 values > 80 at almost every level. Among the four hydrometeors in Origin, ${q_{\rm{i}}}$ is the most Gaussian-like variable since the K2 values range from 50 to 80, though still indicating great NG. Those levels with less hydrometeors have greater NG while the levels with more hydrometeors are relatively more Gaussian-like, indicating that the NG of hydrometeors may be related to the distribution of hydrometeors themselves. The Log10 method slightly decreases the NG of ${q_{\rm{r}}}$, ${q_{\rm{s}}}$, and ${q_{\rm{i}}}$ above 300 hPa, but it increases the NG of ${q_{\rm{c}}}$, and ${q_{\rm{i}}}$ below 300 hPa, meaning that the logarithm of hydrometeors still has great NG. When compared to Origin and Log10, the NG of the transformed hydrometeors is decreased considerably in Softmax, except for ${q_{\rm{c}}}$ near 975 hPa and ${q_{\rm{i}}}$ above 250 hPa. The K2 values are decreased to < 60 in Softmax for the four hydrometeors, but it is still far from being Gaussian. The Quasi-Softmax method shows the most promising results in that the K2 values of all hydrometeors at almost every level are reduced to < 10, meaning that the hydrometeors have been transformed into more Gaussian-like variables. Even though the K2 values of ${q_{\rm{s}}}$ near 200 hPa and 400 hPa are a little larger, it still outperforms the other three methods. The results show that, with the Quasi-Softmax transform, the hydrometeors can be transformed into more Gaussian-like variables, which therefore can act as the hydrometeor control variable candidates in data assimilation systems.

      Figure 6.  Vertical profiles of K2 of (a) qc, (b) qi, (c) qr, and (d) qs for the four experiments. For each level, values are averaged over the horizontal domain.

    • In the previous two subsections, the spatial distribution characteristics and NG of the background errors of the hydrometeors for the four experiments were discussed. It was shown that the transformed hydrometeors in Q_Softmax exhibit a reasonable spatial distribution and are the most Gaussian variables among the four experiments. In this subsection, the background error characteristics of hydrometeors for the four experiments are discussed to further evaluate whether the background errors of the transformed hydrometeors are reasonable. The horizontal, vertical variances and the horizontal length scale are discussed, respectively.

      In the data assimilation, the weight of observations to analysis depends on the relative size of the background errors and the observation errors, so the variance of background errors plays a vital role in the data assimilation. Figure 7 shows the horizontal standard deviation (SD) of ${q_{\rm{s}}}$ at the model level 25 for the four experiments. As is shown in Fig. 7a, the SD of the original hydrometeors are larger in the cloudy regions (Fig. 2b) than that in the clear area, indicating that the uncertainties are greater in cloudy regions. The situation reverses for the experiment Log10 (fig. 7b), where larger SD exists in the clear air while being relatively smaller in cloudy regions. This may be explained by the fact that zero-values were not transformed in Log10 in some members, thus the contrast among the 80 members was increased after transformation. For experiments Softmax and Q_softmax, the characteristics of the horizontal distribution of SD are similar with that in Origin, but the values are relatively smaller.

      Figure 7.  Horizontal standard deviation of the transformed qs at the 25th model level for (a) Origin (10−4 kg kg−1), (b) Log10 (kg kg−1), (c) Softmax (10−7 kg kg−1) and (d) Q_Softmax (10−6 kg kg−1).

      Figure 8 shows the vertical profiles of the SD of the hydrometeors for the four experiments. For the original hydrometeor mixing ratios, the vertical distribution of SD is similar to the vertical distribution of hydrometeors themselves. The SD is larger at the levels where the hydrometeors are greater, which means that the uncertainty of hydrometeors are larger at these levels. The values of the SD of the experiment Log10 are almost the same at all levels for the four hydrometeors, though this may not properly represent the vertical characteristics of the background errors of hydrometeors. The vertical characteristics of SD in Softmax is similar to that in Origin. The vertical profile of SD for hydrometeors in Quasi-Softmax is also close to that in Origin except for ${q_{\rm{c}}}$ between 700 hPa and 500 hPa and ${q_{\rm{r}}}$ near 600 hPa. Generally, the hydrometeors transformed by the Quasi-Softmax function reasonably represent the vertical characteristics of background errors for hydrometeors. It is noted that the vertical structures are relatively sharper in Q_softmax, which may be not good for BE modeling. This may be explained by the large differences in denominators of the Quasi-Softmax function in different levels. To handle this problem, one potential solution is to introduce a vertical smoother in Quasi-Softmax function, such as averaging the denominators in Quasi-Softmax function of current level with its near upper and lower levels.

      Figure 8.  Vertical standard deviation profile of (a) qc, (b) qi, (c) qr, and (d) qs for Origin (10−5 kg kg−1), Log10 (10 kg kg−1), Softmax (10−8 kg kg−1) and Q_softmax (10−7 kg kg−1).

      The horizontal length scale is an important parameter which determines how far the observations can be spread in the control variable space. Figure 9 shows the horizontal length scales of the four hydrometeors for the four experiments. The length scales of the original hydrometeors are all < 8 km (2 model grid lengths), which means that the observations containing hydrometeor information will spread a much shorter distance when compared to the common control variables like wind, temperature, and humidity. For ${q_{\rm{c}}}$, the length scales for the four experiments at all levels are less than 5 km, with the scale in Origin being the largest. The structures of the length scale in Softmax and Q_softmax are similar to that in Origin. For ${q_{\rm{i}}}$, the length scale in Origin is nearly 8 km, while the characteristics of length scale in Q_Softmax method is the closest to that in Origin. The length scale of ${q_{\rm{i}}}$ in Softmax is very close at each level, which may be explained by the fact that the spatial correlation of transformed ${q_{\rm{i}}}$ is very similar using the Softmax transform method. When it comes to ${q_{\rm{r}}}$, the length scales in Softmax closely approximate those in the Origin method, and the shape in Q-softmax is similar to those two but with relatively larger length scales. The length scales of ${q_{\rm{s}}}$ in Q_Softmax are similar to that in Origin except for the levels near model level 28. Considering that the length scales of hydrometeors are relatively smaller, the differences among the four experiments can be neglected when they are applied to the data assimilation systems.

      Figure 9.  Horizontal length scale (km) for (a) qc, (b) qi, (c) qr, and (d) qs for Origin, Log10, Softmax and Q_softmax.

    4.   Conclusions
    • A new Gaussian transform method, Qusai-Softmax transform, was proposed to construct more Gaussian hydrometeor control variables satisfying the Gaussian unbiased assumptions of the data assimilation systems. The new Gaussian transform method was compared with the other three transform methods from the perspectives of the spatial distribution, the NG of background errors and the characteristics of background error covariances. A precipitation case from the rainy season of China was selected and an 80-member ensemble of 12-h forecasts was generated to obtain statistical samples of the background errors of hydrometeors.

      Firstly, the horizontal and vertical distribution characteristics of hydrometeors for the four different transform methods were discussed. The characteristics of the horizontal distribution characteristics of original hydrometeor mixing ratios were kept in experiments Log10, Softmax and Q_softmax, but the vertical distribution characteristics varied a lot in these three experiments. The Log10 and Softmax methods changed the vertical distribution greatly, while the Quasi-Softmax method basically kept the characteristics of vertical structures of hydrometeors.

      The D′Agostino test was used to diagnose the NG of background errors of the transformed hydrometeors. The original hydrometeors showed great NG, especially in the intersection areas between cloudy and clear regions, where the greatest uncertainty occurred. The Log10 method slightly improves the NG in the intersection areas but increased the NG in cloudy areas. The Softmax method improved the NG considerably in the intersection areas between cloudy and clear regions, but did not help in the cloudy area. The Quasi-Softmax decreased the NG of hydrometeors significantly and the transformed hydrometeors were much closer to a Gaussian distribution. The Softmax and Quasi-Softmax methods produced a new problem that the hydrometeors in clear areas are transformed such as not to have null values, indicating that the transformation should not be carried out in clear areas. These and other issues should be taken into consideration and will be explored in future work.

      The new Gaussian transform was added to the CVTs of the background error covariances, and the characteristics of ${{B}}$ for different transformed hydrometeors were compared. The Log10 method greatly changed the variance distribution both vertically and horizontally, while the characteristics of the variances of transformed hydrometeors using the Softmax and Quasi-Softmax methods showed reasonable results both in the horizontal and vertical variances. The horizontal length-scales for the three transformed hydrometeor types were all reduced when compared to original hydrometeor mixing ratios. The horizontal length-scales only covered < 2 model grid-lengths, so the differences among those transformed methods make little sense by tuning the length scales.

      In this study, the new Gaussian transform method was only evaluated by measuring the NG of the transformed hydrometeors and diagnosing the characteristics of the background errors. The new Gaussian transform method will be implemented to the data assimilation system, and its application to the assimilation of radar reflectivity or satellite radiance retrievals will be studied further in the near future. We also noticed that some Gaussian transform methods have been applied to other fields, like the Gaussian anamorphosis method applied to precipitation variables (Lien et al., 2013; Kotsuki et al., 2017). It is worth exploring application of this method to hydrometeors. Besides, some assimilation techniques based on the non-Gaussian framework, like particle filters, has been developed and applied in the recent decade (Poterjoy, 2016; Buehner and Jacques, 2020; Kawabata and Ueno, 2020), and it is also worth exploring their handling of NG in the assimilation of cloud-sensitive observations at convective scales.

      Acknowledgements. This research was funded by National Key Research and Development Program of China (Grant No. 2017YFC1502102), National Natural Science Foundation of China (Grant No. 42075148), and Graduate Research and Innovation Projects of Jiangsu Province (Grant No. KYCX20_0910). The numerical calculations of this study are supported by the High-Performance Computing Center of Nanjing University of Information Science and Technology (NUIST).

Reference

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return