Suppose an unknown term E exists, which can be considered as the overall effect of different sources of model . As a result, the NWP model can be rewritten as an inverse problem together with past observations described in (Xue et al., 2015).
We further discretize the model error E into series of datasets and denote Ei (i=-n,-(n-1),
,-1) as the ME in the past interval between iδ and (i+1)δ. For brevity, the following discussion focuses on one state variable of the model equation at the space-discretized point, as in (Xue et al., 2015). In the past arbitrary interval (-iδ,-(i-1)δ) [where i=-n,-(n-1),
,-1], the ME \(\overlineE_i\) in the corresponding interval can be easily obtained iteratively by using the method described in (Xue et al., 2015).
Suppose that, for a moment, the ME is dependent on the time and of the polynomial form of m order as E(t)=a0+a1t+a2t2+
+amtm. a0,a2,
,am are coefficients. Integrating the model Eq. (2a) in (Xue et al., 2015) for each interval δ yields the following equations:
\begin{equation} \label{eq2} \overline{E}_i\delta=a_0\delta\!+\!a_1\dfrac{(i+1)^2-i^{2}}{2}\delta^2+\cdots+a_m\dfrac{(i+1)^{m+1}{\rm -\emph{i}}^{m+1}}{n+1}\delta^m , \end{equation}
where i=-1,-2,
,-n, and m is a positive integer smaller than n. It is noted that there are n number of equations and m+1 number of unknown coefficients, which can be solved by the least-squares method. Minimizing the norm:
\begin{eqnarray} \label{eq3} J(a_0,a_1,\cdots,a_m)&=&\sum_{i=-n}^{-1}\bigg[\overline{E}_i\delta-\bigg(a_0\delta+a_1\dfrac{(i+1)^2-{\it i}^{2}}{2}\delta^2\nonumber\\ &&+\cdots+a_m\dfrac{(i+1)^{{\rm m}+1}{\rm -\emph{i}}^{{\rm m+1}}}{n+1}\delta^m\bigg)\bigg]^2 ,\quad\ \end{eqnarray} results in the following equation set:
\beginsubequations \begin{align} \label{eq4} &\dfrac{\partial {\it J}(a_0,a_1,\cdots,a_m)}{\partial a_0}=0 ,\\ \label{eq5} &\dfrac{\partial {\it J}(a_0,a_1,\cdots,a_m)}{\partial a_1}=0 ,\\ &\cdots\cdots\nonumber \end{align} \endsubequations \begin{equation*} \label{eq6} \dfrac{\partial {\it J}(a_0,a_1,\cdots,a_m)}{\partial a_{m}}=0 . \eqno{({\rm 4bm})} \end{equation*}
Obviously, an explicit formula of ME can be solved from the equation set, which is actually a kind of curve-fitting result. One may argue that this formula works in the past model integration time, but whether it does in model forecast time is not confirmed. However, for some ME components of stable statistical character, e.g., climatological systematic errors, discussed in section 2, the statistical formula result from the past will work in future. The climatological systematic error is the difference between model forecasts and observations, which maintain in a specific season and increase linearly in short-term prediction (Danforth et al., 2007). That means a constant online corrector is enough for the long-existent linear-increasing climatological systematic errors. It was discussed in (Xue et al., 2015) that direct application of a constant model correction in the NWP model should impact the model at the very beginning of model integration, which makes the iteration divergent. Here, we are not concerned with the convergent problem, but the impact of the NWP model at the very beginning of model integration may also lead to some uncertainties. In reality, to deal with this problem technically, the correction is increased linearly from zero to its true magnitude in a few steps of model integration and then maintained at a constant in the following steps. Based on this discussion, the order of the polynomial should be zero (m=0), and the solution of the equation set is easily solved as
\begin{equation} \label{eq7} E=a_0=\dfrac{1}{n}\sum_{i=-n}^{-1}\overline{E}_i . \end{equation}
Actually, Eq. (8) shows that the systematic model error is none other than the time average of the MEs in past intervals. According to Eq. (2), the systematic errors are a kind of mean error, but the sample amount usually requires a long series of data (e.g., decades of annual data). However, to save computational resources (plus, the purpose of this study is not real operational use of the method), we use two-month samples to calculate the mean forecast error. Although this is too short to totally represent the systematic errors, the results show that the patterns and evolutions of the two-month mean errors are similar to the systematic error (5-year averaged errors). Therefore, the two-month mean errors are approximately represented by the systematic errors in the following sections.
Another factor that should be considered is the increasing rate of systematic errors. We used 2-month MEs to estimate the mean errors and to approximately represent the systematic errors in (Xue et al., 2015). The patterns and evolution of mean error corrections estimated from Eq. (8) have been compared offline to the mean forecast errors (Xue et al., 2015). The results show that the patterns of estimated error are well matched to the mean forecast errors for both the Northern Hemisphere and the tropics, but the increasing rates of estimated error are over-abrupt compared to mean forecast errors. Hence, a weight α is necessary to account for the overestimated error corrector, and the corrector becomes \begin{equation} \label{eq8} E=\dfrac{\alpha}{n}\sum_{i=-n}^{-1}\overline{E}_i . \end{equation}
It is expected that a large amount of systematic errors will be offset by choosing the weight reasonably. However, there is another parameter that should be considered: the number of past MEs, n. Empirically, the climatological systematic error corrector should result statistically from the past data and model outputs of multiple years (Glahn and Lowry, 1972; Carter et al., 1989). Here, for the purpose of reducing the computational cost, we use the spectrum analysis method to obtain a proper amount of past data. Thus, averaged spectra of the MEs of horizontal velocity components u and v, perturbation from reference potential temperature θ' and Exner pressure perturbation \(\Pi'\) (blue) at 3rd model level and the 14th model level for GRAPES-GFS are shown in Fig. 4. Estimates are based on the MEs resulting from iteration for January-February 2010 with 6 h intervals. Apparently, the spectra of the four variables show three peaks in 60 days: the first is a one-day period, which might be associated with the diurnal cycle; the second is greater than a 3-day period, which may be related to the synoptic process; and the third one is greater than a 30-day period, which may represent the long-existent climatological systematic errors. Finally, we use 30 days' past data (n=120) to determine the correction in the following forecast tests. In this part, for the purpose of simply testing the new method, only long-existent climatological systematic errors were considered in the following experiments.
To determine the weight factor α, a set of experiments with changing α from 0 to 1 were conducted for January 2010 firstly, based on the past MEs produced by iteration in (Xue et al., 2015). Before applying the model correction through model integration, the ME for the forecast period should be calculated based on its previous 30-day MEs. Then, the ME is introduced into the model as a tendency term and forces the model at every time step. As different values of the weight act on the ME, the influence of the correction could be balanced. Secondly, according to the results of the first set of experiments, a linear decayed factor was used to weight the correction. Then, the linear weight correction was applied for July 2009 (JUL2009) and January 2010 (JAN2010). The experiments were all initialized by 1°× 1° NCEP final (FNL) analyses at 0000 UTC and performed for 8-day forecasts. The model configuration and physical processes The resolution is set as 1°, and the model dynamics and physics are default.
Results