We developed the statistical forecast guidance on ternary KDD in the springtime for each month separately. A three-grade ordinal logistic regression model was applied to generate a trinomial distribution using the final predictors given in Table 6. The optimal thresholds were determined based on skill scores. Some results were obtained in the following way.
First, the variable selection was performed by Fisher's scoring method offered by the statistical package called SAS. The final predictors, selected at the significance level 0.05, were the following: BS12, NOI7, and SOI4 for March; CT7, BR5, and NOI1 for April; and CT4, AR12, and AWS11 for May.
Second, the optimal thresholds were determined based on skill scores, which were 0.3 and 0.65 for March; 0.2 and 0.6 for April; and 05 and 0.5 for May.
Third, forecast guidance generated a binary forecast and a ternary forecast together. The proposed guidance for March was as follows:
x = 0.1223× BS12 - 1.8894× NOI7 - 1.918× SOI4;
e0 = exp(-8.5420 + x); e1 = exp(-3.7485 + x);
p0=e0 / (1 + e0);
p1=e1 / (1 + e1)-p0;
p2 = 1 - p0-p1.
IF p0 > 0.3 THEN
Ternary_forecast=0;
ELSE IF p1/(p1+p2) > 0.65 THEN
Ternary_forecast =1;
ELSE Ternary_forecast =2;
IF Ternary_forecast=0 OR Ternary_forecast=1
THEN Binary_forecast=1;
ELSE Binary_forecast=2.
The proposed guidance for April was as follows:
x = -3.5088× CT7 - 11.1385× BR5 +2.2744× NOI10;
e0 = exp(85.8864+ x); e1 = exp(88.0663+ x);
p0=e0/ (1 + e0);
p1=e1 / (1 + e1)-p0.
P2 = 1 - p0-p1;
IF p0 > 0.2 THEN
Ternary_forecast=0;
ELSE IF p1/(p1+p2) > 0.6 THEN
Ternary_forecast =1;
ELSE Ternary_forecast =2;
IF Ternary_forecast=0 OR Ternary_forecast=1 THEN
Binary_forecast=1;
ELSE Binary_forecast=2.
The proposed guidance for May was as follows:
x = 0.9213× CT4+41.0877× AR12-1.9697× AWS11;
e0 = exp(x); e1 = exp(4.176 + x);
p0=e0/(1 + e0);
p1=e1/(1 + e1)-p0;
p0 = 1 - p0 -p1;
IF p0 > 0.5 THEN
Ternary_forecast=0;
ELSE IF p1/(p1+p2) > 0. 5 THEN
Ternary_forecast =1;
ELSE Ternary_forecast =2;
IF Ternary_forecast=0 OR Ternary_forecast=1 THEN
Binary_forecast=1;
ELSE Binary_forecast=2;
Forth, the results of binary forecast were summarized in 2×2 contingency tables (Table 10). According to contingency tables, the computed skill scores were as follows: HR2 = 0.9, POD2 = 1.0, FAR2 = 0.22 for March; HR2 = 0.9, POD2 = 1.0, FAR2 = 0.2 for April; and HR2 = 1.0, POD2 = 1.0, FAR2 = 0.22 for May. Table 11 shows the results of cross-validation for binary forecast as follows; HR2 = 0.85, POD2 = 0.71, FAR2 = 0.16 for March; HR2 = 0.85, POD2 = 0.75, FAR2 = 0.14 for April; and HR2 = 1.0, POD2 = 1.0, FAR2 = 0.22 for May. These scores indicate that each forecast guidance may be reliable and useful.
Fifth, the results of ternary forecast were summarized in the 3×3 contingency tables (Table 12). Based on cross tables, the computed skill scores were as follows: HR3 = 0.85, wHR3 = 0.93 for March; HR3 = 0.85, wHR3 = 0.93 for April; and HR3 = 0.9, wHR3 = 0.95 for May. Table 13 shows the results of cross-validation for ternary forecast and their skill scores were as follows: HR3 = 0.75, wHR3 = 0.88 for March; HR3 = 0.8, wHR3 = 0.9 for April; and HR3 = 0.9, wHR3 = 0.95 for May. These scores indicate that the proposed forecast guidance may be reliable.
Sixth, we applied the proposed models to forecast KDD in the years 2011 and 2012. The results are summarized in Table 14. In 2011, the actual category and forecasted category of KDD were (normal, above normal), (below normal, below normal) and (above normal, above normal) for March, April, and May, respectively. The rate of correct forecast was 67%. In 2012, they are (below normal, normal), (below normal, below normal) and (below normal, normal) for March, April and May respectively. Unfortunately, the rate of correct forecast was only 33% in this case. In April and May 2012, North Pacific high enlarged unusually to the Korean peninsula, which might have suppressed dust transport to South Korea.