-
The correlation-coefficient field r is given in the field area
$ \mathrm{\Omega } $ , the significant-correlation in the region$ \widehat{\mathrm{\Omega }}(\widehat{\mathrm{\Omega }}\in \mathrm{\Omega }) $ is$ (\left|r\right|\geqslant {r}_{\alpha }) $ , and$ D\left(\widehat{D}\right) $ is the area of$ \mathrm{\Omega }\left(\widehat{\mathrm{\Omega }}\right) $ .$ \widehat{\mathrm{\Omega }} $ is the significant-correlation region of r, and$ \widehat{D} $ is the significant-correlation area of r. -
Fields of r used in atmospheric circulation or climate analysis are obtained from the discretized data given on the lattice (site) network, and the most commonly used lattice network is a rectangular network with uniform spacing on plane
$ \lambda -\phi $ ($ \lambda $ is the longitude, ϕ is the latitude). Figure 1a is a uniform grid distance rectangular lattice network in the Northern Hemisphere, and the network area is$ D=2\pi {a}^{2} $ (a is the radius). Figure 1b shows a 160-station network in China with an area of about 9.6 million km2. Obviously, the distribution of lattice points on the network is uneven. In order to calculate$ \widehat{D} $ , it is first necessary to give the measurement method of the lattice network inhomogeneity. Luo et al. (2011) introduced the element area$ {d}_{s} $ of the s-th lattice (site), which is the part of the domain area D represented by the s-th lattice (site). The larger (smaller)$ {d}_{s} $ is, the sparser (denser) the dot distribution is near point s.$ {d}_{s} $ is the basis for calculating the area of the significant-correlation area, and its algorithm should be given first.Figure 1. Example of lattice (site) network. (a) Uniform grid distance rectangular lattice network in the Northern Hemisphere(
$ \mathrm{\Delta }\lambda ,\Delta \phi =\pi /18 $ );(b) 160-station network in China (Wang et al., 2011)(1) Calculation of element area
$ {d}_{s} $ Chung and Nigam (1999) pointed out that the element area of the lattice point located at latitude
$ \varphi $ on the lattice network in Fig. 1a was proportional to$\mathrm{c}\mathrm{o}\mathrm{s}\varphi$ . Figure 1a shows the uniform spacing on the$ \lambda -\varphi $ plane of the Northern Hemisphere ($\Delta \lambda \;\mathrm{a}\mathrm{n}\mathrm{d}\;\Delta \varphi$ are constant). The zonal lattice number is$ u(\Delta \lambda =2\pi /u) $ , and the meridional lattice number is$(v+1\left)\right(\Delta \varphi =\pi /2v)$ . The two-dimensional lattice point ordinal number$ \left(i,j\right)=\left(\mathrm{0,0}\right) $ is taken on the intersection point$ \left(\lambda ,\varphi \right)=\left(\mathrm{0,0}\right) $ between the prime meridian and the equator, and by using Fig. 2, the corresponding element area can be easily written (i is the longitude coordinates, j is the latitude coordinates):Figure 2. A schematic diagram of the calculation of the area of s lattice element. The shaded area of the Arctic is only
$ {d}_{s}/4 $ .where the units of
$ \Delta \lambda $ and$ \Delta \varphi $ are radians (rad), and the units of d(i,j) and${d}\left(i,j\right)$ are the radius of the Earth squared. Because$ \Delta \lambda $ is constant,$ d\left(i,j\right) $ only changes with j. Using the relation S~(i, j),where
$ d\left(i,j\right) $ can be easily converted to$ {d}_{s} $ . Note that the North Pole is processed into$ u $ lattice points,${d}_{s}=\overline {1,m}$ ,$ m=u\left(v+1\right) $ ,$\Delta \lambda =\Delta \varphi =\pi /$ 18,$ u $ =36,${v}$ =9, and$m=u (v+1)$ =360.Luo et al. (2011) presented the calculation scheme of S station area
$ {d}_{s} $ on a Chinese 160-station network (Fig. 1b), which is briefly introduced here with the help of Fig. 3. In the figure, S represents the test station located at$ {(\lambda }_{s},{\phi }_{s}) $ . The shaded area in Fig. 3 is the spherical part of the spherical crown with S as the pole, and the area$ {S}_{0} $ is called the area of the crown [(${S}_{0}=2\pi (1-\cos\theta ){a}^{2}$ , where$ a $ is the radius of the Earth and$ \theta ={\angle SOS}^{*} $ is the center angle corresponding to the pole of the spherical cap to the edge point)]. For the selected$ {S}_{0} $ , S station crown area$ {\mathrm{\Omega }}_{s} $ is determined, and its area is$ {S}_{0} $ . The number of stations in the statistics is determined to be$ {\widehat{m}}_{s} $ , and then the area$ {D}_{s} $ that belongs to the coverage area of station network$ {\mathrm{\Omega }}_{s} $ is calculated. Thus, the area$ {{d}}_{s} $ of station element S and the density of the station network are obtained, which are defined as:Figure 3. A schematic diagram of the calculation of the area of s station. The shaded area is
$ {\mathrm{\Omega }}_{s} $ .Figure 4 is the diagram representing the calculation of
$ {{d}}_{s} $ and${{m}}_{s}$ for the Lanzhou (inland), Altay (land border), and Shanghai (ocean boundary) stations. For the inland station,${{D}}_{s}$ =1, and for the boundary station,$ {{D}}_{s} $ <1. Table 1 shows${{d}}_{s}$ ,${{m}}_{s}$ , and the intermediate calculation results of$ {\widehat{m}}_{s} $ and${{D}}_{s}$ for the three stations.Figure 4. A schematic diagram of the structure of three types of stations. The elliptical line is the
$ {\mathrm{\Omega }}_{s} $ boundary, the shallow shadow area is the ocean, the deep shadow area is the land, and the white curve is the land boundary/sea land demarcation line. (a) Lanzhou (inland) station, (b) Aletai (land border) station, and (c) Shanghai (ocean boundary) station.City Number (s) ${\lambda }_{s}{(}^{\circ }{\rm{E}})$ ${\phi }_{s}{(}^{\circ }{\rm{N}})$ $ {\widehat{m}}_{s} $ $ {D}_{s} $ $ {d}_{s} $ $ {m}_{s} $ Lanzhou 138 103.88 36.05 9 1.000 0.111 9.00 Aletay 156 88.08 47.73 1 0.454 0.454 2.20 Shanghai 56 121.46 31.41 10 0.413 0.041 24.21 Table 1. The
$ {\widehat{m}}_{s} $ ,$ {D}_{s} $ ,$ {d}_{s} $ , and$ {m}_{s} $ of the Lanzhou, Altay, and Shanghai stations.(2) Calculate of the significant-correlation area D
Figure 1 shows the correlation-coefficient field r in the lattice (site) network composed of m phase relationship values
$ {{r}}_{s} $ . The significance test of reliability$ \alpha $ is carried out for each${{r}}_{s}$ , where k-th$ {r}_{s} $ are significant. We introduce 0 and 1 functions,in which we take 1 through the test lattice (site) point
$ {p}_{s} $ ; otherwise, we take 0. Thus, the formula for calculating the area of the significant-correlation area in the correlation-coefficient field is obtained aswhere
$ \widehat{D} $ is the new statistic used in this paper for the significance test of correlation-coefficient field. -
The empirical Monte Carlo (EMC) method for significance testing a correlation-coefficient field r on a lattice network (Fig. 1a) and site network (Fig. 1b) is the same as using statistic
$ \widehat{D} $ (area of significant-correlation area) to replace k (number of significant-correlation points) in the Livezey method. Here, the EMC method is given by taking the significance test of the correlation-coefficient field r of a 160-station network in China (Fig. 1b) as an example. Meanwhile, the Livezey method is introduced in order to compare it (Livezey and Chen, 1983; Wang et al., 2007, 2011) with the test results. -
r is the correlation-coefficient distribution fields of the time series x and field series Y or the local correlation-coefficient distribution fields of the sequence of fields X and Y in the same grid (site) network at the same time (or delayed) in time series
$ q=1,\dots ,n $ . x, X, and Y are expressed aswhere
$ {x}_{q} $ is the t-th time value of sequence x, and$ {x}_{q} $ and$ {y}_{q} $ are the$ t $ -th time fields of field orders X and Y. The test steps of reliability$ \alpha $ of r using the EMC method are as follows:(1) According to the method described in section 2, the significant-correlation area
$ \widehat{D} $ of sample correlation-coefficient field r is found.(2) The sample sequence
$ {x}_{q},q=1,\dots ,n $ or$ {\mathit{x}}_{q}, $ $q=1,\dots ,n $ is randomly selected L times by a random function, where L = 1000 or more. Then, the correlation-coefficient field of$ {\mathit{x}}_{l} $ (or$ {\mathit{X}}_{l} $ ),$ l=1,\dots ,L $ , and Y are found, and the following is obtained:(3) The significant area
$ {\widehat{D}}_{l},l=1,\dots ,L $ of$ {r}_{l},l=1,\dots ,L $ is found, and it is arranged in non-ascending order to get$ {\widehat{D}}_{l},l=1,\dots ,L $ . Then, making${l'}_{\alpha }=\alpha \times L$ ,${\widehat{D}}_{{{l}^{{'}}}_{\alpha }}$ is the critical value of$ \alpha $ for the reliability of$ {\widehat{D}}_{l} $ . For$ \alpha $ =0.05, L=1000,${l'}_{0.05}$ =50,$ {\widehat{D}}_{50} $ is a value that the reliability of$ \widehat{D} $ is the critical value of 0.05.(4) The significance of r is tested with statistic
$ \widehat{D} $ : If$ \widehat{D}\geqslant {\widehat{D}}_{{l}_{\alpha }} $ , then r is significant with respect to$ \alpha $ under the reliability, and x(X) and Y are from the relevant parent; otherwise, r is not significant, and x(X) and Y come from an uncorrelated parent.The key of the EMC method is to use the random sequence of the sample sequence x(X) to generate its simulated sequence
$ {\mathit{x}}_{l}\left({\mathit{X}}_{l}\right) $ , which is the basis of the word “empirical” in its name. This process makes the statistical properties of$ {\mathit{x}}_{l}\left({\mathit{X}}_{l}\right) $ closest to the sample and the statistical properties of the sample closest to its parent. For example, the$ {\mathit{r}}_{l} $ obtained by the EMC method in step (2) should be well organized. Therefore, the EMC significance test of r considers the spatial correlation of X (or Y). -
For the 160-station network of China, when the reliability
$ \alpha $ =0.05 and L=1000, the critical value of k for the number of significantly correlated stations of in the correlogram is$ {k}_{0.05} $ =13 (Wang et al., 2007). When$ {k > k}_{0.05} $ , r passes the significance test, and x(X) and Y come from the correlation matrix. -
According to the method of Sun et al. (2015), we obtained the intensity sequence
$ {P}_{t},t=1,\dots ,70 $ (P) of five atmospheric activity centers (ACAs) during a 70-yr period (1951–2020) in the Northern Hemisphere in winter, which is x in Eq. (5). Then, the temperature and precipitation field sequences (T and R) were obtained at 160 stations in China during the same period, which is Y in Eq. (5). Figures 5 and 6 show the simultaneous correlation-coefficient field r of x-T and x-R, respectively.Figure 5. Correlations between ACA intensity index in the Northern Hemisphere and temperature at 160 stations in China in winter from 1951 to 2020. The shaded area is the significant-correlation area that passes the t test at the
$ \alpha =0.05 $ level. (a) Iceland low. (b) North Atlantic High. (c) Mongolia High. (d) Aleutian Low. (e) North Pacific High.Figure 6. Correlations between ACA intensity index in the Northern Hemisphere and precipitation at 160 stations in China in winter from 1951 to 2020. The shaded area is the significant-correlation area that passes the t-test at the
$ \alpha =0.05 $ level. (a) Iceland low. (b) North Atlantic High. (c) Mongolia High. (d) Aleutian Low. (e) North Pacific High.Table 2 shows the significance test results of the EMC method with the area
$ \widehat{D} $ and$ \alpha $ = 0.05 of the significance test area in Figs. 4 and 5. Two qualitative ACAs in the North Atlantic (Iceland low and North Atlantic High) and the Mongolia High intensity in eastern Eurasia are significantly correlated with the corresponding winter temperature in China. Quantitatively, the correlation between the Mongolian High and temperature is the strongest, followed by the Icelandic Low and the North Atlantic High. The correlation between intensity and China’s winter precipitation in the same period is much weaker, and only the Icelandic Low intensity is significantly correlated with China’s precipitation in the same period. The above conclusions are consistent with the size of the significant-correlation area in Figs. 5 and 6.Climate Field Intensity Index (P) LIC HNA HMO LAL HNP T 11.363 5.911 15.645 0.156 1.633 R 5.711 2.048 1.490 2.278 0.971 Note: Bolded values indicate significance test by the EMC method of $ \alpha $ = 0.05. LIC is Iceland low, HNA is North Atlantic High, HMO is Mongolia High, LAL is Aleutian Low, and HNP is North Pacific High. Table 2. Area with significant correlations between ACA intensity indices and temperature/precipitation from 1951 to 2020 and the result of the EMC method test.
Table 3 shows the Livezey significance test results of the number of significantly correlated stations k and
$ \alpha $ =0.05 in Figs. 5 and 6. It can be seen that the test results of P-T are consistent. However, the test results of P-R vary greatly. Regarding P-R, in addition to the Icelandic Low, the Mongolian High, Aleutian Low, and North Pacific High were also significantly correlated. As can be seen from Figs. 6c–e, although the area of the significant-correlation area above them is small, they are all located in the dense area of the site in Fig. 1b. The inhomogeneity of the station network seriously affects the significance test of r.Climate Field Intensity Index (P) LIC HNA HMO LAL HNP T 100 45 144 1 7 R 74 10 19 26 9 Note: Bolded values indicate significance test by the Livezey method of $ \alpha $ = 0.05. LIC is Iceland low, HNA is North Atlantic High, HMO is Mongolia High, LAL is Aleutian Low, and HNP is North Pacific High. Table 3. The number of the stations with significant correlations between ACA intensity indices and temperature/precipitation from 1951 to 2020 and the result of Livezey method.
By comparing Tables 2 and 3, it is reasonable and effective to test the significance of the correlation-coefficient field r of an example by using the EMC method and through the significant-correlation area
$ \widehat{D} $ .
City | Number (s) | ${\lambda }_{s}{(}^{\circ }{\rm{E}})$ | ${\phi }_{s}{(}^{\circ }{\rm{N}})$ | $ {\widehat{m}}_{s} $ | $ {D}_{s} $ | $ {d}_{s} $ | $ {m}_{s} $ |
Lanzhou | 138 | 103.88 | 36.05 | 9 | 1.000 | 0.111 | 9.00 |
Aletay | 156 | 88.08 | 47.73 | 1 | 0.454 | 0.454 | 2.20 |
Shanghai | 56 | 121.46 | 31.41 | 10 | 0.413 | 0.041 | 24.21 |