-
Three cases are used to demonstrate the rationality and effectiveness of the improved DBSCAN method in cloud classification. In cases 1 and 2, convective clouds with strong development are embedded in large-scale stratiform clouds, and melting layer echoes are obvious. The contrast images are screenshot from the operational software and classified according to the fuzzy logic method (Xiao and Liu, 2007). The classification results of the two methods are basically the same, but some HCSs near the melting layer are often identified as SFCs by the fuzzy logic method. Instead of judging only by one point in the fuzzy logic method, clouds are marked according to their spatial characteristics of multiple points with the same properties in the improved DBSCAN method. Therefore, the improved DBSCAN method obtains better recognition results than the fuzzy logic method.
Case 3 is a hail and thunderstorm gale weather event, and the identification results of convective clouds can clearly show the development process of disastrous weather.
-
A multicell precipitation weather process observed with the SA radar in Nanjing at 1000 UTC on 28 April 2015 is taken as an example. Figures 4a–c show images of the 0.5°, 1.5°, and 2.4° PPIs (plan position indicators), which are blocked around the azimuth at 135°. Figures 4d and e are the results of the cloud classifications conducted with the 2D and 3D models corresponding to Fig. 4a, respectively, in which eight colors are used to represent the eight types of clouds, namely, the core and boundary areas of the DCC, SCC, HCS, and SFC types. It can be seen from Figs. 4d and e that there are many convective clouds (marked with purple and red) embedded in large-scope HCSs and SFCs (marked with yellow and green, respectively), and lightning (marked with black plus signs) occurs in the convective cloud regions.
Figure 4. The reflectivity PPI at 0.5°, 1.5°, and 2.4° observed with the Nanjing SA radar at 1000 UTC on 28 April 2015 (a–c), and the results of the cloud classification performed with the 2D model (d) and 3D model (e) in which the symbol _c represents the core area, and _b represents the bound area of each type of cloud cluster. The black plus signs are the locations of lightning, the black and red lines are the locations of sections corresponding to Fig. 5, and each circle is 50 km apart (same below). To compare the recognition effect, classification PPIs of the convective cloud (red area) and stratiform cloud (blue area) are attached to (f), as identified by Xiao et al. (2007) according to the fuzzy logic method.
Clouds often occur above the 0.5° elevation height of the radar detection and exist at different elevations. Therefore, a 3D model with reflectivity at elevations of 0.5°, 1.5°, and 2.4° is more accurate for identifying cloud clusters than a 2D model. Figures 4a–c show the PPIs of the lowest three elevation angles. The echo height increases with elevation, and the 3D model can distinguish the core and boundary of the cloud cluster well.
Figures 4d and e show that the cloud classifications performed with the 2D and 3D models are basically the same, but differences are seen in some areas. For example, almost no clouds are seen in Fig. 4d, but full, weak SFCs are seen in Fig. 4e in the two boxes indicated by the black dotted line, suggesting that a weak SFC exists above the 0.5° elevation height that cannot be obtained with the 2D model. In addition, developing DCCs are marked by black ellipses in Fig. 4d, in which the core area of the DCC is still relatively small, but significantly increases in Fig. 4e, in which it is almost marked as the core of the DCC. Therefore, the 3D model has great advantages in identifying SFCs and developing DCCs. However, some developing convective clouds of small areas at the low level cannot be identified with the 3D model because they are covered by thick stratiform clouds. Therefore, the classification results may be more reasonable when the two models are combined to identify clouds together.
To further verify the classification process, two VPRs corresponding to the black (Fig. 5a) and red (Fig. 5b) lines in Fig. 4 are demonstrated, in which the horizontal axis denotes the length of the section, the vertical axis represents the height, and the black dotted line shows the height of the 0ºC layer according to the atmospheric reanalysis data. In Fig. 5a, two DCCs are located in the horizontal distance area from 0 km to 35 km; these clouds have obvious columnar structures with large vertical thicknesses and uneven echo tops (Zhang et al., 2010), and echo intensity heights larger than 45 dBZ extend over 7 km. The maximum intensities from 70 km to 95 km and from 105 km to 115 km are over 45 dBZ, and the 35 dBZ height is above 5 km, much higher than the 3.8 km 0ºC-level height; therefore, it is reasonable to identify HCSs using the improved DBSCAN algorithm in these areas with obvious convective structures; however, these areas are identified as SFCs with the fuzzy logic method (Fig. 4f). In addition, the intensities from 50 km to 70 km in Fig. 5a are relatively weak and are correctly identified as SFCs with the improved DBSCAN algorithm.
Figure 5. The reflectivity of the vertical sections corresponding to the black line (a) and the red line (b) in Fig. 4 (Range is horizontal distance, height is echo height, and the black dotted line is the 3.8 km height of the 0ºC layer in Nanjing at 1200 UTC on 28 April 2015).
In Fig. 5b, there is a strong convective cloud area from 5 km to 35 km in which the reflectivity of the echo center is larger than 65 dBZ and the 45 dBZ height is more than 12 km; this area is identified as DCCs in Fig. 4. In addition, there are several convective bubbles from 60 km to 120 km. The bubble from 60 km to 70 km is weak, and the 35 dBZ echo extends to a height of approximately 5 km and is classified as a SCC in Fig. 4. The bubble at approximately 110 km is relatively strong, showing an obvious columnar structure, and the 35 dBZ height is over 9 km, but the cloud is only marked as the SCC boundary because there are not sufficient core points in the SCC. However, this area further evolves to a core SCC area after six minutes (picture is omitted); therefore, the improved DBSCAN algorithm suggests a developing convective cloud.
According to the above analysis, some differences exist in cloud classifications between the 2D and 3D models. To further illustrate these differences, the number of each cloud type point is calculated and shown in Fig. 6. The total points in the 2D and 3D models are 34 316 and 42 501, respectively. The numbers of convective points are slightly different between the 2D and 3D models, but the total number of points identified with the 3D model is nearly 8 000 larger than that identified with the 2D model, which is mainly caused by the difference in SFCs. There are two reasons why the DCC, HCS, and SFC areas identified with the 3D model are larger than those identified with the 2D model. One is that the 3D model is based on the reflectivity factor at three elevation angles, allowing a larger height scope for identifying points. The other is that the range of the 3D model is divided into several sections to set different density threshold values that can mark more points at far distances. In addition, the number of each type of cluster point under different parameter settings is counted, while two parameters (density threshold and ε neighborhood) of the improved DBSCAN algorithm are set differently. The SCC area identified with the 2D model is slightly higher than that identified with the 3D model, which may be caused by the weak development of the SCC during the event.
The parameter change in the 2D model is taken as an example to illuminate the impact of the parameters on the classification. In terms of the density threshold (Table 1), the number of echo points is listed, while the density is set to 3, 4, 5, 6, and 7 points within a one-kilometer neighborhood. With the increase in the density threshold, it is found that the more severe the conditions for identifying clouds are, the smaller the echo area of the cloud cluster. In terms of the ε neighborhood (Table 2), the number of points is listed, while the threshold values of ε are given as 1 km, 2 km, 3 km, and 4 km. Contrary to the changing trend of the density threshold, the larger the ε neighborhood is, the larger the echo area of the identified cloud cluster. It is obvious from Tables 1 and 2 that the values of the density threshold and ε neighborhood greatly influence the classification results.
Threshold of density Cloud type 3 4 5 6 7 Core of DCC 953 812 733 725 699 Boundary of DCC 598 566 508 496 444 Core of SCC 812 779 797 795 664 Boundary of SCC 1491 1356 1252 1096 914 Core of HCS 2355 2100 1992 2041 1935 Boundary of HCS 2505 2233 2067 1837 1524 Core of SFC 22467 21560 20543 19085 15921 Boundary of SFC 6179 6276 6424 6541 6906 Table 1. Point number of each cloud type obtained under different density thresholds.
ε neighborhoods Cloud type 1 2 3 4 Core of DCC 733 857 1015 1128 Boundary of DCC 508 1097 1695 2362 Core of SCC 797 413 278 234 Boundary of SCC 1252 2132 2705 3440 Core of HCS 1992 2020 1874 1667 Boundary of HCS 2067 3503 4584 5482 Core of SFC 20 543 22 592 22 435 22 277 Boundary of SFC 6424 9476 12161 13877 Table 2. Point number of each cloud type in different ε neighborhoods.
-
Another radar-based dataset taken as an example was observed with the SA radar in Xuzhou at 0754 UTC on 6 July 2019. The identifications of cloud clusters with the DBSCAN and fuzzy logic algorithms are shown in Fig. 7. The large-scope purple areas indicate that this was a strong convective weather event, and the convective clouds accompanied by lightning were very energetic. Both the 2D and 3D models can obviously classify the convective and stratiform features, and similar to case one, the 2D model marks more SCC points while the 3D model marks more SFC points.
Figure 7. Similar to Fig. 4, but the radar data were observed in Xuzhou at 0754 UTC 6 July 2019; the black line is the location of the section corresponding to Fig. 8.
Comparing the classifications between adjacent times 0754 UTC and 0800 UTC (figure omitted), the evolution and dissipation of convective clouds can be found in both the 2D and 3D models; these results can improve our understanding of the mechanism of precipitation and allow us to better estimate precipitation.
In the azimuth area from 55° to 95° and ranging from 200 km to 230 km in the 0.5° reflectivity PPI, there is a strong region within the echo over 35 dBZ; this region is marked as an HCS by the 2D and 3D models but as an SFC or as an uncertain cloud cluster by the fuzzy logic method, which considers that these strong echoes are caused by a melting layer. To verify this classification, a vertical section (Fig. 8) is made along the echo region (black line in Fig. 7a). As shown in Fig. 8, most echoes above 35 dBZ are higher than the 4.5-km height of the melting layer; therefore, it is reasonable that the cloud is marked as an HCS by the improved DBSCAN method. In addition, although the intensity is greater than 35 dBZ at the bottom of the black line shown in Fig. 7a, the cloud is marked as an SFC by both the 2D and 3D models. Figure 8 also verifies that the echo in this region is not strong.
-
Affected by the cold vortex, a large-scale severe convective weather event occurred in Shandong Province on 17 May 2020. Disasterous weather, including thunderstorms, gales of magnitude 8–10, and hail, occurred in many places that day. Taking data from the Qingdao SA radar as an example, the clouds are classified by a 3D model. There is no super-refraction within the radar detection range, as found by examining the radial velocity and the intensity vertical texture. The points of the DCC and SCC are drawn in Fig. 9a. From 0940 UTC, thunderstorms and gales appeared, while convective clouds were detected in the radar detection range. Convective clouds developed to their most energetic stage from 1300 UTC to 1500 UTC and then weakened and completely dissipated at approximately 1900 UTC; disastrous weather also dissipated at that time.
Figure 9. The changing trend of the number of cloud clusters identified by the 3D model; (a) 0940–1904 UTC and (b) 1128–1220 UTC (scale of the point number of the SFC boundary is on the right side of the Y-axis, and the other seven types of clouds are on the left).
To analyze the relationship between the evolution of convection and the size of the cloud boundary area, the variations in the core and boundary points of each type of cloud from 1128 UTC to 1220 UTC are shown in Fig. 9b, and the corresponding PPI images of reflectivity are shown in Fig. 10. As shown in Fig. 10, various types of cloud clusters continued to develop during the eastward movement of the system. From 1128 UTC, the boundary of the SFC increased rapidly, which indicated that the weather system had entered the radar detection range. The core and boundary areas of other clouds increased slowly, and there was an opposite phase change between the curves of the core and boundary with time. In particular, the inverse phase change between the core and boundary curves of the DCC was more obvious, which clearly showed the dynamic evolutionary process of the convective system from occurrence to extinction. In addition, the convective clouds in classification images (Figs. 10e–h) are clearer than those in the PPIs (Figs. 10a–d). At this distance, the strong echoes caused by the melting layer are correctly recognized as stratiform clouds, which can aid in the better location of disastrous weather.
Threshold of density | Cloud type | ||||
3 | 4 | 5 | 6 | 7 | |
Core of DCC | 953 | 812 | 733 | 725 | 699 |
Boundary of DCC | 598 | 566 | 508 | 496 | 444 |
Core of SCC | 812 | 779 | 797 | 795 | 664 |
Boundary of SCC | 1491 | 1356 | 1252 | 1096 | 914 |
Core of HCS | 2355 | 2100 | 1992 | 2041 | 1935 |
Boundary of HCS | 2505 | 2233 | 2067 | 1837 | 1524 |
Core of SFC | 22467 | 21560 | 20543 | 19085 | 15921 |
Boundary of SFC | 6179 | 6276 | 6424 | 6541 | 6906 |