3.1. Experimental data
In Hunan Province, China lies Jingping village - an ancient settlement that boasts a heritage steeped in history and culture. As a proud recipient of National Key Cultural Relics Protection Unit recognition as well as being designated a Chinese Traditional Village of the fourth batch, this village has been well-preserved for over a millennium featuring splendid clusters of Ming and Qing dynasty era structures. With its open spatial arrangement, Jingping village presents a remarkable blend of water, fields, and woodlands - illustrating the transcendent beauty of peaceful coexistence between mankind and nature while highlighting the quintessential features of western Hunan's traditional settlements.
In August 2019, the village MVS point cloud data was meticulously captured by the author through fieldwork employing a Zhonghaida iFly D1 quadcopter UAV integrated with an iCam Q5 mini five-lens tilt camera and a UAV-PPK receiver, equipped with an impressive 120 million effective camera pixel count (24 million x 5). Extensive explorations were carried out in the experimental area, control points were intentionally laid, and routes were strategically planned. Through this effort, orthophotos with a resolution of around 1.5cm were obtained, along with approximately 320 million point cloud data of the village. Following the requisite crop, the main building area measures approximately 0.21km
2, inclusive of around 180 million building point clouds. The flight parameters utilized for this study are presented in
Table 1.
3.3. Roof extraction and slope segmentation
During the process of removing vegetation point cloud, it was discovered that taller trees obscured local information of the roofs due to limitations in the UAV tilt photography technology. To address this issue, one more complete building from each roof type shown in
Figure 7 was selected for the outline extraction experiment. The impact of the GR value on the extraction effect of the roofs in the CSF algorithm was examined and found to have a significant influence. As an example, I-shaped roofs were used and changing the GR value while keeping other values constant resulted in the best extraction effect being achieved when GR was set to 0.1, as demonstrated in
Figure 8. This approach was then applied to obtain all types of roof point clouds, presented in
Figure 9. However, upon closer examination, redundant point cloud was noted at the eaves of different roofs. The reason for this is that elevation information in this area cannot be captured during tilt photography, resulting in absence of clear point cloud data during the aerial triangulation solution. Additionally, the ancestral hall of Jingping village, depicted in
Figure 9d, has higher surrounding walls than the roof, leading to partial voids in the area of the walls. Consequently, the application of the CSF algorithm to extract the roof of the building did not guarantee roof integrity.
To mitigate the influence of threshold setting on plane segmentation observed in previous studies, we propose a novel approach whereby point cloud data is projected onto a flat raster and smoothed before slope calculation. In this study, we exclusively experimented with the first three types of roofs mentioned above, as the extraction performance for "回"-shaped roofs was poor and hill walls prevented slope segmentation. Since the traditional village buildings' roof tiles are concave and convex, our slope direction calculation produced eight initial directions, which were subsequently reclassified into a 2-slope or 4-slope map. For example, regarding L-shaped roofs (
Figure 10), we partitioned each raster pixel into one of eight slope directions, then divided them into four categories based on morphological classification and
Figure 11. Nonetheless, the reclassified raster slope directions may contain a small number of misclassified stray points at edges and intersections, which require additional smoothing and denoising to avoid negatively affecting downstream detection. Our method's segmentation results were compared to those yielded by traditional RANSAC and region segmentation algorithms to verify their effectiveness.
The accuracy of slope segmentation was assessed through the utilization of equations (1)-(3), with the corresponding results illustrated in
Figure 12. From the provided visualizations, it is apparent that the precision of the surface slope, which was extracted using a directional segmentation approach, exceeded 99.6%, indicating a remarkably satisfactory outcome.
where TP is the number of correctly segmented raster pixels, FN is the number of missed raster pixels, FP is the number of incorrectly segmented raster pixels, R is the recall rate, P is the accuracy rate, and F is the measure.
Table 2.
Correct slope segmentation for each type of roof.
Table 2.
Correct slope segmentation for each type of roof.
Index |
I-shaped |
L-shaped |
U-shaped |
TP |
30587 |
6665 |
26675 |
FN |
0 |
0 |
0 |
FP |
126 |
47 |
53 |
R |
1 |
1 |
1 |
P |
99.59% |
99.30% |
99.80% |
F |
99.79% |
99.65% |
99.90% |
Figure 12.
Comparison of different types of roof surface segmentation:(a)RANSAC algorithms; (b)region segmentation algorithm; (c)slope segmentation algorithm; (d)Slope segmentation algorithm+noise removal.
Figure 12.
Comparison of different types of roof surface segmentation:(a)RANSAC algorithms; (b)region segmentation algorithm; (c)slope segmentation algorithm; (d)Slope segmentation algorithm+noise removal.
It is noteworthy that, owing to the high precision of the data, the re-categorized slope direction raster might exhibit a marginal number of erroneously classified outlying points at the edges and intersections of two slope directions, namely, inaccurately segmented raster pixels. These artifacts may exert a certain impact on the ensuing detection procedures and thus must be subjected to subsequent smoothing and denoising processes. To ascertain the effectiveness of this approach,
Figure 12 and
Table 3 were produced by comparing its segmentation outcomes with the conventional RANSAC and region segmentation algorithms.
Based upon the results presented in
Figure 12 and
Table 3, our proposed method demonstrates significant improvements over both the RANSAC algorithm and the region segmentation algorithm. Firstly, all three algorithms exhibit comparable performance in terms of computational efficiency due to the small size of the input data. Secondly, regarding segmentation accuracy, neither the RANSAC algorithm nor the region segmentation algorithm succeeded in accurately segmenting the facets of simple I-shaped or slightly more complex U-shaped roofs. Instead, these algorithms exhibited varying degrees of under- and over-segmentation on either type of roof. In addition, based on point cloud data analysis, the region segmentation algorithm demonstrated a higher tendency toward under-segmentation when compared to the RANSAC algorithm. For instance, as shown in
Figure 12, inner sides of both L-shaped and U-shaped roofs were segmented into single planes and the resulting segmented roofs showed greater noise due to restricted data accuracy. Furthermore, during the experiment, it was observed that the RANSAC algorithm produced inconsistent results under identical parameter settings, indicating significant deviations and instability.
Using the direction of slope, successful segmentation of all slopes was achieved in all three types of roof segmentation, resulting in improved outcomes. Though there were several instances of mis-segmentation, such occurrences did not adversely affect the overall effects compared to the area segmentation algorithm. Furthermore, after denoising, a complete representation of the various roof slopes was obtained. It should be noted that during experimentation, the first two algorithms were directly based on point cloud segmentation, resulting in neater planar edges. In contrast, slope segmentation relied on raster images after point cloud projection, leading to rougher edge contours due to lower raster resolution. However, this issue can be eliminated by regularization in subsequent feature line extraction.