Extraction of Pole-like Road Objects from MMS Point Clouds Using Deep Learning and Geometric-Topological Feature Fusion

Su, Shu; Shirai, Masataka; Yokota, Hiroyuki

doi:10.5194/isprs-annals-XI-2-2026-145-2026

Articles | Volume XI-2-2026

https://doi.org/10.5194/isprs-annals-XI-2-2026-145-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-annals-XI-2-2026-145-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume XI-2-2026

03 Jul 2026

| 03 Jul 2026

Extraction of Pole-like Road Objects from MMS Point Clouds Using Deep Learning and Geometric-Topological Feature Fusion

Shu Su, Masataka Shirai, and Hiroyuki Yokota

Keywords: Point Clouds, Pole-like Objects, Deep Learning, Geometric-Topological Fusion, Cross-Domain Generalization

Abstract. This paper presents a fusion framework for the automatic extraction of pole-like road objects, including traffic lights, road signs, streetlights, and utility poles, from Mobile Mapping System (MMS) point clouds. The proposed method combines KPConv-based semantic segmentation with geometric-topological reasoning, enabling structural completion and heuristic filtering of nearby clutter without retraining or additional annotated data. The framework was trained on 8 km of manually annotated MMS data collected in the Kinki region of Japan and evaluated on two large-scale datasets: (i) a 26 km MMS dataset from Hokkaido (≈2.53 billion points) acquired using the same LiDAR sensor, and (ii) the Paris-Lille-3D benchmark (France) captured with a different LiDAR sensor. Quantitative evaluation demonstrates that the proposed fusion framework consistently outperforms the KPConv baseline across all datasets, particularly in recall and F₁-score. On the Hokkaido dataset, recall improved from 0.7952 to 0.8924 (+0.0972), and the F₁-score increased from 0.8263 to 0.8689 (+0.0426), reflecting successful reconstruction of lamp tops, signal arms, and previously unseen snow delineator posts (snow poles). On the Paris-Lille-3D benchmark, representing a cross-sensor and cross-domain scenario, recall improved from 0.5109 to 0.6656 (+0.1547), while the F₁-score increased from 0.6230 to 0.7032 (+0.0802). In terms of computational efficiency, the 26 km Hokkaido dataset was processed in under 13 hours on a single NVIDIA Quadro RTX 8000. Overall, these results confirm that the proposed deep- learning-geometry-topology fusion framework achieves high accuracy, robust generalization, and practical scalability for large-scale road-asset mapping and digital-twin generation.

Extraction of Pole-like Road Objects from MMS Point Clouds Using Deep Learning and Geometric-Topological Feature Fusion

Useful Links

Useful External Links

Our Contact