<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">ISPRS-Annals</journal-id>
<journal-title-group>
<journal-title>ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences</journal-title>
<abbrev-journal-title abbrev-type="publisher">ISPRS-Annals</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2194-9050</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/isprs-annals-XI-2-2026-145-2026</article-id>
<title-group>
<article-title>Extraction of Pole-like Road Objects from MMS Point Clouds Using Deep Learning and Geometric-Topological Feature Fusion</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Su</surname>
<given-names>Shu</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Shirai</surname>
<given-names>Masataka</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Yokota</surname>
<given-names>Hiroyuki</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>AERO TOYOTA CORPORATION, Japan</addr-line>
</aff>
<pub-date pub-type="epub">
<day>03</day>
<month>07</month>
<year>2026</year>
</pub-date>
<volume>XI-2-2026</volume>
<fpage>145</fpage>
<lpage>154</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2026 Shu Su et al.</copyright-statement>
<copyright-year>2026</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://isprs-annals.copernicus.org/articles/XI-2-2026/145/2026/isprs-annals-XI-2-2026-145-2026.html">This article is available from https://isprs-annals.copernicus.org/articles/XI-2-2026/145/2026/isprs-annals-XI-2-2026-145-2026.html</self-uri>
<self-uri xlink:href="https://isprs-annals.copernicus.org/articles/XI-2-2026/145/2026/isprs-annals-XI-2-2026-145-2026.pdf">The full text article is available as a PDF file from https://isprs-annals.copernicus.org/articles/XI-2-2026/145/2026/isprs-annals-XI-2-2026-145-2026.pdf</self-uri>
<abstract>
<p>This paper presents a fusion framework for the automatic extraction of pole-like road objects, including traffic lights, road signs, streetlights, and utility poles, from Mobile Mapping System (MMS) point clouds. The proposed method combines KPConv-based semantic segmentation with geometric-topological reasoning, enabling structural completion and heuristic filtering of nearby clutter without retraining or additional annotated data. The framework was trained on 8 km of manually annotated MMS data collected in the Kinki region of Japan and evaluated on two large-scale datasets: (i) a 26 km MMS dataset from Hokkaido (&amp;asymp;2.53 billion points) acquired using the same LiDAR sensor, and (ii) the Paris-Lille-3D benchmark (France) captured with a different LiDAR sensor. Quantitative evaluation demonstrates that the proposed fusion framework consistently outperforms the KPConv baseline across all datasets, particularly in recall and F₁-score. On the Hokkaido dataset, recall improved from 0.7952 to 0.8924 (+0.0972), and the F₁-score increased from 0.8263 to 0.8689 (+0.0426), reflecting successful reconstruction of lamp tops, signal arms, and previously unseen snow delineator posts (snow poles). On the Paris-Lille-3D benchmark, representing a cross-sensor and cross-domain scenario, recall improved from 0.5109 to 0.6656 (+0.1547), while the F&lt;sub&gt;1&lt;/sub&gt;-score increased from 0.6230 to 0.7032 (+0.0802). In terms of computational efficiency, the 26 km Hokkaido dataset was processed in under 13 hours on a single NVIDIA Quadro RTX 8000. Overall, these results confirm that the proposed deep- learning-geometry-topology fusion framework achieves high accuracy, robust generalization, and practical scalability for large-scale road-asset mapping and digital-twin generation.</p>
</abstract>
<counts><page-count count="10"/></counts>
</article-meta>
</front>
<body/>
<back>
</back>
</article>