<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">ISPRS-Annals</journal-id>
<journal-title-group>
<journal-title>ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences</journal-title>
<abbrev-journal-title abbrev-type="publisher">ISPRS-Annals</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2194-9050</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/isprs-annals-XI-1-2026-437-2026</article-id>
<title-group>
<article-title>Automatic Scan-to-BIM: The Impact of Semantic Segmentation Accuracy on Opening Detection</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Patil</surname>
<given-names>Jidnyasa</given-names>
<ext-link>https://orcid.org/0009-0005-3248-0031</ext-link>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Sowmya</surname>
<given-names>Arcot</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Kalantari</surname>
<given-names>Mohsen</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>School of Civil and Environmental Engineering, University of New South Wales, Sydney, 2052, NSW, Australia</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>School of Computer Science and Engineering, University of New South Wales, Sydney, 2052, NSW, Sydney, Australia</addr-line>
</aff>
<aff id="aff3">
<label>3</label>
<addr-line>School of Civil and Environmental Engineering, University of New South Wales, Sydney, 2052, NSW, Australia</addr-line>
</aff>
<pub-date pub-type="epub">
<day>03</day>
<month>07</month>
<year>2026</year>
</pub-date>
<volume>XI-1-2026</volume>
<fpage>437</fpage>
<lpage>445</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2026 Jidnyasa Patil et al.</copyright-statement>
<copyright-year>2026</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://isprs-annals.copernicus.org/articles/XI-1-2026/437/2026/isprs-annals-XI-1-2026-437-2026.html">This article is available from https://isprs-annals.copernicus.org/articles/XI-1-2026/437/2026/isprs-annals-XI-1-2026-437-2026.html</self-uri>
<self-uri xlink:href="https://isprs-annals.copernicus.org/articles/XI-1-2026/437/2026/isprs-annals-XI-1-2026-437-2026.pdf">The full text article is available as a PDF file from https://isprs-annals.copernicus.org/articles/XI-1-2026/437/2026/isprs-annals-XI-1-2026-437-2026.pdf</self-uri>
<abstract>
<p>The automation of Scan-to-BIM remains a major challenge within the Architecture, Engineering, and Construction industry, particularly in the detection and geometric characterisation of architectural openings such as doors and windows. Although recent advances in 3D semantic segmentation have improved the classification of architectural elements, the effect of segmentation accuracy on downstream geometric detection and reconstruction is still under study. This work compares five state-of-the-art deep learning models, PointNeXt, PointMetaBase, Point Transformer V1, Point Transformer V3, and Swin3D, on opening detection in Scan-to-BIM. A unified evaluation framework integrating DBSCAN clustering with axis-aligned bounding box fitting is introduced to generate per-instance geometric representations. The models are assessed using semantic metrics and geometric reliability indicators, including centroid error, dimensional deviation and 3D IoU. Experiments on the S3DIS Area 5 dataset, reveal notable performance differences across models. Swin3D achieved the highest door detection rate of 96.9%, followed by PointMetaBase at 92.9%, PointNeXt at 87.4%, PTV3 at 85.0%, and PTV1 at 81.9%. Window detection proved more challenging, with Swin3D and PTV3 both achieving 75.0%, PTV1 at 71.2%, and PointNeXt and PointMetaBase at 67.3%. Notably, PointMetaBase produced strong geometric accuracy for doors despite lower semantic scores. These results suggest that high segmentation accuracy does not always lead to precise geometric reconstruction. To assess generalisation, the trained models were applied to 11 Matterport3D rooms, confirming that the observed patterns extend across different scanning environments. This study concludes that in Scan-to-BIM workflows, greater emphasis should be placed on geometric reconstruction algorithms than segmentation performance alone.</p>
</abstract>
<counts><page-count count="9"/></counts>
</article-meta>
</front>
<body/>
<back>
</back>
</article>