<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">ISPRS-Annals</journal-id>
<journal-title-group>
<journal-title>ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences</journal-title>
<abbrev-journal-title abbrev-type="publisher">ISPRS-Annals</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2194-9050</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/isprs-annals-XI-2-2026-751-2026</article-id>
<title-group>
<article-title>6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Sinha</surname>
<given-names>Saptarshi Neil</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Kühn</surname>
<given-names>Paul Julius</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Goschke</surname>
<given-names>Mika Silvan</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Weinmann</surname>
<given-names>Michael</given-names>
</name>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Fraunhofer IGD, Fraunhoferstr. 5, 64283 Darmstadt, Germany</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Fraunhofer IGD, Fraunhoferstr. 5, 64283 Darmstadt, Germany</addr-line>
</aff>
<aff id="aff3">
<label>3</label>
<addr-line>Fraunhofer IGD, Fraunhoferstr. 5, 64283 Darmstadt, Germany</addr-line>
</aff>
<aff id="aff4">
<label>4</label>
<addr-line>Delft university of technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands</addr-line>
</aff>
<pub-date pub-type="epub">
<day>03</day>
<month>07</month>
<year>2026</year>
</pub-date>
<volume>XI-2-2026</volume>
<fpage>751</fpage>
<lpage>758</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2026 Saptarshi Neil Sinha et al.</copyright-statement>
<copyright-year>2026</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://isprs-annals.copernicus.org/articles/XI-2-2026/751/2026/isprs-annals-XI-2-2026-751-2026.html">This article is available from https://isprs-annals.copernicus.org/articles/XI-2-2026/751/2026/isprs-annals-XI-2-2026-751-2026.html</self-uri>
<self-uri xlink:href="https://isprs-annals.copernicus.org/articles/XI-2-2026/751/2026/isprs-annals-XI-2-2026-751-2026.pdf">The full text article is available as a PDF file from https://isprs-annals.copernicus.org/articles/XI-2-2026/751/2026/isprs-annals-XI-2-2026-751-2026.pdf</self-uri>
<abstract>
<p>Automated and selective harvesting of fruits is increasingly vital due to high costs and seasonal labor shortages especially in advanced economies. This paper focuses on 6D pose estimation of strawberries using purely synthetic data generated through a procedural pipeline for photorealistic rendering. We employ the single-shot YOLOX-6D-Pose algorithm that leverages the YOLOX backbone (i.e., a specific deep convolution network that extracts hierarchical image features used for object detection), known for its balance between speed and accuracy, and support for edge inference. To address the lacking availability of training data, we present a flexible pipeline for generating realistic synthetic data from various 3D strawberry models via the procedural Blender pipeline, enhancing its value for training pose estimation algorithms. Quantitative evaluations show YOLOX-6D-Pose algorithm achieve comparable accuracy on both the NVIDIA RTX 3090 and Jetson Orin Nano, measured by several ADD-S metrics, which measure 6D object pose estimation accuracy by computing the average closest-point distance between model points under predicted and ground-truth poses (for symmetric objects) and evaluating it against chosen thresholds. The RTX 3090 offers superior processing speed, while the Jetson Orin Nano is ideal for resource-constrained environments, suitable for agricultural robotics. Qualitative results confirm the model&amp;rsquo;s ability to accurately infer poses of ripe and partially ripe strawberries, though challenges remain with unripe specimens. This indicates potential for future enhancements, particularly in detecting unripe strawberries by exploring color variations. The methodology can also be adapted for other fruits like apples, peaches, and plums, broadening its impact in agricultural automation.</p>
</abstract>
<counts><page-count count="8"/></counts>
</article-meta>
</front>
<body/>
<back>
</back>
</article>