Mapping of urban tree canopy in high-resolution aerial imagery using deep neural networks

Machado, Brian Leite; Kikuti, Rafael Ochi; Osco, Lucas Prado; Marcato Junior, José; Gonçalves, Wesley Nunes; Ramos, Ana Paula Marques

doi:10.5194/isprs-annals-X-3-W4-2025-219-2026

Articles | Volume X-3/W4-2025

https://doi.org/10.5194/isprs-annals-X-3-W4-2025-219-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-annals-X-3-W4-2025-219-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume X-3/W4-2025

13 Mar 2026

| 13 Mar 2026

Mapping of urban tree canopy in high-resolution aerial imagery using deep neural networks

Brian Leite Machado, Rafael Ochi Kikuti, Lucas Prado Osco, José Marcato Junior, Wesley Nunes Gonçalves, and Ana Paula Marques Ramos

Keywords: Urban forestry, Semantic segmentation, DeepLabV3, Aerial high-resolution imagery, Deep learning

Abstract. While deep learning has proven effective for urban tree mapping, there is a critical lack of validated benchmarks and comparative methodological studies for the diverse urban landscapes of Brazil. To address this gap, this work presents a deep-learning workflow that produces such maps from 25 cm RGB orthophotos. Images covering ten São Paulo cities were compiled; seven were used for training/validation and three for independent testing. The DeepLabV3 architecture with a ResNet-152 backbone was assessed under three loss configurations: (i) Balanced Cross-Entropy (BCE) baseline, (ii) BCE plus PointRend boundary refinement, and (iii) BCE combined with a 0.5-weighted Dice term. The BCE baseline delivered the top mean IoU (0.83) and F1-Score (0.91). PointRend increased recall but introduced systematic false positives in heterogeneous roofs and shaded riparian zones. The BCE+Dice variant recovered recall without raising commission error, achieving the highest balanced accuracy (0.96). The workflow delineates canopy with fine spatial detail and processes 2.8 × 10⁶ m² in under 30 minutes on a single RTX 4000 Ada workstation, demonstrating a practical, scalable solution for statewide tree-inventory production.

Mapping of urban tree canopy in high-resolution aerial imagery using deep neural networks

Useful Links

Useful External Links

Our Contact