The P<sup>3</sup> Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization

Sulzer, Raphael; Duan, Liuyun; Girard, Nicolas; Lafarge, Florent

doi:10.5194/isprs-annals-XI-2-2026-349-2026

Articles | Volume XI-2-2026

https://doi.org/10.5194/isprs-annals-XI-2-2026-349-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-annals-XI-2-2026-349-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume XI-2-2026

03 Jul 2026

| 03 Jul 2026

The P³ Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization

Raphael Sulzer, Liuyun Duan, Nicolas Girard, and Florent Lafarge

Keywords: 3D Point Clouds, Aerial Imagery, Building Segmentation, Building Vectorization, Data Fusion

Abstract. We present P^3, a large-scale multimodal dataset for building vectorization, including aerial LiDAR point clouds, aerial images, and vectorized 2D building outlines, collected across three continents. P³ contains over 10 billion LiDAR points with decimeter-level accuracy and RGB images at a ground sampling distance of 25 centimeters. While many existing datasets focus on the image modality, P³ offers a complementary perspective by incorporating dense 3D information. We demonstrate that LiDAR point clouds serve as a robust modality for predicting building polygons, both in hybrid and end-to-end learning frameworks. Moreover, fusing LiDAR and imagery further improves accuracy and geometric quality of predicted polygons. The P³ dataset is publicly available, along with code and pretrained weights of three state-of-the-art models for building polygon prediction at https://github.com/raphaelsulzer/PixelsPointsPolygons.

The P³ Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization

Useful Links

Useful External Links

Our Contact

The P3 Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization

Useful Links

Useful External Links

Our Contact

The P³ Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization