TY - JOUR
T1 - Benchmarking tree species classification from proximally sensed laser scanning data
T2 - Introducing the FOR-species20K dataset
AU - Puliti, Stefano
AU - Lines, Emily R.
AU - Müllerová, Jana
AU - Frey, Julian
AU - Schindler, Zoe
AU - Straker, Adrian
AU - Allen, Matthew J.
AU - Winiwarter, Lukas
AU - Rehush, Nataliia
AU - Hristova, Hristina
AU - Murray, Brent
AU - Calders, Kim
AU - Coops, Nicholas
AU - Höfle, Bernhard
AU - Irwin, Liam
AU - Junttila, Samuli
AU - Krůček, Martin
AU - Krok, Grzegorz
AU - Král, Kamil
AU - Levick, Shaun R.
AU - Luck, Linda
AU - Missarov, Azim
AU - Mokroš, Martin
AU - Owen, Harry J.F.
AU - Stereńczak, Krzysztof
AU - Pitkänen, Timo P.
AU - Puletti, Nicola
AU - Saarinen, Ninni
AU - Hopkinson, Chris
AU - Terryn, Louise
AU - Torresan, Chiara
AU - Tomelleri, Enrico
AU - Weiser, Hannah
AU - Astrup, Rasmus
N1 - Publisher Copyright:
© 2025 The Author(s). Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society.
PY - 2025
Y1 - 2025
N2 - Proximally sensed laser scanning presents new opportunities for automated forest ecosystem data capture. However, a gap remains in deriving ecologically pertinent information, such as tree species, without additional ground data. Artificial intelligence approaches, particularly deep learning (DL), have shown promise towards automation. Progress has been limited by the lack of large, diverse, and, most importantly, openly available labelled single-tree point cloud datasets. This has hindered both (1) the robustness of the DL models across varying data types (platforms and sensors) and (2) the ability to effectively track progress, thereby slowing the convergence towards best practice for species classification. To address the above limitations, we compiled the FOR-species20K benchmark dataset, consisting of individual tree point clouds captured using proximally sensed laser scanning data from terrestrial (TLS), mobile (MLS) and drone laser scanning (ULS). Compiled collaboratively, the dataset includes data collected in forests mainly across Europe, covering Mediterranean, temperate and boreal biogeographic regions. It includes scattered tree data from other continents, totaling over 20,000 trees of 33 species and covering a wide range of tree sizes and forms. Alongside the release of FOR-species20K, we benchmarked seven leading DL models for individual tree species classification, including both point cloud (PointNet++, MinkNet, MLP-Mixer, DGCNNs) and multi-view 2D-based methods (SimpleView, DetailView, YOLOv5). 2D Image-based models had, on average, higher overall accuracy (0.77) than 3D point cloud-based models (0.72). Notably, the performance was consistently >0.8 across scanning platforms and sensors, offering versatility in deployment. The top-scoring model, DetailView, demonstrated robustness to training data imbalances and effectively generalized across tree sizes. The FOR-species20K dataset represents an important asset for developing and benchmarking DL models for individual tree species classification using proximally sensed laser scanning data. As such, it serves as a crucial foundation for future efforts to classify accurately and map tree species at various scales using laser scanning technology, as it provides the complete code base, dataset, and an initial baseline representative of the current state-of-the-art of point cloud tree species classification methods.
AB - Proximally sensed laser scanning presents new opportunities for automated forest ecosystem data capture. However, a gap remains in deriving ecologically pertinent information, such as tree species, without additional ground data. Artificial intelligence approaches, particularly deep learning (DL), have shown promise towards automation. Progress has been limited by the lack of large, diverse, and, most importantly, openly available labelled single-tree point cloud datasets. This has hindered both (1) the robustness of the DL models across varying data types (platforms and sensors) and (2) the ability to effectively track progress, thereby slowing the convergence towards best practice for species classification. To address the above limitations, we compiled the FOR-species20K benchmark dataset, consisting of individual tree point clouds captured using proximally sensed laser scanning data from terrestrial (TLS), mobile (MLS) and drone laser scanning (ULS). Compiled collaboratively, the dataset includes data collected in forests mainly across Europe, covering Mediterranean, temperate and boreal biogeographic regions. It includes scattered tree data from other continents, totaling over 20,000 trees of 33 species and covering a wide range of tree sizes and forms. Alongside the release of FOR-species20K, we benchmarked seven leading DL models for individual tree species classification, including both point cloud (PointNet++, MinkNet, MLP-Mixer, DGCNNs) and multi-view 2D-based methods (SimpleView, DetailView, YOLOv5). 2D Image-based models had, on average, higher overall accuracy (0.77) than 3D point cloud-based models (0.72). Notably, the performance was consistently >0.8 across scanning platforms and sensors, offering versatility in deployment. The top-scoring model, DetailView, demonstrated robustness to training data imbalances and effectively generalized across tree sizes. The FOR-species20K dataset represents an important asset for developing and benchmarking DL models for individual tree species classification using proximally sensed laser scanning data. As such, it serves as a crucial foundation for future efforts to classify accurately and map tree species at various scales using laser scanning technology, as it provides the complete code base, dataset, and an initial baseline representative of the current state-of-the-art of point cloud tree species classification methods.
KW - biodiversity
KW - deep learning
KW - lidar
KW - point cloud classification
KW - remote sensing
KW - single-tree inventory
UR - http://www.scopus.com/inward/record.url?scp=85216845860&partnerID=8YFLogxK
U2 - 10.1111/2041-210X.14503
DO - 10.1111/2041-210X.14503
M3 - Article
AN - SCOPUS:85216845860
SN - 2041-210X
JO - Methods in Ecology and Evolution
JF - Methods in Ecology and Evolution
ER -