You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: _pages/publications/2026/cross_dataset_evaluation_of_visual_semantic_segmentation_models_for_off_road_autonomous_driving.md
Intelligent autonomous driving in off-road environments is an emerging field with great potential to impact areas such as agriculture, forestry, and rescue operations. Perception in these scenarios presents unique challenges due to the diversity of elements and weather conditions, along with the inherent ambiguity in class definitions. Consequently, off-road visual semantic segmentation datasets remain underdeveloped, roughly ten times smaller than their urban counterparts, hindering dependable performance assessment and potentially compromising the safety of autonomous systems. To address these challenges, we present a comprehensive cross-dataset evaluation of visual semantic segmentation models for autonomous off-road navigation. We propose a unified ontology that harmonizes class definitions across relevant datasets, enabling their combination for both training and testing. This approach ensures fair model comparisons and reliable assessment of generalization to unseen domains. We further benchmark models on the original datasets, analyze the impact of different ontology harmonization criteria and conversion strategies, and evaluate the trade-off between segmentation performance and computational cost. Results show that Transformer-based architectures achieve the most consistent segmentation performance across datasets. While often computationally demanding, some variants maintain real-time inference (≈12 ms) with top-tier accuracy. The unified ontology simplifies the segmentation task, yielding more reliable models and about 40% faster training convergence. Cross-dataset training further enhances generalization, improving mean IoU by up to +20% on RUGD and +13% on WildScenes compared to RELLIS-3D-only training. Overall, this study provides valuable insights for developing robust perception modules for off-road autonomous vehicles.
Examples of the ontology conversion proposed for enabling cross-dataset evaluation.
42
+
</p>
43
+
</div>
44
+
<div style="width: 100%;">
45
+
<img src="https://ars.els-cdn.com/content/image/1-s2.0-S0957417426015691-gr2_lrg.jpg" alt="Overview of our cross-dataset training and evaluation pipeline." style="width: 100%; height: auto;">
<img src="https://ars.els-cdn.com/content/image/1-s2.0-S0957417426015691-gr11_lrg.jpg" alt="mIoU vs. average inference time per image." style="width: 100%; height: auto;">
mIoU vs. average inference time per image. Models trained on the combined RELLIS-3D and GOOSE train datasets, and evaluated on RUGD (a) and WildScenes (b) complete datasets. Bubble size represents the number of parameters for each model. Labels indicate model names. Bold labels highlight Pareto-optimal models.
Copy file name to clipboardExpand all lines: _pages/publications/publications.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,7 +18,7 @@ Journals, congress papers and research publications can be found below:
18
18
19
19
# 2025
20
20
21
-
*[Deep Learning-Based Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving in Unstructured Environments, Proceedings of the XXV International Workshop on Physical Agents (WAF). Félix Martínez, David Pascual-Hernández, Daniel Borja Fernández, Inmaculada Mora Jiménez, José María Cañas]
21
+
* Deep Learning-Based Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving in Unstructured Environments, Proceedings of the XXV International Workshop on Physical Agents (WAF). Félix Martínez, David Pascual-Hernández, Daniel Borja Fernández, Inmaculada Mora Jiménez, José María Cañas.
0 commit comments