Editorial: Localization and scene understanding in urban environments

Ballardini, Augusto Luis; Cattaneo, Daniele; Sorrenti, Domenico G.; Parra Alonso, Ignacio

doi:10.3389/frobt.2024.1509637

EDITORIAL article

Front. Robot. AI, 08 November 2024

Sec. Field Robotics

Volume 11 - 2024 | https://doi.org/10.3389/frobt.2024.1509637

This article is part of the Research TopicLocalization and Scene Understanding in Urban EnvironmentsView all 5 articles

Editorial: Localization and scene understanding in urban environments

Augusto Luis Ballardini¹*

Daniele Cattaneo²

Domenico G. Sorrenti³

Ignacio Parra Alonso¹

¹Computer Engineering Department, Universidad de Alcalá, Alcalá de Henares, Spain
²Department of Computer Science, University of Freiburg, Freiburg, Germany
³Department of Informatics, Systems and Communication DISCO, Università degli Studi di Milano - Bicocca, Milan, Italy

Editorial on the Research Topic
Localization and scene understanding in urban environments

Urban settings represent one of the main challenges for self-driving vehicles today. Navigating a vehicle in these environments with only Global Navigation and Satellite Systems (GNSS) can lead to potentially dangerous situations that could jeopardize the safety of both onboard passengers and road users. All navigation systems rely on two main aspects: a reference map and a localization system, which uses the latter for identifying the position of the vehicle.

Regarding the reference map, nowadays different proposals from mapping companies such as TomTom¹ and Here², as well as self-driving companies like Waymo³, are paving the way for reliable sources of information. On the other hand, open-source community-created geospatial projects like OpenStreetMap are also a valuable source of information that have been extensively used for research purposes over the last decade. Localization algorithms are responsible for accurate localization of the vehicle within the map. This is achieved using different on-board sensors, starting with the most obvious: GNSS. Despite these systems’ ability to provide very good localization accuracies, up to the order of 0.01 m, their reliability Research Topic place them in the danger tier, especially in urban areas. This is due to the need for a clear line of sight to a significant part of the sky, which can be often obstructed by trees, buildings or urban canyons or even thick clouds.

Matteo Frosi et al. investigated this aspect in their first contribution, focusing on the evaluation of the positioning accuracy and precision of localization systems that are not based on GNSS sensors. Besides presenting also a novel Simultaneous Localization and Mapping (SLAM) system that achieves real-time performance, the authors benchmarked a set of algorithms on manually collected datasets, demonstrating the active interest of the scientific community in the Research Topic. They presented a comparison of algorithms that use LiDAR data coupled with inertial measurements units, a technique that has been extensively investigated over the last decade. Results from both outdoor and indoor scenarios show very promising results for real-world applications such as autonomous driving.

In a second contribution, a different group also led by Frosi et al., proposed an algorithm that directly connects with the HD mapping concept previously described. Among all the features that HD maps integrate, they interestingly used one of the major sources of problems: the buildings themselves. They extended a SLAM system to exploit mapped buildings in OpenStreetMap by matching single LiDAR scan inputs. Their contribution also presented the re-localization capabilities of their systems. This approach is interesting and paves the way to further investigation in the field, as similar techniques could also be used for other elements of mapped environments.

In this perspective, the work of the group led by Mentasti et al. proposes a pipeline to enrich HD-maps with traffic lights, enhancing their detection with specific features like shape, orientation, and pictogram. Unlike the previous two contributions, their focus is leveraging image information from onboard cameras on surveying vehicles and state-of-the-art deep neural networks for traffic light detection. Furthermore, to accommodate GNSS errors during the mapping process, the authors proposed to use Kalman filtering techniques to provide the most accurate traffic light position possible.

Even though localization is, in a way, a lower scale of complexity compared to SLAM algorithms, as they exploit already available maps, all the algorithms presented so far use special techniques that require high computational resources. Most of the time, parts of these algorithms include high-consuming subprocesses due to repetitive massive sets of operations performed on the sensed data. This can be the case of either LiDAR input with thousands of 3D points as well as imagery data, processed using deep neural networks. While specific hardware is available to speedup inference and classification of images, the same hardware can be used to parallelize consolidated filtering techniques. This is demonstrated in the last work presented in this collection, where Mochurad proposed an implementation of a parallel Kalman filter algorithm to speedup vehicle localization based on a specific usage of the LiDAR data. Using the CUDA programming language and NVIDIA hardware, her proposal achieved a 3.8x speedup compared to the original un-parallelized implementation of the Kalman algorithm, which is remarkable as it allows easy scaling with the number of 3D LiDAR points used for localization.

This Research Topic sought to explore the vehicle localization field from a variety of perspectives, stimulating the scientific community to exploit different cues from common urban settings. We foresee that the future of self-driving will strongly rely on localization algorithms that exploit a comprehensive understanding of the surrounding environment. Only through global awareness will it be possible to preemptively identify dangerous conditions that might lead to non-fatal injuries or deaths. We hope that this short collection can inspire the readers and researchers to push the boundaries in the realm of vehicle localization.

Author contributions

AB: Writing–original draft, Writing–review and editing. DC: Writing–review and editing. DS: Writing–review and editing. IP: Writing–review and editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

¹https://www.tomtom.com/products/hd-map (Accessed October 29, 2024)

²https://www.here.com/platform/HD-live-map (Accessed October 29, 2024)

³https://waymo.com/blog/2016/12/building-maps-for-self-driving-car (Accessed October 29, 2024)

Keywords: GNSS-denied environments, openstreetmap, localization, HD-map, traffic light, extended Kalman filter, CUDA, self-driving

Citation: Ballardini AL, Cattaneo D, Sorrenti DG and Parra Alonso I (2024) Editorial: Localization and scene understanding in urban environments. Front. Robot. AI 11:1509637. doi: 10.3389/frobt.2024.1509637

Received: 11 October 2024; Accepted: 28 October 2024;
Published: 08 November 2024.

Edited and reviewed by:

Dongbing Gu, University of Essex, United Kingdom

Copyright © 2024 Ballardini, Cattaneo, Sorrenti and Parra Alonso. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Augusto Luis Ballardini, YXVndXN0by5iYWxsYXJkaW5pQHVhaC5lcw==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.