Building Maps Using Monocular Image-feeds from Windshield-mounted Cameras in a Simulator Environment

Authors

  • Mátyás Szántó
    Affiliation
    Department of Control Engineering and Information Technology, Faculty of Electrical Engineering and Informatics, Budapest University of Technology and Economics, 2. Magyar tudósok Blvd., H-1117 Budapest
  • Sándor Kobál
    Affiliation
    Department of Control Engineering and Information Technology, Faculty of Electrical Engineering and Informatics, Budapest University of Technology and Economics, 2. Magyar tudósok Blvd., H-1117 Budapest
  • László Vajta
    Affiliation
    Department of Control Engineering and Information Technology, Faculty of Electrical Engineering and Informatics, Budapest University of Technology and Economics, 2. Magyar tudósok Blvd., H-1117 Budapest
  • Viktor Győző Horváth
    Affiliation
    Department of Photogrammetry and Geoinformatics, Faculty of Civil Engineering, Budapest University of Technology and Economics, 3 Műegyetem rkp., K building first floor 31. Budapest, H-1111, Hungary
  • János Máté Lógó
    Affiliation
    Department of Photogrammetry and Geoinformatics, Faculty of Civil Engineering, Budapest University of Technology and Economics, 3 Műegyetem rkp., K building first floor 31. Budapest, H-1111, Hungary
  • Árpád Barsi
    Affiliation
    Department of Photogrammetry and Geoinformatics, Faculty of Civil Engineering, Budapest University of Technology and Economics, 3 Műegyetem rkp., K building first floor 31. Budapest, H-1111, Hungary
https://doi.org/10.3311/PPci.21500

Abstract

3-dimensional, accurate, and up-to-date maps are essential for vehicles with autonomous capabilities, whose functionality is made possible by machine learning-based algorithms. Since these solutions require a tremendous amount of data for parameter optimization, simulation-to-reality (Sim2Real) methods have been proven immensely useful for training data generation. For creating realistic models to be used for synthetic data generation, crowdsourcing techniques present a resource-efficient alternative. In this paper, we show that using the Carla simulation environment, a crowdsourcing model can be created that mimics a multi-agent data gathering and processing pipeline. We developed a solution that yields dense point clouds based on monocular images and location information gathered by individual data acquisition vehicles. Our method provides scene reconstructions using the robust Structure-from-Motion (SfM) solution of Colmap. Moreover, we introduce a solution for synthesizing dense ground truth point clouds originating from the Carla simulator using a simulated data acquisition pipeline. We compare the results of the Colmap reconstruction with the reference point cloud after aligning them using the iterative closest point algorithm. Our results show that a precise point cloud reconstruction was feasible with this crowdsourcing-based approach, with 54\% of the reconstructed points having an error under 0.05 m, and a weighted root mean square error of 0.0449 m for the entire point cloud.

Keywords:

environmental reconstruction, crowdsourcing, SfM, automotive simulator, image feed processing

Citation data from Crossref and Scopus

Published Online

2023-03-28

How to Cite

Szántó, M., Kobál, S., Vajta, L., Horváth, V. G., Lógó, J. M., Barsi, Árpád “Building Maps Using Monocular Image-feeds from Windshield-mounted Cameras in a Simulator Environment”, Periodica Polytechnica Civil Engineering, 67(2), pp. 457–472, 2023. https://doi.org/10.3311/PPci.21500

Issue

Section

Research Article