Geolocation on Cartographic Maps with Multi-Modal Fusion

Mengjie Zhou, Liu Liu, Yiran Zhong*, Andrew Calway

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

8 Downloads (Pure)

Abstract

We explore the geolocation problem, aiming to localize ground-view images on cartographic maps, without the need of any GPS priors. This task mimics the human wayfinding ability and offers high scalability and robustness by using the compact and semantic representations of maps. Current methods often rely on 2D maps to encode dense contextual information for ground-to-map matching. In this paper, we lift ground-to-map matching to a 2.5D space, where heights of structures (e.g. buildings) provide richer geometric information to guide the matching process. We propose a new approach to learning representative embeddings from multi-modal data. Specifically, we establish a projection relationship between 2D and 2.5D space. The projection is further used to combine multi-modal features from the 2D and 2.5D maps using an effective pixel-to-point fusion method. By encoding crucial geometric cues, our method learns discriminative location embeddings for matching panoramic images and maps. Additionally, we construct the first large-scale multi-modal geolocation dataset to validate our method and facilitate future research. Both single-image based and route based geolocation experiments are conducted to test our method. Extensive experiments demonstrate that the proposed method achieves significantly higher geolocation accuracy and faster convergence than previous 2D map-based approaches.
Original languageEnglish
Title of host publication 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages5589-5596
Number of pages8
ISBN (Electronic)9798350377705
ISBN (Print)9798350377712
DOIs
Publication statusPublished - 25 Dec 2024
Event2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 - Abu Dhabi, United Arab Emirates
Duration: 14 Oct 202418 Oct 2024
http://iros2024-abudhabi.org/

Publication series

NameIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
PublisherIEEE
ISSN (Print)2153-0858
ISSN (Electronic)2153-0866

Conference

Conference2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period14/10/2418/10/24
Internet address

Keywords

  • geolocation, multi-modal fusion

Fingerprint

Dive into the research topics of 'Geolocation on Cartographic Maps with Multi-Modal Fusion'. Together they form a unique fingerprint.

Cite this