High level 3D structure extraction from a single image using a CNN-based approach

J. A.de Jesús Osuna-Coutiño*, Jose Martinez-Carranza

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

4 Citations (Scopus)
217 Downloads (Pure)


High-Level Structure (HLS) extraction in a set of images consists of recognizing 3D elements with useful information to the user or application. There are several approaches to HLS extraction. However, most of these approaches are based on processing two or more images captured from different camera views or on processing 3D data in the form of point clouds extracted from the camera images. In contrast and motivated by the extensive work developed for the problem of depth estimation in a single image, where parallax constraints are not required, in this work, we propose a novel methodology towards HLS extraction from a single image with promising results. For that, our method has four steps. First, we use a CNN to predict the depth for a single image. Second, we propose a region-wise analysis to refine depth estimates. Third, we introduce a graph analysis to segment the depth in semantic orientations aiming at identifying potential HLS. Finally, the depth sections are provided to a new CNN architecture that predicts HLS in the shape of cubes and rectangular parallelepipeds.

Original languageEnglish
Article number563
Number of pages18
JournalSensors (Switzerland)
Issue number3
Publication statusPublished - 29 Jan 2019


  • 3D vision
  • CNN
  • Depth data analysis
  • High level 3D structure extraction
  • Single image

Fingerprint Dive into the research topics of 'High level 3D structure extraction from a single image using a CNN-based approach'. Together they form a unique fingerprint.

Cite this