Abstract
Efficient and accurate semantic segmentation is particularly important especially for applications like autonomous driving which requires real-time inference speed and high performance. Many works try to compromise spatial resolution to achieve real-time inference speed, which leads to poor performance. As a result, real-time segmentation task for embedded devices is still an open problem. In this paper, we focus on building a network with better performance possible while still achieve real-time inference speed. We first use a pyramid kernel size to capture more spatial information instead of using just a 3×3 kernel size for DWConvolution in ShuffleNet v2. Meanwhile, an efficient Vortex Pooling module is employed to aggregate the contextual information and generate high-resolution features. Compared with other state-of-the-art real-time semantic segmentation networks, the proposed network achieves similar inference speed and better performance on embedded device. Specifically, we achieve state-of-the-art 73.46% mean IoU on Cityscapes test dataset, for a 768×1024 input, a speed of 46.1 frames per second on NVIDIA Jetson AGX Xavier embedded development board is achieved.
Original language | English |
---|---|
Title of host publication | 31st International Conference on Tools with Artificial Intelligence (ICTAI2019) |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Number of pages | 8 |
DOIs | |
Publication status | Published - 13 Feb 2020 |
Event | International Conference on Tools with Artificial Intelligence - Portland, United States Duration: 4 Nov 2019 → 6 Nov 2019 Conference number: 31 http://www.ictai2019.org/ |
Conference
Conference | International Conference on Tools with Artificial Intelligence |
---|---|
Abbreviated title | ICTAI2019 |
Country/Territory | United States |
City | Portland |
Period | 4/11/19 → 6/11/19 |
Internet address |
Keywords
- semantic segmentation
- real-time
- embedded