Abstract
Efficient and accurate semantic segmentation is particularly important especially for applications like autonomous driving which requires real-time inference speed and high performance. Many works try to compromise spatial resolution to achieve real-time inference speed, which leads to poor performance. As a result, real-time segmentation task for embedded devices is still an open problem. In this paper, we focus on building a network with better performance possible while still achieve real-time inference speed. We first use a pyramid kernel size to capture more spatial information instead of using just a 3×3 kernel size for DWConvolution in ShuffleNet v2. Meanwhile, an efficient Vortex Pooling module is employed to aggregate the contextual information and generate high-resolution features. Compared with other state-of-the-art real-time semantic segmentation networks, the proposed network achieves similar inference speed and better performance on embedded device. Specifically, we achieve state-of-the-art 73.46% mean IoU on Cityscapes test dataset, for a 768×1024 input, a speed of 46.1 frames per second on NVIDIA Jetson AGX Xavier embedded development board is achieved.
| Original language | English |
|---|---|
| Title of host publication | 31st International Conference on Tools with Artificial Intelligence (ICTAI2019) |
| Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
| Number of pages | 8 |
| DOIs | |
| Publication status | Published - 13 Feb 2020 |
| Event | International Conference on Tools with Artificial Intelligence - Portland, United States Duration: 4 Nov 2019 → 6 Nov 2019 Conference number: 31 http://www.ictai2019.org/ |
Conference
| Conference | International Conference on Tools with Artificial Intelligence |
|---|---|
| Abbreviated title | ICTAI2019 |
| Country/Territory | United States |
| City | Portland |
| Period | 4/11/19 → 6/11/19 |
| Internet address |
Keywords
- semantic segmentation
- real-time
- embedded