Abstract
Recent advancements in convolutional neural networks based object detection have enabled analyzing the mounting video data with high accuracy. However, inference speed is a major drawback of these video analysis system because of the
heavy object detectors. To address the computational and practicability challenges of video analysis, we propose FastQ, a system for efficient querying over video at scale. Given a target video, FastQ can automatically label the category and number of objects for each frame. We introduce a novel lightweight object detector named FDet to improve the efficiency of query system. First, a difference detector filters the frames whose difference is less than the threshold. Second, FDet is employed to efficiently label the remaining frames. To reduce inference time, FDet detects a center keypoint and a pair of corners from the feature map generated by a lightweight backbone to predict the bounding boxes. FDet completely avoid the complicated computation related to anchor boxes. Compared with state-of-the-art real-time detectors, FDet achieves superior performance with 29.1% AP on COCO benchmark at 25.3ms. Experiments show that FastQ achieves 150× to 300× speed-ups while maintaining more than 90% accuracy in video queries.
heavy object detectors. To address the computational and practicability challenges of video analysis, we propose FastQ, a system for efficient querying over video at scale. Given a target video, FastQ can automatically label the category and number of objects for each frame. We introduce a novel lightweight object detector named FDet to improve the efficiency of query system. First, a difference detector filters the frames whose difference is less than the threshold. Second, FDet is employed to efficiently label the remaining frames. To reduce inference time, FDet detects a center keypoint and a pair of corners from the feature map generated by a lightweight backbone to predict the bounding boxes. FDet completely avoid the complicated computation related to anchor boxes. Compared with state-of-the-art real-time detectors, FDet achieves superior performance with 29.1% AP on COCO benchmark at 25.3ms. Experiments show that FastQ achieves 150× to 300× speed-ups while maintaining more than 90% accuracy in video queries.
Original language | English |
---|---|
Title of host publication | ACM International Conference on Multimedia Retrieval (ICMR)'2020 |
Publisher | Association for Computing Machinery (ACM) |
Pages | 548-554 |
Number of pages | 7 |
DOIs | |
Publication status | Published - 8 Jun 2020 |
Event | ACM International Conference on Multimedia Retrieval (ICMR) , 2020 - Dublin, Dublin, Ireland Duration: 8 Jun 2020 → 11 Jun 2020 http://icmr2020.org/index.html |
Conference
Conference | ACM International Conference on Multimedia Retrieval (ICMR) , 2020 |
---|---|
Country/Territory | Ireland |
City | Dublin |
Period | 8/06/20 → 11/06/20 |
Internet address |
Keywords
- object detection
- real-time
- anchor-free
- keypoint-based
- high-resolution representation