Projects per year
Abstract
Fine-grained action recognition datasets exhibit environmental bias, where even the largest datasets contain sequences from a limited number of environments due to the challenges of large-scale data collection. We show that multi-modal action recognition models suffer with changes in environment, due to the differing levels of robustness of each modality. Inspired by successes in adversarial training for unsupervised domain adaptation, we propose a multi-modal approach for adapting action recognition models to novel environments. We employ late fusion of the two modalities commonly used in action recognition (RGB and Flow), with multiple domain discriminators, so alignment of modalities is jointly optimised with recognition. We test our approach on EPIC Kitchens, proposing the first benchmark for domain adaptation of fine-grained actions. Our multi-modal method outperforms single-modality alignment as well as other alignment methods by up to 3%.
Original language | English |
---|---|
Title of host publication | 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
Subtitle of host publication | CVPR 2020 |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 119-129 |
Number of pages | 11 |
ISBN (Electronic) | 978-1-7281-7168-5 |
DOIs | |
Publication status | E-pub ahead of print - 5 Aug 2020 |
Event | Computer Vision and Pattern Recognition - Duration: 14 Jun 2020 → 19 Jun 2020 |
Publication series
Name | Conference on Computer Vision and Pattern Recognition (CVPR) |
---|---|
Publisher | IEEE |
ISSN (Electronic) | 2575-7075 |
Conference
Conference | Computer Vision and Pattern Recognition |
---|---|
Period | 14/06/20 → 19/06/20 |
Fingerprint
Dive into the research topics of 'Multi-Modal Domain Adaptation for Fine-Grained Action Recognition'. Together they form a unique fingerprint.Projects
- 1 Finished
-
LOCATE: LOcation adaptive Constrained Activity recognition using Transfer learning
4/07/16 → 3/05/18
Project: Research
Equipment
-
HPC (High Performance Computing) Facility
Sadaf R Alam (Manager), Steven A Chapman (Manager), Polly E Eccleston (Other), Simon H Atack (Other) & D A G Williams (Manager)
Facility/equipment: Facility
Profiles
-
Professor Dima Damen
- School of Computer Science - Professor in Computer Vision
- Bristol Vision Institute
- Visual Information Laboratory
Person: Academic , Member