Self-Supervised Multimodal Fusion Transformer for Passive Activity Recognition

Armand K. Koupai, Mohammud J. Bocus*, Raul Santos‐rodriguez, Robert J. Piechocki, Ryan Mcconville

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

7 Citations (Scopus)
111 Downloads (Pure)

Abstract

The pervasiveness of Wi-Fi signals provides significant opportunities for human sensing and activity recognition in fields such as healthcare. The sensors most commonly used for passive Wi-Fi sensing are based on passive Wi-Fi radar (PWR) and channel state information (CSI) data, however current systems do not effectively exploit the information acquired through multiple sensors to recognise the different activities. In this paper, we explore new properties of the Transformer architecture for multimodal sensor fusion. We study different signal processing techniques to extract multiple image-based features from PWR and CSI data such as spectrograms, scalograms and Markov transition field (MTF). We first propose the Fusion Transformer, an attention-based model for multimodal and multi-sensor fusion. Experimental results show that our Fusion Transformer approach can achieve competitive results compared to a ResNet architecture but with much fewer resources. To further improve our model, we propose a simple and effective framework for multimodal and multi-sensor self-supervised learning (SSL). The self-supervised Fusion Transformer outperforms the baselines, achieving a F1-score of 95.9%. Finally, we show how this approach significantly outperforms the others when trained with as little as 1% (2 minutes) of labelled training data to 20% (40 minutes) of labelled training data.
Original languageEnglish
Pages (from-to)149-160
Number of pages12
JournalIET Wireless Sensor Systems
Volume12
Issue number5-6
DOIs
Publication statusPublished - 10 Nov 2022

Bibliographical note

Funding Information:
This work was performed as a part of the OPERA Project, funded by the UK Engineering and Physical Sciences Research Council (EPSRC), Grant EP/R018677/1.

Publisher Copyright:
© 2022 The Authors. IET Wireless Sensor Systems published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.

Keywords

  • eess.SP
  • cs.CV
  • cs.HC
  • cs.LG

Fingerprint

Dive into the research topics of 'Self-Supervised Multimodal Fusion Transformer for Passive Activity Recognition'. Together they form a unique fingerprint.

Cite this