From Detection to Anticipation: Online Understanding of Struggles across Various Tasks and Activities

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

Abstract

Understanding human skill performance is essential for intelligent assistive systems, with struggle recognition offering a natural cue for identifying user difficulties. While prior work focuses on offline struggle classification and localization, real-time applications require models capable of detecting and anticipating struggle online. We reformulate struggle localization as an online detection task and further extend it to anticipation—predicting struggle moments before they occur. We adapt two off-the-shelf models as baselines for online struggle detection and anticipation. Online struggle detection achieves 70–80% per-frame mAP, while struggle anticipation up to 2 seconds ahead yields comparable performance with slight drops. We further examine generalization across tasks and activities and analyse the impact of skill evolution. Despite larger domain gaps in activity-level generalization, models still outperform random baselines by 4–20%. Our feature-based models run at up to 143 FPS, and the whole pipeline, including feature extraction, operates at around 20 FPS — sufficient for realtime assistive applications.
Original languageEnglish
Title of host publication2026 IEEE/CVF Winter Conference on Applications of Computer Vision
PublisherIEEE Computer Society
Publication statusAccepted/In press - 6 Mar 2026
EventThe IEEE/CVF Winter Conference on Applications of Computer Vision 2026 - JW Marriott Starpass, Tucson, United States
Duration: 6 Mar 202610 Mar 2026
https://wacv.thecvf.com/

Publication series

NameIEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
PublisherIEEE
ISSN (Print)2472-6737
ISSN (Electronic)2642-9381

Conference

ConferenceThe IEEE/CVF Winter Conference on Applications of Computer Vision 2026
Abbreviated titleWACV 2026
Country/TerritoryUnited States
CityTucson
Period6/03/2610/03/26
Internet address

Keywords

  • Struggle determination
  • Deep learning
  • Datasets
  • Egocentric Action Recognition
  • Egocentric Vision
  • Action Recognition
  • Pattern Recognition, Visual

Fingerprint

Dive into the research topics of 'From Detection to Anticipation: Online Understanding of Struggles across Various Tasks and Activities'. Together they form a unique fingerprint.

Cite this