Influence reasoning capabilities of large language models in social environments

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

15 Downloads (Pure)

Abstract

We ask whether state-of-the-art large language models can provide a viable alternative to human annotators for detecting and explaining behavioural influence online. Working with a large corpus of online interactions retrieved from the social media platform Mastodon, we cross-examine a dataset containing 11,000 LLM influence labels and explanations across nine state-of-the-art large language models from 312 scenarios. We use a range of resolution categories and four stages of shot prompting to further measure the importance of context to language model performance. We also consider the impact of model architecture, and how social media content and features from the explanation impact model labelling accuracy. Our experiment shows that whilst most large language models struggle to identify the correct framing of influence from an interaction, at lower label resolutions, models like Flan and GPT-4 Turbo perform with an accuracy of 70%-80%, demonstrating encouraging potential for future social influence identification and explanation, and contributing to our understanding of the general social reasoning capabilities of large language models.
Original languageEnglish
Title of host publicationProceedings of the 2024 AAAI Fall Symposia
PublisherAAAI Press
Pages40-47
Number of pages8
ISBN (Electronic)9781577358947
DOIs
Publication statusPublished - 8 Nov 2024
EventAI Trustworthiness and Risk Assessment for Challenged Contexts: AAAI 2024 Fall Symposium - Westin Arlington Gateway, Arlington, United States
Duration: 7 Nov 20249 Nov 2024
https://sites.google.com/view/aaai-atracc

Publication series

NameProceedings of the AAAI Symposium Series
PublisherAAAI
Number1
Volume4
ISSN (Electronic)2994-4317

Conference

ConferenceAI Trustworthiness and Risk Assessment for Challenged Contexts
Abbreviated titleATRACC
Country/TerritoryUnited States
CityArlington
Period7/11/249/11/24
Internet address

Research Groups and Themes

  • Cyber Security

Fingerprint

Dive into the research topics of 'Influence reasoning capabilities of large language models in social environments'. Together they form a unique fingerprint.

Cite this