Data Hazards: An open-source vocabulary of ethical hazards for data-intensive projects

Natalie Zelenka, Nina H. Di Cara*, Euan Bennet, Phil Clatworthy, Huw Day, Ismael Kherroubi Garcia, Susana Roman Garcia, Vanessa Aisyahsari Hanschke, Emma Siân Kuwertz

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

Abstract

Understanding the potential for downstream harms from data-intensive technologies requires strong collaboration across disciplines and with the public. Having shared vocabularies of concerns reduces the communication barriers inherent in this work. The Data Hazards project [url] contains an open-source, controlled vocabulary of 11 hazards associated with data science work, presented as ‘labels’. Each label has (i) an icon, (ii) a description, (iii) examples, and, crucially, (iv) suggested safety precautions. A reflective discussion format and resources have also been developed. These have been created over three years with feedback from interdisciplinary contributors, and their use evaluated by participants (N=47). The labels include concerns often out-of-scope for ethics committees, like environmental impact. The resources can be used as a structure for interdisciplinary harms discovery work, for communicating hazards, collecting public input or in educational settings. Future versions of the project will develop through feedback from open-source contributions, methodological research and outreach.
Original languageEnglish
Article number100110
Number of pages13
JournalJournal of Responsible Technology
Volume21
Early online date12 Feb 2025
DOIs
Publication statusPublished - 1 Mar 2025

Bibliographical note

Publisher Copyright:
© 2025 The Authors

Fingerprint

Dive into the research topics of 'Data Hazards: An open-source vocabulary of ethical hazards for data-intensive projects'. Together they form a unique fingerprint.

Cite this