Finding light in dark archives: Using AI to connect context and content in email

Stephanie Decker*, David Kirsch, Santhilata Kuppili Venkata, Adam Nix

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

5 Citations (Scopus)


Email archives are important historical resources, but access to such data poses a unique archival challenge and many born-digital collections remain dark, while questions of how they should be effectively made available are answered. This paper contributes to the growing interest in preserving access to email by addressing the needs of users, in readiness for when such collections become more widely available. We argue that for the content of email to be meaningfully accessed, the context of email must form part of this access. In exploring this idea, we focus on discovery within large, multi-custodian archives of organisational email, where emails’ network features are particularly apparent. We introduce our prototype search tool, which uses AI-based methods to support user-driven exploration of email. Specifically, we integrate two distinct AI models that generate systematically different types of results, one based upon simple, phrase-matching and the other upon more complex, BERT embeddings. Together, these provide a new pathway to contextual discovery that accounts for the diversity of future archival users, their interests and level of experience.
Original languageEnglish
Pages (from-to)859-872
Number of pages14
JournalAI and Society
Issue number3
Early online date31 Dec 2021
Publication statusPublished - 2022

Bibliographical note

Funding Information:
We gratefully acknowledge funding support by the Arts & Humanities Research Council (UK) and National Endowment for the Humanities (USA) as part of the US-UK Partnership Development Grants, Grant AH/T013060/1.

Publisher Copyright:
© 2021, The Author(s).

Research Groups and Themes

  • MGMT Strategy International Management and Business and Entrepreneurship
  • Digital Societies
  • MGMT theme Innovation and Digitalisation
  • Cultural Work


Dive into the research topics of 'Finding light in dark archives: Using AI to connect context and content in email'. Together they form a unique fingerprint.
  • EMCODIST: A Context-based Search Tool for Email Archives

    Kuppili Venkata, S., Decker, S., Kirsch, D. & Nix, A., 13 Jan 2022, Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021. Chen, Y., Ludwig, H., Tu, Y., Fayyad, U., Zhu, X., Hu, X. T., Byna, S., Liu, X., Zhang, J., Pan, S., Papalexakis, V., Wang, J., Cuzzocrea, A. & Ordonez, C. (eds.). Institute of Electrical and Electronics Engineers (IEEE), p. 2281-2290 10 p. (Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021).

    Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

    Open Access
    2 Citations (Scopus)
    103 Downloads (Pure)

Cite this