Modeling clusters from the ground up: A web data approach

Christoph Stich, Emmanouil Tranos*, Max Nathan

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

5 Citations (Scopus)
77 Downloads (Pure)

Abstract

This paper proposes a new methodological framework to identify economic clusters over space and time. We employ a unique open source dataset of geolocated and archived business webpages and interrogate them using Natural Language Processing to build bottom-up classifications of economic activities. We validate our method on an iconic UK tech cluster – Shoreditch, East London. We benchmark our results against existing case studies and administrative data, replicating the main features of the cluster and providing fresh insights. As well as overcoming limitations in conventional industrial classification, our method addresses some of the spatial and temporal limitations of the clustering literature.
Original languageEnglish
Pages (from-to)1-24
Number of pages24
JournalEnvironment and Planning B: Urban Analytics and City Science
Early online date17 Jun 2022
DOIs
Publication statusE-pub ahead of print - 17 Jun 2022

Bibliographical note

Funding Information:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors acknowledge funding from the Consumer Data Research Centre (CDRC) and Engineering and Physical Sciences Research Council (ESRC). This paper represents the views of the authors, not the funders or data providers.

Publisher Copyright:
© The Author(s) 2022.

Fingerprint

Dive into the research topics of 'Modeling clusters from the ground up: A web data approach'. Together they form a unique fingerprint.

Cite this