Modelling Clusters From The Ground Up: A Web Data Approach

Christoph Stich, Emmanouil Tranos, Max Nathan

Research output: Other contribution


This paper proposes a new methodological framework to identify economic clusters over space and time. We employ a unique open source dataset of geolocated and archived business webpages and interrogate them using Natural Language Processing to build bottom-up classi- fications of economic activities. We validate our method on an iconic UK tech cluster – Shoreditch, East London. We benchmark our results against existing case studies and admin- istrative data, replicating the main features of the cluster and providing fresh insights. As well as overcoming limitations in conventional industrial classification, our method addresses some of the spatial and temporal limitations of the clustering literature.
Original languageEnglish
PublisherUniversity College London
Number of pages57
Publication statusPublished - 22 Jul 2021

Publication series

NameUCL Working Papers Series
PublisherUCL Centre for Advanced Spatial Analysis
ISSN (Print)1467-1298


Dive into the research topics of 'Modelling Clusters From The Ground Up: A Web Data Approach'. Together they form a unique fingerprint.

Cite this