Automated analysis of the US presidential elections using Big Data and network analysis

Saatviga Sudhahar, Giuseppe A. Veltri, Nello Cristianini

Research output: Contribution to journalArticle (Academic Journal)peer-review

6 Downloads (Pure)


The automated parsing of 130,213 news articles about the 2012 US presidential elections produces a network formed by the key political actors and issues, which were linked by relations of support and opposition. The nodes are formed by noun phrases and links by verbs, directly expressing the action of one node upon the other. This network is studied by applying insights from several theories and techniques, and by combining existing tools in an innovative way, including: graph partitioning, centrality, assortativity, hierarchy and structural balance. The analysis yields various patterns. First, we observe that the fundamental split between the Republican and Democrat camps can be easily detected by network partitioning, which provides a strong validation check of the approach adopted, as well as a sound way to assign actors and topics to one of the two camps. Second, we identify the most central nodes of the political camps. We also learnt that Clinton played a more central role than Biden in the Democrat camp; the overall campaign was much focused on economy and rights; the Republican Party (Grand Old Party or GOP) is the most divisive subject in the campaign, and is portrayed more negatively than the Democrats; and, overall, the media reported positive statements more frequently for the Democrats than the Republicans. This is the first study in which political positions are automatically extracted and derived from a very large corpus of online news, generating a network that goes well beyond traditional word-association networks by means of richer linguistic analysis of texts.
Original languageEnglish
Pages (from-to)1-28
Number of pages28
JournalBig Data and Society
Issue number1
Early online date2 Mar 2015
Publication statusPublished - 1 May 2015


  • Big Data
  • network analysis
  • structural balance
  • computational social science
  • mediascape
  • subject-verb-object


Dive into the research topics of 'Automated analysis of the US presidential elections using Big Data and network analysis'. Together they form a unique fingerprint.

Cite this