9 Jan 2023 - 13 Jan 2023 There will be 3-4 hours of sessions per day, scheduled between 11am-5pm Online


Applications have now closed.

The Social Data School is an application only online intensive teaching programme structured around the life-cycle of a digital research project, covering principles of research design, data collection, cleaning and preparation, methods of analysis and visualisation, and data management and preservation practices.

Read on to find out about:


Theme: Visualising Data and Investigating Images

At this Social Data School, we will focus on the image as a place of inquiry for investigations. We will extract data from social networks, and multiple video and photographic sources, and learn how computer vision and automation can help in the investigation of public interest stories and projects. In short, we will develop a critical toolkit for interrogating the image-based cultures of the digital age.

Modules will cover the following content:

  • Methodology for Digital Investigations
  • Video and Image Data Analysis
  • Geolocation and Open Source Investigations
  • Introduction to Critical AI
  • Introduction to Social Media data mining, analysis and visualisation
  • Data Spatialisation using Python and Blender (TBC)

Who can apply?

The school welcomes applications from all backgrounds.

You might be in journalism, marketing, academia, an NGO, activism, a trade union, a civil society organisation, civil service. Anyone who works with social data is welcome to apply.

No previous experience of coding is required and there are no specific academic requirements, however the course content is broadly suitable for those with an undergraduate degree or equivalent professional experience. The School is taught in English. You will need a reliable internet connection to join in, and the ability to download free, open software for use during the School.

We are committed to facilitate participation by women, black and minority ethnic candidates as they have historically been under-represented in the technology and data science sector. We also welcome applications from outside the UK, assuming they can attend the live workshop slots during 11am-4pm GMT. Sessions will not be recorded and therefore live attendance is required.

When and Where

This school will be held online: 9–13 January 2023. Data School live sessions are timetabled daily from 11am–4pm (GMT). To convert this to your timezone you can use this Time Zone Converter.

During the course you will be provided with links to our virtual learning environment (Moodle) where we will publish course content and links to our online video delivery platform for teaching and social interactions.

An in-person Social Data School is also scheduled for March 2023, and those interested in attending under that format will be welcome to apply once the application process opens.


Sessions will include live-taught instruction, demonstrations and discussions online, with access to self-paced study materials and support via email-based discussion groups between sessions. Participants will need a laptop or desktop computer and internet access to participate in the sessions. Some sessions will require software installation – full instructions will be provided but please ensure you have access rights to install software on the device you will be using.

You can download a PDF version of the programme here: Social Data School Jan 2023


  • £245 per person

This fee covers 13 hours of teaching costs, access to resources, discussion groups with top practitioners and technical drop-in sessions.

Cambridge Digital Humanities is committed to democratising access to digital methods and tools, and is offering subsidised participation fees to encourage applications from those who do not normally have access to this type of training. There is a limited number of concessionary places for students, unemployed, community projects, unfunded projects, and Global South residents. In addition, a small number of bursaries are available to those who can demonstrate financial need. You can apply for this on the application form.

The deadline for payment is two weeks before the start of the School.

Q&A Session

Do you have questions about how the school will be run, the content, and how to apply? On 5 December, we held a Q&A session with the school’s convenors. You can watch the recording here:

How to apply

Fill in the application form by 11 December 2022. You will hear whether your application was successful or not by 14 December 2022.

The Social Data School is application-only with limited places. During your application you should make best use of the free text sections to explain your current experience, and what you would get out of attending the School.

Apply here: https://docs.google.com/forms/d/e/1FAIpQLSfyd3Eiv9oSxAQMQEwr80QhsVU8PUYnbz-_bciHxsc9Cix4jA/viewform?usp=sf_link


Monday 9 January, 1:35–2:00pm

Introduction and welcome

Monday 9 January, 2:00–3:00pm

Session 1: Methodology for Digital Investigations

This module addresses fundamental aspects of investigative practice in digital environments and dwells on the importance of using methodology(ies) for data inquiry. Researchers doing investigations using Open Source Intelligence (OSINT) tools, data collection and analysis, as well as developing automated tools for investigations will benefit from this module. It critically reflects on the essential phases of digital investigations at large: Identification of a Problem (formulation of hypotheses), Information Gathering, Preservation, Verification, Analysis, and Dissemination.

By the end of the module, participants would have the principles to conduct investigations that effectively identify, prove, and strategically disseminate issues in the public interest, with fairness and rigour. Its scope is meant to be applied along with the rest of tools and methods from SDS 2022 modules.

Monday 9 January, 3:15–4:15pm

Session 2: (Pre-recorded Lecture) Video Data Analysis I

This module aims to highlight how Video Data Analysis can be used for explorative and investigative research. It covers how the method has evolved since the 1990’s, to examine how it can be used and critically understood today. The two sessions focus on how to approach videos for research, data gathering, and storytelling/content creation purposes, and will provide analytical techniques for extrapolating insights from video content. Some of these techniques will include how to code and sample video content, sequence videos, and construct concepts from situational and spatial dynamics. It will also present a short case study of how it’s possible, in some scenarios, to triangulate videos with other videos via a ‘ready-made multi-camera’ method.

Importantly, the course will also address future trends in Video Data Analysis and proposes an experimental methodology for capturing, documenting, and analysing livestreamed videos that are synchronic, ephemeral, and therefore difficult to analyse.

Tuesday 10 January, 11:00am–12:00pm

Incubator AM


Tuesday 10 January 2:00–3:00pm

Session 3: (Workshop Module) Social Network Analysis with Digital Data I

“Social network” has become a catch-all term for the online spaces where we connect with other people and trade information in exchange for our personal data and attention. Considering the societal impacts of data-driven economics and politics, knowing how to reclaim and reappropriate these data to trace the form and content of online social networks is a vital skill for journalists, civil society and academics alike.

This module will provide a gentle introduction to the field of social network analysis (SNA) with digital data. Social Data School participants will be given the opportunity to “learn by doing” the process of digital data collection as well as the basics of social network visualisation and analysis. After being introduced to the fundamental concepts of SNA, the participants will explore all stages of a social network analysis project, including research design, data collection, data wrangling, graph visualisation, and analysis with essential network measures. The focus will be on the retrieval of electronic archival data (e.g., social media platforms) for non-programmers, and on practical examples of network analysis with specialised software (e.g., Gephi). At the end of the two sessions, participants will be equipped with the basic tools to perform meaningful visualisations and analyses of network data. Typical use cases of SNA range from investigative journalism to NGO monitoring and academic research.

Tuesday 10 January, 4:30–5:30pm

Incubator PM

Wednesday 11 January, 2:00–3:00pm

Session 4: (Workshop Module) Geolocation and open source investigations I

This session will cover geolocation, a crucial stage of any open-source investigation. Geolocation seeks to answer a key question: where did the events depicted happen? We will explore the basic principles of geolocation and introduce participants to a range of tools and techniques. We will cover essential resources including Google Earth Pro and Mapillary, and highlight the advantages of different data sources in a platform-agnostic manner.

This workshop aims to encourage a reflexive and critical approach to open-source data, introducing practical skills while emphasising the importance of ethical and transparent research methods. Drawing from the human rights sphere, this methodology is useful for scholars and citizens using open source data such as social media content, online databases and satellite images.

By the end of this session, participants will be able to identify useful clues in online content, perform reverse image searches and combine satellite information with street-view data.

Wednesday 11 January, 3:15–4:15pm

Session 5: Introduction to Critical AI

The current generation of machine learning Artificial Intelligence systems are now widely deployed in contexts as diverse as recommender systems for online shopping and streaming music and video services, facial recognition and biometric systems used by state and private security agencies for the analysis, summarisation and generation of texts and images. This module will present the technical fundamentals of machine learning systems, exploring the challenges of structural bias, lack of transparency and the impact that the design of contemporary AI has on communities and individuals who face structural discrimination. We will demonstrate web-based platforms for creating Machine Learning models and learn about experimental techniques for exploring their potential and limitations.

Wednesday 11 January, 4:30–5:30pm

Session 6: Automations for Investigations (Public Event)

Thursday 12 January, 2:00–3:00pm

Session 7: (Workshop Module) Geolocation and open source investigations II

Thursday 12 January, 3:00–4:00pm

Session 8: (Workshop Module) Social Network Analysis with Digital Data II

Thursday 12 January, 4:00–5:00pm

Session 9: (Projects Discussion) Social Network Analysis with Digital Data (TBC)


Friday 13 January, 2:00–3:00pm

Session 10: (Workshop) Video Data Analysis II

Friday 13 January, 3:00–4:00pm

Session 11: Data Spatialisation using Python and Blender I (TBC)

In the first session we will look at the interface of Blender, and talk about the various workspaces and 3D tools available and how they relate to data visualisation and spatialisation. From here we will import a geographical shapefile using an addon called Blender GIS and manipulate it to create a 3D height field. We will use a node-based shader to apply a gradient to it. Through this process we will develop an understanding of how to manipulate objects in 3D space and how to use colour and shading to communicate gradation within the data.

In the second session we will look into the Blender text editor, the interactive console, and the system console to understand ways of working with python. We will look at a script which is able to read a csv (comma-separated values) file. We will look at the process of iterating through columns and rows of data, using python to output a result into 3D space. This will allow us to develop a methodology for spatialising datasets which are bespoke, hand-crafted, or obscure.

Friday 13 January, 4:00-5:00pm

Session 12: Data Spatialisation using Python and Blender II (TBC)

Friday 13 January, 5:15-6:15pm

Session 13: Closing plenary, presentations and next steps

Cambridge Digital Humanities

Tel: +44 1223 766886
Email enquiries@crassh.cam.ac.uk