skip to content


Introduction to Text-Mining with Python 1

Tuesday, 30 April, 2019 - 11:00 to 12:30

Introduction to Text-Mining with Python 1 This session will introduce basic methods for reading and processing text files in Python. We will walk through an example that reads in a large text corpus, splits it into tokens (words) and sentences, removes unwanted words (stopwords), counts the words (frequency analysis), and...

Read more

Optical Character Recognition (OCR): An Introduction

Monday, 11 February, 2019 - 11:00 to 12:30

Optical Character Recognition is a term used to describe techniques for converting images containing printed or handwritten text into a format that can be searched and analysed computationally. This workshop will introduce several such tools along with some practical techniques for using them, and will also highlight OCR...

Read more

Creating Databases from Historical Sources

Monday, 25 February, 2019 - 11:00 to 12:30

This workshop will examine strategies for transforming a variety of sources into structured digital data, ranging from crumbling manuscripts to printed documents and books. Tutor: Oliver Dunn Level: Introductory / Foundation For more information and to book, click here This course is open to Cambridge University research...

Read more

Creating Databases from Historical Sources

Thursday, 8 November, 2018 - 10:45 to 12:45

This workshop will provide solutions for database creation with sources from earlier times.

Read more

Introduction to Archival Photography

Tuesday, 30 October, 2018 - 11:00 to 12:30

Photography skills for those undertaking archival research.

Read more

The Library as Data: introduction to Cambridge University’s digital collections

Monday, 15 October, 2018 - 13:30 to 15:00

Explore the methods and tools used to analyse and visualise data.

Read more

From sources to data: historical people in the digital archive

Wednesday, 4 July, 2018 - 11:00 to 14:00

The creation of digital archives and editions of historical texts opens up new possibilities for the creation of structured data on human populations of the past. This workshop aims to open up the journey from archival sources to data to critical scrutiny from a range of different disciplines, with a specific focus on the...

Read more

Machine Reading the Archive 2018: end-of-programme showcase

Thursday, 14 June, 2018 - 11:30 to 15:30

This public workshop will mark the end of the 2018 programme of Machine Reading the Archive, a digital methods development programme organised by Cambridge Digital Humanities with the support of the Isaac Newton Trust and the Researcher Development Fund. It will showcase the digital archive projects created by our cohort...

Read more

Automatic Text Recognition: Diving into the background

Tuesday, 27 March, 2018 - 11:00 to 13:00

Speaker: Professor Roger Labahn (University of Rostock, READ project ) Automatic Text Recognition (ATR) is increasingly becoming an essential core component of application software in Digital Humanities. After years of working with "classical" OCR (Optical Character Recognition) to printed texts, we are now seeing...

Read more

Introduction to Omeka: build and publish a simple digital archive or collection

Tuesday, 8 May, 2018 - 11:00 to 13:00

Course leader: Dr Gabe Recchia If you're involved in any kind of research, chances are you have some kind of collection of materials that would benefit from being put online. In this workshop, you'll learn how to put research materials online in a tagged & searchable fashion with Omeka . The second half of the workshop...

Read more