Text-mining the archive 1

24 Apr 2018

11:00 - 13:00

S2, Alison Richard Building

Description

Description

Course leader: Dr Paul Nulty
This session will introduce basic methods for reading and processing text files in Python. We will proceed slowly through an example that demonstrates reading in a large text corpus from structured or unstructured files, basic string processing, word frequency counting, and syntactic analysis. Attendees will learn to perform these tasks in base Python and using the natural language processing library spaCy. No prior knowledge of Python is assumed, but attendees may wish to follow an online guide to configure a python installation in order to follow along with the examples.

Pre-registration is essential: please book here

PhD students and staff from the University of Cambridge have priority for bookings on this course – if the course appears fully booked and you fall into this category please contact the course organisers directly

Cambridge Digital Humanities

About CDH

Get in touch

Cambridge Digital Humanities
University of Cambridge
17 Mill Lane
Cambridge
CB2 1RX

Email: admin@cdh.cam.ac.uk

Legal links

Stay Connected

Receive our Newsletter

Web Design by Chameleon Studios

Cambridge Digital Humanities

Tel: +44 1223 766886
Email enquiries@crassh.cam.ac.uk