14 Nov 2022 09:00 – 10:00 Online - Zoom


Data which you have captured rather than created yourself is likely to need cleaning up before you can use it effectively. This short session will introduce you to the basic principles of creating structured datasets and walk you through some case studies in data cleaning with OpenRefine, a powerful open source tool for working with messy data.

  • Structuring your data
  • Cleaning messy textual data with OpenRefine
  • Batch processing file names
Target audience

CDH Basics sessions are open to the University of Cambridge staff and graduate students who want to learn and apply digital methods and use digital tools in their research. We also welcome Open-Oxford-Cambridge DTP students.

Cambridge Digital Humanities

Tel: +44 1223 766886
Email enquiries@crassh.cam.ac.uk