Build Your Own Text-as-Data Corpus: A Print-to-Bytes Primer

This hands-on workshop will teach participants how to construct their own digital text corpus for conducting humanities data analysis. We'll cover simple tools for turning printed texts in a variety of languages into computer-readable files, the use of Optical Character Recognition (OCR) software, and consider helpful tools for post-process correction of digitized texts. We’ll also [...]