Turning PDFs into Research Data
With Jack Collins
Online, Starting Mar 13, 2025
Overview
Do you ever feel that the data you need for your research is accessible but it’s not in a convenient table, such as company reports or building plans?
Perhaps the information you need is spread out across many different documents?
If only we could read and extract structured data from thousands of written documents.
In this course, we explore how to accomplish this task by combining web scraping, Optical Character Recognition (OCR), and Natural Language Processing (NLP). Over four weeks, we provide online lessons and interactive sessions to learn the fundamentals of these key technologies.