Data Bytes: Extracting Data from PDFs with Python
Description
Ever found yourself with a collection of information-rich PDFs that you wished you could easily combine into an analysis-ready data set? Join Johns Hopkins Data Services in this Data Bytes session as they provide an overview of the kinds of data that may be present in PDFs and demo several Python packages that you can use to extract and combine it.
Data Bytes are short data-related talks, hosted by Data Services, and offered during lunch on Mondays. Visit the Data Bytes schedule to see all Data Bytes sessions this fall.
Who can attend?
- Faculty
- Staff
- Students