April 5, 2022

from

04:00 PM

– 05:00 PM ET

Managing Messy Data, Part 2: Even Messier Data

Online

This one-hour workshop will build on concepts covered in Part 1, presenting more advanced aspects of OpenRefine, including connecting to APIs, reconciling data, and scraping the web for information.

Katie Wolf, Science and Technology Librarian at Fordham University Libraries, will introduce participants to parsing HTML with OpenRefine using a Project Gutenberg example, as well as working with JSON in OpenRefine using the Library of Congress API. Attendees will be able to dive deeper into the features and functions of OpenRefine, and will learn how to interact with a variety of data formats, transform them to fit their needs, and export the result to share with others. After completing the workshop, participants will be able to expand their OpenRefine projects and turn data collected from outside sources into something more usable.

If you did not attend the first session, it’s highly recommended that you watch the recording before attending Part 2. See the recording here.

Please review our Code of Conduct, our Statement on Viewpoints, and details on Interpreter Services.

Recording