Tools for Managing Digital Collections: OpenRefine

Speaker Kara Van Malssen
Kara Van Malssen is Senior Consultant for digital archiving and preservation projects at AudioVisual Preservation Solutions. She focuses on helping clients develop effective lifecycle management practices for audiovisual material, from production to preservation, and works to implement the policies and tools that will enable those practices to be realized. Some ...

Kara Van Malssen is Senior Consultant for digital archiving and preservation projects at AudioVisual Preservation Solutions. She focuses on helping clients develop effective lifecycle management practices for audiovisual material, from production to preservation, and works to implement the policies and tools that will enable those practices to be realized. Some of her recent clients include the Museum of Modern Art, Corporation for Public Broadcasting, Carnegie Hall, and HBO. Kara is Adjunct Professor at New York University, where she teaches Digital Preservation for the Moving Image Archiving and Preservation Program, from which she is also a graduate. She deals with messy metadata on a daily basis.

Full Description

Messy, inconsistent metadata makes collection management tasks tremendously challenging, yet they are unfortunately the reality for nearly all of us. In this workshop, participants will learn the basics of using OpenRefine (formerly Google Refine), "a free, open source power tool for working with messy data" to analyze, normalize, and clean up collections datasets. Participants will walk through several practical exercises using sample collections metadata that will provide an introduction to OpenRefine's features and start on the path toward data cleanup at their home institutions. Advanced OpenRefine topics, such as reconciliation of datasets against Freebase and other external datasets and web services will be discussed briefly, but not in any depth. This is an introductory workshop, ideal for those who are new to OpenRefine and are interested in exploring it's simple yet powerful features.

The Tools for Digital Collections series is a set of shorter, low-cost workshops that teach specific tips and tricks for managing digital content. They are presented as part of METRO's Keeping Collections project. For more information and future workshops, visit the project's
training information page.

When?

Wed, Apr. 17, 2013
4 p.m. - 6 p.m. US/Eastern

How Much?

Event has ended

Where?

METRO Training Center
57 East 11th Street, 4th Floor
New York, NY 10003