Skip to Main Content

Archives as Data: Methods and Tools for Working with Archives as Data

This guide provides an overview of archival collections datasets (“Archives as Data”), primarily that made available by UCSF Archives and Special Collections, including guidance for accessing and using such data.

Programming Foundations for Data Science

These selected services offer training opportunities, both synchronous workshops as well as asynchronous resources for self-study: 

Digital Humanities Tutorials

These selected sites include tutorials for a range of methods/tools:

Textual Analysis Methods and Tools

These selected services and resources offer useful information and training for getting started:

  • HathiTrust Research Center (HTRC) - HTRC enables computational analysis of the HathiTrust corpus, which includes digitized text from more than 17 million volumes contributed to the HathiTrust digital repository by partner research libraries.
  • The Data-Sitters Club - Using digitized text from Baby-Sitters Club book series as a corpus, this initiative offers a  fun way to learn about computational text analysis for digital humanities through numerous descriptive process walkthroughs and candid tool evaluations.