If you’re looking to make connections with other digital humanists, want to explore new tools, read up on the latest research coming from the discipline, or need help locating a humanities set data, this page is a good place to start.
There are some corpora of humanities data sets available for use. A few are listed below:
The HathiTrust boasts one of the largest digital collection around, comprising more than 13 million volumes, around 2.7 million in the public domain. This is a joint project comprising partnerships from over 90 academic libraries.
HathiTrust Research Center
The HathiTrust Research Center (HTRC) is a collaboration between Indiana University, the University of Illinois, and the HathiTrust that enables computational access to works in the public domain and, in the coming future, works in copyright from the collection in the HathiTrust. A new set of tools has been released to interact with the vast corpus that the HathiTrust comprises. You can perform such operations as text mining and topic modeling.
The Internet Archive is a non-profit digital library that offers free universal access to books, movies, music, and more.
OPenn is an online archive of high resolution images of cultural heritage materials. The collection was developed through Penn State and each collection has machine-readable descriptions and technical metadata. The collections are in the public domain or released under Creative Commons Licenses.
Text Creation Partnership/University of Oxford Text Archive
This archive has a collection of electronic literary and linguistic resources available for download that can be used as data sets. Multiple download formats are supported, including XML, HTML, and plain text.
Women Writers Projects
The Women Writers Project is a research project that is dedicated to Early Modern women’s writing and electronic text encoding. The goal is to make pre-Victorian women’s writings accessible to wide audiences.
Doc South Data
Documenting The American South provides access to text in the “Documenting the American South” project. The text can be downloaded and studied.
Digital Public Library of America (DPLA)
The DPLA provides open access to digitized materials from libraries, archives, and museums around the United States. It seeks to be a resource for students, teachers, scholars, and the public.
Folger Digital Texts
Folger Digital Texts is an online database of PDFs and source of Shakespeare’s plays. The website has fee downloads, XML, and pdfs.
JSTOR Data for Research
Digitized and optically recognized corpora offered by JSTOR, Data for Research (DfR) includes a set of tools you can use to interact with content from the JSTOR archive. You can also request data sets and download in bulk.
Visiting HathiTrust Research Center Digital Humanities Specialist
Numeric and Spatial Data Librarian, Head of the Scholarly Commons
ATLAS (Applied Technologies for Learning in the Arts and Sciences)
ATLAS provides consulting, training, and support for the College of Liberal Arts and Sciences in areas such as statistics, GIS, web development, and digital media.
The Cline Center has a number of data sets that are publicly available through the Societal Infrastructures and Development (SID) project. They work to foster research and data based projects through the University of Illinois at Urbana-Champaign campus.
Data Services at the Scholarly Commons
Center for Informatics Research in Science and Scholarship (CIRSS)
CIRSS is a center within the Graduate School of Library and Information Science that focuses on information problems in scientific and scholarly research and how digital information can advance work in these areas.
The Illinois Program for Research in the Humanities. IPRH holds many talks and events of interest to the digital humanities.
The Media Commons is located in the Undergraduate Library. The media commons specializes in visual and audio resources, and it includes a video lab and equipment that library card holders can check out.
DH Curation Guide
The Digital Humanities has a rich history of allowing access to wide swaths of research, both in journal as well as book form. The items below are a good representation of this practice and are exemplary in the field.
The Stone and the Shell
Ted Underwood, Associate Professor of English at the University of Illinois at Urbana-Champaign, blogs about Digital Humanities, tools, and trends in the discipline.
Digital Humanities Now
This blog showcases current scholarship, news, trends, as well as recent job postings in the Digital Humanities.
Digital Humanities Questions & Answers
This is a Q&A board for digital humanities questions. It is a collaborative project of the Association for Computers and the Humanities and the Chronicle of Higher Education’s ProfHacker.