George Mason University (Fairfax, VA 22030-4444)
Jessica Otis (Project Director: January 2019 to present)
Lincoln A. Mullen (Co Project Director: May 2019 to present)
Datascribe: Enabling Structured Data Transcription in the Omeka S Web Platform

The creation of a structured data transcription module for the Omeka S platform that will make it easier for scholars working with quantitative data (such as government forms or institutional records) to transcribe them into structured data which can be analyzed or visualized.

Datascribe is an application for a Level III Digital Humanities Advancement Grant to create a structured data transcription module, or plug-in, for the Omeka S platform for digital collections. Scholars often collect sources, such as government forms or institutional records, intending to transcribe them into datasets which can be analyzed or visualized. Existing software enables transcription into free-form text but not into tables of data. The proposed module will enable scholars to identify the structure of the data within their sources, speed up the transcription of their sources, and reliably structure their transcriptions in a form amenable to computational analysis. Scholars will be able to turn sources into tables of data stored as numbers, dates, or categories. This module will build on the Omeka S platform, enabling scholars to display transcriptions alongside the source images and metadata, to crowdsource transcriptions, and to publish their results on the web.

History, General; Interdisciplinary Studies, General

Digital Humanities Advancement Grants

Digital Humanities

$324,733 (approved)
$324,733 (awarded)

9/1/2019 – 8/31/2022