[Return to Query]
Reading the First Books: Multilingual, Early-Modern OCR for Primeros Libros
Sergio Romero, University of Texas, Austin
Grant details: https://securegrants.neh.gov/publicquery/main.aspx?f=1&gn=HK-230965-15
Reading the First Books (Public Lecture or Presentation)
Title: Reading the First Books
Abstract: New projects to scan early modern printed books have radically increased global access to valuable historical documents. Machine readers, however, are woefully unsuited to the uneven inking, anachronistic characters, unfamiliar typefaces, inconsistent orthographies, and multilinguality that characterize these historical documents. The “Reading the First Books” project addresses these challenges through the development and implementation of Ocular, a new digital tool for reading and automatically transcribing books from this period. The project focuses on the tailoring the tool for reading the Primeros Libros collection of books printed in the Americas during the first century of Spanish colonization.
This talk will introduce the practical and theoretical implications of this project for both librarians and scholars interested in colonial documents or cultural analytics. On a practical level, we will discuss how the integration of Ocular into the Early Modern OCR Project at Texas A&M will enable new transcription projects across institutions. From a theoretical position, we will consider how Ocular’s transcription process, which simultaneously analyzes patterns in inking, typography, language use, and orthography, opens new possibilities for academic research.
Reading the First Books is a collaboration between LLILAS Benson Latin American Studies and Collections at the University of Texas at Austin, and the Initiative for Digital Humanities, Media, and Culture at Texas A&M University. It is funded by an NEH Digital Humanities Implementation Grant.
Author: Hannah Alpert-Abrams
Location: John Carter Brown Library
The Electronic Edition of Colonial and Nineteenth-Century Latin American Texts: New Tools, New Models for Collaboration (Conference Paper/Presentation)
Title: The Electronic Edition of Colonial and Nineteenth-Century Latin American Texts: New Tools, New Models for Collaboration
Author: Hannah Alpert-Abrams
Abstract: This session brings together a group of experts for a conversation about new possibilities for digital research related to colonial and nineteenth-century Latin America. Hannah Alpert-Abrams of the University of Texas at Austin will speak on Ocular, an optical character recognition (OCR) tool that can read multilingual texts, including those involving indigenous languages. Nick Laiacona, founder of Performant Software Solutions, will discuss Juxta, a TEI-XML-based editing tool that provides an easy-to-use graphical interface and features for project management, including version control. Liz Grumbach, project manager for the Advanced Research Consortium and 18thConnect, will share her experiences creating communities to support the peer-review of electronic scholarship. Ralph Bauer of the University of Maryland will discuss the changes that are taking place at the Early Americas Digital Archive. This session has been designed as the starting point for what we hope will be an ongoing conversation about the Digital Humanities in our field.
Conference Name: Latin American Studies Association