The millions of digitized historic newspaper pages within Chronicling America, a joint initiative between the Library of Congress and the National Endowment for the Humanities, represent an incredibly rich resource for the American public. Historians, journalists, genealogists, students, and members of the American public explore the collection regularly via keyword search. But how do we navigate the abundant visual content? In this talk, I will present my project, Newspaper Navigator, created in collaboration with LC Labs, the National Digital Newspaper Program, and IT Design & Development at the Library of Congress, as well as Professor Daniel Weld at the University of Washington. In particular, I will discuss the two phases of Newspaper Navigator: extracting visual content from 16+ million pages in Chronicling America (resulting in the Newspaper Navigator dataset) and re-imagining how we search over the extracted visual content using the Newspaper Navigator search application. I will also discuss how this project can contribute to research in machine learning, human-computer interaction, and the digital humanities. I will conclude by contextualizing Newspaper Navigator within a large body of emerging work foregrounding machine learning within libraries and other cultural heritage institutions.
- Newspaper Navigator dataset: https://news-navigator.labs.loc.gov/
- Newspaper Navigator search application: https://news-navigator.labs.loc.gov/search
- Newspaper Navigator project description & other links: https://bcglee.github.io/newspaper-navigator.html
About the Presenter
Ben Lee is a third year Ph.D. student in the Paul G. Allen School for Computer Science & Engineering at the University of Washington, where he studies human-AI interaction with his advisor, Professor Daniel Weld. He was also a 2020 Innovator in Residence at the Library of Congress. Ben served as the inaugural Digital Humanities Associate Fellow at the United States Holocaust Memorial Museum, as well as a Visiting Fellow in Harvard's History Department after his graduation from Harvard College. He is currently a National Science Foundation Graduate Research Fellow, as well as the Richard Willner Memorial Fellow in the Stroum Center for Jewish Studies at the University of Washington.
About the Harvard Discovery Series
Formerly known as The Digital Futures Discovery Series, the Harvard Discovery Series is a collaboration between Cabot Science Library and the Harvard University Digital Scholarship Group (DSSG). This event series brings scholars on the frontiers of digital knowledge-making to a Harvard audience in an intimate and interactive setting. All presentations will be virtual during the 2020-2021 academic year.