Using Multiple Strategies To Find Connections In Digital Archives: Making Sense of Historical Data Lester Connie Lee University of Central Florida, United States of America Connie.Lester@ucf.edu 2014-12-19T13:50:00Z Paul Arthur, University of Western Sidney
Locked Bag 1797 Penrith NSW 2751 Australia Paul Arthur

Converted from a Word document

Paper Long Paper Archives Sensemaking Connections Interdisciplinary Visualization databases & dbms historical studies metadata natural language processing interdisciplinary collaboration visualisation spatio-temporal modeling analysis and visualisation English

RICHES Mosaic Interface TM (RICHES MI; https://richesmi.cah.ucf.edu ) is the interactive and innovative digital platform for the Regional Initiative for Collecting the History, Experiences, and Stories (RICHES™) of Central Florida, an interdisciplinary project housed in the History Department at the University of Central Florida in Orlando. The mission of RICHES MI is to enable users to search the database using natural language, tags, topics, and categories to maximize their search results; analyze the results of their search using the ‘Connections’ module to show the relationship between the returned item and other items in the digital archive using a RICHES™-developed algorithm; visualize results through digital exhibits, map overlays, and visualizations, and learn regional history and historical methods as teachers and students use the RICHES™-produced digital modules and source sets. Combining multiple search and analytical tools in an interactive database offers a more effective approach to historical analysis and more closely approximates the process that historians traditionally use in research projects.

As historians moved into digitization of archival collections, the initial excitement of having ready access to rare documents and images took precedence over search and analysis problems that also accompanied this undertaking. Recent conference presentations and white papers demonstrate acknowledgement of these problems and advance solutions that call for greater collaboration between historians and archivists to write more detailed metadata and offer textual analysis tools for understanding larger datasets. A 2011 National Endowment for the Humanities–funded project on changing research practices among historians called for increased digitization of archival sources, the creation of new tools for interactive use of digital sources, and the development of capabilities for connecting smaller archives to larger repositories (Rutner and Schonfeld, 2012). In a discussion on digital history published in the Journal of American History, Daniel Cohen predicted that the next iteration of scholarship would include ‘methods like collaborative filtering and recommendation systems’ (Cohen et al., 2008). Historians have followed the lead of digital literature scholars in the utilization of data mining and text analysis tools (Nelson et al., 2012), but we believe that historians need to view the results of such tools in context with other documents in order to gain insight into their broader meaning.

RICHES MI addresses a number of issues previously raised by digital historians and moves into the realm of analysis through the Connections tool. Still in its early stages of development, Connections intersects with scholarship on sensemaking that is most frequently associated with intelligence analysis and journalism (Pirolli and Card 2005; Pirolli and Russell, 2011). In the model advanced by Pirolli and Card, ‘The sensemaking process is organized into two major loops of activities . . . a foraging loop . . . [and] a sensemaking loop’. Our system seeks to address some of these challenges posed by the search for relationships between data and apply leverage to some key points of the sensemaking processes.

Designed and developed by an interdisciplinary team of historians and computer scientists, RICHES MI was constructed using several open-source programs, including Omeka, Google Maps, and MALLET (topic modeling). Plugins that permit users to contribute to the database and that enable the mapping of multiple sites for a single item were added. The Connections tool (which can be used in the Search and in the Bookbag) provides users with a tree diagram of related items and enables them to visualize the connections by time, location, tags, and topics. Finally, users can save selected items to a Bookbag, where they can organize the items into folders and annotate them. By following a well-known sensemaking process, our system supports digital research that more closely replicates the archival experience across multiple collections and large datasets.

Bibliography Cohen, D. J., et al. (2008). Interchange: The Promise of Digital History. Journal of American History, 95(2) (September). Nelson, R. K., et al. (2012). A Conversation with Digital Historians. Southern Spaces: An Interdisciplinary Journal about Regions, Places, and Cultures of the U.S. South and Their Global Connections, https://southernspaces.org/2012/conversation_digital_historians. Pirolli, P. and Card, S. (2005). The Sensemaking Process and Leverage Points for Analyst Technology as Identified through Cognitive Task Analysis. In Proceedings of International Conference on Intelligence Analysis. Vol. 5. McLean, VA: Mitre. Pirolli, P. and Russell, D. M. (2011). Introduction to This Special Issue on Sensemaking. Human-Computer Interaction, 26(1–2) (March): 1–8. Rutner, J. and Schonfeld, R. C. (2012). Supporting the Changing Research Practices of Historians. Final report from ITHAKA S+R.