Perceptions of the Other in Travelogues 1500-1875 - A Computerized Analysis
Perceptions of the Other in Travelogues 1500-1875 - A Computerized Analysis
DACH: Österreich - Deutschland - Schweiz
Disciplines
History, Archaeology (40%); Computer Sciences (40%); Media and Communication Sciences (20%)
Keywords
-
Digital Humanities,
History,
Computer Science,
Travelogues,
Text-Mining/Topic-Modelling,
Perceptions of the Other and the Orient
Travelogues are a highly important source for historical research. They provide information about regions, cultures and religions but also enable insights into the authors cultural self-conceptions. However, the backgrounds of travelers, the aim of their journeys as well as the intended audiences vary considerably. This makes this source rather difficult to engage with. As a consequence scientific interest tends to concentrate on single texts. Austrian and German researchers from History, Computer Science and Book History cooperate to analyze a digitized collection of several thousand travelogues, covering the period from 1500 until 1875. In close collaboration, the research team will develop novel digital techniques. These include the automatic detection of travelogues within the complete historical holdings of the Austrian National Library (ca. 600,000 volumes, which are currently digitized in the Austrian Books Online Project). Furthermore, the automatic identification of passages of text that relate to otherness and the creation of a database containing the identified travelogues and associated data, such as the name, date and place of birth of the author. The developed tools will be designed and made available to allow researchers from other fields to analyze similar questions based on other documents. Therefore, this project will lay the groundwork for further research and provide the base for follow-up projects. The developed digital methods provide the tools to seek for answers to the following sets of guiding questions: What are the characteristics of travelogues? What is specific about them? How can they be automatically detected? How did authors express otherness in travelogues? Did it change over time, e.g. by switching from fear to mockery, or from antipathy to fascination? How was the Orient depicted in travelogues about this region? How do these relate to the general discussion of the Orient and its inhabitants? Was there any change over time? We specifically focus on how otherness was expressed in travelogues about the Orient and how it changed over time. This is not a purely historical issue. Every one of us is constantly confronted with otherness due to processes such as global mass tourism, the internationalization of consumer culture, transnational migration and globalization. By critically engaging with the historic dimension of otherness within travelogues, it is possible to raise more awareness and it will help to generate historic contextual knowledge and orientation for the present and future.
This interdisciplinary project focused on German language travelogues in the collections of the Austrian National Library (1500 to 1876). To analyze perceptions of "the other" and "the Orient" (Ottoman Empire, Persia) in a large-scale text corpus, algorithms for the semi-automatized search for, and evaluation of texts were being created. Scientists from the Institute for Habsburg and Balkan Studies (IHB) at the Austrian Academy of Sciences, the Austrian Institute of Technology (AIT), the Department of History of the University of Salzburg, the Austrian National Library (ÖNB), and the research center L3S at the University of Hannover took part. Results: 1. The project team identified almost 5,000 travelogues in the digital and analogous holdings of the ÖNB. Based on title and keyword searches, the team semi-automatically created sets of likely travelogues that were sorted out manually. This data served as "ground truth" for algorithms for the automated recognition of travelogues, which worked out successfully. By applying these algorithms, the team identified further travelogues, many of which had previously been unknown to the research community. The method is absolutely new and proved the usability of mashine learning in the identification of specific types of texts, even if the conversion of the text into machine-encoded text (OCR) is of poor quality. For this, the project was rewarded with the Lee Dirks Award for Best Full Paper. In the meantime, other projects already tried and applied this method. 2. As another result the project secured a large amount of European cultural heritage since it created the largest corpus of (German language) travelogues ever. The largest part of the sources is now digitally accessible via a search marker in the online library catalog of the ÖNB. 3. The metadata, that have been checked intellectually and have been created, improves the visibility, and explorational potentials regarding the text corpus for the humanities and the computer sciences. 5. We successfully identified identical text passages ("text reuse") of the travelogues on the "Orient" by applying BLAST (Basic Local Alignment Search Tool), an algorithm originally established in the field of bioinformatics for fast DNA sequence alignment. The results are promising, but show, however, that the OCR quality is decisive and that ways to balance out different spellings, synonyms and verbal variations of the same and similar content need to be tackled in future research on the topic. 6. The semi-automated identification of perceptions of "otherness" proved to be currently not as feasible as hoped (reasons: heterogeneity of the corpus; great need for "ground truth" for machine learning; insufficient OCR quality). However, related questions have been addressed using classical humanities methods as part of a collected volume, that will be published in 2022.
- Bernhard Haslhofer, Austrian Institute of Technology - AIT , associated research partner
- Maximilian Kaiser, Österreichische Nationalbibliothek , associated research partner
- Markus Brantl, Bayerische Staatsbibliothek München - Germany
- Patrick Sahle, Bergische Universität Wuppertal - Germany
- Wolfgang Nejdl, Leibniz Universität Hannover - Germany
- Manfred Thaller, Universität Köln - Germany
- Hans-Christof Kraus, Universität Passau - Germany
- Malte Rehbein, Universität Passau - Germany
- Merrick Lex Berman, Harvard University - USA
- Tom Elliott, New York University - USA
- Katherine Weimer, Rice University Houston - USA
- Carl Lagoze, University of Michigan - USA
- Elton Barker, The Open University
- Jane Winters, University of London
Research Output
- 22 Citations
- 14 Publications
- 2 Methods & Materials
- 1 Disseminations
- 1 Scientific Awards
- 1 Fundings
-
2024
Title Intertextualität; In: Digital Humanities in den Geschichtswissenschaften Type Book Chapter Author Gruber D Publisher Böhlau Pages 18 -
2021
Title Neural OCR Post-Hoc Correction of Historical Corpora DOI 10.48550/arxiv.2102.00583 Type Preprint Author Lyu L -
2021
Title Neural OCR Post-Hoc Correction of Historical Corpora DOI 10.1162/tacl_a_00379 Type Journal Article Author Lyu L Journal Transactions of the Association for Computational Linguistics Pages 479-493 Link Publication -
2018
Title Traveling through Space and Time, or: Making Historical Travelogues Accessible; In: Proceedings of the 18th European Networked Knowledge Organization Systems (NKOS) Workshop, co-located with the 22nd International Conference on Theory and Practice of Digital Libraries 2018 Type Book Chapter Author Rörden Link Publication -
2021
Title Neural OCR Post-Hoc Correction of Historical Corpora DOI 10.15488/15073 Type Other Author Koutraki M Link Publication -
2021
Title Knowledge in Transition DOI 10.7767/9783205214069.163 Type Book Chapter Author Gruber D Publisher Brill Osterreich Pages 163-178 -
2020
Title Identifying Historical Travelogues in Large Text Corpora Using Machine Learning DOI 10.48550/arxiv.2001.01673 Type Preprint Author Rörden J -
2020
Title On the Way into the Unknown? Comparative Perspectives on the "Orient" in (Early) Modern Travelogues Type Other Author Lisa Brunner Link Publication -
2020
Title Identifying Historical Travelogues in Large Text Corpora Using Machine Learning DOI 10.1007/978-3-030-43687-2_67 Type Book Chapter Author Rörden J Publisher Springer Nature Pages 801-815 -
2020
Title TRAVELOGUES: FREMDWAHRNEHMUNGEN IN REISEBERICHTEN 1500–1876 DOI 10.1553/dha-proceedings2018s62 Type Conference Proceeding Abstract Author Gruber D Pages 62-66 Link Publication -
2022
Title Europeans Encounter the World in Travelogues: 1450-1900 Type Journal Article Author Doris Gruber Journal Europäische Geschichte Online -
2019
Title Japan and the Ottoman Empire in the Eye of the European Beholder. A Comparison Type Other Author Doris Gruber Link Publication -
2019
Title Travelogues – Perceptions of the Other 1500–1876. A Computerized Analysis DOI 10.7767/9783205209096.129 Type Book Chapter Author Gruber D Publisher Brill Osterreich Pages 129-132 Link Publication -
2020
Title Austrian Books Online – Acht Jahre Digitalisierung des historischen Buchbestandes der Österreichischen Nationalbibliothek mit Google DOI 10.1515/bfp-2020-0008 Type Journal Article Author Fritze C Journal Bibliothek Forschung und Praxis Pages 89-99 Link Publication
-
2020
Title Lee Dirks Award Type Research prize Level of Recognition Continental/International
-
2022
Title Scholarship Type Research grant (including intramural programme) Start of Funding 2022 Funder Herzog August Library