OA Database "Adjective-Adverb Interfaces in Romance"
OA Database "Adjective-Adverb Interfaces in Romance"
Disciplines
Other Humanities (40%); Linguistics and Literature (60%)
Keywords
-
Corpus Linguistics,
Longterm Preservation,
Interactive Research Infrastructures,
Adjective and adverb,
Romance languages
The project Open Access Database Adjective-Adverb Interfaces in Romance shall be carried out by the Research Group The Interfaces of Adjective and Adverb in Romance at the University of Graz. Since 2002, this Research Group has published approximately 40 articles and books on this topic. All publications were consequently based on corpora; three of them are already available in an open access modality via a graphical user interface, the first one since 2005. The project thus stands in the tradition of a corpus based linguistics that since the very beginnings aims at warranting the scientific principle of verification of the analyses by the scientific community, which requires free access to all data. Cooperation with the Centre of Information Modelling (Austrian Centre for Digital Humanities, ZIM-ACDH) in this project will turn it possible to make a big qualitative step ahead, insofar as the tagging of the data according to the state of the art in linguistics (responsible: Martin Hummel) and information technology (responsible: Johannes Stigler) can be updated or newly operated in this way. For this purpose, both cooperation partners have already prepared two fully tagged pilot corpora, which have been used for first publications. The project will additionally provide open access to the underlying research data in XML/TEI P5 format and offer further conversion into standardized formats proposed by CLARIN ERIC in order to ensure optimal re-usability to the scientific community. The project will apply these principles to several, mostly already existing databases, including the data of international partners such as Adam Ledgeway (Cambridge) and Adrian Chircu (Cluj-Napoca) on the dialects of Italy and Old Romanian. Thus a common platform for databases will be elaborated which at the end of the project will cover major parts of the Romance languages, and still more parts on the long run. This means that the traditional methodology of historical-comparative linguistics will receive a modern basis which will be used in the future by the Research Group, e.g. for promoting the career of young researches (already planned: one habilitation thesis, one dissertation), and hopefully by other linguists as well.
The Open-Access Database: Adjective-Adverb Interfaces in Romance" (2017-2020) was a cooperation project between the Department of Romance (https://romanistik.uni-graz.at) and the Austrian Centre for Digital Humanities (https://informationsmodellierung.uni-graz.at). The project created or added a series of corpora to a joined database providing common research tools for all sub-corpora: "Open-Access Database: Adjective-Adverb Interfaces in Romance" (http://gams.uni-graz.at/context:aaif). In sum, over 35.000 historical and present-day attestations for adjectives with adverbial functions in several languages (French, Spanish, Portuguese, southern Italian varieties, Romanian and Latin) have been documented and tagged with the same tagging categories and tools. This includes cases such as Fr. voler haut 'to fly high' or aller droit 'to go direct(ly)', as well as prepositional phrases such as Pt. de novo 'again, newly' or Sp. a ciegas 'blindly'. The project has updated or developed an annotation model including categories related to word formation, syntax, morphology and semantics. Each attestation cited has been manually tagged according to seven different categories and has been attributed to a lemma. For the cross-linguistic purpose, the project has engaged cooperation with international partners, especially for Italian varieties, Portuguese, Romanian and Latin. In addition, three previously existing databases on French and Spanish have been updated and thus saved for the future in terms of sustainability. All the sub-corpora have been tagged with the same categorization and annotation principles. They allow for cross-linguistic parsing, e.g. for several Romance languages at the same time. The research data can be acceded openly and sustainably via the GAMS system, which operates the long-term storage of digital data at the University of Graz. The annotated sub-corpora are freely available and downloadable in reusable formats (XML/TEI, XML/RDF or Excel). The annotation model has been conceived as an Ontology of the Web of Data, thus constituting the conceptual foundation of the technical solutions applied in the research mask. It will allow in future to link the highly structured data with other data sources. The website of the ORD database includes a clear and transparent description of the metadata, the sources as well as the annotation tools. The corpora description further contains a list of the lemmata processed in each of them and the number of attestations. The database is open for adding new sub-corpora in the future. The annotation model is freely available for this purpose. Due to its original cross-linguistic conception, the database provides a useful tool for further research in the framework of the long-term project cluster "The Adjective-Adverb Interfaces in Romance" (https://adjective-adverb.uni-graz.at/en/). Several publications have already derived benefit of the data. The international research community's recent increase of interest in the domain of prepositional adverbial adds additional and certainly crucial value to the database.
- Universität Graz - 100%
Research Output
- 6 Citations
- 8 Publications
- 1 Methods & Materials
- 2 Datasets & models
- 2 Scientific Awards
- 3 Fundings
-
2021
Title Annotation Model: Criteria for Linguistic Categorization in the Database "Adjective-Adverb Interfaces in Romance" DOI 10.5281/zenodo.4030345 Type Other Author Gerhalter K Link Publication -
2017
Title La structure “verbe?+?adjectif” DOI 10.1075/rro.16021.hum Type Journal Article Author Hummel M Journal Revue Romane Pages 261-296 Link Publication -
2018
Title Baseline elaboration and echo-sounding at the adjective adverb interface DOI 10.1515/cog-2016-0033 Type Journal Article Author Hummel M Journal Cognitive Linguistics Pages 407-452 Link Publication -
2018
Title Semantic Annotation in the Project "Open Access Database "Adjective-Adverb Interfaces' in Romance" Type Conference Proceeding Abstract Author Pollin C Conference Workshop on Annotation in Digital Humanities, CEUR Workshop Proceedings Pages 41-46 Link Publication -
2020
Title An Annotated Corpus of Adjective-Adverb Interfaces in Romance Languages Type Conference Proceeding Abstract Author Gerhalter K Conference 12th Language Resources and Evaluation Conference (LREC 2020, Marseille) Pages 946-950 Link Publication -
2020
Title Annotation Model: Criteria for Linguistic Categorization in the Database "Adjective-Adverb Interfaces in Romance" Type Other Author Gerhalter K Pages 1-23 Link Publication -
2020
Title ADVERBIALS WITH ADJECTIVAL BASIS IN BRAZILIAN PORTUGUESE AND THEIR FREQUENCY IN SPOKEN AND WRITTEN LANGUAGE DOI 10.24193/subbphilo.2020.4.12 Type Journal Article Author Gerhalter K Journal Studia Universitatis Babe?-Bolyai Philologia Pages 189-208 Link Publication -
2018
Title Compilation and Annotation of Adjective-Adverb Interfaces in Romance. Towards a multilingual Open Access Corpus DOI 10.15366/chimera2018.5.2.009 Type Journal Article Author Gerhalter K Journal CHIMERA: Romance Corpora and Linguistic Studies Pages 305-311 Link Publication
-
2020
Title Keynote Talk at the international conference "The problem of the adverb" Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2018
Title Poster Award at the DHA2018 Conference Type Poster/abstract prize Level of Recognition National (any country)
-
2020
Title "Spring School LiSeH 2020 "Linked Data and the Semantic Web for Humanities research" CLARIAH-AT Funding Call 2020 (participation of project members Gerlinde Schneider and Christopher Pollin) Type Research grant (including intramural programme) Start of Funding 2020 -
2019
Title Training Digital Scholars: Knowledge Exchange between V4 and Austria Visegrad Grant, Project ID #21820079 (Project Lead: Gabor Palko, ELTE Budapest), participant at the University of Graz: project member Gerlinde Schneider Type Travel/small personal Start of Funding 2019 -
2018
Title Erasmus+ Staff Mobility for Training (participation of project members Christopher Pollin and Gerlinde Schneider at the "Summer School in Digital Humanities and IT", Borovets, Bulgaria, August 2018) Type Travel/small personal Start of Funding 2018