• Skip to content (access key 1)
  • Skip to search (access key 7)
FWF — Austrian Science Fund
  • Go to overview page Discover

    • Research Radar
      • Research Radar Archives 1974–1994
    • Discoveries
      • Emmanuelle Charpentier
      • Adrian Constantin
      • Monika Henzinger
      • Ferenc Krausz
      • Wolfgang Lutz
      • Walter Pohl
      • Christa Schleper
      • Elly Tanaka
      • Anton Zeilinger
    • Impact Stories
      • Verena Gassner
      • Wolfgang Lechner
      • Birgit Mitter
      • Oliver Spadiut
      • Georg Winter
    • scilog Magazine
    • Austrian Science Awards
      • FWF Wittgenstein Awards
      • FWF ASTRA Awards
      • FWF START Awards
      • Award Ceremony
    • excellent=austria
      • Clusters of Excellence
      • Emerging Fields
    • In the Spotlight
      • 40 Years of Erwin Schrödinger Fellowships
      • Quantum Austria
    • Dialogs and Talks
      • think.beyond Summit
    • Knowledge Transfer Events
    • E-Book Library
  • Go to overview page Funding

    • Portfolio
      • excellent=austria
        • Clusters of Excellence
        • Emerging Fields
      • Projects
        • Principal Investigator Projects
        • Principal Investigator Projects International
        • Clinical Research
        • 1000 Ideas
        • Arts-Based Research
        • FWF Wittgenstein Award
      • Careers
        • ESPRIT
        • FWF ASTRA Awards
        • Erwin Schrödinger
        • doc.funds
        • doc.funds.connect
      • Collaborations
        • Specialized Research Groups
        • Special Research Areas
        • Research Groups
        • International – Multilateral Initiatives
        • #ConnectingMinds
      • Communication
        • Top Citizen Science
        • Science Communication
        • Book Publications
        • Digital Publications
        • Open-Access Block Grant
      • Subject-Specific Funding
        • AI Mission Austria
        • Belmont Forum
        • ERA-NET HERA
        • ERA-NET NORFACE
        • ERA-NET QuantERA
        • Alternative Methods to Animal Testing
        • European Partnership BE READY
        • European Partnership Biodiversa+
        • European Partnership BrainHealth
        • European Partnership ERA4Health
        • European Partnership ERDERA
        • European Partnership EUPAHW
        • European Partnership FutureFoodS
        • European Partnership OHAMR
        • European Partnership PerMed
        • European Partnership Water4All
        • Gottfried and Vera Weiss Award
        • LUKE – Ukraine
        • netidee SCIENCE
        • Herzfelder Foundation Projects
        • Quantum Austria
        • Rückenwind Funding Bonus
        • WE&ME Award
        • Zero Emissions Award
      • International Collaborations
        • Belgium/Flanders
        • Germany
        • France
        • Italy/South Tyrol
        • Japan
        • Korea
        • Luxembourg
        • Poland
        • Switzerland
        • Slovenia
        • Taiwan
        • Tyrol–South Tyrol–Trentino
        • Czech Republic
        • Hungary
    • Step by Step
      • Find Funding
      • Submitting Your Application
      • International Peer Review
      • Funding Decisions
      • Carrying out Your Project
      • Closing Your Project
      • Further Information
        • Integrity and Ethics
        • Inclusion
        • Applying from Abroad
        • Personnel Costs
        • PROFI
        • Final Project Reports
        • Final Project Report Survey
    • FAQ
      • Project Phase PROFI
      • Project Phase Ad Personam
      • Expiring Programs
        • Elise Richter and Elise Richter PEEK
        • FWF START Awards
  • Go to overview page About Us

    • Mission Statement
    • FWF Video
    • Values
    • Facts and Figures
    • Annual Report
    • What We Do
      • Research Funding
        • Matching Funds Initiative
      • International Collaborations
      • Studies and Publications
      • Equal Opportunities and Diversity
        • Objectives and Principles
        • Measures
        • Creating Awareness of Bias in the Review Process
        • Terms and Definitions
        • Your Career in Cutting-Edge Research
      • Open Science
        • Open-Access Policy
          • Open-Access Policy for Peer-Reviewed Publications
          • Open-Access Policy for Peer-Reviewed Book Publications
          • Open-Access Policy for Research Data
        • Research Data Management
        • Citizen Science
        • Open Science Infrastructures
        • Open Science Funding
      • Evaluations and Quality Assurance
      • Academic Integrity
      • Science Communication
      • Philanthropy
      • Sustainability
    • History
    • Legal Basis
    • Organization
      • Executive Bodies
        • Executive Board
        • Supervisory Board
        • Assembly of Delegates
        • Scientific Board
        • Juries
      • FWF Office
    • Jobs at FWF
  • Go to overview page News

    • News
    • Press
      • Logos
    • Calendar
      • Post an Event
      • FWF Informational Events
    • Job Openings
      • Enter Job Opening
    • Newsletter
  • Discovering
    what
    matters.

    FWF-Newsletter Press-Newsletter Calendar-Newsletter Job-Newsletter scilog-Newsletter

    SOCIAL MEDIA

    • LinkedIn, external URL, opens in a new window
    • , external URL, opens in a new window
    • Facebook, external URL, opens in a new window
    • Instagram, external URL, opens in a new window
    • YouTube, external URL, opens in a new window

    SCILOG

    • Scilog — The science magazine of the Austrian Science Fund (FWF)
  • elane login, external URL, opens in a new window
  • Scilog external URL, opens in a new window
  • de Wechsle zu Deutsch

  

Cross-layer prosodic models for conversational speech

Cross-layer prosodic models for conversational speech

Barbara Schuppler (ORCID: 0000-0003-4009-0832)
  • Grant DOI 10.55776/V638
  • Funding program Elise Richter
  • Status ended
  • Start October 1, 2018
  • End November 30, 2021
  • Funding amount € 271,184
  • Project website

Disciplines

Computer Sciences (20%); Linguistics and Literature (80%)

Keywords

    Conversational Speech, Prosodic Models, Automatic Speech Recognition, Austrian German, Pronunciation Variation, Machine Learning

Abstract Final report

Automatic speech recognition (ASR) systems were originally designed to cope with carefully pronounced speech. Most real world applications of ASR systems, however, require the recognition of spontaneous, conversational speech (e.g., dialogue systems, voice input aids for physically disabled, medical dictation systems, etc.). Compared to prepared speech, conversational speech contains utterances that might be considered `ungrammatical` and contain disfluencies such as ...oh, well, I think ahm exactly . Moreover, in spontaneous conversation, a word like yesterday may sound like yeshay and the German word haben (to have) may sound like ham. The pronunciation of the words depends on well-known factors, for instance on the regional background of the speakers and the formality of the situation. Highly influential, but not so well studied factors are those reflecting the prosodic characteristics of the word in the utterance. These prosodic characteristics describe the rhythm and melody of a sentence, and for instance whether a word is accented or not. The proposed project aims at investigating which role prosody plays for pronunciation variation from a linguistic point of view and at incorporating gained knowledge into an ASR system. In our investigations, we will use speech material from German and Austrian speakers. In contrast to most research in the field of prosody which used read sentences or prepared speech, we will annotate and analyze speech from free conversations between speakers who know each other well. Such speech material is not only more naturalistic, but also richer in pronunciation variation. In sum, our project will deliver the first prosodically annotated database for conversational Austrian German, automatic tools for the creation of prosodic annotations and a prosody-dependent ASR system for conversational speech from German and Austrian German speakers.

Cross-layer prosodic models for conversational speech Automatic speech recognition (ASR) systems were originally designed to cope with carefully pronounced speech. Most real world applications of ASR systems, however, require the recognition of spontaneous, conversational speech (e.g., dialogue systems, voice input aids for physically disabled, medical dictation systems, etc.). Compared to prepared speech, conversational speech contains utterances that might be considered 'ungrammatical' and contain disfluencies such as "...oh, well, I think ahm exactly ". Moreover, in spontaneous conversation, a word like "yesterday" may sound like "yeshay" and the German word "haben" ("to have") may sound like "ham". The pronunciation of the words depends on well-known factors, for instance on the regional background of the speakers and the formality of the situation. Highly influential, but not so well studied factors are those reflecting the prosodic characteristics of the word in the utterance. These prosodic characteristics describe the rhythm and melody of a sentence, and for instance whether a word is accented or not. This Elise Richter project investigating which role prosody plays for pronunciation variation from a linguistic point of view and investigated statistical methods for its quantitative analyis. In our investigations, we used speech material from German and Austrian speakers. In contrast to most research in the field of prosody which used read sentences or prepared speech, we annotated and analyzed speech from casual, free conversations between speakers who know each other well. Such speech material is not only more naturalistic, but also richer in pronunciation variation, coming with the challenge of more variation and the requirement of more complex statistic techniques. One of our main findings was that from a speech-melody, rhythm point of view, German and Austrian German conversations show a more similar pattern than Austrian read and Austrian conversational speech, leading us to the conclusion that with respect to prosodic phrasing, speaking style is more relevant than the regional background of the speakers. One main deliverable of the project was the first prosodically annotated database for conversational Austrian German, along with automatic tools for the creation of prosodic annotations. These will continued to be used to develop an automatic speech recognition system for conversational speech from German and Austrian German speakers. What is more, the speech database is already in use by linguists and speech technologists by national and international academic research institutions

Research institution(s)
  • Technische Universität Graz - 100%
International project participants
  • Margaret Zellers, University of Stockholm - Sweden
  • Philip Garner, Idiap Research Institute - Switzerland

Research Output

  • 7 Citations
  • 10 Publications
  • 1 Policies
  • 2 Methods & Materials
  • 1 Disseminations
  • 3 Scientific Awards
Publications
  • 2024
    Title The prosody of theme, rheme and focus in Egyptian Arabic: A quantitative investigation of tunes, configurations and speaker variability
    DOI 10.1016/j.specom.2024.103082
    Type Journal Article
    Author El Zarka D
    Journal Speech Communication
  • 2024
    Title An introduction to pluricentric languages in speech science and technology
    DOI 10.1016/j.specom.2023.103007
    Type Journal Article
    Author Adda-Decker M
    Journal Speech Communication
  • 2020
    Title Towards building a cross-lingual speech recognition system for Slovenian and Austrian German,
    Type Journal Article
    Author A. Žgank
    Journal The Phonetician
    Link Publication
  • 2020
    Title An analysis of prosodic boundary detection in German and Austrian German read speech,
    Type Conference Proceeding Abstract
    Author Ludusan B.
    Conference Speeh Prosody
    Pages 990-994
    Link Publication
  • 2019
    Title Prosodic Effects on Plosive Duration in German and Austrian German
    DOI 10.21437/interspeech.2019-2197
    Type Conference Proceeding Abstract
    Author Schuppler B
    Pages 1736-1740
  • 2019
    Title Acoustic Cues to Topic and Narrow Focus in Egyptian Arabic
    DOI 10.21437/interspeech.2019-1189
    Type Conference Proceeding Abstract
    Author Zarka D
    Pages 1771-1775
  • 2019
    Title Automatic detection of prosodic boundaries in two varieties of German
    Type Conference Proceeding Abstract
    Author Ludusan B.
    Conference Interspeech 2019 Satellite Workshop on 'Pluricentric Languages in Speech Technology'
    Link Publication
  • 2020
    Title An analysis of prosodic boundary detection in German and Austrian German read speech
    DOI 10.21437/speechprosody.2020-202
    Type Conference Proceeding Abstract
    Author Ludusan B
    Pages 990-994
  • 2020
    Title Towards automatic annotation of prosodic prominence levels in Austrian German
    DOI 10.21437/speechprosody.2020-204
    Type Conference Proceeding Abstract
    Author Linke J
    Pages 1000-1004
  • 2020
    Title Microprosodic Variability in Plosives in German and Austrian German
    DOI 10.21437/interspeech.2020-2353
    Type Conference Proceeding Abstract
    Author Zellers M
    Pages 656-660
Policies
  • 2021
    Title ELRC
    Type Membership of a guideline committee
Methods & Materials
  • 2019
    Title GRASS corpus
    Type Improvements to research infrastructure
    Public Access
  • 0
    Title Prosodic Boundary Annotation Tool
    Type Improvements to research infrastructure
    Public Access
Disseminations
  • 2019
    Title Radio interview
    Type A press release, press conference or response to a media enquiry/interview
Scientific Awards
  • 2019
    Title Speech Communication Editor
    Type Appointed as the editor/advisor to a journal or book series
    Level of Recognition Continental/International
  • 2018
    Title Keynote speech
    Type Personally asked as a key note speaker to a conference
    Level of Recognition National (any country)
  • 2020
    Title Guest Professor
    Type Attracted visiting staff or user to your research group
    Level of Recognition National (any country)

Discovering
what
matters.

Newsletter

FWF-Newsletter Press-Newsletter Calendar-Newsletter Job-Newsletter scilog-Newsletter

Contact

Austrian Science Fund (FWF)
Georg-Coch-Platz 2
(Entrance Wiesingerstraße 4)
1010 Vienna

office(at)fwf.ac.at
+43 1 505 67 40

General information

  • Job Openings
  • Jobs at FWF
  • Press
  • Philanthropy
  • scilog
  • FWF Office
  • Social Media Directory
  • LinkedIn, external URL, opens in a new window
  • , external URL, opens in a new window
  • Facebook, external URL, opens in a new window
  • Instagram, external URL, opens in a new window
  • YouTube, external URL, opens in a new window
  • Cookies
  • Whistleblowing/Complaints Management
  • Accessibility Statement
  • Data Protection
  • Acknowledgements
  • IFG-Form
  • Social Media Directory
  • © Österreichischer Wissenschaftsfonds FWF
© Österreichischer Wissenschaftsfonds FWF