Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the updraftplus domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /var/www/lmi_wordpress/wp-includes/functions.php on line 6114

Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the polylang domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /var/www/lmi_wordpress/wp-includes/functions.php on line 6114
Speech Technologies – Laboratory for Machine Intelligence

Speech Technologies

Study Program: Electrical Engineering, 2nd Bologna Cycle
Semester: winter semester
Credits: 6

Lecturer: Assoc. Prof. Simon Dobrišek, PhD
Assistant: Asst. Marija Ivanovska, MSc


Course aims

The aim of this course is to acquaint students with the field of speech technologies and introduce various algorithms, techniques, and methods to accomplish tasks related to this field.

Content

Introduction: description of the field, short outline of the historical development of speech technologies. Importance and developments of speech technologies applications for Slovenian language.

Basic characteristics production and auditory perception in human speech communication. Representation of speech patterns.

Speech processing: acquisition and preprocessing, speech features, speech signal segmentation, speech databases.

Speech recognition systems: speaker recognition and verification, isolated word and continuous speech recognition, spontaneous speech recognition. Statistical acoustic and language modeling, semantic speech analysis.

Artificial speech: systems for speech synthesis in general, grapheme-to-phoneme conversion, prosody modeling, speech-synthesis procedures. Assessment of speech synthesis systems.

Dialogue: automated dialogue systems in general, system configurations, dialogue management, knowledge representations, multimodality, assessment of dialogue systems

Literature

  • Mihelič F., Signali, Založba FE in FRI, Ljubljana, 2014
  • Pavešić N., Razpoznavanje vzorcev: uvod v analizo in razumevanje vidnih in slušnih vzorcev, 3. Popravljena in dopolnjena izdaja, Založba FE in FRI, Ljubljana, 2012
  • Rabiner L., Schafer R., Theory and Applications of Digital Speech Processing, Prentince Hall, 1. Ed., 2010

Prerequisites

Basic knowledge of applied mathematics (vectors and matrices, eigenvectors and eigenvalues, some linear algebra, multivariate analysis, probability theory, and statistics).