Speech Technologies

Study Program: Electrical Engineering, 2nd Bologna Cycle
Semester: winter semester
Credits: 6

Lecturer: Assoc. Prof. Simon Dobrišek, PhD
Assistant: Asst. Marija Ivanovska, MSc


Course aims

The aim of this course is to acquaint students with the field of speech technologies and introduce various algorithms, techniques, and methods to accomplish tasks related to this field.

Content

Introduction: description of the field, short outline of the historical development of speech technologies. Importance and developments of speech technologies applications for Slovenian language.

Basic characteristics production and auditory perception in human speech communication. Representation of speech patterns.

Speech processing: acquisition and preprocessing, speech features, speech signal segmentation, speech databases.

Speech recognition systems: speaker recognition and verification, isolated word and continuous speech recognition, spontaneous speech recognition. Statistical acoustic and language modeling, semantic speech analysis.

Artificial speech: systems for speech synthesis in general, grapheme-to-phoneme conversion, prosody modeling, speech-synthesis procedures. Assessment of speech synthesis systems.

Dialogue: automated dialogue systems in general, system configurations, dialogue management, knowledge representations, multimodality, assessment of dialogue systems

Literature

  • Mihelič F., Signali, Založba FE in FRI, Ljubljana, 2014
  • Pavešić N., Razpoznavanje vzorcev: uvod v analizo in razumevanje vidnih in slušnih vzorcev, 3. Popravljena in dopolnjena izdaja, Založba FE in FRI, Ljubljana, 2012
  • Rabiner L., Schafer R., Theory and Applications of Digital Speech Processing, Prentince Hall, 1. Ed., 2010

Prerequisites

Basic knowledge of applied mathematics (vectors and matrices, eigenvectors and eigenvalues, some linear algebra, multivariate analysis, probability theory, and statistics).