Speech Processing (SICOM-SIGMA S9) - 5PMSPAR0
A+Augmenter la taille du texteA-Réduire la taille du texteImprimer le documentEnvoyer cette page par mail
Number of hours
- Lectures : 8.0
- Tutorials : 8.0
- Laboratory works : 4.0
- Projects : 0
- Internship : 0
ECTS : 2.0
Goals
This serie of lectures will cover the fundamentals of automatic speech processing including fundamentals of speech production and perception, acoustic phonetics, speech signal analysis and transformation, automatic speech recognition, Text-to-speech synthesis, statistical voice conversion.
Contact Thomas HUEBER
Content This serie of lectures will cover the fundamentals of automatic speech processing:
- Introduction to speech science (speech production/perception, acoustic phonetics)
- Speech signal analysis (STFT, cepstral analysis, pitch detection, voice transformation)
- Automatic speech recognition (template matching, HMM-based approach, neural approaches including LSTM, CTC, and seq2seq+attention model)
- Voice conversion (using Gaussian mixture regression and neural approaches)
- Text-to-speech synthesis (from unit selection to neural TTS)
PrerequisitesBasics of digital signal processing, machine learning
Tests Exam + lab work
1ère session : Examen écrit présentiel
2ème session : Rapport sur miniprojet Python
Additional Information Curriculum->Engineering degree->Semester 9
Curriculum->Double-Diploma Engineer/Master->Semester 9
A+Augmenter la taille du texteA-Réduire la taille du texteImprimer le documentEnvoyer cette page par mail
Date of update January 9, 2017