Electrical & Electronics Engineering
CLSASTs - Rapid Cross-Lingual Speaker Adaptation for Statistical
Text-to-Speech Systems
ABSTRACT
The proposed system is expected to be the first high-performance cross-lingual speaker adaptation method for STS that can
work with 5-10 seconds of adaptation data.
Yrd. Doç. Dr.
Cenk Demiroğlu
DEPARTMENT
Electrical - Electronics
Engineering
CONTACT
[email protected]
FUNDING SCHEME
EC - Marie Curie International
Reintegration Grants
START DATE
01.02.2011
2011 International Grants
Unit selection has been the dominant approach to text-to-speech synthesis (TTS) in the last decade. Recently, statistical
TTS (STS) is proposed where statistical models are used for speech synthesis. The high quality and intelligibility speech it
generates, the flexibility it offers in voice/speaker/emotion conversion, and its small memory requirements make STS systems
a strong candidate to be the dominant TTS technology in the next decade. One of the most exciting research directions in the
STS field is speaker adaptation where the goal is to adapt the voice characteristics to a target s