Freshly Printed - allow 7 days lead
Couldn't load pickup availability
Improvements in Speech Synthesis
Cost 258: The Naturalness of Synthetic Speech
E. Keller (Edited by), E Keller (Author), G. Bailly (Edited by), A. Monaghan (Edited by), J. Terken (Edited by), M. Huckvale (Edited by)
9780471499855, Wiley
Hardback, published 12 October 2001
416 pages
25 x 17.2 x 2.8 cm, 0.822 kg
Naturalness in synthetic speech is one of the most intractable problems in information technology today. Although speech synthesis systems have improved considerably over the last 20 years, they rarely sound entirely like human speakers. Why is this so, and what can be done about it? Improvements in Speech Synthesis presents the latest research in the above areas. Contributors include speech synthesis specialists from 16 countries, with experience in the development of systems for 12 European languages. This volume emerges from a four-year European COST project focused on "The Naturalness of Synthetic Speech", and will be a valuable text for everyone involved in speech synthesis.
List of Contributors ix Preface xiii Part I Issues in Signal Generation 1 1 Towards Greater Naturalness: Future Directions of Research in Speech Synthesis 3 2 Towards More Versatile Signal Generation Systems 18 3 A Parametric Harmonic ‡ Noise Model 22 4 The COST 258 Signal Generation Test Array 39 5 Concatenative Text-to-Speech Synthesis Based on Sinusoidal Modelling 52 6 Shape Invariant Pitch and Time-Scale Modification of Speech Based on a Harmonic Model 64 7 Concatenative Speech Synthesis Using SRELP 76 Part II Issues in Prosody 87 8 Prosody in Synthetic Speech: Problems, Solutions and Challenges 89 9 State-of-the-Art Summary of European Synthetic Prosody R&D 93 10 Modelling FO in Various Romance Languages: Implementation in Some TTS Systems 104 11 Acoustic Characterisation of the Tonic Syllable in Portuguese 120 12 Prosodic Parameters of Synthetic Czech: Developing Rules for Duration and Intensity 129 13 MFGI, a Linguistically Motivated Quantitative Model of German Prosody 134 14 Improvements in Modelling the FO Contour for Different Types of Intonation Units in Slovene 144 15 Representing Speech Rhythm 154 16 Phonetic and Timing Considerations in a Swiss High German TTS System 165 17 Corpus-based Development of Prosodic Models Across Six Languages 176 18 Vowel Reduction in German Read Speech 186 Part III Issues in Styles of Speech 197 19 Variability and Speaking Styles in Speech Synthesis 199 20 An Auditory Analysis of the Prosody of Fast and Slow Speech Styles in English, Dutch and German 204 21 Automatic Prosody Modelling of Galician and its Application to Spanish 218 22 Reduction and Assimilatory Processes in Conversational French Speech: Implications for Speech Synthesis 228 23 Acoustic Patterns of Emotions 237 24 The Role of Pitch and Tempo in Spanish Emotional Speech: Towards Concatenative Synthesis 246 25 Voice Quality and the Synthesis of Affect 252 26 Prosodic Parameters of a 'Fun' Speaking Style 264 27 Dynamics of the Glottal Source Signal: Implications for Naturalness in Speech Synthesis 273 28 A Nonlinear Rhythmic Component in Various Styles of Speech 284 Part IV Issues in Segmentation and Mark-up 293 29 Issues in Segmentation and Mark-up 295 30 The Use and Potential of Extensible Mark-up (XML) in Speech Generation 297 31 Mark-up for Speech Synthesis: A Review and Some Suggestions 307 32 Automatic Analysis of Prosody for Multi-lingual Speech Corpora 320 33 Automatic Speech Segmentation Based on Alignment with a Text-to-Speech System 328 34 Using the COST 249 Reference Speech Recogniser for Automatic Speech Segmentation 339 Part V Future Challenges 349 35 Future Challenges 351 36 Towards Naturalness, or the Challenge of Subjectiveness 353 37 Synthesis Within Multi-Modal Systems 363 38 A Multi-Modal Speech Synthesis Tool Applied to Audio-Visual Prosody 372 39 Interface Design for Speech Synthesis Systems 383 Index 391
Eric Keller
Gerard Bailly
Gerard Bailly
Gerard Bailly
Eduardo Rodiguez Banga, Carmen Garcia Mateo and Xavier Fernandez Salgado
Darragh O'Brien and Alex Monaghan
Erhard Rank
Alex Monaghan
Alex Monaghan
Philippe Martin
Joao Paulo Ramos Teixeira and Diamantino R.S. Freitas
Marie Dohalska, Jana Mejvaldova and Tomas Dubeda
Hansjorg Mixdorff
Ales Dobnikar
Brigitte Zellner Keller and Eric Keller
Beat Siebenhaar, Brigitte Zellner Keller and Eric Keller
Justin Fackrell, Halewijn Vereecken, Cynthia Grover, Jean-Pierre Martens and Bert Van Coile
Christina Widera
Jacques Terken
Alex Monaghan
Eduardo Lopez Gonzalo, Juan M. Villar Navarro and Luis A. Hernandez Gomez
Danielle Duez
Branka Zei Pollermann and Marc Archinard
Juan Manuel Montero Martinez, Juana M. Gutierrez Arriola, Ricardo de Cordoba Herralde, Emilia Victoria Enriquez Carrasco and Jose Manuel Pardo Munoz
Ailbhe Ni Chasaide and Christer Gobl
Kjell Gustafson and David House
Christer Gobl and Ailbhe Ni Chasaide
Brigitte Zellner Keller and Eric Keller
Mark Huckvale
Mark Huckvale
Alex Monaghan
Daniel Hirst
Petr Horak
Narada D. Warakagoda and Jon E. Natvig
Eric Keller
Genevieve Caelen-Haumont
Andrew Breen
Jonas Beskow, Bjorn Granstrom and David House
Gudrun Flach
Subject Areas: Electronics & communications engineering [TJ]
