Building Synthetic Voices |
Book name:
Building Synthetic VoicesAuthor Name:
Alan W BlackKevin A. Lenzo
Compatibility:
This File is in pdf format. Make sure you have installed adobe reader in your PC.Download:
Download (pdf)
1. Overview of Speech Synthesis...............................................................................1
3. A Practical Speech Synthesis System....................................................................9
11. Diphone databases...............................................................................................95
Table of Contents:
I. Speech Synthesis...............................................................................................................11. Overview of Speech Synthesis...............................................................................1
- History..................................................................................................................1
- Uses of Speech Synthesis..................................................................................3
- General Anatomy of a Synthesizer.................................................................3
3. A Practical Speech Synthesis System....................................................................9
- Basic Use............................................................................................................10
- Utterance structure...........................................................................................12
- Modules.............................................................................................................13
- Utterance access................................................................................................15
- Utterance building............................................................................................18
- Extracting features from utterances...............................................................20
- II. Building Synthetic Voices............................................................................................23
- Hardware/software requirements.................................................................23
- Voice in a new language..................................................................................23
- Voice in an existing language..........................................................................24
- Selecting a speaker...........................................................................................24
- Who owns a voice.............................................................................................25
- Recording under Unix......................................................................................26
- Extracting pitchmarks from waveforms........................................................28
- designing the prompts.....................................................................................35
- customizing the synthesizer front end..........................................................36
- autolabeling issues...........................................................................................37
- unit size and type.............................................................................................37
- using limited domain synthesizers................................................................38
- Telling the time..................................................................................................39
- Making it better.................................................................................................44
- Non-standard words analysis.........................................................................47
- Token to word rules..........................................................................................47
- Number pronunciation....................................................................................51
- Homograph disambiguation...........................................................................52
- TTS modes.........................................................................................................52
- Mark-up modes.................................................................................................52
- Word pronunciations........................................................................................55
- Lexicons and addenda.....................................................................................55
- Out of vocabulary words.................................................................................56
- Building letter-to-sound rules by hand.........................................................57
- Building letter-to-sound rules automatically...............................................58
- Post-lexical rules...............................................................................................62
- Building lexicons for new languages.............................................................63
- Phrasing.............................................................................................................65
- Accent/Boundary Assignment.......................................................................69
- F0 Generation....................................................................................................71
- Duration.............................................................................................................75
- Prosody Research..............................................................................................79
- Prosody Walkthrough......................................................................................80
- Non-Latin-script languages............................................................................91
11. Diphone databases...............................................................................................95
- Diphone introduction.......................................................................................95
0 Post a Comment :
Post a Comment