There are a lot of tags you can use to make your voice more lifelike. A lot of them are described in the standards, but some of the details are left to the supplier of the engine. In the following table, I give you an overview of the most important tags and how to use them in a way that the Microsoft synthesizer understands:
Element |
Description |
Attributes |
break |
A pause in the spoken text. |
strength, time |
emphasis |
Makes the part in this element more prominent. |
level |
p |
Paragraph. |
- |
s |
Sentence inside a paragraph. |
- |
phoneme |
Turns the pronunciation into a phonetic one. Used to spell out numbers and characters. |
ph, alphabet |
prosody |
Controls the rate, pitch, ... |