Overview of the most important SSML tags

There are a lot of tags you can use to make your voice more lifelike. A lot of them are described in the standards, but some of the details are left to the supplier of the engine. In the following table, I give you an overview of the most important tags and how to use them in a way that the Microsoft synthesizer understands:

Element

Description

Attributes

break

A pause in the spoken text.

strength, time

emphasis

Makes the part in this element more prominent.

level

p

Paragraph.

-

s

Sentence inside a paragraph.

-

phoneme

Turns the pronunciation into a phonetic one. Used to spell out numbers and characters.

ph, alphabet

prosody

Controls the rate, pitch, ...

Get Microsoft HoloLens Developer’s Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.