Join us in Phaze Demesnes or follow us:

This system converts text to female or male speech.

Click 'View Source' to see the commands for controlling pitch, speed and emphasis.

FemaleMale
Low ResHigh Res (Second Life 44khz)
Speed:
nbsp;

Your Wave File is Ready

24190 bytes, 0.3 seconds

Download the wave file (right click and save-Link-as)

 

Speed

The Speed attribute controls the relative rate of the voice. The value of Speed should be an integer between negative ten and ten. The absolute value is found by adding each Speed to the current slider value which controls the Rate.

<rate speed="5"> This text should be spoken at rate five.
<rate speed="-5"> This text should be spoken at rate negative five.
</rate>
</rate>

The Rate tag controls the rate of a voice. The tag can be empty, in which case it applies to all subsequent text, or it can have content, in which case it only applies to that content.

Rate

The value of Rate should be an integer between negative ten (-10) and ten (1). Values outside of this range may be truncated by the engine. The AbsSpeed attribute controls the absolute rate of the voice, so a value of ten always corresponds to a value of ten, a value of five always corresponds to a value of five.

<rate absspeed="5">
This text should be spoken at rate five.
<rate absspeed="-5"> This text should be spoken at rate negative five.
</rate>
</rate> <rate absspeed="10">All text which follows should be spoken at rate ten. </rate>

Emph
The Emph tag instructs the voice to emphasize a word or section of text. The Emph tag cannot be empty. The following word should be emphasized.
To stress a word, use emphasis. <emph> boo </emph>!

Spell

The Spell tag forces the voice to spell out all text, rather than using its default word and sentence breaking rules, normalization rules, and so forth. All characters should be expanded to corresponding words (including punctuation, numbers, and so forth). The Spell tag cannot be empty.

<spell>
These words should be spelled out.
</spell>
These words should not be spelled out.
Pitch
The Pitch tag controls the pitch of a voice. The tag can be empty, in which case it applies to all subsequent text, or it can have content, in which case it only applies to that content.

The Pitch tag has two attributes, Middle and AbsMiddle, one of which must be present. The value of both of these attributes should be an integer between negative ten and ten.

The AbsMiddle attribute controls the absolute pitch of the voice, so a value of ten always corresponds to a value of ten, a value of five always corresponds to a value of five.

<pitch absmiddle="5">
This text should be spoken at pitch five.
   <pitch absmiddle="-5">
      This text should be spoken at pitch negative five.
   </pitch>
</pitch>
<pitch absmiddle="10"/>

All text which follows should be spoken at pitch ten.

The Middle attribute controls the relative pitch of the voice. The absolute value is found by adding each Middle to the current absolute value.

<pitch middle="5">
This text should be spoken at pitch five.
   <pitch middle="-5">
      This text should be spoken at pitch zero.
   </pitch>
</pitch>

Zero represents the default middle pitch for a voice, with positive values being higher and negative values being lower.

Volume

The Volume tag controls the volume of a voice. The tag can be empty, in which case it applies to all subsequent text, or it can have content, in which case it only applies to that content.

The Volume tag has one required attribute: Level. The value of this attribute should be an integer between zero and one hundred. Values outside of this range will be truncated.

<volume level="50">
This text should be spoken at volume level fifty.

   <volume level="100">
      This text should be spoken at volume level one hundred.
   </volume>
   
</volume>

<volume level="80"/>
All text which follows should be spoken at volume level eighty.

One hundred represents the default volume of the voice. Lower values represent percentages of this default. That is, 50 corresponds to 50% of full volume.

Pron

The Pron tag inserts a specified pronunciation. The voice will process the sequence of phonemes exactly as they are specified. This tag can be empty, or it can have content. If it does have content, it will be interpreted as providing the pronunciation for the enclosed text. That is, the enclosed text will not be processed as it normally would be.

The Pron tag has one attribute, Sym, whose value is a string of white space separated phonemes.

<pron sym="h eh 1 l ow & w er 1 l d "/>
<pron sym="h eh 1 l ow & w er 1 l d"> hello world </pron>

PartOfSp

The PartOfSp tag provides the voice with the part of speech of the enclosed word(s). Use this tag to enable the voice to pronounce a word with multiple pronunciations correctly depending on its part of speech. The PartOfSp tag cannot be empty.

The PartOfSp tag has one attribute, Part, which takes a string corresponding to a SAPI part of speech as its attribute. Only SAPI defined parts of speech are supported - "Unknown", "Noun", "Verb", "Modifier", "Function", "Interjection".

<partofsp part="noun"> A </partofsp> is the first letter of the alphabet.
Did you <partofsp part="verb"> record </partofsp> that <partofsp part="noun"> record </partofsp>?

Back to the Best Free Tools in Second Life and OpenSim.