Components of Word Registration
For word registration, you can specify the "notation", "pronunciation", and "class" of the word, among which "notation" and "pronunciation" are mandatory items. The following explains each component.
Item | Description | Required | Example |
|---|---|---|---|
| Notation | The string obtained as a result of speech recognition when the word is spoken. | ● | AmiVoice |
| Pronunciation | Information representing how the word is pronounced. The method of describing the pronunciation differs for each language. | ● | あみぼいす |
| Class | A classification used to specify the category or type of the word. This classification allows the speech recognition system to distinguish words with the same pronunciation used in different contexts. Classes are defined for each engine, and API users cannot add classes. | 固有名詞 |
The English engine does not support word registration.
Overview of Word Registration
For example, if you want to register the word "パレオパラドキシア" because it's not being recognized, register the notation and pronunciation pair as follows. Separate the notation and pronunciation with a space. If you also want to set a class, please see How to Set Class.
パレオパラドキシア ぱれおぱらどきしあ
Setting multiple pronunciations for the same notation
You can set multiple pronunciations for one notation.
For example, you can set the notation "AMI" for pronunciation like "あみ" or "アドバンストメディア".
AMI あみ
AMI あどばんすとめでぃあ
Setting the same pronunciation for multiple notations
You can set the same pronunciation for multiple different notations. It won't cause an error, but which notation will be chosen is undefined. It is not recommended to set this intentionally.
For example, you can set notations like "AMI" and "AmiVoice" for the pronunciation "あみ".
AMI あみ
AmiVoice あみ
Notation
The "notation" is the string you want to output for the spoken audio.
Special Characters Usable in Notation
Among the characters that can be used in the notation, there are symbols that have special functions.
| Character | Character Name | Description |
|---|---|---|
| _ | Underscore | Symbol that outputs as a space in speech recognition results |
It is not possible to output an underscore (_) as a speech recognition result.
Characters That Cannot Be Registered in Notation
Strings containing the following characters cannot be registered in the notation.
| Character | Character Name |
|---|---|
| | | Vertical bar |
| Space | |
| : | Colon |
While you cannot use spaces in the notation you're registering, if you use an underscore (_) in the notation when registering a word, it will be output as a space in the speech recognition results.*
For example, if you want to output "Advanced Media" when "あみ" is spoken, register the word as "Advanced_Media あみ".*
Advanced_Media あみ
Pronunciation
"Pronunciation" refers to how the word is pronounced (how it's spoken).
How to Describe Pronunciation for Each Language
The method of describing pronunciation differs for each language. The following explains the description method for each language.
Japanese
For Japanese, describe using hiragana or katakana.
Chinese
For Chinese, describe using pinyin with tones represented by numbers. For example, "我们" should be described as "wo3men5".
我们 wo3men5
Korean
For Korean, describe using Hangul.