Skip to main content

Hints

Number of registered words

You may register around 1,000 words per profile in the user dictionary. You can only register up to 1,000 words from MyPage.

For the Hybrid engine, each engine has a predetermined vocabulary, and words not included in that vocabulary will never be recognized. Even for the End to End engine, uncommon words or neologisms are less likely to be recognized. In cases where there are numerous proper nouns such as product names, it may be better to register words even if it exceeds 1,000. However, be aware that newly added words may be mistakenly recognized as other words with similar pronunciations. Adding too many words can increase such errors and potentially decrease recognition accuracy.

Additionally, increasing the number of registered words has drawbacks such as longer loading times for profiles during speech recognition and increased speech recognition time. Particularly, the profile loading time increases exponentially with the number of registered words. Similarly, use caution when specifying words in the profileWords request parameter during requests.

Guidelines

As general precautions, please keep the following in mind:

  • When a corresponding class is available, specify the class when registering words. Note that classes are only available for "Word Registration" in the Hybrid engine and cannot be used for "keyword biasing" in the End to End engine.
  • Words with few syllables are more likely to be misrecognized, so either avoid adding them or always specify a class.
  • Don't register words with excessively long pronunciations. If you want to register proper nouns or words with long pronunciations, consider splitting the word and registering it in parts. For example, if you want to register the station name "南阿蘇水の生まれる里白水高原駅", one approach is to register "南阿蘇" and "白水高原" in the '地名' class, which is used for Japanese place names, excluding the general words that can likely be recognized without registration.
  • Don't register words with the same pronunciation in the same class. When registering words with the same pronunciation in different classes, you can expect them to be distinguished based on context, but keep in mind that perfect distinction is not guaranteed.
  • Don't use the user dictionary to rewrite recognition results to expected writtens. For instance, in cases where you want "子ども" to be recognized but the result shows "子供". It's not recommended to register words in an attempt to get the desired recognition result in such cases. Registered words are not always recognized, and there's a risk of misrecognition of unspoken registered words. If you want to rewrite recognition results to expected writtens, perform post-processing in the client program.
  • Don't add words that are unlikely to be spoken just as a precaution. If you have a word list for a service or industry, don't register all of them at once. Instead, consider the above points and carefully select which words to register.