Speech Engines for Anywhere365

Introduction

There are multiple ​TextToSpeech engines that are currently available, only one SpeechProvider can be selected per UCCUCC stands for Unified Contact Center and consists of a queue that can be handled by Agents Each Contact Center has its own settings, interactive voice response questions and Agent with specific skills. Agents can be member of, or sign up to, one or more Contact Centers..

Each SpeechProvider can be configured with a SpeechPreferredVoiceName,

When you do not configure the SpeechPreferredVoiceName the UCC will look for an available voice based on the Culture configured in GlobalSettings.

When a speech provider can't initialize or is misconfigured the MicrosoftSpeechSythesizer will be used as fallback.

 

MicrosoftSpeechSynthesizer

The standard option this is the on-premise text to speech solution, using the Microsoft Speech Sythesizer with the installed voices on the Anywhere365 UCC server.

Setting

Value

SpeechProvider​

MicrosoftSpeechSynthesizer

SpeechPreferredVoiceName

Exampled: Microsoft Server Speech Text to Speech Voice (en-GB, Hazel)

 

MicrosoftCognitiveServices​

This speech provider uses the ​Azure cloud service * to generate text to speech audio files. This one offers better quality than the standard Microsoft Speech installed on the UCC server.

* Text to Speech is subject to additional costs and are billed on the used Azure subscription. For rates see https://azure.microsoft.com/en-us/pricing/details/cognitive-services/speech-services/ .​

Setting​

Description

Value

​SpeechMicrosoftCognitiveApiKey​

The api key for Microsoft Cognitive services. ​

 

​SpeechMicrosoftCognitiveApiEndpoint

The api endpoint for Microsoft Cognitive services.

Example: 

https://westeurope.tts.speech.microsoft.com/cognitiveservices/v1

SpeechMicrosoftCognitiveApiAuthorizationEndpoint

The authorization endpoint for Microsoft Cognitive services.

Example: 

https://westeurope.api.cognitive.microsoft.com/sts/v1.0/issueToken

SpeechProvider

 

MicrosoftCognitiveServices

​​SpeechPreferredVoiceName

 

Example:

Microsoft Server Speech Text to Speech Voice (en-US, JessaNeural)

 

GoogleCloudTextToSpeech​

Google Cloud Text to Speech service is the one that offers the most comprehensive set of voices, a list of this you can find at https://cloud.google.com/text-to-speech/docs/voices.

For more information about their offering you can have a look here: https://cloud.google.com/speech-to-text

For this plugin you have to configure two lists, GlobalSettings and the PluginSettings lis.

In the PluginSettings list you have to put in the Credentials Json from the Google service.

 

GlobalSettings

Setting

Value

SpeechProvider​

GoogleCloudTextToSpeechV1

SpeechPreferredVoiceName

Example:

nl-NL-Wavenet-C

PluginSettings

Setting​

Scope​​

Value

GoogleAppCredentialsJson​

TextToSpeech

Json generation in Google Cloud.