Guide to PraisonAI’s real-time voice interaction feature with text-to-speech, voice input processing, and financial data integration
To use the Realtime Voice Interface, follow these steps:
Install PraisonAI with the realtime dependencies:
Set up your OpenAI API key:
Azure OpenAI Configuration
To use Azure OpenAI instead of standard OpenAI, configure these environment variables:
The realtime interface will automatically detect the base URL and adjust the WebSocket connection accordingly.
Launch the Realtime Voice Interface:
Once the interface is launched:
You can configure various aspects of the Realtime Voice Interface:
Variable | Description | Default |
---|---|---|
OPENAI_API_KEY | Your OpenAI or Azure OpenAI API key | Required |
OPENAI_BASE_URL | Custom base URL for OpenAI-compatible APIs (e.g., Azure OpenAI) | https://api.openai.com/v1 |
OPENAI_MODEL_NAME | Model to use for realtime API | gpt-4o-mini-realtime-preview-2024-12-17 |
Choose different AI models for processing. Supported models include:
gpt-4o-realtime-preview
gpt-4o-mini-realtime-preview-2024-12-17
Adjust voice characteristics for the AI’s speech output through the session configuration.
Configure input/output audio formats and quality. The interface uses PCM16 format by default for optimal compatibility.
If you encounter issues:
For more detailed information and advanced usage, please refer to the PraisonAI documentation.
Guide to PraisonAI’s real-time voice interaction feature with text-to-speech, voice input processing, and financial data integration
To use the Realtime Voice Interface, follow these steps:
Install PraisonAI with the realtime dependencies:
Set up your OpenAI API key:
Azure OpenAI Configuration
To use Azure OpenAI instead of standard OpenAI, configure these environment variables:
The realtime interface will automatically detect the base URL and adjust the WebSocket connection accordingly.
Launch the Realtime Voice Interface:
Once the interface is launched:
You can configure various aspects of the Realtime Voice Interface:
Variable | Description | Default |
---|---|---|
OPENAI_API_KEY | Your OpenAI or Azure OpenAI API key | Required |
OPENAI_BASE_URL | Custom base URL for OpenAI-compatible APIs (e.g., Azure OpenAI) | https://api.openai.com/v1 |
OPENAI_MODEL_NAME | Model to use for realtime API | gpt-4o-mini-realtime-preview-2024-12-17 |
Choose different AI models for processing. Supported models include:
gpt-4o-realtime-preview
gpt-4o-mini-realtime-preview-2024-12-17
Adjust voice characteristics for the AI’s speech output through the session configuration.
Configure input/output audio formats and quality. The interface uses PCM16 format by default for optimal compatibility.
If you encounter issues:
For more detailed information and advanced usage, please refer to the PraisonAI documentation.