Realtime Voice Interface
Guide to PraisonAI’s real-time voice interaction feature with text-to-speech, voice input processing, and financial data integration
Features
- Real-time voice input processing
- Text-to-speech output for AI responses
- Seamless integration with OpenAI’s realtime API
- Support for various AI models
- Persistent conversation history
- Financial data integration with yfinance
Getting Started
To use the Realtime Voice Interface, follow these steps:
-
Install PraisonAI with the realtime dependencies:
-
Set up your OpenAI API key:
-
Launch the Realtime Voice Interface:
Usage
Once the interface is launched:
- Click the microphone button or press ‘P’ to start voice input.
- Speak your message or query.
- The AI will process your input and respond with both text and voice.
- You can ask for financial data, which will be fetched using yfinance.
- The conversation history is maintained for context in ongoing discussions.
Configuration
You can configure various aspects of the Realtime Voice Interface:
- Model selection: Choose different AI models for processing.
- Voice settings: Adjust voice characteristics for the AI’s speech output.
- Audio settings: Configure input/output audio formats and quality.
Troubleshooting
If you encounter issues:
- Ensure your microphone is properly connected and permitted in your browser.
- Check your internet connection for stable real-time communication.
- Verify that your OpenAI API key is correctly set and has the necessary permissions.
For more detailed information and advanced usage, please refer to the PraisonAI documentation.
Was this page helpful?