Setup
Text-to-Speech
Speech-to-Text
Models
| Model | Type |
|---|---|
gemini/gemini-2.5-flash-preview-tts | TTS |
gemini/gemini-2.0-flash | STT |
TTS and STT with Google Gemini
export GOOGLE_API_KEY=your-key
from praisonaiagents import AudioAgent
agent = AudioAgent(llm="gemini/gemini-2.5-flash-preview-tts")
agent.speech("Hello world!", output="hello.mp3")
from praisonaiagents import AudioAgent
agent = AudioAgent(llm="gemini/gemini-2.0-flash")
text = agent.transcribe("audio.mp3")
print(text)
| Model | Type |
|---|---|
gemini/gemini-2.5-flash-preview-tts | TTS |
gemini/gemini-2.0-flash | STT |