> ## Documentation Index
> Fetch the complete documentation index at: https://docs.praison.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Audio Overview

> Text-to-Speech and Speech-to-Text

## Text-to-Speech

<Tabs>
  <Tab title="Basic">
    ```python theme={"theme":{"light":"vitesse-light","dark":"vitesse-dark"}}
    from praisonaiagents import AudioAgent

    agent = AudioAgent(llm="openai/tts-1")
    agent.say("Hello!", output="hello.mp3")
    ```
  </Tab>

  <Tab title="Advanced">
    ```python theme={"theme":{"light":"vitesse-light","dark":"vitesse-dark"}}
    from praisonaiagents import AudioAgent

    agent = AudioAgent(llm="openai/tts-1-hd")
    agent.speech("Hello!", voice="nova", speed=1.2, output="hello.mp3")

    # Voices: alloy, echo, fable, onyx, nova, shimmer
    ```
  </Tab>
</Tabs>

## Speech-to-Text

<Tabs>
  <Tab title="Basic">
    ```python theme={"theme":{"light":"vitesse-light","dark":"vitesse-dark"}}
    from praisonaiagents import AudioAgent

    agent = AudioAgent(llm="openai/whisper-1")
    text = agent.listen("audio.mp3")
    print(text)
    ```
  </Tab>

  <Tab title="Advanced">
    ```python theme={"theme":{"light":"vitesse-light","dark":"vitesse-dark"}}
    from praisonaiagents import AudioAgent

    agent = AudioAgent(llm="groq/whisper-large-v3")  # 10x faster
    text = agent.transcribe("audio.mp3", language="en")
    print(text)
    ```
  </Tab>
</Tabs>

## Providers

<CardGroup cols={2}>
  <Card title="OpenAI" icon="robot" href="/docs/audio/openai">TTS + STT</Card>
  <Card title="Groq" icon="bolt" href="/docs/audio/groq">Fast STT</Card>
  <Card title="ElevenLabs" icon="e" href="/docs/audio/elevenlabs">Premium TTS</Card>
  <Card title="Deepgram" icon="d" href="/docs/audio/deepgram">STT</Card>
</CardGroup>
