OCRAgent

Defined in the Ocr Agent module.

AI Agent A specialized agent for OCR (Optical Character Recognition). Extracts text from documents (PDFs) and images using AI models. Supported Providers:

Mistral: mistral/mistral-ocr-latest

Constructor

name

Optional

No description available.

instructions

Optional

No description available.

llm

Optional

No description available.

model

Optional

No description available.

base_url

Optional

No description available.

api_key

Optional

No description available.

ocr

Optional

No description available.

verbose

Union

default:"True"

No description available.

Methods

console()

Lazily initialize Rich Console.

litellm()

Lazy load litellm module when needed.

extract()

Extract text from a document or image.

aextract()

Async version of extract().

read()

Quick OCR - extract and return markdown text.

aread()

Async version of read().

Usage

from praisonaiagents import OCRAgent
    
    agent = OCRAgent(llm="mistral/mistral-ocr-latest")
    
    # Extract from PDF URL
    result = agent.extract("https://example.com/document.pdf")
    print(result.text)
    
    # Extract from image URL
    result = agent.extract("https://example.com/image.png")
    for page in result.pages:
        print(page.markdown)

Source

View on GitHub

praisonaiagents/agent/ocr_agent.py at line 42

Guide

Reference

O C R Agent • AI Agent SDK

OCRAgent

Constructor

Methods

console()

litellm()

extract()

aextract()

read()

aread()

Usage

Source

View on GitHub

Agents Concept

Single Agent Guide

Multi-Agent Guide

Agent Configuration

Auto Agents

Guide

Reference

​OCRAgent

​Constructor

​Methods

console()

litellm()

extract()

aextract()

read()

aread()

​Usage

​Source

View on GitHub

​Related Documentation

Agents Concept

Single Agent Guide

Multi-Agent Guide

Agent Configuration

Auto Agents

OCRAgent

Constructor

Methods

Usage

Source

Related Documentation