Skip to main content

OCRAgent

Defined in the ocr_agent module.
A specialized agent for OCR (Optical Character Recognition). Extracts text from documents (PDFs) and images using AI models. Supported Providers:
  • Mistral: mistral/mistral-ocr-latest

Constructor

name
Optional
No description available.
instructions
Optional
No description available.
llm
Optional
No description available.
model
Optional
No description available.
base_url
Optional
No description available.
api_key
Optional
No description available.
ocr
Optional
No description available.
verbose
Union
default:"True"
No description available.

Methods

Usage

from praisonaiagents import OCRAgent
    
    agent = OCRAgent(llm="mistral/mistral-ocr-latest")
    
    # Extract from PDF URL
    result = agent.extract("https://example.com/document.pdf")
    print(result.text)
    
    # Extract from image URL
    result = agent.extract("https://example.com/image.png")
    for page in result.pages:
        print(page.markdown)