SDPM Chunking

SDPM (Semantic Double-Pass Merging) combines sentence-level chunking with semantic analysis for optimal chunk boundaries.

Quick Start

from praisonaiagents import Agent

agent = Agent(
    instructions="Answer questions from research papers.",
    knowledge={
        "sources": ["papers/"],
        "chunker": {
            "type": "sdpm",
            "chunk_size": 512,
            "embedding_model": "all-MiniLM-L6-v2"
        }
    }
)

response = agent.start("Summarize the findings")

When to Use

Research papers with complex structure
Technical documents with multiple topics
Content where both sentence flow AND semantic coherence matter

Parameters

Parameter	Type	Default	Description
`chunk_size`	int	512	Max tokens per chunk
`embedding_model`	str	auto	Model for semantic analysis

Getting Started

Core Concepts

Guides

Features

Models

Databases

Observability

Memory

Knowledge

RAG

Persistence

Tools

Other Features

Developers

Configuration

Best Practices

Getting Started (No Code)

Quick Start

When to Use

Parameters

Getting Started

Core Concepts

Guides

Features

Models

Databases

Observability

Memory

Knowledge

RAG

Persistence

Tools

Other Features

Developers

Configuration

Best Practices

Getting Started (No Code)

​Quick Start

​When to Use

​Parameters

Quick Start

When to Use

Parameters