BaseChunkerStrategy
Abstract base class for document chunking strategies.
Usage
BaseChunkerStrategy()Parameters
max_chunk_size: int = 1000-
Maximum size of each chunk in characters.
overlap_size: int = 200-
Number of overlapping characters between chunks.
Methods
| Name | Description |
|---|---|
| calculate_optimal_boundaries() | Find optimal chunk boundary positions. |
| chunk_document_content() | Split document content into chunks. |
calculate_optimal_boundaries()
Find optimal chunk boundary positions.
Usage
calculate_optimal_boundaries(content)chunk_document_content()
Split document content into chunks.
Usage
chunk_document_content(content)