Docling
on Power
Document extraction and RAG preparation for IBM Power infrastructure.
PDF, DOCX, PPTX → structured chunks. Native on ppc64le & ppc64.
IBM's document engine, now running on Power.
Extract and chunk any document for your RAG pipelines — on-premises extraction, LLM-powered search via API.
Documents in. Structured chunks out.
Docling parses page layouts, extracts tables, detects formulas, and outputs clean Markdown, JSON, or HTML — ready for your RAG pipeline. Retrieval and search are powered by LLMs via API.
Docling processing pipeline — from raw documents to structured, RAG-ready chunks.
Every format. One pipeline.
Advanced document extraction
Layout analysis, reading order detection, table structure recognition, and formula extraction — all running natively on Power without GPU requirements.
RAG-ready chunking
Docling partitions documents into optimized chunks with preserved reading order and semantic boundaries — ready to feed into your vector databases and retrieval pipelines.
Search via LLM APIs
Document extraction and chunking run locally on Power. For retrieval and search, connect to any LLM provider via API — OpenAI, watsonx, or your preferred service. Hybrid architecture: local processing, cloud intelligence.
AI framework integrations
Plug-and-play connectors for LangChain, LlamaIndex, CrewAI, and Haystack. Use the Python SDK or CLI to integrate document extraction into your existing workflows.
Native on ppc64le & ppc64 (AIX)
LibrePower's port of Docling brings document extraction and RAG preparation to IBM Power Systems — Linux (ppc64le) and AIX (ppc64). No x86 emulation. No GPU required.
# Install on ppc64le or ppc64 (AIX) # pip install docling from docling.document_converter import DocumentConverter source = "quarterly-report.pdf" converter = DocumentConverter() result = converter.convert(source) # Export to Markdown for RAG md = result.document.export_to_markdown() # Or export to JSON for processing json_data = result.document.export_to_dict()
Bring document extraction to your Power infrastructure
Deploy Docling on IBM Power today. Extract and chunk your documents for RAG — on-premises processing, LLM-powered search via API.