$ timeahead.in
/ servers/pypi/haiku.rag
pypi

haiku.rag

Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling

533 stars867/wkupdated 0d agogithub ↗
88good
▣ Overview
Testscodecov

What it does

Haiku RAG is an agentic retrieval-augmented generation system for indexing documents and answering questions with citations. It combines LanceDB for vector storage, Pydantic AI for multi-agent orchestration, and Docling for document parsing. The system supports hybrid search (vector plus full-text), multimodal retrieval (embedding both text and figures in a shared vector space), and vision-aware QA when documents contain images. Beyond simple question answering, it provides research agents for iterative planning and synthesis, analysis agents for complex computational tasks via sandboxed Python, and conversational interfaces with multi-turn memory. Indices are local-first via embedded LanceDB, though cloud and object-storage backends are available.

Who it's for

Document analysts and researchers who need to extract structured insights from large collections. Teams building conversational document search features. Engineers integrating RAG capabilities into Claude Desktop or other AI assistants without running a separate backend.

Common use cases

  • Index PDFs and web documents, then search by keyword or semantic similarity with page-specific citations
  • Run multi-turn research workflows using agentic planning: decompose a research question into steps, execute searches, and synthesize results
  • Analyze document collections programmatically—count mentions, compute aggregations, compare claims across sources
  • Build conversational chatbots over proprietary documents with session memory and visual grounding
  • Expose document search tools to Claude via MCP for use within Claude Desktop or API calls

Setup pitfalls

  • Requires Python 3.12 or newer; existing Python 3.11 environments will not work
  • Needs an embedding provider configured (Ollama, OpenAI, VoyageAI, LM Studio, or vLLM); indexing will fail if none is available
  • Reads and writes to the filesystem for document cache and LanceDB indices; requires appropriate permissions and disk space for large document collections
  • Makes network calls for remote document fetching and embedding API calls; runs with high risk classification and should be sandboxed in security-sensitive environments
▣ Score BreakdownMCPScore = Σ(raw × weight)
DimensionRawWeighted
Security
35%
100
35.0
Freshness
25%
100
25.0
Adoption
20%
63
12.5
Quality
10%
100
10.0
Trust
10%
50
5.0
Total
87.5
⚿ Capabilities & Risk Explainer
fs readfs writenetworkexecevalsecrets
◆ Risk level: high
fs read + fs write + network + exec + eval + secrets active — can execute code, access credentials, and make external network calls.
⚙ Install config
Claude Desktop · Cursor · Windsurf · VS Code (Copilot) · Claude Code
add to your MCP client config:
{
  "mcpServers": {
    "haikurag-1": {
      "command": "uvx",
      "args": [
        "haiku.rag"
      ]
    }
  }
}
📈 Score historylast 27 snapshots
5/10/20266/6/2026 · 27 snapshots
⚙ Maintenance health
59/ 100 · is this project alive?
contributors (1y)5
top contributor share96%
releases (1y)100
last release0d ago
ci✓ passing
⛁ Raw data
weekly downloads867
github stars533
forks36
open issues8
license✓ present
readme length6722 chars
last publish0d ago
last commit0d ago
last updated7h ago
install verified✓ pass · 18d ago
owner of this server? claim your listing to get a verified badgeclaim →
🔔 Score drop alerts
get notified by email when this server's score drops 5+ points