$ timeahead.in
/ servers/pypi/haiku.rag
pypi

haiku.rag

Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling

533 stars867/wkupdated 0d agogithub ↗
88good
▣ Overview
Testscodecov

What it does

Haiku RAG is a retrieval-augmented generation system that indexes documents and answers questions with citations and page numbers. It combines vector search and full-text retrieval via Reciprocal Rank Fusion, supports multimodal search (text queries over figures, image queries over documents), and wraps agentic skills for question-answering, vision-aware QA, and analysis via sandboxed Python. Built on LanceDB (local embeddings), Pydantic AI (agent logic), and Docling (document structure).

Who it's for

Backend engineers and developers building document-aware AI applications—particularly those needing embeddings-agnostic retrieval (Ollama, OpenAI, VoyageAI) without external servers. Teams building conversational interfaces over knowledge bases (internal docs, research papers, PDFs) or running complex document analysis tasks.

Common use cases

  • Query a local PDF or web-content collection and retrieve answers with citations
  • Build a multi-turn conversational assistant over internal documentation or research archives
  • Analyze documents programmatically via sandboxed Python (cross-document aggregation, metric extraction)
  • Search documents by image and retrieve matching figures with multimodal embeddings
  • Continuously ingest and index documents from filesystem, S3, HTTP, and WebDAV sources

Setup pitfalls

  • Requires a pre-configured embedding provider (Ollama, OpenAI, VoyageAI) before indexing
  • Python 3.12+ required
  • Reads and writes to local filesystem; ensure proper permissions and consider S3/GCS backend for production deployments
  • Multimodal and vision features require additional vLLM setup and vision-capable models
  • Use haiku.rag-slim for minimal installs; install only the extras you need
▣ Score BreakdownMCPScore = Σ(raw × weight)
DimensionRawWeighted
Security
35%
100
35.0
Freshness
25%
100
25.0
Adoption
20%
63
12.5
Quality
10%
100
10.0
Trust
10%
50
5.0
Total
87.5
⚿ Capabilities & Risk Explainer
fs readfs writenetworkexecevalsecrets
◆ Risk level: high
fs read + fs write + network + exec + eval + secrets active — can execute code, access credentials, and make external network calls.
⚙ Install config
Claude Desktop · Cursor · Windsurf · VS Code (Copilot) · Claude Code
add to your MCP client config:
{
  "mcpServers": {
    "haikurag-1": {
      "command": "uvx",
      "args": [
        "haiku.rag"
      ]
    }
  }
}
⚙ Maintenance health
59/ 100 · is this project alive?
contributors (1y)5
top contributor share96%
releases (1y)100
last release0d ago
ci✓ passing
⛁ Raw data
weekly downloads867
github stars533
forks36
open issues8
license✓ present
readme length6722 chars
last publish0d ago
last commit0d ago
last updated10h ago
install verified✓ pass · 18d ago
owner of this server? claim your listing to get a verified badgeclaim →
🔔 Score drop alerts
get notified by email when this server's score drops 5+ points