Make 12 years of work product instantly findable.
Cortex Knowledge is a self-hosted document Q&A system for small professional services firms. Drop in your PDFs, DOCXs, and notes. Ask questions in plain English. Every answer cites its source.
"We have 12 years of work product nobody can find. The senior people remember where things are. The junior people give up and ask the seniors. We waste hours per day."
Generic search doesn't synthesize, so you still have to open every result. ChatGPT can synthesize but can't be trusted with client data and won't cite sources. Both fail this use case for the same reason: small professional services firms need both precision and privacy, and neither alone is enough.
Watch a real query in action.
Click any citation to open the source document at the cited passage.
Three steps, end to end.
Drop files in
PDF, DOCX, TXT, Markdown. Watched folder or web upload. Auto-indexed.
Ask in plain English
RAG-powered retrieval against your vectorized library. Local embeddings by default.
Get cited answers
Every claim links to source doc and page. Click through to verify.
Runs in Docker on your hardware. Postgres + Qdrant for storage; embeddings generated locally via Ollama.
For answer synthesis, you choose the LLM mode at install: Local Ollama keeps everything on your hardware. BYO Anthropic API key sends only the cited chunks to Anthropic under your own contract. Our proxy routes through our infrastructure to Anthropic with per-token billing.
Your full document corpus never leaves your firm regardless of mode. Retrieval sends only the relevant 5–10 chunks per query.
Self-hosted is the differentiator.
| Cortex Knowledge | ChatGPT / Claude | Microsoft Copilot | Glean / Hebbia | |
|---|---|---|---|---|
| Where documents are stored | Your infrastructure | Vendor | Vendor | Vendor |
| Who controls the LLM | You (local or BYO) | Vendor | Vendor | Vendor |
| Per-seat cost | None | $20/user/mo | $30/user/mo | $50K–$200K/yr |
| Right-sized for 5–30 person firms | Yes | N/A | Bloated | Enterprise only |
What ships in the first release.
- Document ingestion: PDF, DOCX, TXT, Markdown
- Auto-indexing: new, changed, and removed files reflected automatically
- Citation-required answers with click-through to source passages
- Local user accounts (admin + members)
- Three LLM modes: bring your own Anthropic API key, run on local Ollama, or use our proxy
- Admin dashboard: doc count, index status, users, query log, usage stats
- In-app updates from container registry
- Single-command backup and restore
Flat install fee plus optional managed support.
Concrete numbers vary by firm size and LLM mode. Contact for a quote during your demo.
Ready to see it on your documents?
Book a 30-minute demo. I'll show you the system running against a sample legal/accounting corpus, then we'll talk about what installation looks like for your firm.