MVP Stage 0 Corpus API + MCP
Read-only, permission-aware retrieval over the existing corpus, callable from Claude, ChatGPT, or an internal copilot.
ReviewOSMediGen strategy portalSolution and product roadmap
ReviewOS should start as a read-only retrieval layer over MediGen's existing corpus. The application comes after real usage shows which review workflows are worth standardizing.
MVP Stage 0 is the current build target: a Corpus API and MCP server that parses the existing ~50K-document corpus, stores it in the AI semantic search engine, builds semantic and keyword indexes, enforces document-level permissions, and returns citation-first evidence objects to approved AI clients.
The boundary is explicit. MVP Stage 0 is read-only. It does not own answer synthesis, write documents, replace privilege review, or ship a new workflow UI.
Release ladder
| Release | Product form | What it adds |
|---|---|---|
| MVP Stage 0 | Corpus API + MCP server | Read-only retrieval substrate over the existing corpus. |
| MVP Stage 1 | Document ingestion | Controlled corpus maintenance and refresh workflows. |
| MVP Stage 2 | Citation + cross-reference agent | Grounded synthesis, contradiction checks, and evidence packets. |
| MVP Stage 3 | ReviewOS application | Projects, queues, approvals, templates, exports, analytics, and audit. |
| X1-X3 | Extensions | Multimodal search, thinking-model expansion, and recursive retrieval optimization. |
Capabilities
Read-only, permission-aware retrieval over the existing corpus, callable from Claude, ChatGPT, or an internal copilot.
Vector search plus BM25 keyword retrieval, metadata filters, exact terms, IDs, dates, clauses, and regulation references.
Structured evidence objects return source text, document ID, title, page, chunk, score, and metadata.
Authorized upload, batch ingestion, index refresh, duplicate detection, failed-ingestion queue, and deprecation.
Headless multi-pass retrieval, cross-reference checks, contradiction and gap detection, and evidence packets.
Projects, queues, approvals, templates, exports, analytics, RBAC, SSO, and full source-to-output audit.
12-week product roadmap
Weeks 1-2 · Foundation
Stand up repo, CI, provider abstraction, hosting/database path, corpus inventory, parsing spike, permission discovery, and eval sets.
Value: Create the factual basis for MVP Stage 0 instead of guessing.
Weeks 3-4 · MVP Stage 0
Parse PDF/DOCX/TXT/spreadsheets, chunk passages, extract metadata, build vector and BM25 indexes, and log parse failures.
Value: Make the existing corpus operationally queryable.
Weeks 5-6 · MVP Stage 0
Ship hybrid retrieval, metadata filters, citation-first evidence API, read-only MCP server, permissions, logs, and eval gate.
Value: Open a pilot inside existing AI tools.
Weeks 7-8 · MVP Stage 1
Add controlled upload, batch ingestion, incremental refresh, duplicates, failed-ingestion queue, deprecation, and ingestion audit.
Value: Keep the corpus current without a full rebuild.
Weeks 9-10 · MVP Stage 2
Add query decomposition, multi-pass retrieval, contradiction/gap detection, citation validation, and evidence packet output.
Value: Move from cited retrieval to grounded synthesis.
Weeks 11-12 · MVP Stage 2 + Stage 3 decision
Harden confidence, safe-fail behavior, permissions, eval dashboard, and use pilot logs to decide whether to build the Stage 3 app.
Value: Productize only the workflows proven by usage.
MVP 0-2 system architecture
Stage 0 proves trusted retrieval, Stage 1 keeps the corpus current, and Stage 2 turns cited passages into reusable evidence packets without jumping straight to a workflow app.
Existing documents become searchable, permission-aware evidence.
Reviewers get exact passages, source text, pages, scores, and filters.
The search layer stays current without rebuilding the corpus by hand.
The product moves from cited retrieval to defensible first-pass synthesis.
Asks a research, legal, or regulatory question in an approved AI tool.
Scopes the request against permissions, metadata filters, and corpus boundaries.
Runs semantic and keyword search, ranks passages, and returns citation-first evidence objects.
Validates source fit, sensitivity, and answer use; ReviewOS logs query, sources, and failures.
Adds controlled ingestion, refresh, duplicates, deprecation, and ingestion audit.
Builds grounded evidence packets with corroboration, contradictions, gaps, and citation checks.
Not now