Loading RAG Corpus Ingestion Proof Pack...
RAG Proof
RAG Corpus Ingestion Proof Pack
Turn source notes into a RAG ingestion proof pack with chunk CSV, duplicate-density checks, source coverage matrix, citation anchors, injection risk, and JSON receipt.
Reviewed 2026-06-19
AIBrowser-firstAgent-readableJSON receiptCopy/downloadNo signup
WHY THIS EXISTS
Operational proof for AI-era sites and agents.
Turn source notes into a RAG ingestion proof pack with chunk CSV, duplicate-density checks, source coverage matrix, citation anchors, injection risk, and JSON receipt. The useful output is a visible table plus a receipt that names input, checks, limits, and next action.
- Creates deterministic chunks and source ids instead of only estimating chunk size.
- Flags duplicates, citation-anchor gaps, prompt-injection content, and PII/secret patterns.
- Exports chunks.csv plus a JSON ingestion plan for downstream review.
Boundary: Not for legal data governance, regulated records, production embedding approval, or complete privacy de-identification.