RAG Document Ingestion Prep
Prepare document batches for RAG, evaluation, and agent workflows with Markdown digests and structured manifests.
RAG quality starts before embedding. FileDigest helps convert document batches into cleaner, auditable artifacts that can be reviewed before they are chunked, embedded, indexed, or given to an agent.
Why a manifest matters
manifest.json gives downstream software a structured view of the job: which files were processed, which failed, what artifacts were generated, and how large the output is. That makes ingestion easier to test and debug than a folder of one-off conversions.
Why Markdown matters
digest.md is portable. It can be pasted into an LLM, saved with a project packet, or passed to a controlled parser before vector indexing.
Best fit
FileDigest is a good fit for AI builders who want a hosted Docling workflow with auth, billing, private storage, job history, and Modal processing instead of rebuilding the same conversion workflow for every document batch.