Build a robust ingestion pipeline that parses, cleans, enriches, and embeds heterogeneous documents for a RAG or search system.
## CONTEXT You are building the ingestion pipeline that transforms raw, messy documents into clean, chunked, embedded, and indexed records ready for retrieval. Ingestion quality determines everything downstream: bad parsing, lost structure, or stale data poisons retrieval no matter how good the model is. The user has…
Premium Prompt
Unlock this prompt — and all 25,000+ expert-crafted prompts — with Pro.
Unlock with Pro