RAG calculator
RAG Cost Calculator
Estimate the one-time and recurring cost of a retrieval-augmented generation system. Model document ingestion, embeddings, vector storage, retrieval context, and LLM output.
Estimate RAG monthly cost
Use a manual estimate for now because vector database pricing differs by provider.
Estimated monthly total$0.00
One-time embedding$0.00
Cost per query$0.00
Monthly LLM input$0.00
Monthly LLM output$0.00
Estimated chunks0
How the estimate works
RAG cost is split into indexing cost and query-time cost. The core model isembedding cost + vector database cost + LLM input cost + LLM output cost.
- Document words are converted to estimated tokens using a default ratio of 1.33 tokens per word.
- Chunk count increases when overlap is higher, which increases embedding and storage pressure.
- Retrieved chunks become input context on every query, so retrieval depth drives recurring LLM cost.
Prices can change. Verify provider pricing before making purchasing or product pricing decisions.