Project

ChatPDF AI

Retrieval-grounded document QA system using LangChain and FAISS with agentic query workflows.

RAGLangChainFAISSAgentsLLM

Problem

Users need grounded answers over long PDFs without the model inventing unsupported claims.

I built a retrieval-first pipeline with chunking, vector indexing, and controlled context assembly before answer generation.

The system was most useful when retrieval quality was treated as the main product surface instead of an invisible backend detail.

This project combines document chunking, embedding-based retrieval, and answer generation into a practical PDF assistant.

Chunking strategy affects answer quality more than people expect.
Retrieval diagnostics are essential; the model cannot recover from weak context.
Grounding checks are necessary if the app is meant to feel trustworthy.