RAG PDF Chatbot with Phi-2/TinyLlama

A full-stack RAG (Retrieval-Augmented Generation) application built to process and query unstructured PDF data. The system implements a sophisticated pipeline: documents are partitioned into semantic chunks, transformed into high-dimensional embeddings, and indexed in a FAISS vector store. At query time, the system performs semantic retrieval to provide grounded context to local Small Language Models (TinyLlama or Phi-2), ensuring accurate answers while maintaining 100% data privacy.

Key Features

Tech Stack

PythonLangChainFAISS (Vector Database)StreamlitHuggingFace (Transformers)Phi-2 / TinyLlama (LLMs)Sentence-Transformers (Embeddings)PyPDF

Screenshots

Screenshot 0
Screenshot 1
Screenshot 2
Screenshot 3
Screenshot 4
Screenshot 5