Skip to content

SanielDev/pdf-rag-chatbot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

2 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿง  PDF RAG Chatbot with Free LLM + ChromaDB

A Streamlit-based chatbot that lets you query PDF files using Retrieval-Augmented Generation (RAG) with ChromaDB and free HuggingFace LLMs.

๐Ÿš€ Features

  • Upload any resume PDF ๐Ÿ“„
  • Parses and chunks documents using LangChain
  • Uses ParentDocumentRetriever for hierarchical chunking
  • Embeds using sentence-transformers
  • Stores vectors locally with ChromaDB
  • Answers powered by Hugging Face's Mixtral-8x7B-Instruct endpoint
  • Returns answers with source snippets โœจ

๐Ÿงฉ Tech Stack

  • ๐Ÿ–ฅ Streamlit โ€“ UI for chat interface
  • ๐Ÿง  LangChain โ€“ for RAG logic and document parsing
  • ๐Ÿ” ChromaDB โ€“ local vector store
  • ๐Ÿงฉ Sentence-Transformers โ€“ text embeddings
  • ๐Ÿค– Mixtral-8x7B-Instruct โ€“ HuggingFace-hosted LLM (free tier)

๐Ÿ›  Setup

# 1. Clone repo
git clone https://github.com/<your-username>/pdf-rag-chatbot.git
cd pdf-rag-chatbot

# 2. Setup virtual environment
python3 -m venv venv
source venv/bin/activate  # or venv\Scripts\activate on Windows

# 3. Install dependencies
pip install -r requirements.txt

# 4. Add your HuggingFace token to `.env`
HUGGINGFACEHUB_API_TOKEN=your_token_here

# 5. Run the app
streamlit run app.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages