📄 PDF Reader Chatbot

An AI-powered PDF chatbot that allows users to upload any PDF document and ask natural language questions about its content. The system extracts text from the PDF and uses a Large Language Model (LLM) to generate accurate, context-aware answers.

📸 Demo

User uploads: "project_report.pdf"

User: "What is the main objective of this project?"
Bot:  "The main objective is to build a real-time data pipeline that..."

User: "Summarise the conclusion section."
Bot:  "The conclusion highlights three key findings: ..."

⚙️ How It Works

PDF Upload — User uploads a PDF file through the interface
Text Extraction — The system extracts all readable text from the PDF pages
Text Chunking — Long documents are split into overlapping chunks for better context
Embedding & Retrieval — Text chunks are embedded and the most relevant chunks are retrieved for each question
LLM Answer Generation — Retrieved context is passed to an LLM which generates a precise answer

🛠️ Tech Stack

Tool	Purpose
Python 3.x	Core programming language
PyPDF2 / pdfplumber	PDF text extraction
LangChain	LLM chaining and retrieval pipeline
OpenAI / Gemini API	Language model for answer generation
Streamlit	Web-based user interface
FAISS	Vector store for document retrieval

📦 Installation

# Clone the repository
git clone https://github.com/arunkumararavindhakshan05-sudo/Pdf-Reader-Chatbot.git
cd Pdf-Reader-Chatbot

# Install dependencies
pip install -r requirements.txt

▶️ Usage

streamlit run app.py

Then open your browser at http://localhost:8501, upload a PDF, and start asking questions!

📁 Project Structure

Pdf-Reader-Chatbot/
│
├── app.py              # Main Streamlit application
├── requirements.txt    # Python dependencies
└── README.md

🔮 Future Improvements

Support multiple PDF uploads at once
Add chat history / memory
Support Word (.docx) and Excel (.xlsx) files
Deploy on Hugging Face Spaces or Streamlit Cloud

👤 Author

Arunkumar Aravindhakshan 🔗 LinkedIn | GitHub

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 PDF Reader Chatbot

📸 Demo

⚙️ How It Works

🛠️ Tech Stack

📦 Installation

▶️ Usage

📁 Project Structure

🔮 Future Improvements

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

📄 PDF Reader Chatbot

📸 Demo

⚙️ How It Works

🛠️ Tech Stack

📦 Installation

▶️ Usage

📁 Project Structure

🔮 Future Improvements

👤 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages