Hrishikesh Telang

Research Assistant, St. Francis Institute of Technology

telanghrishi [AT] gmail.com

Chat with Multiple PDFs Using Google Gemini and LangChain

An App to ask questions related to PDF content

Problem Statement

With the increasing demand for analyzing and interacting with information in multiple PDF documents, there is a need for an intelligent system to provide instant answers to user queries about the content within these documents. This project demonstrates how to create an end-to-end application that allows users to upload multiple PDFs, process them into vector embeddings, and interact with the content using Google Gemini Pro and LangChain.


Features

  1. Upload and process multiple PDFs into vector embeddings.
  2. Use vector search to retrieve context-relevant answers to user queries.
  3. Handle PDF documents up to 200 MB each.
  4. Enable modular development with a clear separation of backend processing and frontend interaction using Streamlit.

Tools and Technologies


Workflow

1. PDF Processing

2. Vector Embedding

3. Conversational Chain

4. Frontend Interaction


Steps to Build

Setup

  1. Environment Setup:
    • Create a virtual environment:
      conda create -n pdf-chat python=3.10
      conda activate pdf-chat
      
    • Install dependencies:
      pip install -r requirements.txt
      
  2. API Configuration:
    • Obtain a Google Gemini Pro API Key from Google MakerSuite.
    • Store the API key in an .env file:
      GOOGLE_API_KEY=your_google_api_key
      

Development

  1. Backend Functionality:
    • Extract text from PDFs using PyPDF2.
    • Split text into chunks for embedding.
    • Convert text chunks into vector embeddings using Google Gemini Pro.
    • Save embeddings locally using FAISS.
  2. Query Handling:
    • Load the vector index for similarity search.
    • Process user questions and retrieve the most relevant chunks.
    • Generate detailed responses using Google Gemini Pro’s conversational API.
  3. Frontend Integration:
    • Build a Streamlit interface for PDF uploads and text input.
    • Display real-time responses in a user-friendly format.

Usage Instructions

  1. Run the application: ```bash streamlit run app.py