Fire in da houseTop Tip:Paying $100+ per month for Perplexity, MidJourney, Runway, ChatGPT and other tools is crazy - get all your AI tools in one site starting at $15 per month with Galaxy AI Fire in da houseCheck it out free

mcp-pdf-tools

MCP.Pizza Chef: hanweg

The mcp-pdf-tools server is an MCP server designed to empower language models with PDF file manipulation capabilities. It supports merging multiple PDFs into one, extracting specific pages, and searching within PDFs using text extraction and regex pattern matching. This server facilitates complex PDF workflows by allowing LLMs to interact with PDF documents programmatically through the Model Context Protocol. It is especially useful for automating document processing tasks such as combining reports, extracting relevant sections, and finding related PDFs based on content. Installation involves cloning the repository, setting up a Python virtual environment, and installing the package. Note that it is a work-in-progress and currently optimized for Windows environments.

Use This MCP server To

Merge multiple PDF files into one document Extract specific pages from a PDF file Search text within PDF documents Find related PDFs using regex and text extraction Automate PDF document processing workflows

README

WORK IN PROGRESS - USE WITH CAUTION - Windows:

MCP PDF Tools Server

An MCP (Model Context Protocol) server that provides PDF manipulation tools. This server allows LLMs to perform operations like merging PDFs and extracting pages through the Model Context Protocol.

mcp-pdf-tools MCP server

Features

  • Merge multiple PDF files into a single PDF
  • Merge multiple PDF files into a single PDF in user specified order
  • Extract specific pages from a PDF file
  • Search PDFs filesystem search or Everything search works better than this
  • Find (and merge) related PDFs based on text extraction and regex pattern matching from a target input PDF

Installation

  1. Clone this repository
cd mcp-pdf-tools

# Create and activate virtual environment
uv venv
.venv\Scripts\activate

# Install the package
uv pip install -e .

Usage with Claude Desktop

Add this to your Claude Desktop configuration file (claude_desktop_config.json):

{
    "mcpServers": {
        "pdf-tools": {
            "command": "uv",
            "args": [
                "--directory",
                "PATH_TO\\mcp-pdf-tools",
                "run",
                "pdf-tools"
            ]
        }
    }
}

mcp-pdf-tools FAQ

How do I install the mcp-pdf-tools server?
Clone the repository, create and activate a Python virtual environment, then install the package using pip with 'pip install -e .'.
Can mcp-pdf-tools merge PDFs in a user-specified order?
Yes, it supports merging multiple PDF files in the order specified by the user.
Is mcp-pdf-tools compatible with all operating systems?
It is primarily designed for Windows, and some features may not work optimally on other OSes.
How does mcp-pdf-tools perform PDF searches?
It uses text extraction and regex pattern matching to find content within PDFs, though external tools like Everything search may be more effective.
Can I extract specific pages from a PDF using this server?
Yes, the server allows extraction of specific pages from PDF files.
Is mcp-pdf-tools safe to use in production environments?
It is currently a work-in-progress and should be used with caution in production until fully tested.
What programming language is required to run mcp-pdf-tools?
It requires Python and uses a virtual environment for installation and execution.
Does mcp-pdf-tools support automated workflows with LLMs?
Yes, it enables LLMs to manipulate PDFs programmatically via the Model Context Protocol.