mcp-pdf-tools MCP Server

README

WORK IN PROGRESS - USE WITH CAUTION - Windows:

MCP PDF Tools Server

An MCP (Model Context Protocol) server that provides PDF manipulation tools. This server allows LLMs to perform operations like merging PDFs and extracting pages through the Model Context Protocol.

Features

Merge multiple PDF files into a single PDF
Merge multiple PDF files into a single PDF in user specified order
Extract specific pages from a PDF file
Search PDFs filesystem search or Everything search works better than this
Find (and merge) related PDFs based on text extraction and regex pattern matching from a target input PDF

Installation

Clone this repository

cd mcp-pdf-tools

# Create and activate virtual environment
uv venv
.venv\Scripts\activate

# Install the package
uv pip install -e .

Usage with Claude Desktop

Add this to your Claude Desktop configuration file (claude_desktop_config.json):

{
    "mcpServers": {
        "pdf-tools": {
            "command": "uv",
            "args": [
                "--directory",
                "PATH_TO\\mcp-pdf-tools",
                "run",
                "pdf-tools"
            ]
        }
    }
}

mcp-pdf-tools FAQ

How do I install the mcp-pdf-tools server?

Clone the repository, create and activate a Python virtual environment, then install the package using pip with 'pip install -e .'.

Can mcp-pdf-tools merge PDFs in a user-specified order?

Yes, it supports merging multiple PDF files in the order specified by the user.

Is mcp-pdf-tools compatible with all operating systems?

It is primarily designed for Windows, and some features may not work optimally on other OSes.

How does mcp-pdf-tools perform PDF searches?

It uses text extraction and regex pattern matching to find content within PDFs, though external tools like Everything search may be more effective.

Can I extract specific pages from a PDF using this server?

Yes, the server allows extraction of specific pages from PDF files.

Is mcp-pdf-tools safe to use in production environments?

It is currently a work-in-progress and should be used with caution in production until fully tested.

What programming language is required to run mcp-pdf-tools?

It requires Python and uses a virtual environment for installation and execution.

Does mcp-pdf-tools support automated workflows with LLMs?

Yes, it enables LLMs to manipulate PDFs programmatically via the Model Context Protocol.