Use This MCP server To

Automate web page navigation and interaction Capture full-page or element screenshots Execute JavaScript in real browser contexts Fill and submit web forms programmatically Monitor browser console logs for debugging Integrate browser automation into AI workflows Test web applications with real browser sessions

README

Puppeteer MCP Server (Python Implementation)

A Model Context Protocol server that provides browser automation capabilities using Playwright (Python's equivalent to Puppeteer). This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment.

Overview

This Python implementation provides a stable alternative to the TypeScript version, offering the same capabilities with improved error handling and logging. It uses Playwright, which is the Python equivalent to Puppeteer, providing robust browser automation capabilities.

Key Features

Full browser automation
Page navigation
Screenshot capture (full page or elements)
Form interaction (clicking and filling)
JavaScript execution
Console log monitoring
Configurable timeouts
Detailed error handling
Comprehensive logging

Prerequisites

Python 3.8+
pip (Python package installer)

Installation

Install the required packages:

pip install -r requirements.txt

Install Playwright browsers:

playwright install

Usage

Starting the Server

Run the server directly:

python puppeteer_server.py

Claude Desktop Configuration

Add this to your Claude configuration file:

{
  "mcpServers": {
    "puppeteer": {
      "command": "python",
      "args": ["path/to/puppeteer.py"]
    }
  }
}

Available Tools

puppeteer_navigate

Navigate to any URL in the browser.

{
  "name": "puppeteer_navigate",
  "arguments": {
    "url": "https://example.com",
    "timeout": 60000  // optional, defaults to 60000ms
  }
}

puppeteer_screenshot

Capture screenshots of the entire page or specific elements.

{
  "name": "puppeteer_screenshot",
  "arguments": {
    "name": "my_screenshot",
    "selector": "#specific-element",  // optional
    "width": 1280,  // optional, default: 1280
    "height": 720,  // optional, default: 720
    "timeout": 30000  // optional, defaults to 30000ms
  }
}

puppeteer_click

Click elements on the page.

{
  "name": "puppeteer_click",
  "arguments": {
    "selector": ".button-class",
    "timeout": 30000  // optional, defaults to 30000ms
  }
}

puppeteer_fill

Fill out input fields.

{
  "name": "puppeteer_fill",
  "arguments": {
    "selector": "#input-id",
    "value": "text to fill",
    "timeout": 30000  // optional, defaults to 30000ms
  }
}

puppeteer_evaluate

Execute JavaScript in the browser console.

{
  "name": "puppeteer_evaluate",
  "arguments": {
    "script": "document.title",
    "timeout": 30000  // optional, defaults to 30000ms
  }
}

Error Handling

The server provides detailed error messages for common scenarios:

Navigation failures
Element not found
Timeout errors
JavaScript execution errors
Screenshot failures

Logging

Comprehensive logging is implemented with different levels:

INFO: Standard operations
ERROR: Operation failures
DEBUG: Detailed execution information

Notes

Browser launches in non-headless mode for better debugging
Default viewport size is 1280x720
All timeouts are configurable
Console logs are captured and stored
Screenshots are stored in memory with base64 encoding

Contributing

Contributions are welcome! Please read the repository's contributing guidelines before submitting pull requests.

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

mcp-server-puppeteer-py FAQ

How do I install the mcp-server-puppeteer-py?

Install Python 3.8+, then run 'pip install -r requirements.txt' to install dependencies.

What browser automation library does this server use?

It uses Playwright, the Python equivalent of Puppeteer, for robust browser automation.

Can this server capture screenshots of web pages?

Yes, it supports capturing full-page or specific element screenshots.

Does it support executing JavaScript on web pages?

Yes, you can run arbitrary JavaScript in the browser context.

How does the server handle errors and logging?

It provides detailed error handling and comprehensive logging for easier debugging.

Is this server compatible with multiple LLM providers?

Yes, it works with OpenAI, Anthropic Claude, and Google Gemini models via MCP.

Can it interact with web forms?

Yes, it supports clicking, filling, and submitting forms on web pages.

What Python version is required?

Python 3.8 or higher is required to run this server.