basic MCP Server

Use This MCP server To

Enable voice-based AI assistant interactions with MCP tool integration Convert speech to text for natural language processing Use GPT-4o for advanced natural language understanding Perform text-to-speech synthesis for voice responses Detect voice activity to optimize interaction timing Integrate external MCP servers for extended functionality

README

LiveKit Agent with MCP Tools

A voice assistant application built using the LiveKit Agents framework, capable of using Multimodal Control Protocol (MCP) tools to interact with external services.

Features

Voice-based interaction with a helpful AI assistant
Integration with MCP tools from external servers
Speech-to-text using Deepgram
Natural language processing using OpenAI's GPT-4o
Text-to-speech using OpenAI
Voice activity detection using Silero

Prerequisites

Python 3.9+
API keys for OpenAI and Deepgram
MCP server endpoint

Installation

Clone this repository:

git clone https://github.com/livekit-examples/basic-mcp.git
cd basic-mcp

Install the required packages:
```
pip install -r requirements.txt
```

Create a .env file with your API keys and configuration:

OPENAI_API_KEY=your_openai_api_key
DEEPGRAM_API_KEY=your_deepgram_api_key
ZAPIER_MCP_URL=your_mcp_server_url

Usage

Run the agent with the LiveKit CLI:

python agent.py console

The agent will connect to the specified LiveKit room and start listening for voice commands.

Project Structure

agent.py: Main agent implementation and entrypoint
mcp_client/: Package for MCP server integration
- server.py: MCP server connection handlers
- agent_tools.py: Integration of MCP tools with LiveKit agents
- util.py: Utility functions for MCP client

Acknowledgements

LiveKit for the underlying real-time communication infrastructure
OpenAI for GPT-4o and text-to-speech
Deepgram for speech-to-text
Silero for Voice Activity Detection

basic-mcp FAQ

How do I set up the basic-mcp server?

Clone the repository, install dependencies with pip, and configure API keys in a .env file.

What APIs does basic-mcp require?

It requires API keys for OpenAI, Deepgram, and an MCP server endpoint.

Which speech-to-text service does basic-mcp use?

It uses Deepgram for speech-to-text conversion.

How does basic-mcp handle natural language processing?

It uses OpenAI's GPT-4o model for natural language understanding and generation.

Can basic-mcp integrate with other MCP servers?

Yes, it supports integration with external MCP servers via MCP tools.

What voice activity detection technology is used?

basic-mcp uses Silero for voice activity detection to improve interaction responsiveness.

What Python version is required to run basic-mcp?

Python 3.9 or higher is required to run the server.

Is basic-mcp limited to OpenAI for NLP?

While it uses OpenAI's GPT-4o, it can be extended to support other LLM providers like Anthropic's Claude and Google's Gemini.