Skip to Content
MCP ServersCommunityMCP Selenium Server

MCP Selenium Server

View original on GitHub 

A Model Context Protocol (MCP) server implementation for Selenium WebDriver, allowing AI agents to control web browsers.

Installation

npm install -g @angiejones/mcp-selenium

Usage

Start the server by running:

mcp-selenium

Or use with NPX in your MCP configuration:

{ "mcpServers": { "selenium": { "command": "npx", "args": [ "-y", "@angiejones/mcp-selenium" ] } } }

Tools

start_browser

Launches a browser session.

Parameters:

  • browser (required): Browser to launch
    • Type: string
    • Enum: [β€œchrome”, β€œfirefox”]
  • options: Browser configuration options
    • Type: object
    • Properties:
      • headless: Run browser in headless mode
        • Type: boolean
      • arguments: Additional browser arguments
        • Type: array of strings

Example:

{ "tool": "start_browser", "parameters": { "browser": "chrome", "options": { "headless": true, "arguments": ["--no-sandbox"] } } }

Navigates to a URL.

Parameters:

  • url (required): URL to navigate to
    • Type: string

Example:

{ "tool": "navigate", "parameters": { "url": "https://www.example.com" } }

find_element

Finds an element on the page.

Parameters:

  • by (required): Locator strategy
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the locator strategy
    • Type: string
  • timeout: Maximum time to wait for element in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "find_element", "parameters": { "by": "id", "value": "search-input", "timeout": 5000 } }

click_element

Clicks an element.

Parameters:

  • by (required): Locator strategy
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the locator strategy
    • Type: string
  • timeout: Maximum time to wait for element in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "click_element", "parameters": { "by": "css", "value": ".submit-button" } }

send_keys

Sends keys to an element (typing).

Parameters:

  • by (required): Locator strategy
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the locator strategy
    • Type: string
  • text (required): Text to enter into the element
    • Type: string
  • timeout: Maximum time to wait for element in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "send_keys", "parameters": { "by": "name", "value": "username", "text": "testuser" } }

get_element_text

Gets the text() of an element.

Parameters:

  • by (required): Locator strategy
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the locator strategy
    • Type: string
  • timeout: Maximum time to wait for element in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "get_element_text", "parameters": { "by": "css", "value": ".message" } }

hover

Moves the mouse to hover over an element.

Parameters:

  • by (required): Locator strategy
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the locator strategy
    • Type: string
  • timeout: Maximum time to wait for element in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "hover", "parameters": { "by": "css", "value": ".dropdown-menu" } }

drag_and_drop

Drags an element and drops it onto another element.

Parameters:

  • by (required): Locator strategy for source element
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the source locator strategy
    • Type: string
  • targetBy (required): Locator strategy for target element
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • targetValue (required): Value for the target locator strategy
    • Type: string
  • timeout: Maximum time to wait for elements in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "drag_and_drop", "parameters": { "by": "id", "value": "draggable", "targetBy": "id", "targetValue": "droppable" } }

double_click

Performs a double click on an element.

Parameters:

  • by (required): Locator strategy
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the locator strategy
    • Type: string
  • timeout: Maximum time to wait for element in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "double_click", "parameters": { "by": "css", "value": ".editable-text" } }

right_click

Performs a right click (context click) on an element.

Parameters:

  • by (required): Locator strategy
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the locator strategy
    • Type: string
  • timeout: Maximum time to wait for element in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "right_click", "parameters": { "by": "css", "value": ".context-menu-trigger" } }

press_key

Simulates pressing a keyboard key.

Parameters:

  • key (required): Key to press (e.g., β€˜Enter’, β€˜Tab’, β€˜a’, etc.)
    • Type: string

Example:

{ "tool": "press_key", "parameters": { "key": "Enter" } }

upload_file

Uploads a file using a file input element.

Parameters:

  • by (required): Locator strategy
    • Type: string
    • Enum: [β€œid”, β€œcss”, β€œxpath”, β€œname”, β€œtag”, β€œclass”]
  • value (required): Value for the locator strategy
    • Type: string
  • filePath (required): Absolute path to the file to upload
    • Type: string
  • timeout: Maximum time to wait for element in milliseconds
    • Type: number
    • Default: 10000

Example:

{ "tool": "upload_file", "parameters": { "by": "id", "value": "file-input", "filePath": "/path/to/file.pdf" } }

take_screenshot

Captures a screenshot of the current page.

Parameters:

  • outputPath (optional): Path where to save the screenshot. If not provided, returns base64 data.
    • Type: string

Example:

{ "tool": "take_screenshot", "parameters": { "outputPath": "/path/to/screenshot.png" } }

Supported Browsers

  • Chrome
  • Firefox

Features

  • Start browser sessions with customizable options
  • Navigate to URLs
  • Find elements using various locator strategies
  • Click, type, and interact with elements
  • Perform mouse actions (hover, drag and drop)
  • Handle keyboard input
  • Take screenshots
  • Upload files
  • Support for headless mode

Development

To work on this project:

  1. Clone the repository
  2. Install dependencies: npm install
  3. Run the server: npm start

License

MIT

Last updated on