WindTools MCP Server logo

WindTools MCP Server

by ZahidGalea

The WindTools MCP Server provides document embedding and retrieval capabilities for the WindTools code assistant. It uses ChromaDB and sentence transformers for semantic code search and repository indexing.

View on GitHub

Last updated: N/A

WindTools MCP Server

MCP Server for the WindTools code assistant, providing document embedding and retrieval capabilities using ChromaDB and sentence transformers.

Features

  • Semantic Code Search: Uses sentence transformers for embedding code snippets and retrieval
  • Code Repository Indexing: Automatically indexes code files from specified directories
  • Persistent Storage: Saves code embeddings in ChromaDB for persistent retrieval
  • Directory Exploration: Built-in tools for navigating and exploring codebases
  • Background Initialization: Loads resources asynchronously to minimize startup time
  • Environment Configuration: Configurable through environment variables

Tools

  1. list_dir

    • List the contents of a directory
    • Inputs:
      • directory_path (string): Path to list contents of, should be absolute path to a directory
    • Returns: JSON string containing directory information including file types and sizes
  2. get_initialization_status

    • Check the status of the background initialization process
    • Returns: JSON string with initialization status of ChromaDB and embedding model
  3. index_repository

    • Index code files from specified directories into ChromaDB
    • Inputs:
      • target_directories (array of strings): List of absolute paths to directories to index
      • force_reindex (boolean, optional): If true, reindex all files even if they already exist in the index
    • Returns: JSON string containing indexing statistics and results
  4. codebase_search

    • Find code snippets relevant to a search query
    • Inputs:
      • query (string): Search query describing what you're looking for
      • limit (integer, optional): Maximum number of results to return (default: 10)
      • min_relevance (float, optional): Minimum relevance score threshold (0.0 to 1.0)
    • Returns: JSON string containing search results with relevant code snippets

Technical Architecture

The WindTools MCP Server is built on these key components:

  • ChromaDB: Vector database for storing and retrieving code embeddings
  • Sentence Transformers: Deep learning models for creating embeddings from code
  • FastMCP: Framework for building MCP-compliant servers
  • Async Lifespan Management: Efficient resource initialization and cleanup

Initialization Process

The server initializes ChromaDB and the embedding model in the background, allowing it to start accepting requests immediately while resource loading continues in the background. The get_initialization_status tool can be used to check if the initialization is complete.

Setup

Environment Variables

The server can be configured with the following environment variables:

  • DATA_ROOT: Absolute directory where ChromaDB database and model cache will be stored (default: a 'data' directory inside the package)
  • CHROMA_DB_FOLDER_NAME: Name of the folder where ChromaDB stores data (default: "default")
  • SENTENCE_TRANSFORMER_PATH: Path to the sentence transformer model (default: "jinaai/jina-embeddings-v2-base-code")

Installation

Using pip
pip install windtools-mcp
From source
git clone https://github.com/ZahidGalea/windtools-mcp
cd windtools-mcp
pip install -e .

Usage with Claude Desktop

Add the following to your claude_desktop_config.json:

Direct Execution

Using Python 3.11 as ChromaDB has issues with newer Python versions.

{
  "mcpServers": {
    "windtools": {
      "command": "uvx",
      "args": [
        "-p",
        "3.11",
        "-U",
        "windtools-mcp"
      ],
      "env": {
        "DATA_ROOT": "/Users/<user>/windtools_data",
        "CHROMA_DB_FOLDER_NAME": "chromadb",
        "SENTENCE_TRANSFORMER_PATH": "jinaai/jina-embeddings-v2-base-code"
      }
    }
  }
}

Data (including ChromaDB database and model cache) will be saved in the /Users/<user>/windtools_data directory and persist between container executions.

Development

Requirements

  • Python 3.11
  • Dependencies listed in pyproject.toml

Development Setup

For developing:

# Install development dependencies
uv sync --dev

If you want to use locally:

pip install -e .

Configuration for local development:

{
  "mcpServers": {
    "windtools": {
      "command": "uv",
      "args": [
        "run",
        "windtools-mcp"
      ],
      "env": {
        "DATA_ROOT": "/Users/<user>/windtools_data",
        "CHROMA_DB_FOLDER_NAME": "chromadb",
        "SENTENCE_TRANSFORMER_PATH": "jinaai/jina-embeddings-v2-base-code"
      }
    }
  }
}

Inspector

npx @modelcontextprotocol/inspector uvx -p 3.11 windtools-mcp
npx @modelcontextprotocol/inspector uv run windtools-mcp

Running Tests

pytest tests/

The project includes both unit tests and integration tests using pytest and pytest-asyncio for testing asynchronous functionality.

Project Structure

src/
  windtools_mcp/
    __init__.py
    __main__.py
    server.py
tests/
  test_client.py
  test_unit.py
.github/
  workflows/
    publish.yml
    test.yml
.gitignore
.python-version
pyproject.toml
README.md
VERSION

Release Process

The project version is managed centrally in the VERSION file. The release process is automatic:

  1. Update the version number in the VERSION file
  2. Commit and push to the main branch
  3. The GitHub Actions workflow will automatically:
    • Detect the change in the VERSION file
    • Create a git tag with the format v{VERSION}
    • Generate a release on GitHub
    • Publish the package to PyPI

It is not necessary to manually create tags or publish to PyPI, everything is managed automatically when the VERSION file is updated.

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License.