Dataset Viewer MCP Server logo

Dataset Viewer MCP Server

by privetin

An MCP server for interacting with the Hugging Face Dataset Viewer API. It provides capabilities to browse and analyze datasets hosted on the Hugging Face Hub.

View on GitHub

Last updated: N/A

What is Dataset Viewer MCP Server?

This is an MCP server designed to interact with the Hugging Face Dataset Viewer API, enabling users to browse, analyze, and manipulate datasets hosted on the Hugging Face Hub.

How to use Dataset Viewer MCP Server?

First, install the server following the instructions provided in the README, including setting up the virtual environment and installing dependencies. Configure the server with the necessary environment variables, such as the Hugging Face API token. Then, use the available tools through command-line interface or integrate it with tools like Claude Desktop by adding server configuration.

Key features of Dataset Viewer MCP Server

  • Accesses Hugging Face datasets using the dataset:// URI scheme

  • Supports dataset configurations and splits

  • Provides paginated access to dataset contents

  • Handles authentication for private datasets

  • Supports searching and filtering dataset contents

  • Provides dataset statistics and analysis

Use cases of Dataset Viewer MCP Server

  • Validating the existence and accessibility of a dataset.

  • Retrieving detailed information about a specific dataset.

  • Fetching paginated contents of datasets for exploration.

  • Searching for specific text within a dataset to analyze data.

  • Filtering and sorting dataset rows based on SQL-like conditions.

  • Downloading entire datasets in Parquet format for offline processing

FAQ from Dataset Viewer MCP Server

What is the Dataset Viewer MCP Server used for?

It allows you to interact with datasets on the Hugging Face Hub, enabling you to browse, analyze, and manipulate them.

How do I install the server?

Clone the repository, create a virtual environment, activate it, and install the server in development mode using uv.

What environment variables do I need to set?

You need to set the HUGGINGFACE_TOKEN environment variable for accessing private datasets.

How can I use the tools provided by the server?

The server provides tools like validate, get_info, get_rows, search_dataset, and filter, each with specific parameters such as dataset identifier, configuration name, split name, and optional authentication token.

Can I access private datasets using this server?

Yes, the server handles authentication for private datasets. You'll need to provide your Hugging Face API token via the auth_token parameter.