gradio-transcript-mcp logo

gradio-transcript-mcp

by bsmnyk

gradio-transcript-mcp is a Gradio application configured as an MCP server. It transcribes audio and video from URLs into text using OpenAI's Whisper and ffmpeg.

View on GitHub

Last updated: N/A

What is gradio-transcript-mcp?

gradio-transcript-mcp is a Gradio application designed to function as a Model Control Protocol (MCP) server. It transcribes audio and video from URLs into text, leveraging OpenAI's Whisper and ffmpeg for efficient multimedia processing.

How to use gradio-transcript-mcp?

To use, clone the repository, install the dependencies using pip install -r requirements.txt, and run the application with python app.py. This starts the Gradio interface and the MCP server, exposing the transcribe_url function. Configure your MCP client (like Cline) with the server's URL, typically http://127.0.0.1:7860/gradio_api/mcp/sse or the hosted Hugging Face Spaces URL.

Key features of gradio-transcript-mcp

  • Transcribes audio and video from URLs

  • Uses OpenAI's Whisper for transcription

  • Leverages ffmpeg (via yt-dlp) for media conversion

  • Functions as an MCP server

  • Supports dynamic device selection (CPU or GPU)

Use cases of gradio-transcript-mcp

  • Automated transcription of online lectures

  • Converting video interviews into text

  • Generating transcripts for podcasts

  • Processing audio/video content from various online sources

  • Integration with MCP clients for remote transcription tasks

FAQ from gradio-transcript-mcp

What is an MCP server?

An MCP (Model Control Protocol) server allows clients to remotely access and control models or tools, in this case, a transcription service.

What formats does it support?

It supports any audio or video format that yt-dlp and ffmpeg can handle, converting them to WAV before transcription.

How do I connect my MCP client?

Configure your client with the server's URL, typically http://127.0.0.1:7860/gradio_api/mcp/sse when running locally, or the Hugging Face Spaces URL if using the hosted version.

What if my client doesn't support SSE?

Use a tool like mcp-remote as an intermediary to bridge the connection.

Can I use this server for commercial purposes?

Yes, as it is licensed under the Apache-2.0 License, which allows for commercial use, modification, and distribution, subject to the terms of the license.