Telegram AI Assistant Bot
by shettysaish20
The Telegram AI Assistant Bot, named Cortex-R, is a reasoning-driven AI agent designed to interact with users via Telegram. It utilizes multiple Model Context Protocol (MCP) servers to perform tasks like web searches, document processing, and Google services integration.
Last updated: N/A
Telegram AI Assistant Bot
Overview
The Telegram AI Assistant Bot is a reasoning-driven AI agent, named Cortex-R, designed to interact with users via Telegram. It uses multiple Model Context Protocol (MCP) servers to perform various tasks such as web searches, document processing, mathematical calculations, and Google services integration (e.g., Gmail and Google Sheets). The bot employs both stdio
and sse
transport protocols to communicate with these MCP servers.
Features
- Telegram Integration: The bot interacts with users through Telegram, responding to their queries and tasks.
- Multi-MCP Server Support: Utilizes multiple MCP servers for diverse functionalities.
- Gmail integration for email management.
- Google Sheets integration for spreadsheet operations.
- Web search capabilities using DuckDuckGo.
- Document processing and semantic search.
- Mathematical calculations.
- Dynamic Tool Discovery: Automatically discovers tools from MCP servers during initialization.
- Customizable Agent Strategy: Supports different strategies like conservative, retry-once, and explore-all.
- Memory Management: Retrieves and stores memory items to enhance task-solving capabilities.
- Configurable Persona: Allows customization of tone, verbosity, and behavior.
Architecture
The bot is built using the following components:
1. Telegram Bot Integration
The bot is implemented in telegram_agent.py
using the python-telegram-bot
library. It handles user messages and commands such as /start
and /help
. User queries are processed by the Cortex-R agent.
2. Core Components
- Agent Loop (
core/loop.py
): Manages the main execution loop of the agent, including perception, planning, and tool execution. - Session Management (
core/session.py
): Handles communication with multiple MCP servers usingstdio
andsse
transport protocols. - Strategy (
core/strategy.py
): Implements planning strategies for decision-making.
3. MCP Servers
The bot interacts with the following MCP servers:
- Gmail Server (
gmail_server.py
): Manages email operations such as sending, reading, and trashing emails. - Google Sheets Server (
mcp_gdrive_server.py
): Handles spreadsheet operations like creating, updating, and sharing Google Sheets. - Web Search Server (
mcp_server_3.py
): Performs web searches using DuckDuckGo and fetches webpage content. - Document Processing Server (
mcp_server_2.py
): Processes documents for semantic search and indexing. - Math Server (
mcp_server_1.py
): Performs mathematical calculations.
4. Configuration
- Profiles (
config/profiles.yaml
): Defines agent settings, including strategy, memory configuration, and MCP server details. - Models (
config/models.json
): Specifies the models used for text generation and embeddings. - Environment Variables (
.env
): Stores sensitive information like API keys and file paths.
Workflow
-
Initialization:
- The bot loads MCP server configurations from
profiles.yaml
. - Initializes the
MultiMCP
dispatcher to discover tools from all configured MCP servers.
- The bot loads MCP server configurations from
-
User Interaction:
- Users interact with the bot via Telegram.
- The bot processes user messages and determines the required tools to fulfill the task.
-
Task Execution:
- The agent uses perception to understand the user query.
- Plans the next action using the defined strategy.
- Executes the required tool from the appropriate MCP server.
- Retrieves results and responds to the user.
-
Memory Management:
- Stores tool outputs and user queries as memory items.
- Retrieves relevant memories to assist in task-solving.
Configuration
Environment Variables
Set the following variables in the .env
file:
GEMINI_API_KEY
: API key for the Gemini model.SERVICE_ACCOUNT_PATH
: Path to the Google service account JSON file.DRIVE_FOLDER_ID
: Google Drive folder ID for storing files.CREDENTIALS_PATH
: Path to the OAuth client credentials file.TOKEN_PATH
: Path to the token file for Google APIs.
MCP Server Configuration
Define MCP servers in config/profiles.yaml
:
mcp_servers:
- id: gmail
type: stdio
script: gmail_server.py
cwd: <path-to-server>
- id: gdrive
type: stdio
script: mcp_gdrive_server.py
cwd: <path-to-server>
- id: math
type: sse
script: mcp_server_1.py
url: http://localhost:8000/sse
cwd: <path-to-server>
- id: documents
type: stdio
script: mcp_server_2.py
cwd: <path-to-server>
- id: websearch
type: stdio
script: mcp_server_3.py
cwd: <path-to-server>
Installation
- Clone the repository:
git clone <repository-url> cd <repository-folder>
- Sync dependencies using uv:
uv sync
- Set up environment variables in
.env
. - Start the Telegram bot:
uv run telegram_agent.py
Usage
- Start the bot on Telegram using the
/start
command. - Send queries or tasks to the bot, such as:
- "Find the current F1 standings and share them in a Google Sheet."
- "Send an email to [email protected] with the subject 'Meeting Update'."
Contributing
Contributions are welcome! Please follow these steps:
- Fork the repository.
- Create a new branch for your feature or bug fix.
- Submit a pull request with a detailed description of your changes.
License
This project is licensed under the MIT License. See the LICENSE
file for details.