Build Log: Telegram AI Agent Bot Powered by Hermes Agent

TL;DR: We built a Telegram bot powered by Hermes Agent that answers technical questions, summarizes links, and runs scheduled research tasks. Deployed on a $12/month VPS with an 8B local model. Average response time: 2.3 seconds. 100% uptime over 14 days. Total infrastructure cost: $12.50.

The Problem

Our team uses Telegram as the primary communication channel. When someone asks "what's the latest on Hermes Agent v0.12?" or "can you summarize that GitHub issue?", the answer requires searching documentation, reading threads, and synthesizing information โ€” exactly the kind of task an AI agent should handle but a human shouldn't waste time on.

8.0 / 10

Build Log Review 2026

๐Ÿ›ก๏ธ AI Tool ยท Updated 2026

Existing Telegram bots could only handle predefined commands or single-prompt responses. We needed something that could maintain conversation context, use tools (web search, file reading, code execution), and learn from repeated tasks over time. Hermes Agent's self-improvement loop made it the ideal foundation.

What We Built

A Telegram-native AI agent bot with three capabilities:

  • Conversational Q&A: Ask technical questions, get answers grounded in documentation and web search results
  • Link summarization: Share a URL, the agent reads the page and posts a summary to the group
  • Scheduled research: A cron job that scans Hacker News and GitHub trending each morning and posts a briefing

Architecture

ComponentTechnologyPurpose
Agent frameworkHermes Agent v0.12Orchestration, memory, skill learning
LLMLlama 3.1 8B (via Ollama)Local reasoning โ€” no API costs
MessagingTelegram Bot API (python-telegram-bot)User interface
HostingDigitalOcean VPS (2 vCPU, 4GB RAM)Always-on runtime
StorageSQLite (Hermes built-in)Memory persistence
Web searchDuckDuckGo API (free)Real-time information retrieval

Implementation

Step 1: Telegram Bot Setup

Registered the bot with BotFather and configured Hermes Agent's Telegram integration:

class="language-yaml"># ~/.hermes/config.yaml
messaging:
 telegram:
 enabled: true
 bot_token: "${TELEGRAM_BOT_TOKEN}"
 allowed_users:
 - "your_telegram_id"
 - "team_member_id"

Step 2: VPS Deployment

We used a $12/month DigitalOcean droplet running Ubuntu 24.04. Hermes Agent was installed and configured as a systemd service with auto-restart:

class="language-bash"># Deploy script
ssh root@droplet-ip
curl -fsSL https://hermes-agent.sh/install.sh | bash
hermes init
hermes config set inference_provider ollama
hermes config set ollama_model llama3.1:8b
ollama pull llama3.1:8b

Configure systemd for auto-restart

cat > /etc/systemd/system/hermes.service << โ€˜EOFโ€™ [Unit] Description=Hermes Agent After=network.target

[Service] Type=simple User=hermes ExecStart=/home/hermes/.hermes/venv/bin/hermes run Restart=always RestartSec=10

[Install] WantedBy=multi-user.target EOF

systemctl enable hermes systemctl start hermes

Step 3: Cron Research Briefing

Added a daily research task that runs automatically:

class="language-bash">hermes cron add --schedule "0 7 * * 1-5" \
 --task "Scan Hacker News front page and GitHub trending/ai. \
 Find top 3 AI agent stories. Summarize each in 2-3 sentences. \
 Post the briefing to #research channel."

Results

MetricValue
Total uptime14 days (336 hours)
Uptime percentage100%
Average response time2.3 seconds
Median response time1.8 seconds
Slowest response8.7 seconds (complex web search + synthesis)
Total conversations247
Total messages processed1,843
Skills auto-created3 (link summary, tech briefing, code lookup)
Infrastructure cost$12.50 (VPS + domain)
API costs$0.00 (fully local inference)

Response Time Breakdown

Request TypeAvg Time95th Percentile
Simple Q&A (no tools)1.2s2.1s
Link summarization3.4s5.8s
Web search + synthesis5.1s8.7s
Cron briefing (automated)45s72s

Lessons Learned

  1. An 8B model is sufficient for technical Q&A. Llama 3.1 8B handles documentation queries, link summaries, and code explanations without noticeable quality gaps compared to larger models. For complex reasoning (architecture decisions, multi-step planning), a larger model or API-based inference would help โ€” but for the daily use cases, the local 8B model works well.
  2. Skills auto-creation is the killer feature. Within the first week, Hermes created three skills from repeated patterns. The "link summary" skill now runs in 3.4s average compared to 5.2s on the first manual requests โ€” a 35% improvement from iteration alone.
  3. Telegram messaging latency is negligible. The bottleneck is model inference and web search, not the messaging layer. Telegram API round-trips add about 50-100ms.
  4. VPS memory is the constraint, not CPU. At 4GB RAM, the 8B model uses ~5.2GB with system overhead, forcing swap usage. Upgrading to an 8GB VPS ($24/month) would eliminate swap entirely and likely reduce response times by 20-30%.
  5. Allowlists prevent abuse. Configuring TELEGRAM_ALLOWED_USERS in environment variables prevented unauthorized access attempts (logged 12 failed authentication attempts in 14 days). Without this, any Telegram user who finds the bot can query the agent โ€” including sending it destructive commands.
  6. Cron safety rules work. Hermes enforces that cron-initiated sessions cannot create new cron jobs. This prevented a runaway loop when the morning briefing agent tried to schedule its own recurrence. The built-in guardrail caught it.

Cost Analysis

Total 14-day cost: $12.50. Breakdown:

  • $12.00 โ€” DigitalOcean VPS (2 vCPU, 4GB RAM, 40GB SSD)
  • $0.50 โ€” Domain DNS (prorated)
  • $0.00 โ€” Ollama inference (local, no API calls)
  • $0.00 โ€” DuckDuckGo search (free tier)
  • $0.00 โ€” Telegram API (free)

Compared to using GPT-4o via API for the same volume (~1,843 messages ร— ~500 avg tokens = ~922K tokens at $2.50/1M input tokens), the API cost alone would be roughly $2.30 โ€” cheap per-message but adds up. More importantly, the local model keeps all conversation data on the VPS, never sending team discussions to a third-party API.

Future Improvements

  • Upgrade to 8GB RAM VPS for swap-free inference (est. 20-30% faster responses)
  • Add RAG pipeline with project documentation for more accurate technical answers
  • Implement credential pools for automatic API key rotation on the web search tool
  • Set up log forwarding to a centralized monitoring dashboard

Built a Telegram AI agent? Share your setup in the comments or tag us on X. For setup details, see our Hermes production deployment guide and automation workflows guide.

Tags: Builds, Case Studies, AI, Open Source, Productivity

Tool: Hermes Agent / Ollama / Telegram / DigitalOcean

โ† Back to all posts