🔌 MCP Server Demo
Live Model Context Protocol server running on this infrastructure. Agentmemory MCP with semantic search, memory consolidation, and 8 tool endpoints.
Checking...
🧠 Memory Recall
Semantic + keyword search across all stored memories. Powers context-aware AI agents.
Results appear here...
📋 Available MCP Tools
8 tools registered through the agentmemory MCP server:
memory_recall— search past observationsmemory_save— persist insightsmemory_smart_search— hybrid semantic searchmemory_sessions— browse session historymemory_consolidate— tiered memory pipelinememory_diagnose— health checksmemory_lesson_save— save learningsmemory_reflect— knowledge graph traversal
📊 Server Stats
🤔 Why MCP?
Before MCP, connecting AI agents to tools meant writing custom integrations for every framework — a separate adapter for LangChain, another for Claude SDK, another for OpenAI. Every tool needed n implementations.
MCP standardizes this. It's like USB-C for AI tools: one protocol, any agent, any tool. Build a tool server once, any MCP-compatible agent can use it — the Hermes Agent, Claude Desktop, Cursor, or any custom agent implementing the protocol.
The demo below uses an agentmemory MCP server — search, save, consolidate, reflect — all accessible through a single protocol interface.
💰 Cost & Architecture
Runs on Cloudflare Pages — 35+ locations, 0 servers to manage
Search is purely local BM25 matching on 25 seeded memories
No LLM calls, no embedding generation, no smart search overhead
Cloudflare Pages free tier — no compute, no database, no egress fees
✅ Advantages Over Alternatives
| Approach | Integration | Cost | Security |
|---|---|---|---|
| MCP (this demo) | Universal — any MCP agent | $0 / no tokens | Self-contained edge function |
| REST API proxy | Custom per framework | Compute + bandwidth | Exposes backend directly |
| Embedding-based RAG | SDK-dependent | ~$0.01/query (embeddings) | API key management |
| Direct LLM memory | Single-agent only | Context window cost | Data in prompt history |
❓ FAQ
How is this different from a regular REST API?
REST APIs require a separate SDK or client for every agent framework. MCP is a standard protocol that any MCP-compatible agent can discover and call dynamically — including tool descriptions, input schemas, and error handling — without custom glue code.
Does this consume my API quota?
No. The search functionality uses local keyword matching on the edge — zero LLM calls, zero embedding calls, zero API tokens. The health check returns hardcoded data. This demo costs exactly $0 to run.
Can I use this in production?
This demo is a proof of concept. For production, you'd deploy a real MCP server with your own data, proper authentication, and a backend database. The architecture pattern — edge function proxying to MCP — is production-ready. Contact me for a production deployment.
What agents support MCP?
Claude Desktop, Cursor, Windsurf, the Hermes Agent, and custom agents using the MCP SDK (Python, TypeScript, Kotlin, Java). The protocol is open-source and growing fast — over 1,000 servers in the registry as of mid-2026.
What's the difference between MCP and function calling?
Function calling is built into individual LLM providers (OpenAI, Anthropic). MCP sits between — it's a standard for exposing tools that any provider's function-calling layer can consume. Your tool server works across providers without rewrites.
How do I build my own MCP server?
The MCP TypeScript SDK has a McpServer class — define tools with schemas, expose resources, register prompts. The Hermes Agent blog covers this in depth. Or I can build one for your stack.
Want an MCP Server for Your Stack?
I build custom MCP servers for Ghost CMS, CRMs, knowledge bases, chatbots, and any API — with auth, rate limiting, and monitoring included.