An AI communication assistant with its own app, in-app chat, and a major WhatsApp integration — scheduling, reminders, summaries and natural voice. I own the AI architecture: multi-model orchestration, an MCP tool layer, persistent agent memory, and a serverless notification engine rebuilt for infinite scale.
Overview
A multi-surface assistant — app, in-app chat and WhatsApp — backed by an agent runtime that routes across GPT-4o, Gemini and Claude by latency and cost.
Architecture
Django core, a dedicated MCP server, RAG memory with Langfuse, and a serverless Lambda + SQS + EventBridge scheduler.
Challenges
Multi-model orchestration, durable agent memory, Meta messaging compliance, and keeping inference cost predictable.
Impact
Eliminated CRON infra overhead, made time-based messaging infinitely scalable, and lifted answer accuracy via self-learning agents.