Utility providers are under pressure to deliver responsive, accurate, and empathetic support across channels—without ballooning operational costs. What if you could automate most customer questions with human-level fluency, escalate seamlessly to agents when needed, and do it all on a secure, pay-per-use foundation?
Here’s how we built a production-ready, serverless AI customer support platform on AWS—covering both web chat and voice—using Amazon Lex, Agents for Amazon Bedrock, Amazon Bedrock Knowledge Base (backed by Amazon OpenSearch Serverless), Amazon Titan Nova Pro, Amazon Connect, AWS Cognito, Amazon S3, Amazon CloudFront, and Amazon CloudWatch.
Always-On Omnichannel Support
Our goal was simple and ambitious: deliver fast, accurate answers to utility customers—anytime, via chat or phone—with smooth handoffs to human agents when needed. The system had to be secure, resilient, and scalable, and it had to learn and improve as FAQs evolve.The result? A serverless, AI-native contact platform that standardizes high-quality responses, reduces handle time, and raises customer satisfaction—without adding operational overhead.
Architecture Deep Dive: Built for Real-World Scale

Serverless by Default
- Global delivery via CloudFront with S3 static hosting
- Conversational AI with Amazon Lex orchestrating to Agents for Amazon Bedrock
- Knowledge retrieval via Bedrock Knowledge Base with OpenSearch Serverless as the vector store
- Voice via Amazon Connect, with effortless routing and escalation
- Security & identity via AWS Cognito and fine-grained IAM
- Observability with CloudWatch logs, metrics, and alarms
The Eight-Module Architecture
- Identity & Access Management
- AWS Cognito handles sign-up/sign-in, token issuance, and optional MFA
- JWT validation at the edge; least-privilege IAM to downstream services
- Global Frontend Delivery
- Web client hosted on S3 and accelerated through CloudFront
- Low-latency chat widget integrating directly with Amazon Lex APIs
- Conversational Ingestion (Text & Voice)
- Users type or speak; the Lex bot normalizes intents and sessions
- RecognizeText/RecognizeUtterance unify inputs while preserving context
- Agentic Reasoning Layer
- Agents for Amazon Bedrock receives the Lex input and manages the plan
- Amazon Titan Nova Pro provides reasoning and natural-language generation
- Tool-use grounded by the Knowledge Base ensures factual, contextual answers
- Knowledge Retrieval & Indexing
- Bedrock Knowledge Base connected to S3 as the authoritative data source
- Documents chunked, embedded, and indexed in OpenSearch Serverless
- Automatic sync pipelines keep content fresh and traceable
- Voice Contact Flow (PSTN to AI to Human)
- Amazon Connect greets callers, collects DTMF, and invokes Lex
- Lex forwards to the Bedrock Agent for answers;
intelligent escalation routes complex cases to human agents with context
- Session State & Response Packaging
- Lex session attributes track previous turns, customer IDs, and preferences
- Rich responses (plain text + SSML for voice) rendered consistently across channels
- Observability & Ops
- CloudWatch logs for Lex, Bedrock, and Connect flows
- Metrics and alarms for latency, error rates, and knowledge sync health
Technology Integration: Patterns That Matter
RAG, the Right Way (Retrieval-Augmented Generation)
- The Bedrock Agent queries the Knowledge Base to ground responses in your S3-backed FAQs and policies.
- OpenSearch Serverless provides low-latency vector search with automatic scaling.
- Titan Nova Pro converts retrieved context into succinct, empathetic answers—reducing hallucinations.
Omnichannel Consistency
- One knowledge source, two channels (web chat + voice).
- Lex + Bedrock ensure identical policies, pricing, and troubleshooting steps across modalities.
Event-Driven Freshness
- Updating FAQs in S3 triggers re-indexing to the Knowledge Base.
- No manual redeployments—content freshness is operationalized.
Security & Compliance: Guardrails by Design
- Encryption everywhere: TLS 1.2+ in transit; S3 server-side encryption at rest
- IAM least privilege: scoped roles for Lex, Bedrock, Connect, and indexing jobs
- Tenant isolation & PII hygiene: short-lived tokens via Cognito; minimize data retention in logs
- Auditability: request/response breadcrumbs in CloudWatch to support QA and compliance
Real-World Performance: What You Can Expect
High Availability
- Managed, multi-AZ serverless services for routing, inference, and search
- Stateless frontends and auto-healing managed components
Performance Optimization
- CloudFront for CDN-level acceleration of the web client
- OpenSearch Serverless for fast vector retrieval
- Lex session state to reduce re-prompting and shorten time-to-answer
Cost Efficiency
- Pure pay-per-use across compute, inference, and search
- S3 lifecycle policies and content tiering for efficient storage
End-to-End Flow
- User Authenticates with Cognito and opens the web chat (S3 + CloudFront) or dials in via Amazon Connect.
- Lex receives the utterance and forwards to the Bedrock Agent.
- The Agent queries the Knowledge Base (S3-sourced, OpenSearch Serverless indexed).
- Titan Nova Pro formulates a grounded, channel-appropriate answer (SSML for voice, text for chat).
- If confidence is low or the user asks, Connect escalates to a human agent with session context.
- CloudWatch captures telemetry for quality assurance and continuous improvement.
Extensibility
- New Channels: Add WhatsApp, web chat variants, or mobile quickly—same Lex/Bedrock backbone
- Personalization: Use Cognito attributes and past interactions to tailor answers
- Automation: Add transactional tools (bill payments, meter readings) as Agent tools with secure, auditable calls
- Multilingual: Expand intents and training phrases; reuse shared knowledge grounding
Why This Architecture Works
Serverless-First: We avoid server management while gaining autoscaling, resiliency, and cost control.
AI-Native: Bedrock’s managed models and Agents let us iterate prompts, tools, and policies without re-platforming.
Grounded Answers: A well-designed Knowledge Base + vector search reduces hallucinations and drives trust.
Operable from Day One: CloudWatch metrics, logs, and alarms keep quality visible and actionable.
Looking Ahead: Smarter, Safer, Faster
- Proactive support: Trigger outbound education for planned outages or tariff changes
- Quality loops: Use feedback signals to auto-enrich FAQs in S3 and re-index
- Agent assist: Surface AI suggestions to human agents during escalations
- Stronger guardrails: Add PII redaction and policy-aware responses for regulated workflows
Customer Support at the Speed of Trust
Great support isn’t just fast—it’s reliable, consistent, and secure. By combining Lex, Agents for Bedrock, Titan Nova Pro, Bedrock Knowledge Base with OpenSearch Serverless, Connect, Cognito, S3/CloudFront, and CloudWatch, we’ve built an omnichannel utility support platform that scales with demand, stays grounded in the latest knowledge, and hands off gracefully to humans when empathy and judgment matter most.
Ready to modernize your customer experience? With this serverless, AI-powered foundation, you can deliver better answers today—and evolve confidently as your business and customers’ needs grow.