Principal GenAI SA · AWS · 20+ years
building the future, in real-time —

Mani
Khanuja.

Expert on scaling autonomous AI agents safely & efficiently, AI governance, and strategic leadership.

With 20+ years of experience, Mani builds Generative AI strategy for enterprise customers at AWS. Her current focus is scaling autonomous AI agents safely and efficiently — from building AI platforms from scratch to governing agentic systems at scale. Researcher, author, and speaker sharing insights in The Agentic Enterprise newsletter.

Agentic AI RAG LLMs Context Engineering Amazon Bedrock Distributed Training
Mani Khanuja
20+ Years in AI & Tech
About

Architecting the
future, with care.

Mani Khanuja is a Technical AI Leader and Principal Generative AI Solutions Architect at AWS with 20+ years of experience building AI platforms from scratch and driving enterprise AI strategy. She works directly with customers to build their Generative AI strategy, from architecture to production deployment at scale.

Her current focus is scaling autonomous AI agents safely and efficiently: developing stateful, memory-driven agents with personalization, advancing AI governance frameworks, and translating cutting-edge research into real-world enterprise systems. She is the author of Applied Machine Learning and High-Performance Computing on AWS and co-author of the upcoming The AI Steering Wheel.

A pioneering researcher, she co-authored the widely cited paper "Keyword search is all you need: Achieving RAG-level performance without vector databases using agentic tool use." She is also a recognized technical speaker at re:Invent, Grace Hopper Celebration, AI Engineer Summit, and AWS Summits worldwide. She resides in Seal Beach, California, where she stays active with long runs along the coast.

Mani Khanuja
20+
Years of experience
100+
Conferences
2
Books
12+
Papers reviewed
Gallery

On stage & in the field.

re:Invent, Grace Hopper, AWS Summits — the conferences where the work gets stress-tested.

Books

Published works.

From high-performance computing to the agentic frontier — bridging theory and enterprise practice.

The AI Steering Wheel
+
Upcoming

The AI Steering Wheel

Unified Framework for Scaling Generative and Agentic Systems

Three interlocking layers — Strategy, Operations, and Engineering — keep AI products aligned from first decision through last deployment. Co-authored with Dr. Fouad Bousetouane.

Applied Machine Learning and High-Performance Computing on AWS
+
Published

Applied Machine Learning and High-Performance Computing on AWS

A comprehensive guide to building, training, and deploying ML models at scale on AWS. Co-authored with Farooq Sabir, Shreyas Subramanian & Trenton Potgieter.

YouTube

Featured talks & sessions.

Deep-dives on Agentic AI, RAG, Amazon Bedrock, and distributed training — top videos by views.

View Full Playlist on YouTube →
Newsletter

The Agentic Enterprise.

Insights on Agentic AI, ethics, enterprise deployment, and the future of autonomous systems — from The Agentic Enterprise newsletter.

All articles on Substack LinkedIn Newsletter
Research

Research & publications.

Peer-reviewed papers advancing retrieval-augmented generation, agentic systems, and applied machine learning.

Speaking

Stages & conferences.

Recognized technical speaker at the world's leading AI & cloud conferences.

AWS re:Invent
2022 — 2024
Grace Hopper
2023
AI Engineer Summit
2024
AWS Summit Toronto
2024
AWS Summit NYC
2024
AWS Summit SF
2023
QCon
2024
ODSC
2023
AWS Blogs

Selected AWS writing.

Technical deep-dives published on the AWS Machine Learning and Generative AI blogs.

AWS Machine Learning Blog · 2025
Amazon Bedrock AgentCore Memory: Building Context-Aware Agents
A deep-dive into Amazon Bedrock AgentCore Memory, how to build AI agents that maintain context across sessions, enabling genuinely personalized, stateful interactions that improve over time in enterprise deployments.
Read
AWS Machine Learning Blog · 2026
Build Agents to Learn from Experiences Using Amazon Bedrock AgentCore Episodic Memory
A technical walkthrough on building AI agents with episodic memory, enabling agents to retain context across sessions, learn from past interactions, and deliver personalized responses at enterprise scale using Amazon Bedrock.
Read
AWS Machine Learning Blog · 2025
Streamline GitHub Workflows with Generative AI Using Amazon Bedrock and MCP
Explore how the Model Context Protocol (MCP) connects Amazon Bedrock to GitHub, enabling AI agents to automate pull request reviews, issue triage, and code generation workflows, reducing developer toil and accelerating delivery.
Read
AWS Machine Learning Blog · 2025
Unlocking the Power of Model Context Protocol (MCP) on AWS
A comprehensive guide to MCP on AWS, how the Model Context Protocol enables AI agents to securely connect to data sources, tools, and APIs. Covers architecture patterns, security considerations, and practical implementation with Amazon Bedrock.
Read
AWS Machine Learning Blog · 2025
From Concept to Reality: Navigating the Journey of RAG from Proof of Concept to Production
Bridges the gap between RAG prototypes and production-grade deployments on AWS. Covers chunking strategies, retrieval optimization, evaluation frameworks, and the architectural decisions that determine success at scale.
Read
AWS Machine Learning Blog
Create a Next-Generation Chat Assistant with Amazon Bedrock, Connect, Lex, LangChain, and WhatsApp
Step-by-step guide to building a production-ready conversational AI assistant that combines Amazon Bedrock's foundation models with Amazon Connect and Lex for omni-channel customer engagement over WhatsApp.
Read
AWS Machine Learning Blog · Author archive
All AWS posts authored by Mani Khanuja
15+ deep-dive articles spanning Amazon Bedrock, SageMaker, distributed training, RAG, and agentic AI patterns. Browse the full author archive.
View archive
Stay close

Come build with me.

Three places. Same voice. Different cadences.