As generative AI moves from prototypes to production, the challenge shifts from “Can we build it?” to “Can we scale it reliably, securely, and cost-effectively?” Scaling GenAI products requires more than bigger models or faster GPUs; it calls for deliberate infrastructure patterns that balance experimentation with stability, performance with observability, and rapid iteration with governance. Leaders must learn how to design architectures that support multi-agent collaboration, context-aware systems, and evaluation pipelines while avoiding pitfalls like runaway costs, latency bottlenecks, and brittle integrations. This discussion will explore the infrastructure strategies and tradeoffs that enable teams to move from demos to durable products. Participants will leave with practical insights into how to future-proof their GenAI platforms and scale responsibly without losing speed or trust.
Rashi Agrawal is Head of AI Engineering at GoodLeap, where she leads enterprise-wide AI initiatives that deliver real business impact. An accomplished speaker, she covers the latest in AI, including context engineering, evaluations, and multi-agent collaboration, while driving Applied AI innovation in the enterprise. Previously, she scaled engineering teams at Yahoo, advancing its multibillion-dollar advertising business. A passionate world traveler to 40+ countries, Rashi brings global perspective and energy to her leadership and storytelling.