Go offline with the Player FM app!
AI Evaluation and Testing: How to Know When Your Product Works (or Doesn’t)
Manage episode 454798932 series 3585084
This episode of AI Native Dev, hosted by Simon Maple and Guy Podjarny, features a mashup of conversations with leading figures in the AI industry. Guests include Des Traynor, founder of Intercom, who discusses the paradigm shift generative AI brings to product development. Rishabh Mehrotra, Head of AI at SourceGraph, emphasizes the importance of evaluation processes over model training. Tamar Yehoshua, President of Products and Technology at Glean, shares her experiences in enterprise search and the challenges of using LLMs in data-sensitive environments. Finally, Simon Last, Co-Founder and CTO of Notion, talks about continuous improvement and the iterative processes at Notion. Each guest provides invaluable insights into the evolving landscape of AI-driven products.
Watch the episode on YouTube: https://youtu.be/gZ4sGROvOdQ
54 episodes
AI Evaluation and Testing: How to Know When Your Product Works (or Doesn’t)
The AI Native Dev - from Copilot today to AI Native Software Development tomorrow
Manage episode 454798932 series 3585084
This episode of AI Native Dev, hosted by Simon Maple and Guy Podjarny, features a mashup of conversations with leading figures in the AI industry. Guests include Des Traynor, founder of Intercom, who discusses the paradigm shift generative AI brings to product development. Rishabh Mehrotra, Head of AI at SourceGraph, emphasizes the importance of evaluation processes over model training. Tamar Yehoshua, President of Products and Technology at Glean, shares her experiences in enterprise search and the challenges of using LLMs in data-sensitive environments. Finally, Simon Last, Co-Founder and CTO of Notion, talks about continuous improvement and the iterative processes at Notion. Each guest provides invaluable insights into the evolving landscape of AI-driven products.
Watch the episode on YouTube: https://youtu.be/gZ4sGROvOdQ
54 episodes
All episodes
×
1 Vibe Coding SimCity II: Injecting Chaos with Natural Disasters and AI Tools 1:00:42

1 Vibe Coding SimCity: Prototyping Tiny Towns with AI Dev Tools 44:00

1 Exploring LLM Observability with Traceloop's Gal Kleinman 40:26

1 From Builder to Orchestrator—Confronting the Software Engineer’s Identity Crisis 54:37

1 Is Code Dead & The $1B Solo Startup Myth - 5 AI Realities with Tessl's Guy Podjarny 29:03

1 AI's Transformative Impact on Development with Alex Komorske 31:25

1 Datadog CEO Olivier Pomel on AI Security, Trust, and the Future of Observability 59:59

1 Intent-Driven Development: Insights from Patrick Debois 45:11

1 How Attackers Trick AI: Lessons from Gandalf’s Creator 54:35

1 Monthly Roundup: AI Model Wars, GPT-4.5 vs. Sonnet 3.7, and the Future of AI Dev Tools 42:09

1 The Future of Audio AI: Insights from Mati Staniszewski of ElevenLabs 1:02:52

1 Building the Ultimate AI-Powered Development Environment with Farhath Razzaque 36:16

1 DeepSeek R1: Ask Me Anything - Open Weights, MoE innovations, Model Distillation and more! 33:18

1 Live Monthly: News on DeepSeek, Stargate, StackBlitz (Bolt.new) funding, prompting, and more 51:46
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.