30 subscribers
Go offline with the Player FM app!
#116: AI Agents, MCP and the problems with AI benchmarks | ft. Matt Carey
Manage episode 477903349 series 2636547
In this episode, I spoke with Matt Carey, founding AI engineer at StackOne, founder of AI Demo Days and member of the OpenUK AI Advisory Board.
Everyone needs a friend who works in AI to help them filter the AI news and get the signals from the noise. Matt is that friend for me!
We discussed AI agents, MCP, and the challenges of AI benchmarks, which help explain the disconnect between the benchmark results and the anecdotal experiences of AI users, such as myself.
Links from the episode:
- Google's whitepaper on AI agents
- Anthropic Building Effective AI Agents
- Simon Willison on X
- Thorsten Ball's Joy & Curiosity newsletter
- AI Demo Days
- MCP has a prompt injection problem
Opening theme song:
Cheery Monday by Kevin MacLeod
Link: https://incompetech.filmmusic.io/song/3495-cheery-monday
License: http://creativecommons.org/licenses/by/4.0
115 episodes
Manage episode 477903349 series 2636547
In this episode, I spoke with Matt Carey, founding AI engineer at StackOne, founder of AI Demo Days and member of the OpenUK AI Advisory Board.
Everyone needs a friend who works in AI to help them filter the AI news and get the signals from the noise. Matt is that friend for me!
We discussed AI agents, MCP, and the challenges of AI benchmarks, which help explain the disconnect between the benchmark results and the anecdotal experiences of AI users, such as myself.
Links from the episode:
- Google's whitepaper on AI agents
- Anthropic Building Effective AI Agents
- Simon Willison on X
- Thorsten Ball's Joy & Curiosity newsletter
- AI Demo Days
- MCP has a prompt injection problem
Opening theme song:
Cheery Monday by Kevin MacLeod
Link: https://incompetech.filmmusic.io/song/3495-cheery-monday
License: http://creativecommons.org/licenses/by/4.0
115 episodes
All episodes
×
1 #116: AI Agents, MCP and the problems with AI benchmarks | ft. Matt Carey 48:08


1 #114: Best practices for building a multi-tenant system with Khawaja Shams 48:24

1 #113: Why you need Knowledge Graphs for your AI chatbot | ft. Aniket Mitra 45:13

1 #112: Better Developer Experience for Event-Driven Architectures | ft. Alex Bouchard, co-founder of Hookdeck 59:18

1 #111 - EventCatalog Revolutionizes Governance in Event-Driven Architectures | ft. David Boyne 51:00

1 #109: Building serverless apps in PHP with Bref | ft Matthieu Napoli 55:56

1 #108: Lambda on Rust with James Eastham 1:02:02

1 #107: How to Have a Successful Cloud Career in 2024 | ft. Andrew Brown 52:33

1 #106: Rust with Lambda, easy-mode Rust & future of Middy | ft. Luciano Mammino 45:51

1 #105: The inception story of Cognito & secret to succeeding at AWS | ft. David Behroozi 1:14:51

1 #104: Baseline, is this new serverless development framework better than Amplify? 57:49

1 #103 - Community building, being an enable, is serverless dead? ft. Allen Helton 1:00:27

1 #102: Building AWS communities with Farrah Campbell 44:05

1 #101: Faster serverless APIs with Brian LeRoux 1:00:19
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.