Scaling AI Inference With Open Source Ft. Brian Stevens Technically Speaking With Chris Wright podcast

Artwork

Tech Business Red Hat Emerging Technologies Machine Learning Chris Wright Hybrid Clouds Data Development Security Linux Software Cloud Computing Enterprise Technology Artificial Intelligence Programming Coding Careers Technology

Content provided by Red Hat. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Red Hat or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.

Technically Speaking with Chris Wright »
Scaling AI inference with open source ft. Brian Stevens

3d ago 29:39

Share

MP3•Episode home

Content provided by Red Hat. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Red Hat or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.

Explore the future of enterprise AI with Red Hat's SVP and AI CTO, Brian Stevens. In this episode, we delve into how AI is being practically reimagined for real-world business environments, focusing on the pivotal shift to production-quality inference at scale and the transformative power of open source. Brian Stevens shares his expertise and unique perspective on: • The evolution of AI from experimental stages to essential, production-ready enterprise solutions. • Key lessons from the early days of enterprise Linux and their application to today’s AI inference challenges. • The critical role of projects like vLLM in optimizing AI models and creating a common, efficient inference stack for diverse hardware. • Innovations in GPU-based inference and distributed systems (like KV cache) that enable AI scalability. Tune in for a deep dive into the infrastructure and strategies making enterprise AI a reality. Whether you're a seasoned technologist, an AI practitioner, or a leader charting your company's AI journey, this discussion will provide valuable insights into building an accessible, efficient, and powerful AI future with open source.

… continue reading

2 episodes

#Tech #Business #Red Hat #Emerging Technologies #Machine Learning #Chris Wright #Hybrid Clouds #Data #Development #Security #Linux #Software #Cloud Computing #Enterprise Technology #Artificial Intelligence #Programming #Coding Careers #Technology

Artwork

Scaling AI inference with open source ft. Brian Stevens

Technically Speaking with Chris Wright

published 3d ago

Share

MP3•Episode home

Content provided by Red Hat. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Red Hat or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.

Explore the future of enterprise AI with Red Hat's SVP and AI CTO, Brian Stevens. In this episode, we delve into how AI is being practically reimagined for real-world business environments, focusing on the pivotal shift to production-quality inference at scale and the transformative power of open source. Brian Stevens shares his expertise and unique perspective on: • The evolution of AI from experimental stages to essential, production-ready enterprise solutions. • Key lessons from the early days of enterprise Linux and their application to today’s AI inference challenges. • The critical role of projects like vLLM in optimizing AI models and creating a common, efficient inference stack for diverse hardware. • Innovations in GPU-based inference and distributed systems (like KV cache) that enable AI scalability. Tune in for a deep dive into the infrastructure and strategies making enterprise AI a reality. Whether you're a seasoned technologist, an AI practitioner, or a leader charting your company's AI journey, this discussion will provide valuable insights into building an accessible, efficient, and powerful AI future with open source.

… continue reading

2 episodes

#Tech #Business #Red Hat #Emerging Technologies #Machine Learning #Chris Wright #Hybrid Clouds #Data #Development #Security #Linux #Software #Cloud Computing #Enterprise Technology #Artificial Intelligence #Programming #Coding Careers #Technology

All episodes

×

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

Listen to 500+ topics

Quick Reference Guide

Top Podcasts

The Bill Simmons Podcast

Comedy of the Week

How Did This Get Made?

Doug Loves Movies

TED Talks Daily

NBC Nightly News with Tom Llamas

The World This Hour

Daily Boost Motivation and Coaching

This American Life

Sword and Scale

Help/FAQ | Upgrade | Advertise

Arts|Business|Comedy|Economics|Entertainment|News|Politics|Religion

Science|Soccer|Sports|Storytelling|Technology|True Crime

Copyright 2025 | Privacy Policy | Terms of Service | | Copyright

Listen to this show while you explore