Artwork

Content provided by Stack Overflow and The Stack Overflow Podcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Stack Overflow and The Stack Overflow Podcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

The server-side rendering equivalent for LLM inference workloads

21:44
 
Share
 

Manage episode 501111517 series 1123725
Content provided by Stack Overflow and The Stack Overflow Podcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Stack Overflow and The Stack Overflow Podcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.

Ryan is joined by Tuhin Srivastava, CEO and co-founder of Baseten, to explore the evolving landscape of AI infrastructure and inference workloads, how the shift from traditional machine learning models to large-scale neural networks has made GPU usage challenging, and the potential future of hardware-specific optimizations in AI.

Episode notes:

Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring AI products to market fast.

Connect with Tuhin on LinkedIn or reach him at his email [email protected].

Shoutout to user Hitesh for winning a Populist badge for their answer to Cannot drop database because it is currently in use.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

  continue reading

860 episodes

Artwork
iconShare
 
Manage episode 501111517 series 1123725
Content provided by Stack Overflow and The Stack Overflow Podcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Stack Overflow and The Stack Overflow Podcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.

Ryan is joined by Tuhin Srivastava, CEO and co-founder of Baseten, to explore the evolving landscape of AI infrastructure and inference workloads, how the shift from traditional machine learning models to large-scale neural networks has made GPU usage challenging, and the potential future of hardware-specific optimizations in AI.

Episode notes:

Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring AI products to market fast.

Connect with Tuhin on LinkedIn or reach him at his email [email protected].

Shoutout to user Hitesh for winning a Populist badge for their answer to Cannot drop database because it is currently in use.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

  continue reading

860 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide

Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play