Go offline with the Player FM app!
Podcasts Worth a Listen
SPONSORED


1 72: If You Want to Grow—Stop Fixing the Wrong Problem 16:32
How Similarweb Delivers Customer Facing Analytics Over 100s of TBs
Manage episode 438488410 series 3418247
According to Yoav Shmaria, VP R&D Platform at Similarweb, the best way to manage data warehouse costs is to tag every table, database or ETL running to have good granularity over every feature.
Besides handy cost management tips, Yoav walks the bros through the tech stack he implemented to analyze 100s of TBs of web data to serve fast customer-facing analytics.
Full disclosure, Similarweb is a Firebolt customer, but the bros kept it objective, and there’s no Firebolt talk in this episode.
Previous guests include: Joseph Machado of Linkedin, Metthew Weingarten of Disney, Joe Reis and Matt Housely, authors of The Fundamentals of Data Engineering, Zach Wilson of Eczachly Inc, Megan Lieu of Deepnote, Erik Heintare of Bolt, Lior Solomon of Vimeo, Krishna Naidu of Canva, Mike Cohen of Substack, Jens Larsson of Ark, Gunnar Tangring of Klarna, Yoav Shmaria of Similarweb and Xiaoxu Gao of Adyen.
Check out our three most downloaded episodes:
59 episodes
Manage episode 438488410 series 3418247
According to Yoav Shmaria, VP R&D Platform at Similarweb, the best way to manage data warehouse costs is to tag every table, database or ETL running to have good granularity over every feature.
Besides handy cost management tips, Yoav walks the bros through the tech stack he implemented to analyze 100s of TBs of web data to serve fast customer-facing analytics.
Full disclosure, Similarweb is a Firebolt customer, but the bros kept it objective, and there’s no Firebolt talk in this episode.
Previous guests include: Joseph Machado of Linkedin, Metthew Weingarten of Disney, Joe Reis and Matt Housely, authors of The Fundamentals of Data Engineering, Zach Wilson of Eczachly Inc, Megan Lieu of Deepnote, Erik Heintare of Bolt, Lior Solomon of Vimeo, Krishna Naidu of Canva, Mike Cohen of Substack, Jens Larsson of Ark, Gunnar Tangring of Klarna, Yoav Shmaria of Similarweb and Xiaoxu Gao of Adyen.
Check out our three most downloaded episodes:
59 episodes
All episodes
×
1 From Zero to 100M Users: Inside Notion’s Data Stack and AI Strategy with Sumit Gupta 22:13

1 How Rising Wave Is Redefining Real-Time Data with Postgres Power 31:36

1 Revolutionizing Data Governance with DataStrato’s Unified Open Source Approach 23:36

1 Database Technology in the Age of AI with DuckDB Labs co-creator Hannes Mühleisen 30:52

1 AI and Data Movement: Trends and Best Practices with Estuary’s Daniel Pálma 30:33

1 AI and Data Change Management with Chad Sanderson, CEO Gable AI 36:43

1 Tech Stacks and Tradeoffs: Xudo's Founder on Picking the Right Tools for BI Success 24:56

1 Data Rewind: Conversation Highlights from Zach Wilson, Matthew Housley, Joe Reis, and Krishnan Viswanathan 28:02

1 The Resurgence of SQL: Insights from Ryanne Dolan from LinkedIn 32:57

1 Vector Databases Won’t Replace SQL - Andy Pavlo 42:59

1 How ZoomInfo transitioned from data graveyards to ROI-driven data projects 39:46

1 Matthew Weingarten from Disney Streaming about Data Quality Best Practices 27:21

1 Joseph Machado, Senior Data Engineer @ LinkedIn talks best practices 25:59

1 Professors Joe Hellerstein and Joseph Gonzalez on LLMs 46:07

1 Megan Lieu on powerful notebooks that enable collaboration 31:31
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.