Go offline with the Player FM app!
The Data Factory: Inside the $100B Race for Post-Training Supremacy, with Labelbox CEO Manu Sharma
Manage episode 493391856 series 3452589
Manu Sharma, founder and CEO of Labelbox, explains how frontier AI training data has evolved far beyond simple labeling to sophisticated reinforcement learning environments where domain experts create "gyms" for models to develop complex skills. With every Western frontier lab now spending over a billion dollars annually on training data, the conversation traces the shift from supervised learning to reinforcement learning from verifiable rewards, particularly for coding, mathematical reasoning, and computer use. Sharma reveals how Labelbox operates as a vertically integrated data factory, conducting over 2,000 AI-powered expert interviews daily and paying top specialists more than $250,000 annually. The discussion provides essential insights into the red-hot training data market that's reshaping AI development following major deals like Meta's $15B acquisition of Scale AI.
Sponsors:
Oracle Cloud Infrastructure:
Oracle Cloud Infrastructure (OCI) is the next-generation cloud that delivers better performance, faster speeds, and significantly lower costs, including up to 50% less for compute, 70% for storage, and 80% for networking. Run any workload, from infrastructure to AI, in a high-availability environment and try OCI for free with zero commitment at https://oracle.com/cognitive
The AGNTCY:
The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at https://agntcy.org
NetSuite by Oracle:
NetSuite by Oracle is the AI-powered business management suite trusted by over 42,000 businesses, offering a unified platform for accounting, financial management, inventory, and HR. Gain total visibility and control to make quick decisions and automate everyday tasks—download the free ebook, Navigating Global Trade: Three Insights for Leaders, at https://netsuite.com/cognitive
PRODUCED BY:
SOCIAL LINKS:
Website: https://www.cognitiverevolution.ai
Twitter (Podcast): https://x.com/cogrev_podcast
Twitter (Nathan): https://x.com/labenz
LinkedIn: https://linkedin.com/in/nathanlabenz/
Youtube: https://youtube.com/@CognitiveRevolutionPodcast
Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk
262 episodes
The Data Factory: Inside the $100B Race for Post-Training Supremacy, with Labelbox CEO Manu Sharma
"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
Manage episode 493391856 series 3452589
Manu Sharma, founder and CEO of Labelbox, explains how frontier AI training data has evolved far beyond simple labeling to sophisticated reinforcement learning environments where domain experts create "gyms" for models to develop complex skills. With every Western frontier lab now spending over a billion dollars annually on training data, the conversation traces the shift from supervised learning to reinforcement learning from verifiable rewards, particularly for coding, mathematical reasoning, and computer use. Sharma reveals how Labelbox operates as a vertically integrated data factory, conducting over 2,000 AI-powered expert interviews daily and paying top specialists more than $250,000 annually. The discussion provides essential insights into the red-hot training data market that's reshaping AI development following major deals like Meta's $15B acquisition of Scale AI.
Sponsors:
Oracle Cloud Infrastructure:
Oracle Cloud Infrastructure (OCI) is the next-generation cloud that delivers better performance, faster speeds, and significantly lower costs, including up to 50% less for compute, 70% for storage, and 80% for networking. Run any workload, from infrastructure to AI, in a high-availability environment and try OCI for free with zero commitment at https://oracle.com/cognitive
The AGNTCY:
The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at https://agntcy.org
NetSuite by Oracle:
NetSuite by Oracle is the AI-powered business management suite trusted by over 42,000 businesses, offering a unified platform for accounting, financial management, inventory, and HR. Gain total visibility and control to make quick decisions and automate everyday tasks—download the free ebook, Navigating Global Trade: Three Insights for Leaders, at https://netsuite.com/cognitive
PRODUCED BY:
SOCIAL LINKS:
Website: https://www.cognitiverevolution.ai
Twitter (Podcast): https://x.com/cogrev_podcast
Twitter (Nathan): https://x.com/labenz
LinkedIn: https://linkedin.com/in/nathanlabenz/
Youtube: https://youtube.com/@CognitiveRevolutionPodcast
Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk
262 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.