16 subscribers
Go offline with the Player FM app!
Podcasts Worth a Listen
SPONSORED


1 Venture Investing in Mobility + Tech with University of Michigan’s Early-Stage Zell Lurie Commercialization Fund 39:30
Jim Fan on Nvidia’s Embodied AI Lab and Jensen Huang’s Prediction that All Robots will be Autonomous
Manage episode 440379171 series 3586723
AI researcher Jim Fan has had a charmed career. He was OpenAI’s first intern before he did his PhD at Stanford with “godmother of AI,” Fei-Fei Li. He graduated into a research scientist position at Nvidia and now leads its Embodied AI “GEAR” group. The lab’s current work spans foundation models for humanoid robots to agents for virtual worlds.
Jim describes a three-pronged data strategy for robotics, combining internet-scale data, simulation data and real world robot data. He believes that in the next few years it will be possible to create a “foundation agent” that can generalize across skills, embodiments and realities—both physical and virtual. He also supports Jensen Huang’s idea that “Everything that moves will eventually be autonomous.”
Hosted by: Stephanie Zhan and Sonya Huang, Sequoia Capital
Mentioned in this episode:
- World of Bits: Early OpenAI project Jim worked on as an intern with Andrej Karpathy. Part of a bigger initiative called Universe
- Fei-Fei Li: Jim’s PhD advisor at Stanford who founded the ImageNet project in 2010 that revolutionized the field of visual recognition, led the Stanford Vision Lab and just launched her own AI startup, World Labs
- Project GR00T: Nvidia’s “moonshot effort” at a robotic foundation model, premiered at this year’s GTC
- Thinking Fast and Slow: Influential book by Daniel Kahneman that popularized some of his teaching from behavioral economics
- Jetson Orin chip: The dedicated series of edge computing chips Nvidia is developing to power Project GR00T
- Eureka: Project by Jim’s team that trained a five finger robot hand to do pen spinning
- MineDojo: A project Jim did when he first got to Nvidia that developed a platform for general purpose agents in the game of Minecraft. Won NeurIPS 2022 Outstanding Paper Award
- ADI: artificial dog intelligence
- Mamba: Selective State Space Models, an alternative architecture to Transformers that Jim is interested in (original paper here)
00:00 Introduction
01:35 Jim’s journey to embodied intelligence
04:53 The GEAR Group
07:32 Three kinds of data for robotics
10:32 A GPT-3 moment for robotics
16:05 Choosing the humanoid robot form factor
19:37 Specialized generalists
21:59 GR00T gets its own chip
23:35 Eureka and Issac Sim
25:23 Why now for robotics?
28:53 Exploring virtual worlds
36:28 Implications for games
39:13 Is the virtual world in service of the physical world?
42:10 Alternative architectures to Transformers
44:15 Lightning round
55 episodes
Jim Fan on Nvidia’s Embodied AI Lab and Jensen Huang’s Prediction that All Robots will be Autonomous
Manage episode 440379171 series 3586723
AI researcher Jim Fan has had a charmed career. He was OpenAI’s first intern before he did his PhD at Stanford with “godmother of AI,” Fei-Fei Li. He graduated into a research scientist position at Nvidia and now leads its Embodied AI “GEAR” group. The lab’s current work spans foundation models for humanoid robots to agents for virtual worlds.
Jim describes a three-pronged data strategy for robotics, combining internet-scale data, simulation data and real world robot data. He believes that in the next few years it will be possible to create a “foundation agent” that can generalize across skills, embodiments and realities—both physical and virtual. He also supports Jensen Huang’s idea that “Everything that moves will eventually be autonomous.”
Hosted by: Stephanie Zhan and Sonya Huang, Sequoia Capital
Mentioned in this episode:
- World of Bits: Early OpenAI project Jim worked on as an intern with Andrej Karpathy. Part of a bigger initiative called Universe
- Fei-Fei Li: Jim’s PhD advisor at Stanford who founded the ImageNet project in 2010 that revolutionized the field of visual recognition, led the Stanford Vision Lab and just launched her own AI startup, World Labs
- Project GR00T: Nvidia’s “moonshot effort” at a robotic foundation model, premiered at this year’s GTC
- Thinking Fast and Slow: Influential book by Daniel Kahneman that popularized some of his teaching from behavioral economics
- Jetson Orin chip: The dedicated series of edge computing chips Nvidia is developing to power Project GR00T
- Eureka: Project by Jim’s team that trained a five finger robot hand to do pen spinning
- MineDojo: A project Jim did when he first got to Nvidia that developed a platform for general purpose agents in the game of Minecraft. Won NeurIPS 2022 Outstanding Paper Award
- ADI: artificial dog intelligence
- Mamba: Selective State Space Models, an alternative architecture to Transformers that Jim is interested in (original paper here)
00:00 Introduction
01:35 Jim’s journey to embodied intelligence
04:53 The GEAR Group
07:32 Three kinds of data for robotics
10:32 A GPT-3 moment for robotics
16:05 Choosing the humanoid robot form factor
19:37 Specialized generalists
21:59 GR00T gets its own chip
23:35 Eureka and Issac Sim
25:23 Why now for robotics?
28:53 Exploring virtual worlds
36:28 Implications for games
39:13 Is the virtual world in service of the physical world?
42:10 Alternative architectures to Transformers
44:15 Lightning round
55 episodes
All episodes
×
1 Mapping the Mind of a Neural Net: Goodfire’s Eric Ho on the Future of Interpretability 47:07

1 ElevenLabs’ Mati Staniszewski: Why Voice Will Be the Fundamental Interface for Tech 59:53

1 From DevOps ‘Heart Attacks’ to AI-Powered Diagnostics With Traversal’s AI Agents 40:32

1 The Breakthroughs Needed for AGI Have Already Been Made: OpenAI Former Research Head Bob McGrew 48:51

1 OpenAI Codex Team: From Coding Autocomplete to Asynchronous Autonomous Agents 37:44

1 Google I/O Afterparty: The Future of Human-AI Collaboration, From Veo to Mariner 53:51

1 From Data Centers to Dyson Spheres: P-1 AI's Path to Hardware Engineering AGI 38:25

1 Gong’s Amit Bendov: From Meeting Recordings to Revenue AI 42:05

1 LIVE: Google's Jeff Dean on the Coming Transformations in AI 30:59

1 LIVE: Ambient Agents and the New Agent Inbox ft. Harrison Chase of LangChain 8:28

1 LIVE: How AI is Reinventing Software Business Models ft. Bret Taylor of Sierra 34:30

1 LIVE: Sam Altman of OpenAI on Building the ‘Core AI Subscription’ for Your Life 32:27

1 Workday CEO Carl Eschenbach: Building the System of Record for the AI Era 47:49

1 The Quest to ‘Solve All Diseases’ with AI: Isomorphic Labs’ Max Jaderberg 55:40

1 Pricing in the AI Era: From Inputs to Outcomes, with Paid CEO Manny Medina 45:29
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.