Go offline with the Player FM app!
Crowdsourced AI benchmarks have serious flaws, some experts say
Manage episode 478768410 series 1269621
AI labs are increasingly relying on crowdsourced benchmarking platforms such as Chatbot Arena to probe the strengths and weaknesses of their latest models. But some experts say that there are serious problems with this approach from an ethical and academic perspective.
Learn more about your ad choices. Visit podcastchoices.com/adchoices
4565 episodes
Manage episode 478768410 series 1269621
AI labs are increasingly relying on crowdsourced benchmarking platforms such as Chatbot Arena to probe the strengths and weaknesses of their latest models. But some experts say that there are serious problems with this approach from an ethical and academic perspective.
Learn more about your ad choices. Visit podcastchoices.com/adchoices
4565 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.