Go offline with the Player FM app!
Podcasts Worth a Listen
SPONSORED


1 Phil Wang Pitches Psychological Thriller Starring WHO?! 24:35
Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
Manage episode 476541689 series 3524393
The study reveals that reasoning LLMs struggle with ill-posed questions, leading to excessive, ineffective responses, while non-reasoning LLMs perform better, highlighting flaws in current training methods.https://arxiv.org/abs//2504.06514YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2287 episodes
Manage episode 476541689 series 3524393
The study reveals that reasoning LLMs struggle with ill-posed questions, leading to excessive, ineffective responses, while non-reasoning LLMs perform better, highlighting flaws in current training methods.https://arxiv.org/abs//2504.06514YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2287 episodes
All episodes
×
1 [QA] Are Reasoning Models More Prone to Hallucination? 7:52

1 Are Reasoning Models More Prone to Hallucination? 20:24

1 [QA] How does Transformer Learn Implicit Reasoning? 8:56

1 How does Transformer Learn Implicit Reasoning? 23:21

1 [QA] Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones 7:26

1 Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones 24:00

1 [QA] Maximizing Confidence Alone Improves Reasoning 7:08

1 Maximizing Confidence Alone Improves Reasoning 13:21

1 [QA] Hardware-Efficient Attention for Fast Decoding 7:57

1 Hardware-Efficient Attention for Fast Decoding 30:59

1 [QA] Reinforcing General Reasoning without Verifiers 7:08

1 Reinforcing General Reasoning without Verifiers 17:11

1 [QA] ENIGMATA: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles 8:16

1 ENIGMATA: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles 23:54

1 [QA] Temporal Sampling for Forgotten Reasoning in LLMs 7:04

1 [QA] Understanding Prompt Tuning and In-Context Learning via Meta-Learning height2pt 7:28

1 Understanding Prompt Tuning and In-Context Learning via Meta-Learning height2pt 21:39



1 [QA] On the creation of narrow AI: hierarchy and nonlocality of neural network skills 7:21

1 On the creation of narrow AI: hierarchy and nonlocality of neural network skills 18:01

1 [QA] Do Language Models Use Their Depth Efficiently? 7:25

1 Do Language Models Use Their Depth Efficiently? 20:25

1 [QA] Enhancing Latent Computation in Transformers with Latent Tokens 8:42

1 Enhancing Latent Computation in Transformers with Latent Tokens 21:54

1 [QA] Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation 8:11

1 Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation 20:20

1 [QA] Visual Planning: Let's Think Only with Images 7:43
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.