Go offline with the Player FM app!
Podcasts Worth a Listen
SPONSORED


1 Battle Camp S1: Reality Rivalries with Dana Moon & QT 1:00:36
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models
Manage episode 479022176 series 3524393
https://arxiv.org/abs//2504.17789
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2239 episodes
Manage episode 479022176 series 3524393
https://arxiv.org/abs//2504.17789
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2239 episodes
All episodes
×
1 [QA] System Prompt Optimization with Meta-Learning 7:41

1 System Prompt Optimization with Meta-Learning 21:51

1 [QA] Revealing economic facts: LLMs know more than they say1 7:35

1 Revealing economic facts: LLMs know more than they say1 20:39

1 [QA] Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures 8:26

1 Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures 43:30

1 [QA] Beyond `Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models 7:24

1 Beyond `Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models 14:33

1 [QA] The COT ENCYCLOPEDIA: Analyzing, Predicting, and Controlling how a Reasoning Model will Think 7:42

1 The COT ENCYCLOPEDIA: Analyzing, Predicting, and Controlling how a Reasoning Model will Think 16:35

1 [QA] Adversarial Suffix Filtering: a Defense Pipeline for LLMs 7:28

1 Adversarial Suffix Filtering: a Defense Pipeline for LLMs 14:06

1 [QA] AM‑Thinking‑v1: Advancing the Frontier of Reasoning at 32B Scale 7:50

1 AM‑Thinking‑v1: Advancing the Frontier of Reasoning at 32B Scale 24:52

1 [QA] Putting It All into Context: Simplifying Agents with LCLMs 8:28

1 Putting It All into Context: Simplifying Agents with LCLMs 23:51



1 [QA] MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining 7:59

1 MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining 34:32

1 [QA] Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions 7:31

1 Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions 20:49

1 [QA] Towards Quantifying the Hessian Structure of Neural Networks 8:04

1 Towards Quantifying the Hessian Structure of Neural Networks 23:12

1 [QA] Crosslingual Reasoning through Test-Time Scaling 7:50

1 Crosslingual Reasoning through Test-Time Scaling 29:07

1 [QA] Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models 9:12

1 Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models 31:54

1 [QA] Generating Physically Stable and Buildable LEGO Designs from Text 8:17

1 Generating Physically Stable and Buildable LEGO Designs from Text 18:25

1 [QA] Reasoning Models Don't Always Say What They Think 7:48

1 Reasoning Models Don't Always Say What They Think 20:33

1 [QA] Scalable Chain of Thoughts via Elastic Reasoning 8:05

1 Scalable Chain of Thoughts via Elastic Reasoning 20:48

1 [QA] ZEROSEARCH: Incentivize the Search Capability of LLMs without Searching 9:22

1 ZEROSEARCH: Incentivize the Search Capability of LLMs without Searching 19:29

1 [QA] Cer-Eval: Certifiable and Cost-Efficient Evaluation Framework for LLMs 8:46

1 Cer-Eval: Certifiable and Cost-Efficient Evaluation Framework for LLMs 22:09

1 [QA] Absolute Zero: Reinforced Self-play Reasoning with Zero Data 7:08

1 Absolute Zero: Reinforced Self-play Reasoning with Zero Data 27:54

1 [QA] Teaching Models to Understand (but not Generate) High-risk Data 7:54

1 Teaching Models to Understand (but not Generate) High-risk Data 16:23



1 [QA] Practical Efficiency of Muon for Pretraining 7:03

1 Practical Efficiency of Muon for Pretraining 23:06

1 [QA] Llama-Nemotron: Efficient Reasoning Models 7:39


1 [QA] Evaluating Frontier Models for Stealth and Situational Awareness 7:42

1 Evaluating Frontier Models for Stealth and Situational Awareness 37:33

1 [QA] Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report 7:42

1 Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report 17:58

1 [QA] COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning 8:32

1 COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning 16:46

1 [QA] DeepCritic: Deliberate Critique with Large Language Models 7:52

1 DeepCritic: Deliberate Critique with Large Language Models 17:52

1 [QA] Direct Motion Models for Assessing Generated Videos 7:34

1 Direct Motion Models for Assessing Generated Videos 17:42

1 [QA] MINERVA: Evaluating Complex Video Reasoning 7:38

1 MINERVA: Evaluating Complex Video Reasoning 20:00

1 [QA] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT 7:13

1 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT 22:08

1 [QA] Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math 7:49

1 Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math 17:05

1 [QA] Reinforcement Learning for Reasoning in Large Language Models with One Training Example 9:15

1 Reinforcement Learning for Reasoning in Large Language Models with One Training Example 29:41

1 [QA] ReasonIR: Training Retrievers for Reasoning Tasks 8:27

1 ReasonIR: Training Retrievers for Reasoning Tasks 24:05



1 [QA] Think, Prune, Train, Improve: Scaling Reasoning Without Scaling Models 7:26

1 Think, Prune, Train, Improve: Scaling Reasoning Without Scaling Models 16:50

1 [QA] Reasoning LLMs Are Just Efficient Samplers: RL Training Elicits No Transcending Capacity 8:04

1 Reasoning LLMs Are Just Efficient Samplers: RL Training Elicits No Transcending Capacity 23:12

1 [QA] Learning Adaptive Parallel Reasoning with Language Models 7:42

1 Learning Adaptive Parallel Reasoning with Language Models 21:22

1 [QA] Boosting Generative Image Modeling via Joint Image-Feature Synthesis 7:48

1 Boosting Generative Image Modeling via Joint Image-Feature Synthesis 18:50

1 [QA] Step1X-Edit: A Practical Framework for General Image Editing 8:07

1 Step1X-Edit: A Practical Framework for General Image Editing 15:50

1 [QA] Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models 7:49

1 Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models 24:06

1 [QA] Exploring How LLMs Capture and Represent Domain-Specific Knowledge 7:44

1 Exploring How LLMs Capture and Represent Domain-Specific Knowledge 19:09

1 [QA] I-Con: A Unifying Framework for Representation Learning 7:41

1 I-Con: A Unifying Framework for Representation Learning 16:31



1 [QA] LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities 8:09

1 LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities 15:38

1 [QA] NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning 8:52

1 NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning 31:19

1 [QA] Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning 7:54

1 Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning 7:09

1 [QA] Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model 7:38

1 Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model 16:13

1 [QA] Reasoning Models Can Be Effective Without Thinking 7:29

1 Reasoning Models Can Be Effective Without Thinking 20:05

1 [QA] A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce 8:27

1 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce 14:38

1 [QA] CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training 7:14

1 CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training 20:35

1 [QA] Position: The Most Expensive Part of an LLM should be its Training Data 7:16

1 Position: The Most Expensive Part of an LLM should be its Training Data 20:05

1 [QA] Activated LoRA: Fine-tuned LLMs for Intrinsics 8:16

1 Activated LoRA: Fine-tuned LLMs for Intrinsics 18:55

1 [QA] COLORBENCH: Can VLMs See and Understand the Colorful World? 7:49

1 COLORBENCH: Can VLMs See and Understand the Colorful World? 20:40

1 [QA] ReTool: Reinforcement Learning for Strategic Tool Use in LLMs 8:33

1 ReTool: Reinforcement Learning for Strategic Tool Use in LLMs 14:57


1 [QA] How to Predict Best Pretraining Data with Small Experiments 8:16

1 How to Predict Best Pretraining Data with Small Experiments 20:22

1 [QA] Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability 7:18

1 Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability 7:07

1 [QA] DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training 7:39

1 DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training 10:11

1 [QA] Steering CLIP's vision transformer with sparse autoencoders 8:11

1 Steering CLIP's vision transformer with sparse autoencoders 17:53

1 [QA] Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning 7:58

1 Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning 18:11




1 [QA] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill? 7:45

1 Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill? 16:23



1 [QA] Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory 7:56

1 Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory 15:48

1 [QA] Scaling Laws for Native Multimodal Models 7:14


1 [QA] OLMOTRACE: Tracing Language Model Outputs Back to Trillions of Training Tokens 7:16

1 OLMOTRACE: Tracing Language Model Outputs Back to Trillions of Training Tokens 18:20

1 [QA] A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility 7:38

1 A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility 19:29
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.