ReasonIR: Training Retrievers For Reasoning Tasks Arxiv Papers podcast

A

Arxiv Papers

1
[QA] Corrector Sampling in Language Models 7:31

2 days ago7:31

7:31

https://arxiv.org/abs//2506.06215 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

A

Arxiv Papers

1
Corrector Sampling in Language Models 19:02

2 days ago19:02

19:02

https://arxiv.org/abs//2506.06215 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

A

Arxiv Papers

1
[QA] Distillation Robustifies Unlearning 7:05

2 days ago7:05

7:05

The paper presents UNDO, a method that enhances unlearning in LLMs through distillation, achieving robust capability removal with reduced compute and data requirements compared to traditional retraining methods. https://arxiv.org/abs//2506.06278 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
Distillation Robustifies Unlearning 14:58

2 days ago14:58

14:58

The paper presents UNDO, a method that enhances unlearning in LLMs through distillation, achieving robust capability removal with reduced compute and data requirements compared to traditional retraining methods. https://arxiv.org/abs//2506.06278 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
[QA] Log-Linear Attention 7:50

3 days ago7:50

7:50

This paper introduces log-linear attention, enhancing linear attention's efficiency by using a logarithmically growing set of hidden states, improving sequence modeling while maintaining computational efficiency. https://arxiv.org/abs//2506.04761 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
Log-Linear Attention 21:59

3 days ago21:59

21:59

This paper introduces log-linear attention, enhancing linear attention's efficiency by using a logarithmically growing set of hidden states, improving sequence modeling while maintaining computational efficiency. https://arxiv.org/abs//2506.04761 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
[QA] Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening 7:45

3 days ago7:45

7:45

This paper critiques GRPO's bias in training language models for theorem proving and introduces the unlikeliness reward to enhance performance and sample diversity, achieving competitive results. https://arxiv.org/abs//2506.02355 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening 16:56

3 days ago16:56

16:56

This paper critiques GRPO's bias in training language models for theorem proving and introduces the unlikeliness reward to enhance performance and sample diversity, achieving competitive results. https://arxiv.org/abs//2506.02355 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
[QA] Self-Challenging Language Model Agents 7:26

4 days ago7:26

7:26

The Self-Challenging framework enables agents to generate and train on high-quality tasks autonomously, achieving significant performance improvements using self-generated data in tool-use benchmarks. https://arxiv.org/abs//2506.01716 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
Self-Challenging Language Model Agents 22:33

4 days ago22:33

22:33

The Self-Challenging framework enables agents to generate and train on high-quality tasks autonomously, achieving significant performance improvements using self-generated data in tool-use benchmarks. https://arxiv.org/abs//2506.01716 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
[QA] Why Gradients Rapidly Increase Near the End of Training 7:00

4 days ago7:00

7:00

https://arxiv.org/abs//2506.02285 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

A

Arxiv Papers

1
Why Gradients Rapidly Increase Near the End of Training 11:24

4 days ago11:24

11:24

https://arxiv.org/abs//2506.02285 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

A

Arxiv Papers

1
[QA] GEM: Empowering LLM for both Embedding Generation and Language Understanding 7:41

5 days ago7:41

7:41

The paper introduces GEM, a self-supervised method enabling decoder-only LLMs to generate high-quality text embeddings, enhancing performance on embedding benchmarks while preserving original text generation capabilities. https://arxiv.org/abs//2506.04344 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
GEM: Empowering LLM for both Embedding Generation and Language Understanding 20:38

5 days ago20:38

20:38

The paper introduces GEM, a self-supervised method enabling decoder-only LLMs to generate high-quality text embeddings, enhancing performance on embedding benchmarks while preserving original text generation capabilities. https://arxiv.org/abs//2506.04344 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

A

Arxiv Papers

1
[QA] HYPERSTEER: Activation Steering at Scale with Hypernetworks 7:49

6 days ago7:49

7:49

HYPERSTEER introduces hypernetwork architectures for generating effective steering vectors in language models, outperforming existing methods and achieving strong performance on unseen prompts. Code available at GitHub. https://arxiv.org/abs//2506.03292 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…

Similar to Arxiv Papers

The Let Them Theory: A Life-Changing Tool That Millions of People Can't Stop Talking About

Amazon Basics Multipurpose Copy Printer Paper, 8.5 x 11 inches, 20 lb, 1 Ream, 500 Sheets, 92 Bright, White

Zevo Flying Insect Trap & Cartridge - Plug in Fly Trap & Indoor Bug Catcher for Gnats, House & Fruit Flies - Mess-Free - Use in Any Room - Uses Blue & UV Light (1 Plug in Device & 1 Cartridge)

Podcasts Worth a Listen

Arxiv Papers « » ReasonIR: Training Retrievers for Reasoning Tasks