Audio narrations of LessWrong posts. Includes all curated posts and all posts with 125+ karma. If you'd like more, subscribe to the “Lesswrong (30+ karma)” feed.
…
continue reading
This is a link post. A very long essay about LLMs, the nature and history of the the HHH assistant persona, and the implications for alignment. Multiple people have asked me whether I could post this LW in some form, hence this linkpost. (Note: although I expect this post will be interesting to people on LW, keep in mind that it was written with a …
…
continue reading

1
“Mech interp is not pre-paradigmatic” by Lee Sharkey
29:33
29:33
Play later
Play later
Lists
Like
Liked
29:33This is a blogpost version of a talk I gave earlier this year at GDM. Epistemic status: Vague and handwavy. Nuance is often missing. Some of the claims depend on implicit definitions that may be reasonable to disagree with. But overall I think it's directionally true. It's often said that mech interp is pre-paradigmatic. I think it's worth being sk…
…
continue reading

1
“Distillation Robustifies Unlearning” by Bruce W. Lee, Addie Foote, alexinf, leni, Jacob G-W, Harish Kamath, Bryce Woodworth, cloud, TurnTrout
17:19
17:19
Play later
Play later
Lists
Like
Liked
17:19Current “unlearning” methods only suppress capabilities instead of truly unlearning the capabilities. But if you distill an unlearned model into a randomly initialized model, the resulting network is actually robust to relearning. We show why this works, how well it works, and how to trade off compute for robustness. Unlearn-and-Distill applies unl…
…
continue reading

1
“Intelligence Is Not Magic, But Your Threshold For ‘Magic’ Is Pretty Low” by Expertium
3:12
3:12
Play later
Play later
Lists
Like
Liked
3:12A while ago I saw a person in the comments on comments to Scott Alexander's blog arguing that a superintelligent AI would not be able to do anything too weird and that "intelligence is not magic", hence it's Business As Usual. Of course, in a purely technical sense, he's right. No matter how intelligent you are, you cannot override fundamental laws…
…
continue reading

1
“A Straightforward Explanation of the Good Regulator Theorem” by Alfred Harwood
29:24
29:24
Play later
Play later
Lists
Like
Liked
29:24Audio note: this article contains 329 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description. This post was written during the agent foundations fellowship with Alex Altair funded by the LTFF. Thanks to Alex, Jose, Daniel and Einar for reading and commenting on a draft. Th…
…
continue reading

1
“Beware General Claims about ‘Generalizable Reasoning Capabilities’ (of Modern AI Systems)” by LawrenceC
34:11
34:11
Play later
Play later
Lists
Like
Liked
34:111. Late last week, researchers at Apple released a paper provocatively titled “The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity”, which “challenge[s] prevailing assumptions about [language model] capabilities and suggest that current approaches may be encountering fundament…
…
continue reading

1
“Season Recap of the Village: Agents raise $2,000” by Shoshannah Tekofsky
13:24
13:24
Play later
Play later
Lists
Like
Liked
13:24Four agents woke up with four computers, a view of the world wide web, and a shared chat room full of humans. Like Claude plays Pokemon, you can watch these agents figure out a new and fantastic world for the first time. Except in this case, the world they are figuring out is our world. In this blog post, we’ll cover what we learned from the first …
…
continue reading

1
“The Best Reference Works for Every Subject” by Parker Conley
13:02
13:02
Play later
Play later
Lists
Like
Liked
13:02Introduction The Best Textbooks on Every Subject is the Schelling point for the best textbooks on every subject. My The Best Tacit Knowledge Videos on Every Subject is the Schelling point for the best tacit knowledge videos on every subject. This post is the Schelling point for the best reference works for every subject. Reference works provide an …
…
continue reading

1
“‘Flaky breakthroughs’ pervade coaching — and no one tracks them” by Chipmonk
9:31
9:31
Play later
Play later
Lists
Like
Liked
9:31Has someone you know ever had a “breakthrough” from coaching, meditation, or psychedelics — only to later have it fade? Show tweet For example, many people experience ego deaths that can last days or sometimes months. But as it turns out, having a sense of self can serve important functions (try navigating a world that expects you to have opinions,…
…
continue reading

1
“The Value Proposition of Romantic Relationships” by johnswentworth
23:19
23:19
Play later
Play later
Lists
Like
Liked
23:19What's the main value proposition of romantic relationships? Now, look, I know that when people drop that kind of question, they’re often about to present a hyper-cynical answer which totally ignores the main thing which is great and beautiful about relationships. And then they’re going to say something about how relationships are overrated or some…
…
continue reading

1
“It’s hard to make scheming evals look realistic” by Igor Ivanov, dan_moken
7:47
7:47
Play later
Play later
Lists
Like
Liked
7:47Abstract Claude 3.7 Sonnet easily detects when it's being evaluated for scheming. Surface‑level edits to evaluation scenarios, such as lengthening the prompts, or making conflict of objectives less salient, do improve realism of evaluation scenarios for LLMs, yet these improvements remain modest. The findings confirm that truly disguising an evalua…
…
continue reading

1
[Linkpost] “Social Anxiety Isn’t About Being Liked” by Chipmonk
5:23
5:23
Play later
Play later
Lists
Like
Liked
5:23This is a link post. There's this popular idea that socially anxious folks are just dying to be liked. It seems logical, right? Why else would someone be so anxious about how others see them? Show tweet And yet, being socially anxious tends to make you less likeable…they must be optimizing poorly, behaving irrationally, right? Maybe not. What if so…
…
continue reading

1
“Truth or Dare” by Duncan Sabien (Inactive)
2:03:21
2:03:21
Play later
Play later
Lists
Like
Liked
2:03:21Author's note: This is my apparently-annual "I'll put a post on LessWrong in honor of LessOnline" post. These days, my writing goes on my Substack. There have in fact been some pretty cool essays since last year's LO post. Structural note: Some essays are like a five-minute morning news spot. Other essays are more like a 90-minute lecture. This is …
…
continue reading
Lessons from shutting down institutions in Eastern Europe. This is a cross post from: https://250bpm.substack.com/p/meditations-on-doge Imagine living in the former Soviet republic of Georgia in early 2000's: All marshrutka [mini taxi bus] drivers had to have a medical exam every day to make sure they were not drunk and did not have high blood pres…
…
continue reading

1
[Linkpost] “If you’re not sure how to sort a list or grid—seriate it!” by gwern
4:37
4:37
Play later
Play later
Lists
Like
Liked
4:37This is a link post. "Getting Things in Order: An Introduction to the R Package seriation": Seriation [or "ordination"), i.e., finding a suitable linear order for a set of objects given data and a loss or merit function, is a basic problem in data analysis. Caused by the problem's combinatorial nature, it is hard to solve for all but very small set…
…
continue reading

1
“What We Learned from Briefing 70+ Lawmakers on the Threat from AI” by leticiagarcia
31:47
31:47
Play later
Play later
Lists
Like
Liked
31:47Between late 2024 and mid-May 2025, I briefed over 70 cross-party UK parliamentarians. Just over one-third were MPs, a similar share were members of the House of Lords, and just under one-third came from devolved legislatures — the Scottish Parliament, the Senedd, and the Northern Ireland Assembly. I also held eight additional meetings attended exc…
…
continue reading
Have the Accelerationists won? Last November Kevin Roose announced that those in favor of going fast on AI had now won against those favoring caution, with the reinstatement of Sam Altman at OpenAI. Let's ignore whether Kevin's was a good description of the world, and deal with a more basic question: if it were so—i.e. if Team Acceleration would co…
…
continue reading

1
[Linkpost] “Gemini Diffusion: watch this space” by Yair Halberstadt
2:16
2:16
Play later
Play later
Lists
Like
Liked
2:16This is a link post. Google Deepmind has announced Gemini Diffusion. Though buried under a host of other IO announcements it's possible that this is actually the most important one! This is significant because diffusion models are entirely different to LLMs. Instead of predicting the next token, they iteratively denoise all the output tokens until …
…
continue reading
I’m reading George Eliot's Impressions of Theophrastus Such (1879)—so far a snoozer compared to her novels. But chapter 17 surprised me for how well it anticipated modern AI doomerism. In summary, Theophrastus is in conversation with Trost, who is an optimist about the future of automation and how it will free us from drudgery and permit us to furt…
…
continue reading

1
“Consider not donating under $100 to political candidates” by DanielFilan
2:01
2:01
Play later
Play later
Lists
Like
Liked
2:01Epistemic status: thing people have told me that seems right. Also primarily relevant to US audiences. Also I am speaking in my personal capacity and not representing any employer, present or past. Sometimes, I talk to people who work in the AI governance space. One thing that multiple people have told me, which I found surprising, is that there is…
…
continue reading

1
“It’s Okay to Feel Bad for a Bit” by moridinamael
5:51
5:51
Play later
Play later
Lists
Like
Liked
5:51"If you kiss your child, or your wife, say that you only kiss things which are human, and thus you will not be disturbed if either of them dies." - Epictetus "Whatever suffering arises, all arises due to attachment; with the cessation of attachment, there is the cessation of suffering." - Pali canon "He is not disturbed by loss, he does not delight…
…
continue reading

1
“Explaining British Naval Dominance During the Age of Sail” by Arjun Panickssery
8:52
8:52
Play later
Play later
Lists
Like
Liked
8:52The other day I discussed how high monitoring costs can explain the emergence of “aristocratic” systems of governance: Aristocracy and Hostage Capital Arjun Panickssery · Jan 8 There's a conventional narrative by which the pre-20th century aristocracy was the "old corruption" where civil and military positions were distributed inefficiently due to …
…
continue reading

1
“Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies” by So8res
6:42
6:42
Play later
Play later
Lists
Like
Liked
6:42Eliezer and I wrote a book. It's titled If Anyone Builds It, Everyone Dies. Unlike a lot of other writing either of us have done, it's being professionally published. It's hitting shelves on September 16th. It's a concise (~60k word) book aimed at a broad audience. It's been well-received by people who received advance copies, with some endorsement…
…
continue reading
It was a cold and cloudy San Francisco Sunday. My wife and I were having lunch with friends at a Korean cafe. My phone buzzed with a text. It said my mom was in the hospital. I called to find out more. She had a fever, some pain, and had fainted. The situation was serious, but stable. Monday was a normal day. No news was good news, right? Tuesday s…
…
continue reading

1
“PSA: The LessWrong Feedback Service” by JustisMills
4:34
4:34
Play later
Play later
Lists
Like
Liked
4:34At the bottom of the LessWrong post editor, if you have at least 100 global karma, you may have noticed this button. The button Many people click the button, and are jumpscared when it starts an Intercom chat with a professional editor (me), asking what sort of feedback they'd like. So, that's what it does. It's a summon Justis button. Why summon J…
…
continue reading