{"slug": "a-simple-argument-for-trying-less-hard", "title": "A simple argument for trying less hard", "summary": "A new argument against trying hard in longtermist areas, particularly AI safety, suggests that intense effort distorts epistemics and increases the risk of negative impact due to high uncertainty. The author contends that pushing hard toward goals leads to motivated reasoning and less capacity for reflection, making it especially dangerous for those working on long-term future issues.", "body_md": "People often make arguments against “trying hard” (working very hard, pushing yourself to the brink, being intensely goal-directed, and so on) by pointing to the risks of burnout or of losing some kind of [wholesomeness](https://forum.effectivealtruism.org/s/qrDdoy8hNj58bFzHK) [1].\n\nBut there’s another, very simple argument against it that I have not seen anyone fully make explicit [2], even though I think it’s very important. It goes like this:\n\n**We face a** **lot of uncertainty about the sign of our impact.**\n\n**Therefore, we** **should be very vigilant about our epistemics to make sure that we are not having a negative impact in expectation.**\n\n**But trying hard** **deeply distorts our epistemics - it makes us more prone to motivated reasoning about what we’re doing, and leaves us with less slack to reflect on it.**\n\n**Therefore, all else being equal, we should try less hard.**\n\nCrucially, this argument applies much more strongly to people working in “longtermist areas” - which other critiques of trying hard generally don’t do. For example, global health EAs whose terminal value is short-term welfare also face uncertainty about the impact of their actions - but much less (especially about the sign) than people trying to improve the long-term future. So this argument suggests that it’s especially dangerous for longtermists to try very hard.[[3]](https://www.lesswrong.com/feed.xml#fndq4rao2lnqj)[[4]](https://www.lesswrong.com/feed.xml#fn6ip0tzqts0i)\n\nI’ll go through the steps in a little bit more detail.\n\nMuch has been said in EA about [cluelessness ](https://forum.effectivealtruism.org/topics/cluelessness)and [crucial considerations](https://forum.effectivealtruism.org/topics/crucial-consideration/), but I’ll highlight a few specific concerns that could make lots of current AI safety work [5] net negative (with no claim to novelty):\n\nThese are very difficult to evaluate (and underargued here). My point is not that these are all valid worries - I’m skeptical of several of them. But I don’t think they can be neglected.\n\nIn the past, I’ve felt a sense of being overwhelmed at all these considerations, and felt tempted to just avoid thinking about them - but that can’t be the answer. We have to take the uncertainty seriously. Even if I don’t currently have the capacity to go into some kind of deep reflection, I should attempt to make my actions as robust to the uncertainty as I can - for example, by making sure I can course-correct, and by keeping my epistemics in good shape.\n\nUnfortunately, trying very hard conflicts with this - the harder we push toward a goal, the more we bend the evidence to justify it, and the less mental room we have left to step back and question what we're doing.[[10]](https://www.lesswrong.com/feed.xml#fnja1khpbsr8)\n\nAnd I think there’s something stronger, too - in an [active inference](https://slatestarcodex.com/2017/09/05/book-review-surfing-uncertainty/) framework, beliefs and desires are both just expectations about the world. Experientially, this rings true to me - the feelings of frustration at not getting what I want and at being taken off guard by something I hadn’t even been paying attention to are very similar. It seems deeply hard to distinguish between what we want and what we believe.\n\nThis is a bit more speculative, but sometimes I think people don’t fully absorb this point: It’s not psychological, it’s *neurological.* There’s a sense in which wanting anything distorts us away from pure self-supervised prediction of the world and compresses us internally into living in a specific hypothesis - a “gut-level” vision of the world, that gets upweighted on the level of our base perceptions. So we may not be able to fully adjust for it by only manipulating psychological factors, e.g. consciously trying to be more objective or less selfish.\n\nTo be clear, I have a lot of respect for people who try extremely hard - I wouldn’t be able to do it, I’m often intimidated by them. [11] I’m also not trying to make a statement about how strong the update from this should be (I don’t even have a good enough knowledge about these spaces to have a precise sense of how hard various groups of people are trying). Maybe arguments for trying even harder actually outweigh this on the current margins, I wouldn’t know.\n\nBut I have the sense that this simple consideration is underrated, and I hope this post can provide a reference point for it and make people take it into account in their personal deliberations.\n\nAlso, some apparently seminal academic philosophy stuff that seems interesting: [Moral Saints](https://r.jordan.im/download/philosophy/wolf1982.pdf) by Susan Wolf, and [Bernard Williams’](https://plato.stanford.edu/entries/williams-bernard/) work.\n\nThe closest thing to it is probably the strain of thought around [What should you change in response to an “emergency”? And AI risk](https://www.lesswrong.com/posts/mmHctwkKjpvaQdC3c/what-should-you-change-in-response-to-an-emergency-and-ai) and [Slack gives you space to notice/reflect on subtle things](https://www.lesswrong.com/posts/fwSDKTZvraSdmwFsj/slack-gives-you-space-to-notice-reflect-on-subtle-things) - but that still seems centered on the MIRI-style mindset of (strawmanning here) “more AI safety is definitely good and we just need to think hard to find “true” AI safety work” (which isn’t really my mindset). That is, the uncertainty about what to do comes more from their specific inside-view that alignment is very hard, rather than model-agnostic EA/philosophy-style cluelessness. So I think the argument in this post is a more general one that should be convincing to more people (nowadays, there are obviously a lot of non-MIRI-cluster people who are trying extremely hard on AI safety stuff, e.g. [this post ](https://substack.com/home/post/p-199473927)that I saw recently).\n\nThere’s also the “[Maximization is perilous](https://forum.effectivealtruism.org/posts/T975ydo3mx8onH3iS/ea-is-about-maximization-and-maximization-is-perilous)” angle, but that’s more about naive optimization in general, and not about facing huge uncertainty specifically (e.g. it applies equally to global health EAs).\n\nAlso, shoutout to [Slack matters more than any outcome](https://www.lesswrong.com/posts/xEHy9oivifjgFbnvc/slack-matters-more-than-any-outcome) for a personally inspirational framing on related issues.\n\nWhich is a little counterintuitive, because we usually see the opposite - longtermist EAs being *more* intense. Although longtermists also have a bigger moral scope and often more urgency (vis-à-vis AI timelines), so may reasonably trade off more sharply against other values and personal well-being.\n\nThe same argument also works for virtue and general emotional health, but that’s out of scope for this post.\n\nOf course, AI safety interventions are extremely heterogenous - but that just increases the extent to which individual decisionmaking is crucial (as opposed to deferring to people).\n\n[Holden Karnofsky](https://80000hours.org/podcast/episodes/holden-karnofsky-concrete-ai-safety-frontier-ai-companies/): “Most things that touch policy at all in any way will move us along that spectrum in one direction or another, so therefore have a high chance of being negative [...]\n\nAnd then most things that you can do in AI at all will have some impact on policy. Even just alignment research: policy will be shaped by what we’re seeing from alignment research, how tractable it looks, what the interventions look like.” (h/t Anthony DiGiovanni)\n\n[Holden Karnofsky](https://80000hours.org/podcast/episodes/holden-karnofsky-concrete-ai-safety-frontier-ai-companies/): “there’s also a lot of micro ways in which you could do harm. Just literally working in safety and being annoying, you might do net harm. You might just talk to the wrong person at the wrong time, get on their nerves. I’ve heard lots of stories of this. Just like, this person does great safety work, but they really annoyed this one person, and that might be the reason we all go extinct” (h/t Anthony DiGiovanni)\n\nAmong other things.\n\nI associate these with people like [Richard Ngo ](https://x.com/RichardMCNgo/status/2056000490840699137)(and [here](https://www.lesswrong.com/posts/6YxdpGjfHyrZb7F2G/third-wave-ai-safety-needs-sociopolitical-thinking)) and [Oliver Habryka](https://forum.effectivealtruism.org/posts/W7AMKT8qssjS8WwjN/habryka-deactivated-s-quick-takes?commentId=cP3pActZJ4FdBQ6tn).\n\nThis is well-known in psychology. Also, Opus 4.8 wrote that sentence.\n\nI definitely don’t want to imply that *if only *it weren’t for this argument, I would be an extremely hard worker too, haha.", "url": "https://wpnews.pro/news/a-simple-argument-for-trying-less-hard", "canonical_source": "https://www.lesswrong.com/posts/sv7WezDZyu9Zdxkxb/a-simple-argument-for-trying-less-hard", "published_at": "2026-06-13 18:12:13+00:00", "updated_at": "2026-06-13 18:18:59.973394+00:00", "lang": "en", "topics": ["ai-safety", "ai-ethics"], "entities": [], "alternates": {"html": "https://wpnews.pro/news/a-simple-argument-for-trying-less-hard", "markdown": "https://wpnews.pro/news/a-simple-argument-for-trying-less-hard.md", "text": "https://wpnews.pro/news/a-simple-argument-for-trying-less-hard.txt", "jsonld": "https://wpnews.pro/news/a-simple-argument-for-trying-less-hard.jsonld"}}