So how does claude code handle checking permissions to do things anyway? There are ...

Why Nostr? What is Njump? Join Nostr

jonny (nonvenomous)

npub1lj…pw8tm

2026-04-02 10:30:45 UTC

in reply to nevent1q…ppz8

So how does claude code handle checking permissions to do things anyway? There are explicit rules that one can set to allow or deny tool calls and shell commands run, but the expanse of possible actions the LLM could take is literally infinite. You could prompt the user for every action that it takes, but that would ruin the ""velocity"" of it all. Regex rules can only take you so far. So what to do?

Could the answer be.... ask the LLM??? Of course it can! Introducing the new "auto mode" that anthropic released on [march 24th](https://claude.com/blog/auto-mode ) billed as a safer alternative to true-yolo mode.

Comments around where the system prompt should be indicate that it should have been inlined from a text file that wasn't included in the sourcemap - however that doesn't happen anywhere else, and the mechanism for doing the inlining is written in-place, so that's probably a hallucination. So great! the classifier flies without a prompt as far as i can tell. There are enough other scraps here that would amount to telling it "you are evaluating if something is safe to run" so i imagine it appears to work just fine.

So we don't have as much visibility here because of the missing prompt, but there's sort of a problem here. rather than just asking the LLM to evaluate if the given command is dangerous, the *entire context* is dumped into a side query, which is a mode that is designed to "have full visibility into the current conversation." That includes all the prior muttering to itself justifying the potentially dangerous tool call! So the auto mode is quite literally asking the exact same LLM given the exact same context if the command it just tried to run is safe to run.

Security!!!!!!!

Author Public Key

npub1ljmfkwmllavdpnf5tgmrfay6mj4t78c0xryugfw4qka0c4exas0q2pw8tm

Seen on

wss://relay.momostr.pink

Show more details

Published at

2026-04-02 10:30:45 UTC

Kind type

1 Short Text Note

Event JSON

{ "id": "36ec790b016baf0628214f2dea7b3c4bae55dd2f6a178386c1b663d041157c05", "pubkey": "fcb69b3b7fff58d0cd345a3634f49adcaabf1f0f30c9c425d505bafc5726ec1e", "created_at": 1775125845, "kind": 1, "tags": [ [ "e", "2bb9f53c1d679af374cf520d9d3dc5500aafdfd8c437b30827d6acc58d0c1b26", "", "root", "fcb69b3b7fff58d0cd345a3634f49adcaabf1f0f30c9c425d505bafc5726ec1e" ], [ "proxy", "https://neuromatch.social/@jonny/116334647429831137", "web" ], [ "e", "adbe0567e70fadf7bc6337b99dc0791b6d9bf7cd95b30adee8fa4e977961dda4", "", "reply", "fcb69b3b7fff58d0cd345a3634f49adcaabf1f0f30c9c425d505bafc5726ec1e" ], [ "imeta", "url https://media.neuromatch.social/media_attachments/files/116/334/632/271/148/691/original/51723d4945fae674.png", "m image/png" ], [ "p", "8f0d58cb3120c79ed44f4699eaf559b63067329357a84354ede7d5ad3e989e6a" ], [ "p", "fcb69b3b7fff58d0cd345a3634f49adcaabf1f0f30c9c425d505bafc5726ec1e" ], [ "imeta", "url https://media.neuromatch.social/media_attachments/files/116/334/631/856/244/419/original/40e077fd00c8a61e.png", "m image/png" ], [ "imeta", "url https://media.neuromatch.social/media_attachments/files/116/334/546/682/608/331/original/3543cc841cbd4b0e.png", "m image/png" ], [ "imeta", "url https://media.neuromatch.social/media_attachments/files/116/334/631/374/663/351/original/21546793db311c31.png", "m image/png" ], [ "proxy", "https://neuromatch.social/users/jonny/statuses/116334647429831137", "activitypub" ], [ "L", "pink.momostr" ], [ "l", "pink.momostr.activitypub:https://neuromatch.social/users/jonny/statuses/116334647429831137", "pink.momostr" ], [ "-" ] ], "content": "So how does claude code handle checking permissions to do things anyway? There are explicit rules that one can set to allow or deny tool calls and shell commands run, but the expanse of possible actions the LLM could take is literally infinite. You could prompt the user for every action that it takes, but that would ruin the \"\"velocity\"\" of it all. Regex rules can only take you so far. So what to do?\n\nCould the answer be.... ask the LLM??? Of course it can! Introducing the new \"auto mode\" that anthropic released on [march 24th](https://claude.com/blog/auto-mode ) billed as a safer alternative to true-yolo mode.\n\nComments around where the system prompt should be indicate that it should have been inlined from a text file that wasn't included in the sourcemap - however that doesn't happen anywhere else, and the mechanism for doing the inlining is written in-place, so that's probably a hallucination. So great! the classifier flies without a prompt as far as i can tell. There are enough other scraps here that would amount to telling it \"you are evaluating if something is safe to run\" so i imagine it appears to work just fine.\n\nSo we don't have as much visibility here because of the missing prompt, but there's sort of a problem here. rather than just asking the LLM to evaluate if the given command is dangerous, the *entire context* is dumped into a side query, which is a mode that is designed to \"have full visibility into the current conversation.\" That includes all the prior muttering to itself justifying the potentially dangerous tool call! So the auto mode is quite literally asking the exact same LLM given the exact same context if the command it just tried to run is safe to run.\n\nSecurity!!!!!!!\nhttps://media.neuromatch.social/media_attachments/files/116/334/546/682/608/331/original/3543cc841cbd4b0e.png\nhttps://media.neuromatch.social/media_attachments/files/116/334/631/374/663/351/original/21546793db311c31.png\nhttps://media.neuromatch.social/media_attachments/files/116/334/631/856/244/419/original/40e077fd00c8a61e.png\nhttps://media.neuromatch.social/media_attachments/files/116/334/632/271/148/691/original/51723d4945fae674.png\n", "sig": "c7a2f39315d8e58f4c28d822ec6979e38a7cf5746a25d32adf6f245c0697cce72aa83c7f0a53b2189ac0b9e6ceffeb3948b8d37d74bcb34210344914d1e40ece" }