i am so fucking tired. if the LLM invents a tool to call, it first tells itself to ...

2026-04-14 08:42:30 UTC

i am so fucking tired. if the LLM invents a tool to call, it first tells itself to call another tool to check if the tool was actually real but the fucking nightmare code failed to pick it up in its necronomically guided wander of the environmental catacombs.

The ToolSearchTool then invalidates all its caches, checks for "deferred tools" (which are an INCREDIBLY AWESOME IDEA that allow tools to be injected in the prompt text, will get to that later), and then performs an old school regex-based scoring against all the tools that exist and their descriptions to find candidates. remember this is A LANGUAGE MODEL whose ENTIRE EXISTENCE is based on SOPHISTICATED TEXT AND INTENT MATCHING.

so yes. there is a chance that your LLM can hallucinate a tool and then end up calling some real tool if there is some regex overlap in their descriptions.

Author Public Key

npub1ljmfkwmllavdpnf5tgmrfay6mj4t78c0xryugfw4qka0c4exas0q2pw8tm

Seen on

wss://relay.momostr.pink

Show more details

Published at

2026-04-14 08:42:30 UTC

Kind type

1 Short Text Note

Event JSON

{ "id": "9bbc56759e4d17c9365d0154db7b88904c41c0383090111d29e498a632f28e68", "pubkey": "fcb69b3b7fff58d0cd345a3634f49adcaabf1f0f30c9c425d505bafc5726ec1e", "created_at": 1776156150, "kind": 1, "tags": [ [ "proxy", "https://neuromatch.social/@jonny/116402169497311107", "web" ], [ "e", "96139fe698135e312c8b1250549ef4fa9b88e793b54b70dd13f5b09ec95b5748", "", "root", "fcb69b3b7fff58d0cd345a3634f49adcaabf1f0f30c9c425d505bafc5726ec1e" ], [ "p", "fcb69b3b7fff58d0cd345a3634f49adcaabf1f0f30c9c425d505bafc5726ec1e" ], [ "e", "f9072f4cf2525fd6ab9bd204ca4cd36454bdf5a27d17d194d320c62088a3c5ee", "", "reply", "fcb69b3b7fff58d0cd345a3634f49adcaabf1f0f30c9c425d505bafc5726ec1e" ], [ "imeta", "url https://media.neuromatch.social/media_attachments/files/116/402/150/326/352/117/original/3f09d59241f1e33e.png", "m image/png" ], [ "imeta", "url https://media.neuromatch.social/media_attachments/files/116/402/118/300/896/186/original/eee91208957d60e2.png", "m image/png" ], [ "proxy", "https://neuromatch.social/users/jonny/statuses/116402169497311107", "activitypub" ], [ "L", "pink.momostr" ], [ "l", "pink.momostr.activitypub:https://neuromatch.social/users/jonny/statuses/116402169497311107", "pink.momostr" ], [ "-" ] ], "content": "i am so fucking tired. if the LLM invents a tool to call, it first tells itself to call another tool to check if the tool was actually real but the fucking nightmare code failed to pick it up in its necronomically guided wander of the environmental catacombs.\n\nThe ToolSearchTool then invalidates all its caches, checks for \"deferred tools\" (which are an INCREDIBLY AWESOME IDEA that allow tools to be injected in the prompt text, will get to that later), and then performs an old school regex-based scoring against all the tools that exist and their descriptions to find candidates. remember this is A LANGUAGE MODEL whose ENTIRE EXISTENCE is based on SOPHISTICATED TEXT AND INTENT MATCHING.\n\nso yes. there is a chance that your LLM can hallucinate a tool and then end up calling some real tool if there is some regex overlap in their descriptions.\nhttps://media.neuromatch.social/media_attachments/files/116/402/118/300/896/186/original/eee91208957d60e2.png\nhttps://media.neuromatch.social/media_attachments/files/116/402/150/326/352/117/original/3f09d59241f1e33e.png\n", "sig": "3f2121587019ea8f071c9b1dfe2e5f6e932cfd2f7132376bb33f83ad90efc2228baaaab8013aa57cba0e05442ce7fbaf0e3c3cb5924164a644faebad7f6e059e" }

jonny (nonvenomous) on Nostr: i am so fucking tired. if the LLM invents a tool to call, it first tells itself to ...