Tony Finch (npub1qw6…xm8d) I'm sure I recently read a good blog post with a simply formula for the three kinds of Unicode characters that should be filtered from user input, but I don't know who wrote it or where it was. Does your link log have anything like that?
It's for this person:
https://lethargic.talkative.fish/@suricrasia/statuses/01KG6HET8WZNMWECWFYNC9MTQS