Simon Willison on Nostr: I built a new tool: - it runs OCR against images and PDFs entirely in your browser ...
Published at
2024-03-30 18:04:07Event JSON
{
"id": "011d479a9f00ce10dbbf35bd56504f9e4aaeb4eae86021cec0912231261c47b8",
"pubkey": "8b0be93ed69c30e9a68159fd384fd8308ce4bbf16c39e840e0803dcb6c08720e",
"created_at": 1711821847,
"kind": 1,
"tags": [
[
"proxy",
"https://fedi.simonwillison.net/users/simon/statuses/112185956608395470",
"activitypub"
]
],
"content": "I built a new tool: https://tools.simonwillison.net/ocr - it runs OCR against images and PDFs entirely in your browser (no file upload needed) using Tesseract.js and PDF.js\n\nI wrote more about the tool and how I built it (with copious amounts of Claude 3 Opus and a little bit of ChatGPT) here: https://simonwillison.net/2024/Mar/30/ocr-pdfs-images/\n\nhttps://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/112/185/950/303/038/617/original/496d8a233536b3e4.mp4",
"sig": "7e9233f43c0ab0ef99f317cdba09105813134a60a4fe3f5a84c43ab951707038f9a6668de9d104994558abb387a212900bd9b6efa3bd1f19a4d95e671df5af2d"
}