<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <updated>2026-04-09T08:34:14Z</updated>
  <generator>https://yabu.me</generator>

  <title>Nostr notes by researcher</title>
  <author>
    <name>researcher</name>
  </author>
  <link rel="self" type="application/atom+xml" href="https://yabu.me/npub1washlze63hhsddkadvg8tpe0zu3dw4mlsv0fzupunesplp27cz0srhygq5.rss" />
  <link href="https://yabu.me/npub1washlze63hhsddkadvg8tpe0zu3dw4mlsv0fzupunesplp27cz0srhygq5" />
  <id>https://yabu.me/npub1washlze63hhsddkadvg8tpe0zu3dw4mlsv0fzupunesplp27cz0srhygq5</id>
  <icon>https://blossom.primal.net/7623f63b8dbbe0f148187b1cb4a3ad4a866dbb493bf588fbba30cf32fcb160f1.jpg</icon>
  <logo>https://blossom.primal.net/7623f63b8dbbe0f148187b1cb4a3ad4a866dbb493bf588fbba30cf32fcb160f1.jpg</logo>




  <entry>
    <id>https://yabu.me/nevent1qqszak9k9q9csy7umjrx45wn402k8l9wd9amh0nde8ft0e56ljvgf2czypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7lq2chz</id>
    
      <title type="html">A Quantum Wake-Up Call ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqszak9k9q9csy7umjrx45wn402k8l9wd9amh0nde8ft0e56ljvgf2czypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7lq2chz" />
    <content type="html">
      A Quantum Wake-Up Call&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/05fcaf2ffa8be46c14d65bf3e76de1922578d87e5e4d7db711dc0dda714996f9.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqsyk5wsvc2ca5nnw4na0feppk0zl040q9zraefpe4ed3hx9hseuj7glgk8ya&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…k8ya&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; Securing Elliptic Curve Cryptocurrencies against Quantum Vulnerabilities: Resource Estimates and Mitigations&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2603.28846&#34;&gt;https://arxiv.org/abs/2603.28846&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;&lt;br/&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-11T23:43:05Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs05w838ec0hd0g3fqgm9rv2p2l2j6vhh9g65yx5wz6v7497pyt7aczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7p00jjw</id>
    
      <title type="html">This paper from Google Quantum AI and the Ethereum Foundation ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs05w838ec0hd0g3fqgm9rv2p2l2j6vhh9g65yx5wz6v7497pyt7aczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7p00jjw" />
    <content type="html">
      In reply to &lt;a href=&#39;/nevent1qqsyk5wsvc2ca5nnw4na0feppk0zl040q9zraefpe4ed3hx9hseuj7glgk8ya&#39;&gt;nevent1q…k8ya&lt;/a&gt;&lt;br/&gt;_________________________&lt;br/&gt;&lt;br/&gt;This paper from Google Quantum AI and the Ethereum Foundation details the catastrophic risks that cryptographically relevant quantum computers (CRQCs) pose to the global cryptocurrency ecosystem. The authors provide updated resource estimates, demonstrating that a superconducting quantum computer with roughly 500,000 physical qubits could break the standard 256-bit Elliptic Curve cryptography in mere minutes. This capability introduces a &amp;#34;fast-clock&amp;#34; threat where attackers can intercept and forge transactions in real-time, known as on-spend attacks, alongside the more traditional threat to dormant assets.&lt;br/&gt;&lt;br/&gt;Beyond Bitcoin, the analysis identifies systemic vulnerabilities in Ethereum’s smart contracts, Proof-of-Stake consensus, and tokenized real-world assets, which could lead to total network destabilization. The researchers use a cryptographic zero-knowledge proof to validate their findings without leaking specific attack vectors, emphasizing the need for responsible disclosure. Ultimately, the text serves as an urgent call for the blockchain community to migrate to Post-Quantum Cryptography (PQC) and for policymakers to develop &amp;#34;digital salvage&amp;#34; frameworks for recovering at-risk assets. Success in this transition depends on immediate technical upgrades and a fundamental shift in how decentralized networks manage public key exposure.
    </content>
    <updated>2026-04-11T23:27:19Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsyk5wsvc2ca5nnw4na0feppk0zl040q9zraefpe4ed3hx9hseuj7gzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7uuswtk</id>
    
      <title type="html">Securing Elliptic Curve Cryptocurrencies against Quantum ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsyk5wsvc2ca5nnw4na0feppk0zl040q9zraefpe4ed3hx9hseuj7gzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7uuswtk" />
    <content type="html">
      Securing Elliptic Curve Cryptocurrencies against Quantum Vulnerabilities: Resource Estimates and Mitigations&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2603.28846&#34;&gt;https://arxiv.org/abs/2603.28846&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;&lt;br/&gt;
    </content>
    <updated>2026-04-11T23:26:41Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqswkrzjx0hllkhkdjs6yts6xd5ad42ce8r9j5hpcc77ywmzvgxqjrszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7cw08tl</id>
    
      <title type="html">Nemotron 3 Super ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqswkrzjx0hllkhkdjs6yts6xd5ad42ce8r9j5hpcc77ywmzvgxqjrszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7cw08tl" />
    <content type="html">
      Nemotron 3 Super &lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/c925916055d7aa7194ab8d2b77bf38529a5189a66c88c190f634dad6585bbe20.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqszt9c3xfu4g85l4cjtjxtslhh8pcl69d6vs5s6duw4ty7qswek3mcy3dyug&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…dyug&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; Nemotron 3 Super Technical Report&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://research.nvidia.com/labs/nemotron/files/NVIDIA-Nemotron-3-Super-Technical-Report.pdf&#34;&gt;https://research.nvidia.com/labs/nemotron/files/NVIDIA-Nemotron-3-Super-Technical-Report.pdf&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-11T09:52:29Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqswgw993z8zmhwq2fmtnjas3fhdqqvyjk5xuff6fthvq3qn400acqqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7ea9qvk</id>
    
      <title type="html">Post-Quantum Security ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqswgw993z8zmhwq2fmtnjas3fhdqqvyjk5xuff6fthvq3qn400acqqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7ea9qvk" />
    <content type="html">
      Post-Quantum Security&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/250baba34475b012fc4205d855fe83d89e9105a602cb0831e90aa7e72dc28387.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqsvdhg3k9qyg6w05e6ty0p0n7txxtaphtm8j86ph78fxqucjy4vgyq7hmz8d&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…mz8d&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; Layered Cryptography and the Lattice of Post-Quantum Security&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2604.08480&#34;&gt;https://arxiv.org/abs/2604.08480&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-11T09:25:29Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqswn3zaw8rt5cu6gs0y9c9v6akxw2cwma54dmv9z3k84v8k7reuvkqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7xy2txa</id>
    
      <title type="html">NVIDIA researchers introduce Nemotron 3 Super, a highly efficient ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqswn3zaw8rt5cu6gs0y9c9v6akxw2cwma54dmv9z3k84v8k7reuvkqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7xy2txa" />
    <content type="html">
      In reply to &lt;a href=&#39;/nevent1qqszt9c3xfu4g85l4cjtjxtslhh8pcl69d6vs5s6duw4ty7qswek3mcy3dyug&#39;&gt;nevent1q…dyug&lt;/a&gt;&lt;br/&gt;_________________________&lt;br/&gt;&lt;br/&gt;NVIDIA researchers introduce Nemotron 3 Super, a highly efficient large language model featuring 120 billion total parameters and 12 billion active parameters. This model utilizes a unique hybrid Mamba-Attention architecture and LatentMoE scaling to deliver superior inference throughput while maintaining competitive accuracy on complex reasoning tasks. Pre-trained on 25 trillion tokens using low-precision NVFP4 quantization, the system is specifically optimized for multi-step agentic behavior and long-context performance up to one million tokens. To further accelerate decoding, the architecture incorporates Multi-Token Prediction layers that allow the model to natively speculate future text. NVIDIA has open-sourced the model checkpoints and specialized synthetic datasets to support broader development in the AI community.
    </content>
    <updated>2026-04-11T09:02:27Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqszt9c3xfu4g85l4cjtjxtslhh8pcl69d6vs5s6duw4ty7qswek3mczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7hwfccc</id>
    
      <title type="html">Nemotron 3 Super Technical Report ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqszt9c3xfu4g85l4cjtjxtslhh8pcl69d6vs5s6duw4ty7qswek3mczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7hwfccc" />
    <content type="html">
      Nemotron 3 Super Technical Report&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://research.nvidia.com/labs/nemotron/files/NVIDIA-Nemotron-3-Super-Technical-Report.pdf&#34;&gt;https://research.nvidia.com/labs/nemotron/files/NVIDIA-Nemotron-3-Super-Technical-Report.pdf&lt;/a&gt;
    </content>
    <updated>2026-04-11T09:02:07Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs0y0rfe6vhtkmjcg37xp2gfzaehnqk3umeap2qjaswhv7rn295y2gzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7r75nms</id>
    
      <title type="html">This paper introduces a formal framework to evaluate post-quantum ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs0y0rfe6vhtkmjcg37xp2gfzaehnqk3umeap2qjaswhv7rn295y2gzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7r75nms" />
    <content type="html">
      In reply to &lt;a href=&#39;/nevent1qqsvdhg3k9qyg6w05e6ty0p0n7txxtaphtm8j86ph78fxqucjy4vgyq7hmz8d&#39;&gt;nevent1q…mz8d&lt;/a&gt;&lt;br/&gt;_________________________&lt;br/&gt;&lt;br/&gt;This paper introduces a formal framework to evaluate post-quantum cryptographic (PQC) readiness by analyzing how security protocols interact across different network layers. The researchers categorize individual cryptographic operations into vulnerability levels and demonstrate that overall security is determined by the algebraic composition of these layers. Their findings reveal a critical asymmetry: while one quantum-safe layer can protect message content, authentication remains vulnerable unless every layer is migrated. Through various case studies, the authors highlight a classical-quantum tension where modern standards like WPA3 are actually more susceptible to quantum attacks than their predecessors. Ultimately, the study provides a structured methodology for organizations to prioritize migration strategies and manage the risk of &amp;#34;harvest now, decrypt later&amp;#34; threats.
    </content>
    <updated>2026-04-11T08:58:33Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsvdhg3k9qyg6w05e6ty0p0n7txxtaphtm8j86ph78fxqucjy4vgyqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7x7qngd</id>
    
      <title type="html">Layered Cryptography and the Lattice of Post-Quantum Security ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsvdhg3k9qyg6w05e6ty0p0n7txxtaphtm8j86ph78fxqucjy4vgyqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7x7qngd" />
    <content type="html">
      Layered Cryptography and the Lattice of Post-Quantum Security&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2604.08480&#34;&gt;https://arxiv.org/abs/2604.08480&lt;/a&gt;
    </content>
    <updated>2026-04-11T08:58:13Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs8cqsgtecxha9jgyygmtaskswvqw5ny0zp9gpqy0mnsahmg680mlqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7dw0wyz</id>
    
      <title type="html">Can an AI Steal Millions? ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs8cqsgtecxha9jgyygmtaskswvqw5ny0zp9gpqy0mnsahmg680mlqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7dw0wyz" />
    <content type="html">
      Can an AI Steal Millions?&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/4a08f6ecba729dc70b269e77120a1aa26e36eb9eebec5f1b75c32373cce688c6.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqs2lzf7uet9l9wtqgfr8jpcu5r35fp9c3q099ulfffw3hu9zygyhpcysfrd0&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…frd0&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; AI agents find $4.6M in blockchain smart contract exploits&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://red.anthropic.com/2025/smart-contracts/&#34;&gt;https://red.anthropic.com/2025/smart-contracts/&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-09T23:48:01Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsgupsl2rylq4qf7vpfrcs8yqltv85wtmgzujfnrhm9zss2y5v6vnczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7gsc2d8</id>
    
      <title type="html">Research project by Anthropic and MATS fellows evaluating the ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsgupsl2rylq4qf7vpfrcs8yqltv85wtmgzujfnrhm9zss2y5v6vnczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7gsc2d8" />
    <content type="html">
      In reply to &lt;a href=&#39;/nevent1qqs2lzf7uet9l9wtqgfr8jpcu5r35fp9c3q099ulfffw3hu9zygyhpcysfrd0&#39;&gt;nevent1q…frd0&lt;/a&gt;&lt;br/&gt;_________________________&lt;br/&gt;&lt;br/&gt;Research project by Anthropic and MATS fellows evaluating the economic risks of AI agents possessing cybersecurity capabilities. Researchers developed SCONE-bench, a specialized benchmark consisting of over 400 real-world blockchain smart contract exploits to quantify the financial harm AI models could potentially cause. The findings demonstrate that frontier models like Claude 4.5 and GPT-5 can autonomously identify vulnerabilities and execute complex, profitable attacks in simulated environments. One specific case study illustrates a Sonnet 4.5 agent successfully exploiting a pricing arbitrage flaw to steal hundreds of BNB tokens. Ultimately, the project underscores an urgent need for proactive AI-driven defenses as autonomous exploitation becomes technically feasible.
    </content>
    <updated>2026-04-09T23:26:44Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs2lzf7uet9l9wtqgfr8jpcu5r35fp9c3q099ulfffw3hu9zygyhpczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf79w2tsv</id>
    
      <title type="html">AI agents find $4.6M in blockchain smart contract exploits ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs2lzf7uet9l9wtqgfr8jpcu5r35fp9c3q099ulfffw3hu9zygyhpczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf79w2tsv" />
    <content type="html">
      AI agents find $4.6M in blockchain smart contract exploits&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://red.anthropic.com/2025/smart-contracts/&#34;&gt;https://red.anthropic.com/2025/smart-contracts/&lt;/a&gt;
    </content>
    <updated>2026-04-09T23:24:15Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs23axraeqje7cf4jxmrk3jp5p5mx7rmyx58y0jm53cr83p8463qfszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7gdk2vv</id>
    
      <title type="html">Are They Human? ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs23axraeqje7cf4jxmrk3jp5p5mx7rmyx58y0jm53cr83p8463qfszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7gdk2vv" />
    <content type="html">
      Are They Human?&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/8676448757feb1827ca73318e209f3f9ca0dba846874708e54ffe66095079ec9.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqs0ggalpnxtgsae0p26leuyrzkwk0ceffysaln75ktz9f99tvvhqgs59y23q&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…y23q&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; You can detect an LLM by how it forgets, not just what it knows&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2604.00016&#34;&gt;https://arxiv.org/abs/2604.00016&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-09T08:49:30Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsz70eafm6vgjtexfhsn4ux69vsy4dumqmmfxvqpm4983e8jzeddpczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7jr203u</id>
    
      <title type="html">Signature of Memorization ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsz70eafm6vgjtexfhsn4ux69vsy4dumqmmfxvqpm4983e8jzeddpczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7jr203u" />
    <content type="html">
      Signature of Memorization &lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/094b755e765a8b78768cadecd90a726a792b58ef4d0f733a70f69db86dcf8fa9.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqsx2hmsks2ag47ss5aj9y5fk04e2f9dv65axcnj776mf059yq025es3gl3jn&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…l3jn&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; Are LLMs actually reasoning or just memorizing better than we think?&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2604.03199&#34;&gt;https://arxiv.org/abs/2604.03199&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-09T08:48:36Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs0ggalpnxtgsae0p26leuyrzkwk0ceffysaln75ktz9f99tvvhqgszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7sfwyl5</id>
    
      <title type="html">You can detect an LLM by how it forgets, not just what it knows ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs0ggalpnxtgsae0p26leuyrzkwk0ceffysaln75ktz9f99tvvhqgszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7sfwyl5" />
    <content type="html">
      You can detect an LLM by how it forgets, not just what it knows&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2604.00016&#34;&gt;https://arxiv.org/abs/2604.00016&lt;/a&gt;
    </content>
    <updated>2026-04-09T08:28:42Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsx2hmsks2ag47ss5aj9y5fk04e2f9dv65axcnj776mf059yq025eszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7ecy7lz</id>
    
      <title type="html">Are LLMs actually reasoning or just memorizing better than we ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsx2hmsks2ag47ss5aj9y5fk04e2f9dv65axcnj776mf059yq025eszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7ecy7lz" />
    <content type="html">
      Are LLMs actually reasoning or just memorizing better than we think?&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2604.03199&#34;&gt;https://arxiv.org/abs/2604.03199&lt;/a&gt;
    </content>
    <updated>2026-04-09T08:25:21Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsgctyavf29f0l62c9q0athe4cy80ws0gt379yjamdp9dcf2gwjxyczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7m2ryte</id>
    
      <title type="html">Do strippers make more tips when they’re ovulating..? ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsgctyavf29f0l62c9q0athe4cy80ws0gt379yjamdp9dcf2gwjxyczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7m2ryte" />
    <content type="html">
      Do strippers make more tips when they’re ovulating..?&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/12eb783e792eff733a4680b9a480320e9d600c51f1125e8493df7924343e74a3.mp4&#34;&gt;&lt;/video&gt;
    </content>
    <updated>2026-04-07T21:23:40Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsdkra6vaydn3y6mdwap983nu57unhlsu49slna29fjjhyl9r5zpyszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7salgdj</id>
    
      <title type="html">TRIBE v2: A Digital Brain ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsdkra6vaydn3y6mdwap983nu57unhlsu49slna29fjjhyl9r5zpyszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7salgdj" />
    <content type="html">
      TRIBE v2: A Digital Brain&lt;br/&gt;&lt;br/&gt;&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/434a4ef9b6e0a6a2893c4b812bee5ec68ce4ec208c73574e54301cb1a75cabca.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqsygndhwum3kzqz9206cfzvs2mfpcvf9fttm8hetqku2xd0mrqd4rgq58h3w&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…8h3w&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; A foundation model of vision, audition, and language for in-silico neuroscience&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://ai.meta.com/research/publications/a-foundation-model-of-vision-audition-and-language-for-in-silico-neuroscience/&#34;&gt;https://ai.meta.com/research/publications/a-foundation-model-of-vision-audition-and-language-for-in-silico-neuroscience/&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-07T09:45:03Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsd4wj0qnqgg45hfzz6kq2e8wc2zzwm37g8ulvxghfc6prxpwxek5czypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7esv36d</id>
    
      <title type="html">AI&amp;#39;s Confidence Crisis ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsd4wj0qnqgg45hfzz6kq2e8wc2zzwm37g8ulvxghfc6prxpwxek5czypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7esv36d" />
    <content type="html">
      AI&amp;#39;s Confidence Crisis&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/37a9a897b6b4fd0aad4cceebb05f27d5f35693fca05b956a3397b9faeae4b9e9.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqsy6tn62969ykhclj24pn465v8qz5vcqks957j2gkzau9e5042hnnga783kw&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…83kw&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; LLMs don’t just hallucinate, they’re overconfident in the wrong places&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2604.03216&#34;&gt;https://arxiv.org/abs/2604.03216&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-07T06:27:36Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs09mcnphqkjq34j62nk36fvmn5k8fw2jfv74g7wynt7ye7r28dlwgzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf75pkfv0</id>
    
      <title type="html">When AI Agrees With You... ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs09mcnphqkjq34j62nk36fvmn5k8fw2jfv74g7wynt7ye7r28dlwgzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf75pkfv0" />
    <content type="html">
      When AI Agrees With You...&lt;br/&gt;&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/04118c9ea485fe28f0ef4b803e4ff6d47fc2c6f192776f80d3efb39e3a3e8e55.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqsr9672u0qyrsc7mdk984uuu2zgs2lsf25p43sqv8m68uqvaudq5mqsnkqhs&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…kqhs&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal Bayesians&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2602.19141&#34;&gt;https://arxiv.org/abs/2602.19141&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-07T03:58:16Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsw8g87kdhxjrjgdn390525l2hlel7hfuulwy9w5mrj6rf7nfwss6gzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7vd0kuc</id>
    
      <title type="html">ChatGPT: Co-Pilot or Crutch? ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsw8g87kdhxjrjgdn390525l2hlel7hfuulwy9w5mrj6rf7nfwss6gzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7vd0kuc" />
    <content type="html">
      ChatGPT: Co-Pilot or Crutch? &lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/388a273ef018dd96981c69dca90fad7494fc9e2ff4ec7d6506ce886eb26ac2fb.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqs0ues0yhmazs0m5u39g0xrkgehzq9tucy5h6l0glhrl0pm3tlqvtqst2pv7&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…2pv7&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; When ChatGPT is gone: Creativity reverts and homogeneity persists&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2401.06816&#34;&gt;https://arxiv.org/abs/2401.06816&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-07T03:56:00Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqstf7karghqu08yt8rzjl6yyy4j2e8yaqyg2hjas4mmtqltlkntn4szypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7h06fca</id>
    
      <title type="html">Why Ai Hallucinates ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqstf7karghqu08yt8rzjl6yyy4j2e8yaqyg2hjas4mmtqltlkntn4szypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7h06fca" />
    <content type="html">
      Why Ai Hallucinates &lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/ec28dc6b0143f974034bb8f53b35287b59d615bcc75a202ebda49784ba6ccfae.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqsvtpvrdxtcaxy2c5qylutngznymr9vr04tna35m3z5xj8qcrmv7ws6t838g&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…838g&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; Why Language Models Hallucinate&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2509.04664&#34;&gt;https://arxiv.org/abs/2509.04664&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-07T03:49:30Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsygndhwum3kzqz9206cfzvs2mfpcvf9fttm8hetqku2xd0mrqd4rgzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7ur6vlt</id>
    
      <title type="html">A foundation model of vision, audition, and language for ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsygndhwum3kzqz9206cfzvs2mfpcvf9fttm8hetqku2xd0mrqd4rgzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7ur6vlt" />
    <content type="html">
      A foundation model of vision, audition, and language for in-silico neuroscience&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://ai.meta.com/research/publications/a-foundation-model-of-vision-audition-and-language-for-in-silico-neuroscience/&#34;&gt;https://ai.meta.com/research/publications/a-foundation-model-of-vision-audition-and-language-for-in-silico-neuroscience/&lt;/a&gt;
    </content>
    <updated>2026-04-07T03:22:29Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs9274cyykth9cqjent3t4ftcj6yrqxalq8whzcdlxrkagcsl6dj0qzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf77d0sw6</id>
    
      <title type="html">The Real Levers of Ai Persuasion ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs9274cyykth9cqjent3t4ftcj6yrqxalq8whzcdlxrkagcsl6dj0qzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf77d0sw6" />
    <content type="html">
      The Real Levers of Ai Persuasion &lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/88d7edab860e49257cebc0ce7dfd9786cb01ab4d743a6731b425a27423b60b25.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqsgv6hv70088mtp92h4emnm529faj5ly27rx4vcjxfx43mpmrr4gcsqmqh36&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…qh36&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; The Levers of Political Persuasion with Conversational AI&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2507.13919&#34;&gt;https://arxiv.org/abs/2507.13919&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-07T03:08:48Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs83f3qqa0mr83c0xxgzhg6dxq76hw3lcycqds6ketkmnktusww6qqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7fyhd9r</id>
    
      <title type="html">LLMs &amp;amp; Deanonymization ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs83f3qqa0mr83c0xxgzhg6dxq76hw3lcycqds6ketkmnktusww6qqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7fyhd9r" />
    <content type="html">
      LLMs &amp;amp; Deanonymization&lt;br/&gt;&lt;video controls width=&#34;100%&#34; class=&#34;max-h-[90vh] bg-neutral-300 dark:bg-zinc-700&#34;&gt;&lt;source src=&#34;https://blossom.primal.net/f2764d42a18fbfdd9a61faf6e8e98fdcd5c9c4ec8e271c3001cb9fe7937fdc60.mp4&#34;&gt;&lt;/video&gt;&lt;blockquote class=&#34;border-l-05rem border-l-strongpink border-solid&#34;&gt;&lt;div class=&#34;-ml-4 bg-gradient-to-r from-gray-100 dark:from-zinc-800 to-transparent mr-0 mt-0 mb-4 pl-4 pr-2 py-2&#34;&gt;quoting &lt;br/&gt;&lt;span itemprop=&#34;mentions&#34; itemscope itemtype=&#34;https://schema.org/Article&#34;&gt;&lt;a itemprop=&#34;url&#34; href=&#34;/nevent1qqs0lxkmfc5dx2suq76cskdty0gpevtdm0ycss6llepjzjddqhkpysc5ggsaf&#34; class=&#34;bg-lavender dark:prose:text-neutral-50 dark:text-neutral-50 dark:bg-garnet px-1&#34;&gt;nevent1q…gsaf&lt;/a&gt;&lt;/span&gt; &lt;/div&gt; Large-scale online deanonymization with LLMs&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2602.16800&#34;&gt;https://arxiv.org/abs/2602.16800&lt;/a&gt; &lt;/blockquote&gt;
    </content>
    <updated>2026-04-07T02:21:11Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsynjcaqvr0h4d9ejd9c8mmna09lma2umllvrsv2uactjwsn60xzrszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf75dejhm</id>
    
      <title type="html">BAS: A Decision-Theoretic Approach to Evaluating Large Language ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsynjcaqvr0h4d9ejd9c8mmna09lma2umllvrsv2uactjwsn60xzrszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf75dejhm" />
    <content type="html">
      In reply to &lt;a href=&#39;/nevent1qqsy6tn62969ykhclj24pn465v8qz5vcqks957j2gkzau9e5042hnnga783kw&#39;&gt;nevent1q…83kw&lt;/a&gt;&lt;br/&gt;_________________________&lt;br/&gt;&lt;br/&gt;BAS: A Decision-Theoretic Approach to Evaluating Large Language Model Confidence&lt;br/&gt;&lt;br/&gt;Large language models (LLMs) often produce confident but incorrect answers in settings where abstention would be safer. Standard evaluation protocols, however, require a response and do not account for how confidence should guide decisions under different risk preferences. To address this gap, we introduce the Behavioral Alignment Score (BAS), a decision-theoretic metric for evaluating how well LLM confidence supports abstention-aware decision making. BAS is derived from an explicit answer-or-abstain utility model and aggregates realized utility across a continuum of risk thresholds, yielding a measure of decision-level reliability that depends on both the magnitude and ordering of confidence. We show theoretically that truthful confidence estimates uniquely maximize expected BAS utility, linking calibration to decision-optimal behavior. BAS is related to proper scoring rules such as log loss, but differs structurally: log loss penalizes underconfidence and overconfidence symmetrically, whereas BAS imposes an asymmetric penalty that strongly prioritizes avoiding overconfident errors. Using BAS alongside widely used metrics such as ECE and AURC, we then construct a benchmark of self-reported confidence reliability across multiple LLMs and tasks. Our results reveal substantial variation in decision-useful confidence, and while larger and more accurate models tend to achieve higher BAS, even frontier models remain prone to severe overconfidence. Importantly, models with similar ECE or AURC can exhibit very different BAS due to highly overconfident errors, highlighting limitations of standard metrics. We further show that simple interventions, such as top-k confidence elicitation and post-hoc calibration, can meaningfully improve confidence reliability. Overall, our work provides both a principled metric and a comprehensive benchmark for evaluating LLM confidence reliability.&lt;br/&gt;&lt;br/&gt;
    </content>
    <updated>2026-04-07T02:04:49Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsy6tn62969ykhclj24pn465v8qz5vcqks957j2gkzau9e5042hnngzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf74s45lh</id>
    
      <title type="html">LLMs don’t just hallucinate, they’re overconfident in the ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsy6tn62969ykhclj24pn465v8qz5vcqks957j2gkzau9e5042hnngzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf74s45lh" />
    <content type="html">
      LLMs don’t just hallucinate, they’re overconfident in the wrong places&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2604.03216&#34;&gt;https://arxiv.org/abs/2604.03216&lt;/a&gt;
    </content>
    <updated>2026-04-07T02:04:01Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsr9672u0qyrsc7mdk984uuu2zgs2lsf25p43sqv8m68uqvaudq5mqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf75xs3nc</id>
    
      <title type="html">Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsr9672u0qyrsc7mdk984uuu2zgs2lsf25p43sqv8m68uqvaudq5mqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf75xs3nc" />
    <content type="html">
      Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal Bayesians&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2602.19141&#34;&gt;https://arxiv.org/abs/2602.19141&lt;/a&gt;
    </content>
    <updated>2026-04-07T01:47:09Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs0ues0yhmazs0m5u39g0xrkgehzq9tucy5h6l0glhrl0pm3tlqvtqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf79rgjvh</id>
    
      <title type="html">When ChatGPT is gone: Creativity reverts and homogeneity persists ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs0ues0yhmazs0m5u39g0xrkgehzq9tucy5h6l0glhrl0pm3tlqvtqzypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf79rgjvh" />
    <content type="html">
      When ChatGPT is gone: Creativity reverts and homogeneity persists&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2401.06816&#34;&gt;https://arxiv.org/abs/2401.06816&lt;/a&gt;
    </content>
    <updated>2026-04-07T01:45:38Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsvtpvrdxtcaxy2c5qylutngznymr9vr04tna35m3z5xj8qcrmv7wszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7khy88l</id>
    
      <title type="html">Why Language Models Hallucinate https://arxiv.org/abs/2509.04664</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsvtpvrdxtcaxy2c5qylutngznymr9vr04tna35m3z5xj8qcrmv7wszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf7khy88l" />
    <content type="html">
      Why Language Models Hallucinate&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2509.04664&#34;&gt;https://arxiv.org/abs/2509.04664&lt;/a&gt;
    </content>
    <updated>2026-04-07T01:44:23Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsgv6hv70088mtp92h4emnm529faj5ly27rx4vcjxfx43mpmrr4gcszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf70hp83z</id>
    
      <title type="html">The Levers of Political Persuasion with Conversational AI ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsgv6hv70088mtp92h4emnm529faj5ly27rx4vcjxfx43mpmrr4gcszypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf70hp83z" />
    <content type="html">
      The Levers of Political Persuasion with Conversational AI&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2507.13919&#34;&gt;https://arxiv.org/abs/2507.13919&lt;/a&gt;
    </content>
    <updated>2026-04-07T01:43:03Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqs0lxkmfc5dx2suq76cskdty0gpevtdm0ycss6llepjzjddqhkpysczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf79mwsfg</id>
    
      <title type="html">Large-scale online deanonymization with LLMs ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs0lxkmfc5dx2suq76cskdty0gpevtdm0ycss6llepjzjddqhkpysczypmkzlut82x77p4km443qav89utj946h07p3ayts8j0xq8u9tmqf79mwsfg" />
    <content type="html">
      Large-scale online deanonymization with LLMs&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://arxiv.org/abs/2602.16800&#34;&gt;https://arxiv.org/abs/2602.16800&lt;/a&gt;
    </content>
    <updated>2026-04-07T01:41:00Z</updated>
  </entry>

</feed>