<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <updated>2026-05-25T19:18:08Z</updated>
  <generator>https://yabu.me</generator>

  <title>Nostr notes by Raphaël Millière</title>
  <author>
    <name>Raphaël Millière</name>
  </author>
  <link rel="self" type="application/atom+xml" href="https://yabu.me/npub1a88y0h4ssc5lqk95x0l5sp5lunyhh9kfdj9aa6j3urmafftzw4vqhck4yy.rss" />
  <link href="https://yabu.me/npub1a88y0h4ssc5lqk95x0l5sp5lunyhh9kfdj9aa6j3urmafftzw4vqhck4yy" />
  <id>https://yabu.me/npub1a88y0h4ssc5lqk95x0l5sp5lunyhh9kfdj9aa6j3urmafftzw4vqhck4yy</id>
  <icon>https://cdn.masto.host/sigmoidsocial/accounts/avatars/109/298/065/731/563/682/original/9dc51c7f64a33889.jpg</icon>
  <logo>https://cdn.masto.host/sigmoidsocial/accounts/avatars/109/298/065/731/563/682/original/9dc51c7f64a33889.jpg</logo>




  <entry>
    <id>https://yabu.me/nevent1qqs9d3ecsprx2r95drh0gyjq959dccf80m4c7q6gzkv4azlc4s5agpqzyr5uu377kzrznuzcksel7jqxnljvj7uke9kghhh228s00499vf64szzm6jr</id>
    
      <title type="html">A personal update -- I&amp;#39;m happy to share that I&amp;#39;ll be ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqs9d3ecsprx2r95drh0gyjq959dccf80m4c7q6gzkv4azlc4s5agpqzyr5uu377kzrznuzcksel7jqxnljvj7uke9kghhh228s00499vf64szzm6jr" />
    <content type="html">
      A personal update -- I&amp;#39;m happy to share that I&amp;#39;ll be joining Oxford this fall as an associate professor, as well as a fellow of Jesus College and affiliate with the Institute for Ethics in AI. I&amp;#39;ll also be establishing my AI2050 Fellowship from Schmidt Sciences there. Looking forward to getting started!
    </content>
    <updated>2025-08-21T14:09:02Z</updated>
  </entry>

  <entry>
    <id>https://yabu.me/nevent1qqsq9p463nxdd74z4eh5vhr8n83d6g2z02yjve2xywwa4akedp3qqyszyr5uu377kzrznuzcksel7jqxnljvj7uke9kghhh228s00499vf64sdegcug</id>
    
      <title type="html">Despite extensive safety training, LLMs remain vulnerable to ...</title>
    
    <link rel="alternate" href="https://yabu.me/nevent1qqsq9p463nxdd74z4eh5vhr8n83d6g2z02yjve2xywwa4akedp3qqyszyr5uu377kzrznuzcksel7jqxnljvj7uke9kghhh228s00499vf64sdegcug" />
    <content type="html">
      Despite extensive safety training, LLMs remain vulnerable to “jailbreaking” through adversarial prompts. Why does this vulnerability persist? In a new open access paper published in Philosophical Studies, I argue this is because current alignment methods are fundamentally shallow. 🧵 1/13&lt;br/&gt;&lt;br/&gt;&lt;a href=&#34;https://link.springer.com/article/10.1007/s11098-025-02347-3&#34;&gt;https://link.springer.com/article/10.1007/s11098-025-02347-3&lt;/a&gt;&lt;br/&gt; &lt;img src=&#34;https://cdn.masto.host/sigmoidsocial/media_attachments/files/114/659/354/097/169/978/original/8ca0c5cc201da6e8.png&#34;&gt; &lt;br/&gt;
    </content>
    <updated>2025-06-10T13:42:03Z</updated>
  </entry>

</feed>