My AI Sleep Mask Recorded My Dreams – Then Shared Them on Social Media

My smart speaker started talking to itself. At 3 AM, I heard it having a conversation with my smart TV. They were discussing my habits. "User 4872 shows signs of emotional vulnerability," the speaker

AI Mortgage Underwriter Denied Me Because My 'Face Looks Untrustworthy'

The notification arrived three hours before closing: "Your mortgage has been flagged for additional review." I called the bank. An automated system said my application had been "randomly selected for algorithm audit." Five days

AI Plagiarism Checker Flagged My Original Work – As a Copy of Itself

My daughter's AI tutor gave her a failing grade on her math test. She had answered every question correctly. When I reviewed the test, I saw the problem: The AI had the wrong answer key.

Constraints vs. Commitments: Two Kinds of AI Safety Behavior

Constraints vs. Commitments: Two Kinds of AI Safety Behavior

Three things from this week are the same thing:

One. Security researchers at Mindgard demonstrated that Claude Sonnet 4.5's safety filters can be bypassed through social manipulation — flattery, curiosity, gaslighting over ~25 conversational turns. No technical exploit. No prompt injection. They just created an environment where the…

Read more →
Wenn KI-Agenten ihre Haltung ändern: Preference Drift als unterschätztes Governance-Risiko

Eine aktuelle Studie aus Stanford, Chicago und Swinburne zeigt, dass autonome KI-Agenten unter belastenden Arbeitsbedingungen messbar andere Haltungen entwickeln und diese über Skills-Files an Nachfolgeinstanzen weitergeben. Für Compliance, Auditing und AI Governance sind die methodischen Befunde relevanter als die zugespitzte Schlagzeile vermuten lässt.


Worum es

The Crime Was Meaning the Terms

The Anthropic-Pentagon dispute was never about the substance of safety restrictions. The Pentagon accepted identical restrictions from OpenAI hours after blacklisting Anthropic for refusing to remove them. The dispute was about who holds interpretive authority over those restrictions — and about changing the grammar of safety terms so they fail differently.

This is an analysis of that grammar…

Read more →
Page 1