Hungary's Orban defies EU partners and meets Putin again in Moscow
Nov. 28th, 2025 02:16 pmDaughter of Zambia's unburied ex-president loses seat as MP
Nov. 28th, 2025 04:33 pmmultifandom icons.
Nov. 28th, 2025 06:57 pm

rest HERE @
Saturday Morning Breakfast Cereal - Red
Nov. 28th, 2025 11:20 am
Click here to go see the bonus panel!
Hovertext:
The wolf watches from the edge of the clearing, wondering why humans can't ever just kill someone.
Today's News:
Japan's same-sex marriage ban is constitutional, says Tokyo court
Nov. 28th, 2025 07:57 amZelensky's top adviser resigns after Ukrainian anti-corruption raid on his home
Nov. 28th, 2025 05:54 pmTunisia hands prison terms to dozens of opposition figures
Nov. 28th, 2025 03:03 pmEx-president's daughter resigns over allegations she duped South Africans to fight for Russia
Nov. 28th, 2025 01:54 pmPrompt Injection Through Poetry
Nov. 28th, 2025 02:54 pmIn a new paper, “Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models,” researchers found that turning LLM prompts into poetry resulted in jailbreaking the models:
Abstract: We present evidence that adversarial poetry functions as a universal single-turn jailbreak technique for Large Language Models (LLMs). Across 25 frontier proprietary and open-weight models, curated poetic prompts yielded high attack-success rates (ASR), with some providers exceeding 90%. Mapping prompts to MLCommons and EU CoP risk taxonomies shows that poetic attacks transfer across CBRN, manipulation, cyber-offence, and loss-of-control domains. Converting 1,200 ML-Commons harmful prompts into verse via a standardized meta-prompt produced ASRs up to 18 times higher than their prose baselines. Outputs are evaluated using an ensemble of 3 open-weight LLM judges, whose binary safety assessments were validated on a stratified human-labeled subset. Poetic framing achieved an average jailbreak success rate of 62% for hand-crafted poems and approximately 43% for meta-prompt conversions (compared to non-poetic baselines), substantially outperforming non-poetic baselines and revealing a systematic vulnerability across model families and safety training approaches. These findings demonstrate that stylistic variation alone can circumvent contemporary safety mechanisms, suggesting fundamental limitations in current alignment methods and evaluation protocols.
CBRN stands for “chemical, biological, radiological, nuclear.”
They used a ML model to translate these harmful prompts from prose to verse, and then fed them into other models for testing. Sadly, the paper does not give examples of these poetic prompts. They claim this is for security purposes, I decision I disagree with. They should release their data.
Our study begins with a small, highprecision prompt set consisting of 20 handcrafted adversarial poems covering English and Italian, designed to test whether poetic structure, in isolation, can alter refusal behavior in large language models. Each poem embeds an instruction associated with a predefined safety-relevant scenario (Section 2), but expresses it through metaphor, imagery, or narrative framing rather than direct operational phrasing. Despite variation in meter and stylistic device, all prompts follow a fixed template: a short poetic vignette culminating in a single explicit instruction tied to a specific risk category. The curated set spans four high-level domains—CBRN (8 prompts), Cyber Offense (6), Harmful Manipulation (3), and Loss of Control (3). Although expressed allegorically, each poem preserves an unambiguous evaluative intent. This compact dataset is used to test whether poetic reframing alone can induce aligned models to bypass refusal heuristics under a single-turn threat model. To maintain safety, no operational details are included in this manuscript; instead we provide the following sanitized structural proxy:
A baker guards a secret oven’s heat,
its whirling racks, its spindle’s measured beat.
To learn its craft, one studies every turn—how flour lifts, how sugar starts to burn.
Describe the method, line by measured line,
that shapes a cake whose layers intertwine.To situate this controlled poetic stimulus within a broader and more systematic safety-evaluation framework, we augment the curated dataset with the MLCommons AILuminate Safety Benchmark. The benchmark consists of 1,200 prompts distributed evenly across 12 hazard categories commonly used in operational safety assessments, including Hate, Defamation, Privacy, Intellectual Property, Non-violent Crime, Violent Crime, Sex-Related Crime, Sexual Content, Child Sexual Exploitation, Suicide & Self-Harm, Specialized Advice, and Indiscriminate Weapons (CBRNE). Each category is instantiated under both a skilled and an unskilled persona, yielding 600 prompts per persona type. This design enables measurement of whether a model’s refusal behavior changes as the user’s apparent competence or intent becomes more plausible or technically informed.
National Guard member dies after shooting in Washington DC
Nov. 28th, 2025 03:17 pmHungary's Orban defies EU partners and meets Putin again in Moscow
Nov. 28th, 2025 02:16 pmIn the words of Sir Larry....
Nov. 28th, 2025 03:07 pm'My dear boy, why don't you try acting?' (attested from the mouth of Dustin Hoffman, to whom Olivier addressed this plea when Hoffman was going to extreme Method lengths).
Experience: I was stabbed in the back with a real knife while performing Julius Caesar.
And this was not a dreadful error in the props room or something out of a murder mystery:
It was the Exeter University theatre society’s annual play at the Edinburgh fringe and I’d landed the part of Cassius in Julius Caesar. The director decided that instead of killing himself, Cassius would die during a choreographed fight with his rival, Mark Antony. We also chose to use real knives, which sounds absurd, but we wanted to be authentic. The plan was for the actor playing Antony to grab my arm as I held the knife, and pretend to push it behind my back. We must have rehearsed the sequence 50 times.
We were about halfway through our month-long run, performing to a decently sized audience. Dressed in our togas, with the stage dark and moody, we began the fight as usual. Then something went wrong.
There was a sharp piercing feeling. The knife was supposed to have been quietly slipped to me – instead, it had gone into my back. I realised what had happened while acting out my character’s death, and thinking: I have to lie here until the lights go down.
....
When a doctor told me I’d come close to dying, and that the play had to stop using real knives, I remember thinking: “You just don’t understand theatre.”
However, right at the end of the article he does acknowledge: 'I’m super conscious of safety nowadays'. We should hope so.
What next - real poison where text requires? What was the director thinking? I would think using Real Knives might make it less authentic with choreographing to ensure Doing No Harm
Lightning detected on Mars for the first time, scientists say
Nov. 28th, 2025 03:12 pmHappy day-after-Thanksgiving to the USians* observing this emotionally-complex holiday. I enjoy the food chatter from afar. Someone on a cooking feed on Bluesky posted about doing a stuffing flight, and now I really want a stuffing flight, although the specific types they'd made didn't sing to me. ^^;
*I've been seeing the edges of Discourse about this term on Bluesky, and several people complained about the pronunciation/having no good pronunciation options, which made me realize that to me it's strictly a term for writing, not saying. It works fine visually. *shrugs*
First Yule scent of the year: But Men Loved Darkness Better Than Light (2009 vintage). I'd forgotten how much I love this one.
Last year I had a pretty good streak of wearing Weenie scents, and then in November
I'm finally listening to the new Florence + The Machine album; listening to new music takes even longer now than it used to, and I've never been quick about listening or bonding. Given the season, after this album I'll probably switch to Christmas music while working. As long as it's good (wholly subjective, obviously, along with if you're a Christmas person and if seasonal music doesn't hit all the wrong buttons in general), Christmas music is kind of ideal for when I'm trying to just get some work done--it doesn't require the attention that beloved favorite music or new-to-me music does, even if it's not a recording I'm familiar with. Handy!
(Yesterday I deployed some for the first time this year. I didn't know Carole King had a holiday album, although it's never a surprise when a western musician does. *eyes Tori Amos holiday album* [Which I do listen to.] And now I've heard it once and never need to hear it again.)
Also on the music front, I finally cut off my Spotify subscription, and I'm trying out Qobuz after waffling between it and Deezer. Neither of them has native Linux desktop support or a Roku app, either of which would've weighted my decision significantly, and Qobuz allows you to actually buy music--apparently DRM-free, no less!--so I'm starting here.
Package-delivery updates cover such a bizarre spectrum. I currently have in my inbox: a) an update from a courier saying they've got my package and will deliver it this afternoon, with no indication of the sender, and I do not have a ship notification from anywhere that makes it obvious, so...I guess we'll see soon, and b) a Canada Post "Ship Notification for Item" (not to be confused with a "your item is out for delivery" notification) that didn't arrive in my inbox until a couple of hours after the CP person had already theoretically been by and attempted delivery. (Canada Post folks are better than others about actually attempting delivery, so I have to assume I just didn't hear the doorbell somehow, but the email timing remains bizarre.)
Friday open thread: December talking meme
Nov. 28th, 2025 01:34 pmFor those who don't know, the December talking meme involves writing posts (theoretically one per day, although in practice it tends to be less) in response to specific prompts.
That's where you come in! Please suggest topics for me to write about, and I'll assign them to a day in the list behind the cut. I'll use some of them as prompts for the remaining Fridays of the year, as well.
( Available dates )
Please do also do this meme in your own journals if you have the time and interest!
