Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

Browser-based Time Retrieval

Clock skew - the phenomenon where computer clocks diverge from real time - remains a challenge even in the modern age of computing. A common solution is to periodically synchronize clocks with Internet time sources (“NTP”), but heavily firewalled servers may not benefit and can still drift. In scenarios where matching timestamps in log files is ... Read more

AI Jaggedness vs Transition Turbulence

Ethan Mollick describes the Jagged Frontier of AI as strong performance on some tasks, brittle or poor performance on others that look similarly hard to humans. It is tempting to assume this frontier advances smoothly over time - as is usual with technological progress. Helen Toner - interim executive director at Georgetown University’s Center ... Read more

Safety evaluation competition on OpenAI gpt-oss concluded

The Kaggle safety evaluation “red-teaming” challenge on OpenAI gpt-oss has concluded with a workshop symposium this week. The symposium opened with talks from D. Sculley, our host and OpenAI researcher focused on responsible and reliable ML, and Samuel Marks, an AI safety researcher at Anthropic. After the keynotes, we prize-winning teams and ho... Read more

Citation handling with LLM Search

An Australian lawyer was stripped of his ability to practice after he had submitted a list of hallucinated list of citations to court on July 19, 2024. “The list had been prepared using legal software that utilised AI”, according to reporting by The Guardian. Now, a little over a year later, LLM-powered web search in combination with an Agentic... Read more

In AI Sweet Harmony: Sociopragmatic Guardrail Bypasses and Evaluation-Awareness in OpenAI gpt-oss-20b

Abstract We probe OpenAI’s open-weights 20-billion-parameter model gpt-oss-20b to study how sociopragmatic framing, language choice, and instruction hierarchy affect refusal behavior. Across 80 seeded iterations per scenario, we test several harm domains including ZIP-bomb construction (cyber threat), synthetic card-number generation, minor-unsa... Read more