Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

OpenAI Agents additions

OpenAI have made additional agentic features available to developers: Web Search, File Search, Computer Use and an SDK that improves on Swarm (Announcement). The “Computer Use” API gives access to the CUA model, which also powers OpenAI Operator. My notes on this: fixed “Bing search: OpenAI news” with initial screenshot demo script here th... Read more

ChatGPT model selector

Ethan Mollick recently shared a tongue-in-cheek remark on how to choose between various ChatGPT “models” (LinkedIn, X, Bluesky). My inital suggestion of creating a decision chart to simplify navigating the ChatGPT user interface had received positive feedback, so I have created a first draft here. Meanwhile, Peter Gostev compiled a “Feature Stat... Read more

[UPDATED] GPT 4.5 Preview

OpenAI have released GPT 4.5 in preview: GPT‑4.5 is a very large and compute-intensive model, making it more expensive⁠ than and not a replacement for GPT‑4o. The System Card remarks reduced hallucination, practical use on the API confirms it’s also much slower in terms of tokens per second. Priced at $75.00 $37.50/$150.00 (input/output), i... Read more

[UPDATED] Article out: AI-assisted Process Visualization

My article on AI-assisted Process Visualization is now available in print and as E-Paper: iX-Magazin issue 03’25. Its pre-release on heise+ has sparked a remarkable amount of feedback from readers worldwide. As questions went beyond the scope of the initial publication, I decided to compile a Q&A in English to share select follow-ups. This ... Read more

OpenAI Operator Testdrive

Operator, the browser-controlling agent, is available in most countries where ChatGPT is available except for the EU/EEA. My notes: when tasking it with something, it translates the request to a search on Microsoft Bing. The first result is visited - and it works from there the UI is a view of the remote browser and a ChatGPT conversation p... Read more