Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

[UPDATED] OpenAI API: 4.1 in, 4.5 out

OpenAI have announced their newest flagship model family: GPT-4.1. It comes in three sizes: GPT-4.1, 4.1-mini, and 4.1-nano. Two versions of GPT-4.1 were available in the last weeks for community testing via OpenRouter as “Optimus Alpha” and “Quasar Alpha”. There, I noticed that the gender bias sample showcased in the Stanford HAI AI Index Repo... Read more

[UPDATED] Llama 4 released

Llama 4 has been released - partially. It’s a suite of three LLMs, with the biggest model (“Behemoth”) still in training. Notes: ~apparently no restrictions on use within the EU~ [see Update below] ~trained in fp8 precision~ [see Update below] already live on OpenRouter via Together: smallest model “Scout” costlier than Gemini ... Read more

[UPDATED] GPT-4o Imagegen for infographics

Several samples on X suggest that GPT-4o Imagegen can be used for infographics (1, 2, 3, 4). In my experiments, instructions to Imagegen need to be on-point: simply supplying a Readme.md with all kinds of different notes on a code project and asking it to visualize the build process does not work - the model will even start to hallucinate random... Read more

[UPDATED] Microsoft Copilot Experimental features

Outlet “TestingCatalog” reports that Microsoft is testing several new features for Copilot. As part of the prompt field, there are “Think Deeper”, “Deep Research” and “Action” (TestingCatalog blog post]. The latter may replicate OpenAI Operator and “Computer Use”, but that’s not certain. OpenAI Operators runs inside a cloud VM, but TestingCatalo... Read more

[UPDATED] GPT-4o image generation

GPT-4o for image generation has been released - as part of ChatGPT and Sora. It supersedes the Dall-E 3 model, which was originally released in October 2023, but remains the best OpenAI image generation model available via their API. Most notable for me, 4o image generation not just supports text prompts (like Dall-E did), but also image prompt... Read more