The Stanford HAI 2025 AI Index Report is out. Notes: “AI is increasingly embedded in everyday life. From healthcare to transportation, AI is rapidly moving from the lab to daily life. […] On the roads, self-driving cars are no longer experimental”. Examples given include Waymo and Baidu Apollo Go” However, streets signs advising to... Read more 13 Apr 2025 - 3 minute read
OpenAI have announced their newest flagship model family: GPT-4.1. It comes in three sizes: GPT-4.1, 4.1-mini, and 4.1-nano. Two versions of GPT-4.1 were available in the last weeks for community testing via OpenRouter as “Optimus Alpha” and “Quasar Alpha”. There, I noticed that the gender bias sample showcased in the Stanford HAI AI Index Repo... Read more 18 Apr 2025 (Updated) - 2 minute read
Llama 4 has been released - partially. It’s a suite of three LLMs, with the biggest model (“Behemoth”) still in training. Notes: ~apparently no restrictions on use within the EU~ [see Update below] ~trained in fp8 precision~ [see Update below] already live on OpenRouter via Together: smallest model “Scout” costlier than Gemini ... Read more 13 Apr 2025 (Updated) - 1 minute read
Several samples on X suggest that GPT-4o Imagegen can be used for infographics (1, 2, 3, 4). In my experiments, instructions to Imagegen need to be on-point: simply supplying a Readme.md with all kinds of different notes on a code project and asking it to visualize the build process does not work - the model will even start to hallucinate random... Read more 13 Apr 2025 (Updated) - 1 minute read
Outlet “TestingCatalog” reports that Microsoft is testing several new features for Copilot. As part of the prompt field, there are “Think Deeper”, “Deep Research” and “Action” (TestingCatalog blog post]. The latter may replicate OpenAI Operator and “Computer Use”, but that’s not certain. OpenAI Operators runs inside a cloud VM, but TestingCatalo... Read more 04 May 2025 (Updated) - less than 1 minute read