Artificial Analysis, an “Independent analysis of AI models and hosting providers” outlet, have published a chart plotting “Intelligence Index vs. Price” (X post). This places Grok 3 Mini Reasoning at the upper left corner, inside the “Most attractive quadrant”. According to this, it has the best intelligence/price ratio - even at High-Reasoning ... Read more 19 Apr 2025 - 1 minute read
In addition to the newly released OpenAI models, I have added Web Search to my LLM frontend. This allows up-to-date information to be worked with: Prompt: when is the new German chancellor going to be sworn in? Response: Friedrich Merz is scheduled to be elected as Germany’s new Chancellor on May 6, 2025. (reuters.com) […] Source references... Read more 28 Jun 2025 (Updated) - 3 minute read
The Stanford HAI 2025 AI Index Report is out. Notes: “AI is increasingly embedded in everyday life. From healthcare to transportation, AI is rapidly moving from the lab to daily life. […] On the roads, self-driving cars are no longer experimental”. Examples given include Waymo and Baidu Apollo Go” However, streets signs advising to... Read more 13 Apr 2025 - 3 minute read
OpenAI have announced their newest flagship model family: GPT-4.1. It comes in three sizes: GPT-4.1, 4.1-mini, and 4.1-nano. Two versions of GPT-4.1 were available in the last weeks for community testing via OpenRouter as “Optimus Alpha” and “Quasar Alpha”. There, I noticed that the gender bias sample showcased in the Stanford HAI AI Index Repo... Read more 18 Apr 2025 (Updated) - 2 minute read
Llama 4 has been released - partially. It’s a suite of three LLMs, with the biggest model (“Behemoth”) still in training. Notes: ~apparently no restrictions on use within the EU~ [see Update below] ~trained in fp8 precision~ [see Update below] already live on OpenRouter via Together: smallest model “Scout” costlier than Gemini ... Read more 13 Apr 2025 (Updated) - 1 minute read