Google has allegedly given a few companies preview access to Gemini: Reuters. Some commenter predicted General Availability in December. If the GPT-4 timeline is any measure, that’s not unreasonable to assume: GPT-4 announcement: 03-14 I got private access: 03-17 (+3 weekdays) General Availability: 07-06 (+ 82 weekdays) So: Gemini private be... Read more 18 Sep 2023 - less than 1 minute read
A comparison chart of various Chatbots has raised some questions on Claude 2, in particular the “Free” price tag. Like with “ChatGPT”, different people mean different things by it, and it again helps to think in the categories previously established by Miles Brundage: Models, Platforms and Systems. Here is his slide extended with Claude 2: ... Read more 05 Sep 2023 - 1 minute read
OpenAI have renamed “Code Interpreter” to “Advanced Data Analysis”. That’s still a misnomer to me, so I’ll try to explain: both ChatGPT Plus premium features, “Code Interpreter” and “Plugins” are basically two use-cases of the same underlying LLM/GPT feature: what Microsoft Research described in their “Sparks of AGI” paper as “Tool Usage”. What... Read more 29 Aug 2023 - 4 minute read
Following up on my post about ChatGPT (not) getting dumber, a commenter remarked that Bard did better on this particular math excersise. I offered the explanation that what they saw here with Bard is the use of a “tool” as proposed by Microsoft Research in their “Sparks of AGI” paper and recently made available by OpenAI: to developers in t... Read more 04 Aug 2023 - 2 minute read
The Wall Street Journal has a piece on “Why ChatGPT Is Getting Dumber at Basic Math”. This is rooted in the same junk science by Zou et al discussed previously. What happened in the meantime: one of the alleged “model degradations” was determined to be a broken benchmark script by Zou et al plus a behaviour change by GPT-4. Simon Boehm fixed the... Read more 04 Aug 2023 - 1 minute read