Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

Developer talk

Sparked by a blog post by Github on Copilot productivity enhancements and developer happiness, there was lively reflection on LLMs as a developer tool. One developer brought up the error scenario where ChatGPT would get stuck in a conversation loop. As a remedy, I offered this advice: sometimes it feels like it’s cornered and can’t get out. In s... Read more

Geppetto's Workbench

I have created Geppetto’s Workbench over the weekend, to simplify working with the OpenAI Playground in several ways: defaults to GPT-4 sets parameters to encourage factual results (and curb hallucination & confabulation) declutters the UI Future plans/wishlist: multiple presets/collections of defaults auto continuation of long... Read more

Claude

Some talk about Claude by Anthropic today, accessible through poe.com. The Claude model scores rather favourably in the benchmark on truthfulness “TruthfulQA”, but lags behind the OpenAI GPTs by a wide margin. From the Stanford AI Index 2023 that has just been released (386 pages; be sure to enjoy in Books.app, rather than Safari 😊) Notably,... Read more

ChatGPT vs. OpenAI Playground

I got the question what the difference between the OpenAI Playground “in chat mode” and chat.openai.com is. So for everybody’s benefit, my answer reproduced below. If you’re not familiar at all, see this screenshot for what I am talking about: Note that this pay-per-use, so a credit card is required (but a free tier seems to exist for new acco... Read more

ChatGPT blocked in Italy (but not whole OpenAI)

Alarming news from Italy: ChatGPT got blocked. After investigating the matter with my colleagues, I learned that only the ChatGPT product was affected, while other infrastructure parts remained accessible. We discovered that software creators and their customers who rely on OpenAI and its APIs, including GPT-4, are not affected by this block. In... Read more