Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

[UPDATED] OBS Stream Recording

How does one record a live stream in absence, perhaps using AWS EC2? A discussion on Reddit lead me to base everything on a g5.xlarge instance. Read more...

[UPDATED] Chatbot Arena Trickery

description: “Investigates evaluation ‘trickery’ in chatbot tournaments, showing how adversarial inputs can skew leaderboard results and recommending robust testing protocols.” layout: post title: “LMSYS Chatbot Arena, a popularity contest” date: 2024-04-17 last_updated: 2024-04-17 tags: [llm, lmsys, gemini, reka] — Read more...

GenAI 'Workbenches' announced

I have announced the trinity of my chat interfaces on LinkedIn, peppered with some background reading/watching material. This is kind-of in response to a Singapore taxi driver asking “How to get into AI” and me frantically filling his OneNote. How I wish there was a single, actually good starting point! (This still isn’t.) Note about the Amazon... Read more...

LLM on smartphone

Following up on our conversation on quantized models on smartphones, Stefano Fiorucci wrote a post about how to run a small language model on a smartphone: . Read more...

Gemini Large context & Video

RAGfluencers are of course discontent with Gemini’s very large 1 million token context window, noting the high costs associated with a large number of input tokens. “It feels like a very niche use case”. Read more...