How does one record a live stream in absence, perhaps using AWS EC2? A discussion on Reddit lead me to base everything on a g5.xlarge instance. Read more... 25 Sep 2024 (Updated) - 2 minute read
description: “Investigates evaluation ‘trickery’ in chatbot tournaments, showing how adversarial inputs can skew leaderboard results and recommending robust testing protocols.” layout: post title: “LMSYS Chatbot Arena, a popularity contest” date: 2024-04-17 last_updated: 2024-04-17 tags: [llm, lmsys, gemini, reka] — Read more... (Updated) - 2 minute read
I have announced the trinity of my chat interfaces on LinkedIn, peppered with some background reading/watching material. This is kind-of in response to a Singapore taxi driver asking “How to get into AI” and me frantically filling his OneNote. How I wish there was a single, actually good starting point! (This still isn’t.) Note about the Amazon... Read more... 13 Apr 2024 - 1 minute read
Following up on our conversation on quantized models on smartphones, Stefano Fiorucci wrote a post about how to run a small language model on a smartphone: . Read more... 09 Apr 2024 - less than 1 minute read
RAGfluencers are of course discontent with Gemini’s very large 1 million token context window, noting the high costs associated with a large number of input tokens. “It feels like a very niche use case”. Read more... 03 Apr 2024 - 1 minute read