Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

LLM Pricing Comparisons: The Missing Tokenizer Efficiency Factor

Recently, Philipp Schmid shared an interactive LLM pricing comparison tool hosted on Hugging Face. This tool allows users to filter providers and models, comparing them side-by-side. It’s an impressive effort that includes a wide range of providers such as Fireworks AI, Groq, Replicate, and IBM. While this tool is undoubtedly useful, I couldn’t... Read more

IBM Granite Models - for Agents?

Recently, I came across a LinkedIn post by Armand Ruiz, VP of Product - AI Platform at IBM, discussing the differences between chatbots, copilots, and agents. While the post provided a general overview of these AI categories, it prompted me to inquire about a more specific topic: IBM’s Granite models. Having recently participated in the NVIDIA ... Read more

Exploring Audio Input with Gemini 1.5 Pro

Simon Willison recently asked about experiments with audio input to Google Gemini 1.5 Pro and Flash models, noting that the ability to query audio files beyond simple transcription is an intriguing and potentially underexplored capability. Michael Gackstatter reported issues processing German audio with Gemini 1.5 Pro, receiving a “cannot proce... Read more

Benchmarking AI Vision

Ethan Mollick, Associate Professor at The Wharton School, recently shared two key developments: The Charxiv benchmark, a challenging real-life chart reading test, where humans achieve 80% accuracy. Interestingly, Claude 3.5, currently the best-performing Large Language Model (LLM) in this test, manages 60% accuracy. The Chatbo... Read more

Gemma-2: Impressive or Just Well-Dressed?

Google recently released their open-source Gemma-2 models (27b and 9b variants), which have been gaining attention in the AI community. In a LinkedIn post, Peter Gostev, Head of AI at Moonpig, highlighted that the 27b variant is now ranking slightly higher than Meta’s 70b model, despite being 2.5 times smaller. However, digging into the technic... Read more