Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

GenAI 'Workbenches' announced

I have announced the trinity of my chat interfaces on LinkedIn, peppered with some background reading/watching material. This is kind-of in response to a Singapore taxi driver asking “How to get into AI” and me frantically filling his OneNote. How I wish there was a single, actually good starting point! (This still isn’t.) Note about the Amazon... Read more...

LLM on smartphone

Following up on our conversation on quantized models on smartphones, Stefano Fiorucci wrote a post about how to run a small language model on a smartphone: . Read more...

Gemini Large context & Video

RAGfluencers are of course discontent with Gemini’s very large 1 million token context window, noting the high costs associated with a large number of input tokens. “It feels like a very niche use case”. Read more...

LLM as a judge

Paper about “A Meeting Assistant Benchmark for Long-Context Language Models” with a remarkable side-note: We also provide a thorough analysis of our GPT-4-based evaluation method, encompassing insights from a crowdsourcing study. Our findings suggest that while GPT-4’s evaluation scores are correlated with human judges’, its ability to differ... Read more...

xz Backdoor

A lot has been (and continues to be) written about the xz Backdoor. Read more...