Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

LLM as a judge

Paper about “A Meeting Assistant Benchmark for Long-Context Language Models” with a remarkable side-note: We also provide a thorough analysis of our GPT-4-based evaluation method, encompassing insights from a crowdsourcing study. Our findings suggest that while GPT-4’s evaluation scores are correlated with human judges’, its ability to differ... Read more...

xz Backdoor

A lot has been (and continues to be) written about the xz Backdoor. Read more...

LLM Tokenizer comparison

A poster on LinkedIn highlighted the Xenova Tokenizer Playground to compare Tokenizer efficiency. Read more...

Apple Chip Flaw

The tech press is busy reporting on an alleged “Apple Chip Flaw Leaks Secret Encryption Keys”. Read more...

[UPDATED] Ai Attribution Art

description: “Examines AI-generated art attribution methods, including digital watermarking, metadata embedding, and industry standards for provenance tracking.” layout: post title: “AI Attribution in Art” date: 2024-03-26 last_updated: 2024-03-26 tags: [ai, art, software engineering, genai] — Read more...