Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

Document-to-Markdown Converters for LLM Use

Recently, a few open-source tools for converting PDFs, Office documents, and other formats into Markdown have drawn attention. Among these are MarkItDown from Microsoft, Docling from IBM Research, PyMuPDF4LLM, and the Jina AI Reader API. They aim to provide text suitable for downstream tasks, including LLM-driven analysis, without requiring manu... Read more

o1 Pro Mode & Llama 3.3

Quick notes on last week’s foundation model releases: OpenAI o1 o1 was released through ChatGPT: it’s a stark improvement over the o1-preview available through API; o1-preview basically not representative. The new “o1 Pro Mode” is a class of its own: it aces through almost all of the subject tasks in a survey paper I have under submission, and ... Read more

Amazon Nova foundation model release

Since there’s community interest in how to set up AWS to use the new Amazon Nova models, here’s a step-by-step guide to get everyone started: Ensure you have model access: open Bedrock in us-west-2 region, scroll down in the menu on the left, and hit Model Access: Check model selection (green arrow), request models if acces... Read more

AI Agency: Philosophical Foundations

The term “AI Agent” has become increasingly prevalent in discussions about artificial intelligence, yet its meaning remains somewhat ambiguous. This ambiguity stems partly from different conceptualizations of agency across disciplines and languages. A recent LinkedIn discussion, sparked by Maximilian Seeth’s introduction to AI ethics, highlighte... Read more

German NER experiments: Presidio, spaCy, GLiNER

As I experimented with the Microsoft Presidio live demo for PII, I found that neither model does very well with German language when the objective is to also identify organization names. Cloning the HuggingFace space that hosts this demo allows one to enable use of other models (through setting the environment variable ALLOW_OTHER_MODELS = 1), b... Read more