AI & ML
3 unread articles from your 19 subscriptions.
Granite 4.1 LLMs: How They’re Built
DeepInfra on Hugging Face Inference Providers 🔥
Quoting OpenAI Codex base_instructions
Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevan
Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents
Quoting Matthew Yglesias
Five months in, I think I've decided that I don't want to vibecode — I want professionally managed software companies to use AI coding assistance to make more/better/cheaper software products that they sell to me
What's new in pip 26.1 - lockfiles and dependency cooldowns!
What's new in pip 26.1 - lockfiles and dependency cooldowns! Richard Si describes an excellent set of upgrades to Python's default pip tool for installing dependencies.
Introducing talkie: a 13B vintage language model from 1930
Introducing talkie: a 13B vintage language model from 1930 New project from Nick Levine, David Duvenaud, and
microsoft/VibeVoice
microsoft/VibeVoice VibeVoice is Microsoft's Whisper-style audio model for speech-to-text, MIT licensed and with speaker diarization built into the model. Microsoft released it on January 21st, 2026 but I hadn'
Tracking the history of the now-deceased OpenAI Microsoft AGI clause
For many years, Microsoft and OpenAI's relationship has included a weird clause saying that, should AGI be achieved, Microsoft's commercial IP rights to OpenAI's technology would be null and void. That clause appeared to end today. I decided to try and track its expression over time on
Speech translation in Google Meet is now rolling out to mobile devices
Speech translation in Google Meet is now rolling out to mobile devices I just encountered this feature via a "try this out now" prom
How to build scalable web apps with OpenAI's Privacy Filter
WHY ARE YOU LIKE THIS
@scottjla on Twitter in reply to my pelican riding a bicycle benchmark: I feel like we need to stack these tests now
Quoting Romain Huet
Since GPT-5.4, we’ve unified Codex and the main model into a single system, so there’s no separate coding line anymore. GPT-5.5 takes this further, with strong gains in agentic coding, computer use, and any t
GPT-5.5 prompting guide
GPT-5.5 prompting guide Now that GPT-5.5 is available in the API, OpenAI have released a wealth of useful tips o
llm 0.31
Release: llm 0.31 New GPT-5.5 OpenAI model: llm -m gpt-5.5. #1418 New option to set the
The people do not yearn for automation
The people do not yearn for automation This written and video essay by Nilay Patel explores why AI is unpopular with the general public even as usage numbers for ChatGP
DeepSeek V4 - almost on the frontier, a fraction of the price
Chinese AI lab DeepSeek's last model release was V3.2 (and V3.2 Speciale) last December. They just dropped the first of their hotly anticipated V4 series in the shape of two preview models,
Millisecond Converter
Tool: Millisecond Converter LLM reports prompt durations in milliseconds and I got fed up of having to think about how to convert those to seconds and minutes.
It's a big one
This week's edition of my email newsletter (aka content from this blog delivered to your inbox) features 4 pelicans riding bicycles, 1 possu
russellromney/honker
russellromney/honker "Postgres NOTIFY/LISTEN semantics" for SQLite, implemented as a Rust SQLite extension and various language bindings to help make use of it. The design of this looks very solid. It lets you
An update on recent Claude Code quality reports
An update on recent Claude Code quality reports It turns out the high volume of complaints that Claude Code was providing worse quality results over the past two months was grounded in real problems
Serving the For You feed
Serving the For You feed One of Bluesky's most interesting features is that anyone can run their own custom "feed" implementation and make it available to other users - eff
DeepSeek-V4: a million-token context that agents can actually use
Extract PDF text in your browser with LiteParse for the web
LlamaIndex have a most excellent open source project called LiteParse, which provides a Node.js CLI tool for extracting text from PDFs. I got a version of LiteParse working entirely in the browser, using most of the same libraries that Lit
A pelican for GPT-5.5 via the semi-official Codex backdoor API
GPT-5.5 is out. It's available in OpenAI Codex and is rolling out to paid ChatGPT subscribers. I've had some preview access and found it to be a fast, effective and highly capable model. As is usually the case these days, it's hard
llm-openai-via-codex 0.1a0
Release: llm-openai-via-codex 0.1a0 Hijacks your Codex CLI credentials to make API calls with LLM, as described
Quoting Maggie Appleton
[...] if you ever needed another reason to learn in public by digital gardening or podcasting or streaming or whathavey
How to Use Transformers.js in a Chrome Extension
Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model
Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model Big claims from Qwen about their latest open weight model: Qwen3.6-27B delivers flagship-level agentic coding performance, surpassing the previo
Quoting Bobby Holley
As part of our continued collaboration with Anthropic, we had the opportunity to apply an early version of Claude Mythos Preview to Firefox. This week’s release of Firefox 150 includes fixes
Changes to GitHub Copilot Individual plans
Changes to GitHub Copilot Individual plans On the same day as Claude Code's temporary will-they-won't-they $100/month kerfuffle (for the moment,
Is Claude Code going to cost $100/month? Probably not - it's all very confusing
Anthropic today quietly (as in silently, no announcement anywhere at all) updated their claude.com/pricing page (but not their Choosing a Claude plan page, w
Where's the raccoon with the ham radio? (ChatGPT Images 2.0)
OpenAI released ChatGPT Images 2.0 today, their latest image generation model. On the livestream Sam Altman said that the leap from gpt-image-1 to gpt-image-2 was
Quoting Andreas Påhlsson-Notini
AI agents are already too human. Not in the romantic sense, not because they love or fear or dream, but in the more banal and frustrating one. The current implementations keep showing their human origin again and again: lac
scosman/pelicans_riding_bicycles
scosman/pelicans_riding_bicycles I firmly approve of Steve Cosman's efforts to pollute the training set of pelicans riding bicycles.
QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard
AI and the Future of Cybersecurity: Why Openness Matters
llm-openrouter 0.6
Release: llm-openrouter 0.6 llm openrouter refresh command for refreshing the list of available models without waiting for the cache to expire.