AI & ML

3 unread articles.

April 2026
H
Hugging Face Blog5 days ago

AI evals are becoming the new compute bottleneck

AI & ML
H
Hugging Face Blog5 days ago

Granite 4.1 LLMs: How They’re Built

AI & ML
H
Hugging Face Blog6 days ago

DeepInfra on Hugging Face Inference Providers 🔥

AI & ML
S
Simon Willison's Weblog6 days ago

Quoting OpenAI Codex base_instructions

Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevan

AI & ML
H
Hugging Face Blog6 days ago

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

AI & ML
S
Simon Willison's Weblog7 days ago

Quoting Matthew Yglesias

Five months in, I think I've decided that I don't want to vibecode — I want professionally managed software companies to use AI coding assistance to make more/better/cheaper software products that they sell to me

AI & ML
S
Simon Willison's Weblog7 days ago

What's new in pip 26.1 - lockfiles and dependency cooldowns!

What's new in pip 26.1 - lockfiles and dependency cooldowns! Richard Si describes an excellent set of upgrades to Python's default pip tool for installing dependencies.

AI & ML
S
Simon Willison's Weblog7 days ago

Introducing talkie: a 13B vintage language model from 1930

Introducing talkie: a 13B vintage language model from 1930 New project from Nick Levine, David Duvenaud, and

AI & ML
S
Simon Willison's Weblog7 days ago

microsoft/VibeVoice

microsoft/VibeVoice VibeVoice is Microsoft's Whisper-style audio model for speech-to-text, MIT licensed and with speaker diarization built into the model. Microsoft released it on January 21st, 2026 but I hadn'

AI & ML
S
Simon Willison's Weblog7 days ago

Tracking the history of the now-deceased OpenAI Microsoft AGI clause

For many years, Microsoft and OpenAI's relationship has included a weird clause saying that, should AGI be achieved, Microsoft's commercial IP rights to OpenAI's technology would be null and void. That clause appeared to end today. I decided to try and track its expression over time on

AI & ML
S
Simon Willison's Weblog7 days ago

Speech translation in Google Meet is now rolling out to mobile devices

Speech translation in Google Meet is now rolling out to mobile devices I just encountered this feature via a "try this out now" prom

AI & ML
H
Hugging Face Blog8 days ago

How to build scalable web apps with OpenAI's Privacy Filter

AI & ML
S
Simon Willison's Weblog9 days ago

WHY ARE YOU LIKE THIS

@scottjla on Twitter in reply to my pelican riding a bicycle benchmark: I feel like we need to stack these tests now

AI & ML
S
Simon Willison's Weblog10 days ago

Quoting Romain Huet

Since GPT-5.4, we’ve unified Codex and the main model into a single system, so there’s no separate coding line anymore. GPT-5.5 takes this further, with strong gains in agentic coding, computer use, and any t

AI & ML
S
Simon Willison's Weblog10 days ago

GPT-5.5 prompting guide

GPT-5.5 prompting guide Now that GPT-5.5 is available in the API, OpenAI have released a wealth of useful tips o

AI & ML
S
Simon Willison's Weblog10 days ago

llm 0.31

Release: llm 0.31 New GPT-5.5 OpenAI model: llm -m gpt-5.5. #1418 New option to set the

AI & ML
S
Simon Willison's Weblog10 days ago

The people do not yearn for automation

The people do not yearn for automation This written and video essay by Nilay Patel explores why AI is unpopular with the general public even as usage numbers for ChatGP

AI & ML
S
Simon Willison's Weblog11 days ago

DeepSeek V4 - almost on the frontier, a fraction of the price

Chinese AI lab DeepSeek's last model release was V3.2 (and V3.2 Speciale) last December. They just dropped the first of their hotly anticipated V4 series in the shape of two preview models,

AI & ML
S
Simon Willison's Weblog11 days ago

Millisecond Converter

Tool: Millisecond Converter LLM reports prompt durations in milliseconds and I got fed up of having to think about how to convert those to seconds and minutes.

AI & ML
S
Simon Willison's Weblog11 days ago

It's a big one

This week's edition of my email newsletter (aka content from this blog delivered to your inbox) features 4 pelicans riding bicycles, 1 possu

AI & ML
S
Simon Willison's Weblog11 days ago

russellromney/honker

russellromney/honker "Postgres NOTIFY/LISTEN semantics" for SQLite, implemented as a Rust SQLite extension and various language bindings to help make use of it. The design of this looks very solid. It lets you

AI & ML
S
Simon Willison's Weblog11 days ago

An update on recent Claude Code quality reports

An update on recent Claude Code quality reports It turns out the high volume of complaints that Claude Code was providing worse quality results over the past two months was grounded in real problems

AI & ML
S
Simon Willison's Weblog11 days ago

Serving the For You feed

Serving the For You feed One of Bluesky's most interesting features is that anyone can run their own custom "feed" implementation and make it available to other users - eff

AI & ML
H
Hugging Face Blog11 days ago

DeepSeek-V4: a million-token context that agents can actually use

AI & ML
S
Simon Willison's Weblog11 days ago

Extract PDF text in your browser with LiteParse for the web

LlamaIndex have a most excellent open source project called LiteParse, which provides a Node.js CLI tool for extracting text from PDFs. I got a version of LiteParse working entirely in the browser, using most of the same libraries that Lit

AI & ML
S
Simon Willison's Weblog11 days ago

A pelican for GPT-5.5 via the semi-official Codex backdoor API

GPT-5.5 is out. It's available in OpenAI Codex and is rolling out to paid ChatGPT subscribers. I've had some preview access and found it to be a fast, effective and highly capable model. As is usually the case these days, it's hard

AI & ML
S
Simon Willison's Weblog11 days ago

llm-openai-via-codex 0.1a0

Release: llm-openai-via-codex 0.1a0 Hijacks your Codex CLI credentials to make API calls with LLM, as described

AI & ML
S
Simon Willison's Weblog12 days ago

Quoting Maggie Appleton

[...] if you ever needed another reason to learn in public by digital gardening or podcasting or streaming or whathavey

AI & ML
H
Hugging Face Blog12 days ago

How to Use Transformers.js in a Chrome Extension

AI & ML
S
Simon Willison's Weblog12 days ago

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model Big claims from Qwen about their latest open weight model: Qwen3.6-27B delivers flagship-level agentic coding performance, surpassing the previo

AI & ML
S
Simon Willison's Weblog13 days ago

Quoting Bobby Holley

As part of our continued collaboration with Anthropic, we had the opportunity to apply an early version of Claude Mythos Preview to Firefox. This week’s release of Firefox 150 includes fixes

AI & ML
S
Simon Willison's Weblog13 days ago

Changes to GitHub Copilot Individual plans

Changes to GitHub Copilot Individual plans On the same day as Claude Code's temporary will-they-won't-they $100/month kerfuffle (for the moment,

AI & ML
S
Simon Willison's Weblog13 days ago

Is Claude Code going to cost $100/month? Probably not - it's all very confusing

Anthropic today quietly (as in silently, no announcement anywhere at all) updated their claude.com/pricing page (but not their Choosing a Claude plan page, w

AI & ML
S
Simon Willison's Weblog13 days ago

Where's the raccoon with the ham radio? (ChatGPT Images 2.0)

OpenAI released ChatGPT Images 2.0 today, their latest image generation model. On the livestream Sam Altman said that the leap from gpt-image-1 to gpt-image-2 was

AI & ML
S
Simon Willison's Weblog13 days ago

Quoting Andreas Påhlsson-Notini

AI agents are already too human. Not in the romantic sense, not because they love or fear or dream, but in the more banal and frustrating one. The current implementations keep showing their human origin again and again: lac

AI & ML
S
Simon Willison's Weblog13 days ago

scosman/pelicans_riding_bicycles

scosman/pelicans_riding_bicycles I firmly approve of Steve Cosman's efforts to pollute the training set of pelicans riding bicycles.

AI & ML
H
Hugging Face Blog14 days ago

QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

AI & ML
H
Hugging Face Blog14 days ago

AI and the Future of Cybersecurity: Why Openness Matters

AI & ML
S
Simon Willison's Weblog14 days ago

llm-openrouter 0.6

Release: llm-openrouter 0.6 llm openrouter refresh command for refreshing the list of available models without waiting for the cache to expire.

AI & ML
H
Hugging Face Blog19 days ago

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

AI & ML
H
Hugging Face Blog19 days ago

Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents

AI & ML
H
Hugging Face Blog19 days ago

The PR you would have opened yourself

AI & ML
H
Hugging Face Blog20 days ago

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

AI & ML
H
Hugging Face Blog20 days ago

Meet HoloTab by HCompany. Your AI browser companion.

AI & ML
H
Hugging Face Blog26 days ago

Multimodal Embedding & Reranker Models with Sentence Transformers

AI & ML
H
Hugging Face Blog26 days ago

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

AI & ML
H
Hugging Face Blog27 days ago

Safetensors is Joining the PyTorch Foundation

AI & ML
H
Hugging Face Blog1 month ago

Welcome Gemma 4: Frontier multimodal intelligence on device

AI & ML
H
Hugging Face Blog1 month ago

Falcon Perception

AI & ML
H
Hugging Face Blog1 month ago

Any Custom Frontend with Gradio's Backend

AI & ML