A Tree of AI Model Names

3 min read Original article ↗

Model names are weird. What started with GPT-2 and GPT-3 is now a hodgepodge of decimals (GPT-3.5, Sonnet 3.7, Opus 4.6, Grok 4.1) skipped version numbers (o2 where art thou?) and bolted-on descriptors (what is claude-opus-4-5-20251101-thinking-32k??)

It'd help if we could visualize this.

Let's get this out on a tree:

Nice.

The link to the yaml file for the model names is available here Contributions are welcome!

We started with GPT-2 and GPT-3. Now Phi-4-mini-reasoning and Qwen3-235B-A22B and Llama-3.1-Nemotron-70B and R1-1776 are all real model IDs that real people are expected to compare.

It's going to keep getting worse. Every company is running multiple product lines with overlapping version numbers and inconsistent tier names. Fruit from git branches that diverged six months ago are thrown away and harvested at the same time.

OpenAI

The o2 Problem

Reasoning models go o1 → o3. They skipped o2 because O2 is a European telecom brand.

OpenAI

Version Time Travel

GPT-4.1 was released in April 2025. GPT-5 came in August 2025. A model called 4.1 came after 5. It's a separate product line.

OpenAI

There Is No o4

o4-mini exists. Regular o4 does not. They released the mini without the full version.

Google

The Great Tier Rename

PaLM 2 used animal sizes: Gecko, Otter, Bison, Unicorn. Gemini switched to Nano, Pro, Ultra, Flash. The animals were never spoken of again.

Google

Nano Banana

A model called "nano-banana" appeared on LMArena benchmarks. Sundar Pichai tweeted 🍌🍌🍌. It turned out to be Gemini 2.5 Flash Image

Mistral AI

The -stral Cinematic Universe

Every product must rhyme: Code → Codestral. Vision → Pixtral. Math → Mathstral. Small → Ministral. Reasoning → Magistral.

Meta

The Case Change

"LLaMA" stood for Large Language Model Meta AI. In version 2, it became "Llama". Just a regular word.

DeepSeek

R1-Zero

Named like a German sedan: R1-Zero, then R1, then R1-Distill. Outperformed OpenAI's o1 at 95% lower cost. The naming was the least disruptive thing about it.

Anthropic

Version Hopscotch

Versions shipped: 3, 3.5, 3.7, 4, 4.5, 4.6. There is no 3.6. Claude 3.5 Opus was announced but never shipped. Haiku 4 doesn't exist. It jumped from 3.5 to 4.5.

Apple

Radical Anti-Naming

Apple called their model "Apple Foundation Models." The two variants are AFM-on-device and AFM-server. That's it. They stopped there.

Microsoft

Phi-4-mini-reasoning

The full model name is Phi-4-mini-reasoning. Model family + version + size tier + capability. Four concepts in one hyphenated name. Also: Phi-4-reasoning-plus.

OpenAI

Codex: Back From the Dead

Codex was discontinued in March 2023. In 2025, the name reappeared as GPT-5.2-Codex. They brought it back.