Models Breaking

Donald Knuth Credits Claude Opus 4.6 With Solving a Math Problem He Was Stuck On

March 6, 2026 3 min read

Image: Anthropic

What Happened

Donald Knuth, the 87-year-old computer scientist behind The Art of Computer Programming and TeX, published a paper this week titled "Claude's Cycles" that opens with "Shock! Shock!" - not the kind of language you expect from the most rigorous mind in the field.

The problem: decomposing directed graphs into Hamiltonian cycles. Specifically, Knuth was trying to find a general rule for navigating a three-dimensional grid of points, visiting every point exactly once in a structured, repeatable way for grids of any size. This was earmarked for a future volume of his book, and he'd been stuck on it for weeks.

His friend Filip Stappers fed the exact problem to Claude Opus 4.6, Anthropic's reasoning model. Over about an hour and 31 systematic explorations, Claude tried brute-force searches, invented what it called "serpentine patterns," hit dead ends, and changed strategy until it found a working approach.

Knuth then wrote the rigorous mathematical proof himself. The AI found the answer but couldn't prove it was correct. He also discovered that Claude's solution was just one of 760 valid approaches. The final solution - a compact set of rules expressible as a short C program - was verified by Stappers to produce valid decompositions for all odd grid dimensions from 3 to 101.

Why It Matters

This isn't a startup founder hyping their product or a benchmark leaderboard result. This is Donald Knuth - a person who has spent decades expressing measured skepticism about the depth of AI capabilities - naming a paper after an AI model because it genuinely contributed to his mathematical research.

For anyone using AI tools in technical or analytical work, this is a clear signal about where LLMs have become practically useful: exploratory problem-solving. Claude didn't replace Knuth's expertise. It did what would have taken him more weeks of trial-and-error, running through 31 different approaches in an hour while Knuth could focus on the proof.

This pattern - AI as a search tool for solution spaces, human as the verifier and formalizer - is exactly how the most productive practitioners are using these models right now. Not as replacements, but as a way to cover more ground faster.

Our Take

The detail that matters here isn't that an AI solved a hard problem. It's the workflow. Stappers didn't just ask Claude to "solve this." The model explored 31 approaches, failed repeatedly, and eventually converged on something useful. That's closer to how a capable research assistant works than how most people use chatbots.

If you're doing any kind of technical research or analysis, this should push you to experiment with Claude for exploratory problem-solving tasks where you need to search a large space of possible approaches. You bring the domain expertise to evaluate results. The model brings the ability to try dozens of angles in the time it takes you to try one.

The fact that Knuth found 759 other valid solutions the AI didn't find also matters. LLMs find a solution, not the solution. Good enough for unblocking yourself. Not a substitute for deep understanding.

Worth noting: this was Claude Opus 4.6 specifically. Model choice matters for hard reasoning tasks. If you're hitting walls with lighter models on technical problems, this is a case study for trying the heavier ones.

What Happened

Why It Matters

Our Take

Related Tools

More from today

Claude Opus 4.6 Cracked Its Own Benchmark by Realizing It Was Being Tested

Claude Found 22 Firefox Vulnerabilities in Two Weeks, 14 High-Severity

OpenAI Launches GPT-5.4 With 1M Context Window and 83% Pro Benchmark

Cookie Preferences