Anthropic has to keep revising its technical interview test so you can’t cheat on it with Claude

Since 2024, Anthropic’s performance optimization team has given job applicants a take-home test to make sure they know their stuff. But as AI coding tools have gotten better, the test has had to change a lot to stay ahead of AI-assisted cheating.

Team lead Tristan Hume described the history of the challenge in a blog post on Wednesday. “Each new Claude model has forced us to redesign the test,” Hume writes. “When given the same time limit, Claude Opus 4 outperformed most human applicants. That still allowed us to distinguish the strongest candidates — but then, Claude Opus 4.5 matched even those.”

The result is a serious candidate-assessment problem. Without in-person proctoring, there’s no way to ensure someone isn’t using AI to cheat on the test — and if they do, they’ll quickly rise to the top. “Under the constraints of the take-home test, we no longer had a way to distinguish between the output of our top candidates and our most capable model,” Hume writes.

The issue of AI cheating is already wreaking havoc at schools and universities around the world, so ironic that AI labs are having to deal with it too. But Anthropic is also uniquely well-equipped to deal with the problem.

In the end, Hume designed a new test that had less to do with optimizing hardware, making it sufficiently novel to stump contemporary AI tools. But as part of the post, he shared the original test to see if anyone reading could come up with a better solution.

“If you can best Opus 4.5,” the post reads, “we’d love to hear from you.”

Originally published at TechCrunch

Tags: artificial-intelligence technology

Anthropic has to keep revising its technical interview test so you can’t cheat on it with Claude

Blue Origin schedules third New Glenn launch for late February, but not to the moon

Sportmax Pre-Fall 2026

Sportmax Pre-Fall 2026

Leave a Reply Cancel reply

Taco Bell’s Chicken Nuggets Are Officially Back—And They’re Not Coming Alone

Butler Technik and Its Role in Heating and Automotive Solutions

The Reigning Titans: 5 Elite Boxers Who Define the Sport in 2024

Konami Prepares to Unveil Silent Hill: Townfall Details After Years of Mystery

How to Back Up All Your Android Messages

Unlocking Your Style: 10 Tips for Fashion Success

Humax Direct: Advanced Home Entertainment with Reliability and Innovation

The AA and Its Position in the UK Motoring Services and Breakdown Assistance Market

Taco Bell’s Chicken Nuggets Are Officially Back—And They’re Not Coming Alone

Fernandes Inspires Manchester United to Victory in Dramatic Clash with Sheffield United

BLUETTI and Its Role in Modern Portable Energy Solutions

Creating a Delicious and Balanced Menu: A Guide to Food Menu Planning

Basic Economic and Financial Market Concepts

My car was stolen. Here are six important things I learned

The Posh Shed Company: Luxury Garden Buildings, Craftsmanship, and What Sets Them Apart

Pros and Cons of Upgrading Your GPU: What to Consider Before Buying

Categories

Recent News

skinChemists: Science-Led Skincare Built on Clinical-Grade Formulations

Monk: Smart Ice Bath Systems Built for Cold Water Therapy and Performance Recovery