Trablog
  • Home
  • LifeStyle
  • Gaming
  • Sport
  • Food
  • Travel
  • Fashion
  • Technology
  • Economy
Monday, March 9, 2026
No Result
View All Result
  • Home
  • LifeStyle
  • Gaming
  • Sport
  • Food
  • Travel
  • Fashion
  • Technology
  • Economy
No Result
View All Result
Trablog
No Result
View All Result
Home Technology

Anthropic has to keep revising its technical interview test so you can’t cheat on it with Claude

January 22, 2026
in Technology
0
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter

Since 2024, Anthropic’s performance optimization team has given job applicants a take-home test to make sure they know their stuff. But as AI coding tools have gotten better, the test has had to change a lot to stay ahead of AI-assisted cheating.

Team lead Tristan Hume described the history of the challenge in a blog post on Wednesday. “Each new Claude model has forced us to redesign the test,” Hume writes. “When given the same time limit, Claude Opus 4 outperformed most human applicants. That still allowed us to distinguish the strongest candidates — but then, Claude Opus 4.5 matched even those.”

The result is a serious candidate-assessment problem. Without in-person proctoring, there’s no way to ensure someone isn’t using AI to cheat on the test — and if they do, they’ll quickly rise to the top. “Under the constraints of the take-home test, we no longer had a way to distinguish between the output of our top candidates and our most capable model,” Hume writes.

The issue of AI cheating is already wreaking havoc at schools and universities around the world, so ironic that AI labs are having to deal with it too. But Anthropic is also uniquely well-equipped to deal with the problem.

In the end, Hume designed a new test that had less to do with optimizing hardware, making it sufficiently novel to stump contemporary AI tools. But as part of the post, he shared the original test to see if anyone reading could come up with a better solution.

“If you can best Opus 4.5,” the post reads, “we’d love to hear from you.”

Originally published at TechCrunch

Tags: artificial-intelligencetechnology
Previous Post

Blue Origin schedules third New Glenn launch for late February, but not to the moon

Next Post

Sportmax Pre-Fall 2026

Next Post

Sportmax Pre-Fall 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Unveiled: The Unreleased Spider-Man Multiplayer Game by Insomniac

Unveiled: The Unreleased Spider-Man Multiplayer Game by Insomniac

April 21, 2024
Euro 2024 is fast approaching, and excitement is building as 24 teams prepare to compete for European football glory in Germany.

Euro 2024 is fast approaching, and excitement is building as 24 teams prepare to compete for European football glory in Germany.

March 28, 2024

OpenEvidence hits $12B valuation, with new round led by Thrive, DST  

January 21, 2026

Royal Trade Documents Containing Afghan Investment Details Allegedly Shared with Convicted Financier

February 11, 2026
6 Top Museums in France

6 Top Museums in France

March 10, 2024
Embracing Tomorrow: 6 Best Fashion Trends of 2024

Embracing Tomorrow: 6 Best Fashion Trends of 2024

March 29, 2024

Cricket players, officials slam ICC for Bangladesh’s T20 World Cup ouster

January 25, 2026

China overturns death sentence of Canadian in sign of diplomatic thaw

February 7, 2026

Vonn crashes in downhill as Johnson wins gold

February 8, 2026

Verstappen Dominates Japanese Grand Prix with Flawless Victory

April 7, 2024

Stars descend on LA for music’s biggest night: Here’s your complete Grammys guide

February 1, 2026

Pros and Cons of Upgrading Your GPU: What to Consider Before Buying

March 10, 2024

Exploring the Future: Apple’s New VR Glasses

March 10, 2024

Exploring Australia’s Stunning Landscapes: A Journey Through the Most Beautiful Places

March 3, 2024

The Drama of Flawed Football Giants

April 14, 2024

Judge orders stop to FBI search of devices seized from Washington Post reporter

January 21, 2026

Categories

  • Economy
  • Fashion
  • Food
  • Gaming
  • LifeStyle
  • Sport
  • Technology
  • Travel

Recent News

Managing Screen Time in Daily Life: Practical Habits for Better Focus

Managing Screen Time in Daily Life: Practical Habits for Better Focus

February 17, 2026

Prime Minister Defends Peerage Decision Amid Claims Peer Withheld Information About Sex Offender Connections

February 11, 2026
  • Imprint

© 2024 trablog.

No Result
View All Result
  • Home
  • Lifestyle
    • Fashion
    • Food
    • Travel

© 2024 trablog.