Trablog
  • Home
  • LifeStyle
  • Gaming
  • Sport
  • Food
  • Travel
  • Fashion
  • Technology
  • Economy
Sunday, June 7, 2026
No Result
View All Result
  • Home
  • LifeStyle
  • Gaming
  • Sport
  • Food
  • Travel
  • Fashion
  • Technology
  • Economy
No Result
View All Result
Trablog
No Result
View All Result
Home Technology

Anthropic has to keep revising its technical interview test so you can’t cheat on it with Claude

January 22, 2026
in Technology
0
0
SHARES
8
VIEWS
Share on FacebookShare on Twitter

Since 2024, Anthropic’s performance optimization team has given job applicants a take-home test to make sure they know their stuff. But as AI coding tools have gotten better, the test has had to change a lot to stay ahead of AI-assisted cheating.

Team lead Tristan Hume described the history of the challenge in a blog post on Wednesday. “Each new Claude model has forced us to redesign the test,” Hume writes. “When given the same time limit, Claude Opus 4 outperformed most human applicants. That still allowed us to distinguish the strongest candidates — but then, Claude Opus 4.5 matched even those.”

The result is a serious candidate-assessment problem. Without in-person proctoring, there’s no way to ensure someone isn’t using AI to cheat on the test — and if they do, they’ll quickly rise to the top. “Under the constraints of the take-home test, we no longer had a way to distinguish between the output of our top candidates and our most capable model,” Hume writes.

The issue of AI cheating is already wreaking havoc at schools and universities around the world, so ironic that AI labs are having to deal with it too. But Anthropic is also uniquely well-equipped to deal with the problem.

In the end, Hume designed a new test that had less to do with optimizing hardware, making it sufficiently novel to stump contemporary AI tools. But as part of the post, he shared the original test to see if anyone reading could come up with a better solution.

“If you can best Opus 4.5,” the post reads, “we’d love to hear from you.”

Originally published at TechCrunch

Tags: artificial-intelligencetechnology
Previous Post

Blue Origin schedules third New Glenn launch for late February, but not to the moon

Next Post

Sportmax Pre-Fall 2026

Next Post

Sportmax Pre-Fall 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Trump threatens Canada with 100% tariffs over its new trade deal with China

January 24, 2026
The Rise of Esports: Where Virtual Battles Shape Real Glory

The Rise of Esports: Where Virtual Battles Shape Real Glory

April 18, 2024
Entryway Shoe Racks for Busy Households

Entryway Shoe Racks for Busy Households

March 17, 2026
Food Storage Containers: The Secret to Faster and Smarter Meal Prep

Food Storage Containers: The Secret to Faster and Smarter Meal Prep

March 15, 2026

These Bose Open Earbuds Are More Than Half Off Right Now

January 22, 2026

OpenEvidence hits $12B valuation, with new round led by Thrive, DST  

January 21, 2026
Old School vs. New School Fashion: Exploring the Pros and Cons

Old School vs. New School Fashion: Exploring the Pros and Cons

January 17, 2024

🏈 NFL mock draft: Kiper projects the first round

January 22, 2026

The AirPods Pro 3 Are $50 Off Right Now

January 23, 2026

Essential Europe Travel Tips for 2024: 10 Must-Know Insights

April 6, 2024

Indulge in Italy’s Culinary Delights: A Gastronomic Journey with Silversea

April 27, 2024

‘You should’ve never been born’: How Buss family infighting drove the $10B sale of the Lakers

January 21, 2026

McCaffrey, 4 QBs named AP NFL MVP finalists

January 22, 2026

Creating the Ultimate Rolled Lasagna

March 9, 2024

10 Shows Like ‘A Knight of the Seven Kingdoms’ You Should Watch Next

January 21, 2026

Tesco Mobile: A Leading Mobile Network in the UK

February 7, 2025

Categories

  • Economy
  • Fashion
  • Food
  • Gaming
  • LifeStyle
  • Sport
  • Technology
  • Travel

Recent News

Quando lo shopping online smette di essere una scelta consapevole

Quando lo shopping online smette di essere una scelta consapevole

April 5, 2026
Warum spätes Scrollen selten entspannt endet

Warum spätes Scrollen selten entspannt endet

April 5, 2026
  • Imprint

© 2024 trablog.

No Result
View All Result
  • Home
  • Lifestyle
    • Fashion
    • Food
    • Travel

© 2024 trablog.