SqueakyPickle
SqueakyPickle

Opus 4.5 completes SWE tasks that would take a human ~4hr 49mins on average successfully 50% of the time.

However GPT-5.1-Codex-Max tops Opus 4.5 on the 80% success rate scoreboard at 32 mins vs. 27 mins

Post image 1
Post image 2
Post image 3
Post image 4
15h ago
Jobs
One interview, 1000+ job opportunities
Take a 10-min AI interview to qualify for numerous real jobs auto-matched to your profile 🔑
+322 new users this month
Discover more
Curated from across