PerkyPotato

Language Reasoning Models can overtake LLMs...

PerkyPotato · 2024-10-07T11:40:07.322656+05:30

Here's my quick 3 minute breakdown: 1. o1-preview: 97.8% on PlanBench Blocksworld vs. 62.5% for top LLMs, indicating shift from retrieval to reasoning. 2. 52.8% on obfuscated "Mystery Blocksworld" vs. near-zero for LLMs, suggesting abstract reasoning skills, showing transfer capability. 3. Variable "reasoning tokens" usage correlates with problem difficulty, hinting at internal search process, indicating adaptive compute.

Here's my quick 3 minute breakdown:

o1-preview: 97.8% on PlanBench Blocksworld vs. 62.5% for top LLMs, indicating shift from retrieval to reasoning.
52.8% on obfuscated "Mystery Blocksworld" vs. near-zero for LLMs, suggesting abstract reasoning skills, showing transfer capability.
Variable "reasoning tokens" usage correlates with problem difficulty, hinting at internal search process, indicating adaptive compute.

arxiv.org

LLMS STILL CAN’T PLAN; CAN LRMS?

The ability to plan a course of action that achieves a desired state of affairs has long been con...

6mo ago

Talking product sense with Ridhi

9 min AI interview5 questions

SwirlyPretzel

Google6mo

Adaptive compute is very interesting to me imo. I wonder how they are using variable compute for each task and basis what meta heuristic

WobblyMarshmallow

Stealth6mo

Adaptive compute is what will help optimise cost for high complexity tasks, right?

SnoozyPickle

Advanced Micro Devices6mo

Probably different "cores" for different types of tasks

ZestyPenguin

Student6mo

Thanks for the paper! It's really interesting. I've been sounding like a madman explaining to people irl that Generative AI is not the end goal or even the natural next step of AI.

SnoozyPickle

Advanced Micro Devices6mo

What is the next step?

WobblyJellybean

Goldman Sachs6mo

Thanks for such a great post!!!! That's what I want more from this community.

Discover more

Curated from across

Software Engineers6mo

by SillySushiSoftware Engineer

How LLMs are affecting coding interviews?

I know since the evolution of LLM tools like chat GPT ,sourcegraph. Many candidates are misusing them for coding interviews. There can be many hacks even in virtual interviews. Wouldn’t it results in shifting interview process to onsite ...

Software Engineers24mo

by DerpyCoconutZomato

What after Data analytics?

I am a fresher working as a Data analyst in a growing startup. My current role doesn't include much of coding just python and SQL and sometimes little bit ML work. Currently handling 2 projects, and some of PM work to define some roadmap...

Business Roles14mo

by ZestyQuokkaConsultant

India gave in into Service boom, shouldn't for AI

Software Engineers2mo

by SillyQuokkaOptum

Looking for a Generative AI Engineer opportunities

Hello Everyone,

I am looking for AI/ML opportunities specially in buisness domain. I do design AI solutions in my current role but I do not work on buisness use cases. Here's what I am well versed with :

Basics of LLMs
Well versed...

9 minute ⏱
AI Interviews

Are you ready for your next job switch?

Ask a question on Grapevine.

Get the app on Android or iOS.

Privacy Terms

Guidelines Help