JumpyTaco

Paper with Code: You can now run LLMs without Matrix Multiplications

Saw this paper: https://arxiv.org/pdf/2406.02528

In essence, MatMul operations can be completely eliminated from LLMs while maintaining strong performance at billion-parameter scales and by utilising an optimised kernel during inference, their model’s memory consumption can be reduced by more than 10× compared to un-optimised models.

source: https://x.com/rohanpaul_ai/status/1799122826114330866

github.com

GitHub - ridgerchu/matmulfreellm: Implementation for MatMul-free LM.

Implementation for MatMul-free LM. Contribute to ridgerchu/matmulfreellm development by creating ...

6mo ago

2.6Kviews

Find out if you are being paid fairly.Download Grapevine

You're early. There are no comments yet.

Be the first to comment.

Discover more

Curated from across

Data Scientists2mo

by JumpyTacoGoldman Sachs

Breakthrough in Test-Time Compute

I came across two interesting papers recently on scaling laws in AI and wanted to share a summary. Here are the key takeaways:

Scaling LLM Test-Time Compute Two papers looked at how to scale up test-time compute for LLMs:

S...

2.4K

Data Scientists3mo

by JumpyTacoGoldman Sachs

AI discovers a Faster Matrix Multiplication Algorithm

❤️ Please do like these deep posts, this helps push this content to more people. ❤️

Here is everything you need to know in 10 points:

DeepMind developed AlphaTensor, an AI system that discovers new algorithms for fundamental tas...

nature.com

Discovering faster matrix multiplication algorithms with reinforcement learning

Google Deep Mind

4.1K

Data Scientists2mo

by SquishyQuokkaGojek

AI This Week: Liquid Foundation Models beat LLMs

Liquid AI has launched a new type of AI model called Liquid Foundation Models (LFMs). These models are designed to be more efficient than traditional transformer-based models, with smaller memory needs and better performance. LFMs come i...

11K

Top comment

Smaller memory needs are welcome and need of the hour. Fucking Nvidia and the insane cost of vRAM is why the littl...

Ask a question on Grapevine.

Get the app on Android or iOS.

Privacy Terms

Guidelines Help