we really all are going to make it, aren't we? 2x3090 setup.

“`html

The post discusses significant improvements in running large language models (LLMs) locally, specifically mentioning the successful deployment and performance of a local setup using the club-3090 model. The author notes that this has led to substantial increases in processing power and efficiency compared to previous setups.

Local LLMs are now viable for use cases previously requiring cloud-based services, such as tool-calling and prompt processing.
The performance gains have been dramatic, with the author reporting 4000 PP/s and 113 TK/s on a local setup without NVLink, indicating significant improvements in throughput compared to previous setups.
These advancements suggest that smaller models could potentially achieve frontier class intelligence within the next year, opening up new possibilities for AI development and deployment outside of cloud environments.

“`

### Takeaways:
– Local LLMs are now more viable and performant compared to previous setups.
– Significant performance improvements in local processing power and efficiency.
– Smaller models might achieve frontier class intelligence within the next year.

Source Read original →

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

we really all are going to make it, aren’t we? 2×3090 setup.

Empowering Businesses with AI — Smart Tools, Smarter Business Decisions.

follow us

Popular Tag

Popular Post

The SSL 1 is…

OpenAI turns ChatGPT into…

Warren Buffett’s Berkshire Hathaway…