we really all are going to make it, aren’t we? 2×3090 setup.

“`html I am blown away by the progress being made in local AI setups. A Reddit user shared a post about their…

By AI Maestro May 14, 2026 1 min read
we really all are going to make it, aren’t we? 2×3090 setup.

“`html

I am blown away by the progress being made in local AI setups. A Reddit user shared a post about their success with a ‘club-3090’ setup, which has significant implications for those looking to run large language models locally.

  • The user was able to achieve impressive performance with a relatively modest configuration: 4000 prompt processing (pp) per second and 113 tool calls per second. This is achieved without the use of NVLink, which typically enhances performance but requires additional hardware resources.
  • This setup has opened up new possibilities for running local AI models like Qwen 3.6 with 27B parameters on a machine equipped with only 48 GB VRAM. The user notes that this model runs almost as effectively as Sonnet, which is a significant milestone given the constraints of their hardware.
  • The Reddit post also hints at potential future improvements and questions whether in the next year or so we might see smaller models reach frontier-class intelligence capabilities. This speaks to the rapid advancements being made in local AI environments.

These developments are exciting as they allow for more flexibility and control over AI operations, especially when compared to relying on cloud-based services. The user is now exploring how this model can handle SSH sessions for their Linux computers, indicating its versatility and utility.

“`

“`

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

Name
Scroll to Top