we really all are going to make it, aren’t we? 2×3090 setup.

“`html i’m blown away. i saw someone made a post the other day about "club-3090" and after having sonnet patch some fixes…

By AI Maestro May 14, 2026 1 min read
we really all are going to make it, aren’t we? 2×3090 setup.

“`html

i’m blown away. i saw someone made a post the other day about "club-3090" and after having sonnet patch some fixes into it, specifically a sse-session drop bug and a bug with tool-calling, it’s fair to say that even "budget" setups like myself will have a path forward soon for only-local-ai.

  • After getting this running, i was originally using WSL2. Fair to say, it was "better" than LM studio but not quite good. t/s was like 30 and pp was around 400….i said fuck it and installed ubuntu as dual boot on the same machien (i’m just not very linux friendly when it’s headless, prefer windows RDP) and wow. i’m getting like 4000 pp/s and 113 tk/s with no nvlink.
  • Supposedly, nvlink would make it faster…..
  • Either way, i’m very excited about this new local future. qwen 3.6 27b with 262k on 48 GB VRAM feels almost-sonnet level, and it’s MUCH faster than cloud. And useful! I had it make some monkey patches and they work fantastic, and well as some relatively useful code reviews.

wondering what the next upgrade path could be. i was thinking about m5 ultra 512 GB + 4x DGX Sparks (prompt processing speed) but now I’m wondering if we’ll reach frontier class intelligence (maybe only domain specific) in smaller models in the next 12 months?

awesome!

“`

### Takeaways
– Local AI setups are becoming more viable and efficient, with speeds approaching those of cloud-based solutions.
– Smaller model advancements could lead to significant progress in frontier-class intelligence within a year.

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

Name
Scroll to Top