we really all are going to make it, aren't we? 2x3090 setup.

“`html

i’m blown away. i saw someone made a post the other day about "club-3090" and after having sonnet patch some fixes into it, specifically a sse-session drop bug and a bug with tool-calling, it’s fair to say that even "budget" setups like myself will have a path forward soon for only-local-ai.

After getting this running, i was originally using WSL2. Fair to say, it was "better" than LM studio but not quite good. t/s was like 30 and pp was around 400….i said fuck it and installed ubuntu as dual boot on the same machien (i’m just not very linux friendly when it’s headless, prefer windows RDP) and wow. i’m getting like 4000 pp/s and 113 tk/s with no nvlink.
Supposedly, nvlink would make it faster…..
Either way, i’m very excited about this new local future. qwen 3.6 27b with 262k on 48 GB VRAM feels almost-sonnet level, and it’s MUCH faster than cloud. And useful! I had it make some monkey patches and they work fantastic, and well as some relatively useful code reviews.

wondering what the next upgrade path could be. i was thinking about m5 ultra 512 GB + 4x DGX Sparks (prompt processing speed) but now I’m wondering if we’ll reach frontier class intelligence (maybe only domain specific) in smaller models in the next 12 months?

awesome!

“`

### Takeaways
– Local AI setups are becoming more viable and efficient, with speeds approaching those of cloud-based solutions.
– Smaller model advancements could lead to significant progress in frontier-class intelligence within a year.

Source Read original →

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

we really all are going to make it, aren’t we? 2×3090 setup.

Empowering Businesses with AI — Smart Tools, Smarter Business Decisions.

follow us

Popular Tag

Popular Post

Warren Buffett’s Berkshire Hathaway…

How small businesses can…

The Trump Administration Is…