![]() | So I bought a second graphics card the other week to get in on the local AI craze and I’ve been having the hardest time using it to build my website. It’s been unreliable, the context gets eaten up, kind of hallucinates sometimes. I had to double check everything it has been very tricky. I use the cloud models too, expensive, but they’re top quality. So the question becomes, how do I get the best of both worlds? This is my answer to subsidizing cloud API costs with my local LLM with a qwen3.6 35B A3B running at 32k context. Learn Like a Leaner
I need your help in validating the submitted by /u/DiscipleofDeceit666 |
Key Takeaways
- LeanLoop is designed to execute tasks with minimal context and guidance, ensuring that the local AI remains reliable.
- The use of unit tests at the end of each task helps maintain quality control over the execution process.
- I plan on supporting a multi-threaded approach in the future, allowing for parallel processing of multiple tasks or running different models concurrently.
- For those interested, here is an example run configuration for my dual GPU setup with qwen3.6.
- Pull requests are welcome to help validate and improve the
leaners/scripts.
Originally published at reddit.com. Curated by AI Maestro.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.





