Here model: https://huggingface.co/LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Genesis-V2-APEX-MTP-GGUF
Safetensors: https://huggingface.co/LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Genesis-V2-FP8-Safetensors
Testing results in Open Code on hardware (Beelink gtr9 pro + Strix Halo) done by my friend on Q8_K_P – MTP quant:
- 5 sessions with 200k context, not a single glitch, no loops, no repeated tool calls.
- After 120k tokens I suddenly gave another task that doesn’t intersect with what it was doing at all, and it calmly picked up and solved it correctly.
- Uncensored with MTP support with APEX quantization.
Recommended quant: APEX, APEX-MTP
Recommended settings for LM Studio:
Or use this minimal string as the first line:
You are Qwen, created by Alibaba Cloud. You are a helpful assistant.
Then add anything you want after. Model may underperform without this first line.
Settings:
| Parameter | Value |
|---|---|
| Temperature | 0.7 |
| Top K Sampling | 20 |
| Presence Penalty | 1.5 |
| Repeat Penalty | 1.0 |
| Top P Sampling | 0.8 |
| Min P Sampling | 0 |
| Seed | 42 |
Enjoy 😄
submitted by /u/EvilEnginer
[link] [comments]
Originally published at reddit.com. Curated by AI Maestro.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.




