M3 Ultra
Prós
- Runs Qwen3 235B-A22B at Q4 natively
- 192 GB VRAM — adequate headroom
1 consumer GPUs can run Qwen3 235B-A22B at Q4 natively. Precise VRAM thresholds and benchmarks below.
Prices and availability may change · affiliate link
llama.cpp 0.2.x · CUDA 12 · ROCm 6 · updated monthly · methodology →
This model requires aFlagship GPU (48 GB+ VRAM)
Best picks by compatibility, VRAM headroom, and value — prices and availability may change.
Prós
Prós
Alguns links são links de afiliado da Amazon. Podemos receber uma comissão sem custo adicional para si. O cookie da Amazon pode durar até 24 horas após o clique.
Check if your GPU can run Qwen3 235B-A22B →
VRAM Calculator — instant compatibility check
M3 Ultra
192 GB · Runs Q4 natively · Check availability
*Prices and availability may change. Some links are affiliate links.
| Quantization | VRAM needed | Disk space | Quality |
|---|---|---|---|
| FP16 (max quality) | 517 GB | 470 GB | Maximum |
| Q8 (high quality) | 258.5 GB | 235 GB | Near-lossless |
| Q4 (recommended) Best balance | 129.3 GB | 117.5 GB | Recommended |
| Q2 (minimum) | 64.6 GB | 58.8 GB | Quality loss |
| Developer | Alibaba |
| Parameters | 235B |
| Context window | 131,072 tokens |
| License | Apache 2.0 |
| Use cases | chat, reasoning, coding, analysis |
| Released | 2025-04 |
Install with Ollama
ollama run qwen3:235b-a22b Hugging Face
Qwen/Qwen3-235B-A22B Qwen3 235B-A22B requires <strong class="text-primary-container">129.3 GB VRAM</strong> at Q4. 1 consumer GPUs meet this threshold. Below 8 GB or 127.30000000000001 GB you'll hit significant offload latency.
1 Q4 native · 1 offload
| GPU Unit | VRAM | Compatibility | Est. Speed | Action |
|---|---|---|---|---|
| M3 Ultra | 192GB | Optimal | 38 tok/s | Calculate → |
| M4 Ultra | 128GB | Offload | 45 tok/s | Calculate → |
Best picks by compatibility, VRAM headroom, and value — prices and availability may change.
M3 Ultra
192 GB VRAM
Check availability →
M4 Ultra
128 GB VRAM
Check availability →
Alguns links são links de afiliado da Amazon. Podemos receber uma comissão sem custo adicional para si. O cookie da Amazon pode durar até 24 horas após o clique.
Qwen3 235B-A22B with 235B parameters only runs fully in multi-GPU or server configurations. Consider distilled versions if available. The VRAM calculator can help you find compatible alternatives.
GPUs that run Qwen3 235B-A22B at Q4 — sorted by AI performance score.
Alguns links são links de afiliado da Amazon. Podemos receber uma comissão sem custo adicional para si. O cookie da Amazon pode durar até 24 horas após o clique.
Similar models in the chat category with comparable VRAM footprints.
The VRAM Calculator tells you exactly which quantization your hardware can handle.
M3 Ultra
Preços mudam diariamente