RUN THIS LLM
Search local LLM hardware requirements
GLM-4 9B Chat
Zhipu · 9B · General
Chinese-English bilingual model with strong general capabilities. 1M token context support.
VRAM Requirements
| Quantization | VRAM |
|---|---|
| Q4_K_M (smallest) | 5.4 GB |
| Q8_0 (balanced) | 9.9 GB |
| FP16 (full quality) | 18 GB |
Specifications
- Parameters: 9B
- Category: General
- Max context: 128K tokens
- System RAM: 12 GB minimum
- HuggingFace: THUDM/glm-4-9b-chat
Benchmarks
- MMLU: 72 — General knowledge & reasoning (5-shot)
- HumanEval: 55 — Code generation (pass@1)
- MATH: 42 — Competition-level math reasoning
Loading interactive analysis...