RUN THIS LLM

Search local LLM hardware requirements

← Back to all models

GLM-4 9B Chat

Zhipu · 9B · General

Chinese-English bilingual model with strong general capabilities. 1M token context support.

VRAM Requirements

Quantization	VRAM
Q4_K_M (smallest)	5.4 GB
Q8_0 (balanced)	9.9 GB
FP16 (full quality)	18 GB

Specifications

Parameters: 9B
Category: General
Max context: 128K tokens
System RAM: 12 GB minimum
HuggingFace: THUDM/glm-4-9b-chat

Benchmarks

MMLU: 72 — General knowledge & reasoning (5-shot)
HumanEval: 55 — Code generation (pass@1)
MATH: 42 — Competition-level math reasoning

Loading interactive analysis...

RTL

Add Run This LLM to your home screen

Get the Chrome Extension

Look up hardware requirements for any model right from your browser sidebar — no tab switching needed.

Add to Chrome — It's Free