-
-
-
-
-
-
Inference Providers
Active filters:
awq
openbmb/MiniCPM-o-4_5-awq
Any-to-Any
•
9B
•
Updated
•
263
•
9
mratsim/MiniMax-M2.1-FP8-INT4-AWQ
Text Generation
•
39B
•
Updated
•
5.58k
•
32
Text Generation
•
2B
•
Updated
•
22
•
13
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
•
14B
•
Updated
•
20.2k
•
2
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
65
•
1
TheBloke/TinyLlama-1.1B-Chat-v0.3-AWQ
Text Generation
•
1B
•
Updated
•
81.4k
•
4
TheBloke/deepseek-coder-1.3b-instruct-AWQ
Text Generation
•
1B
•
Updated
•
224
•
5
Text Generation
•
7B
•
Updated
•
2.73k
•
4
TheBloke/Mistral-7B-Instruct-v0.2-AWQ
Text Generation
•
7B
•
Updated
•
12.6k
•
52
casperhansen/llama-3-8b-instruct-awq
Text Generation
•
8B
•
Updated
•
53.8k
•
28
TechxGenus/DeepSeek-Coder-V2-Lite-Instruct-AWQ
Text Generation
•
16B
•
Updated
•
3.77k
•
8
TechxGenus/DeepSeek-Coder-V2-Lite-Base-AWQ
Text Generation
•
16B
•
Updated
•
8
•
3
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
8B
•
Updated
•
449k
•
86
Qwen/Qwen2.5-3B-Instruct-AWQ
Text Generation
•
3B
•
Updated
•
49.4k
•
16
Qwen/Qwen2.5-72B-Instruct-AWQ
Text Generation
•
73B
•
Updated
•
745k
•
75
Qwen/Qwen2.5-Coder-32B-Instruct-AWQ
Text Generation
•
33B
•
Updated
•
435k
•
33
casperhansen/llama-3.3-70b-instruct-awq
Text Generation
•
71B
•
Updated
•
193k
•
37
kosbu/Llama-3.3-70B-Instruct-AWQ
Text Generation
•
71B
•
Updated
•
438k
•
10
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
8B
•
Updated
•
710k
•
99
Text Generation
•
33B
•
Updated
•
273k
•
124
Text Generation
•
8B
•
Updated
•
137k
•
36
Text Generation
•
4B
•
Updated
•
86.4k
•
23
Any-to-Any
•
11B
•
Updated
•
41.2k
•
17
QuantTrio/Qwen3-235B-A22B-Thinking-2507-AWQ
Text Generation
•
235B
•
Updated
•
3.76k
•
6
openbmb/MiniCPM-V-4_5-AWQ
Image-Text-to-Text
•
9B
•
Updated
•
5.11k
•
12
QuantTrio/Qwen3-VL-32B-Thinking-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
902
•
5
Text Generation
•
358B
•
Updated
•
22.6k
•
23
mratsim/MiniMax-M2.1-BF16-INT4-AWQ
Text Generation
•
39B
•
Updated
•
3.17k
•
5
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
•
31B
•
Updated
•
85.5k
•
4
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
8
•
3