NVIDIA GeForce RTX 3050
NVIDIA GeForce RTX 3050 is a mid-range GPU from NVIDIA. It offers 8 GB of VRAM, adequate for compact to mid-sized models. It delivers 9.1 TFLOPS of FP32 compute performance with 224 GB/s of memory bandwidth It can run 17 models in our database. It's the sweet spot for budget-conscious users who still want to run capable local models.
Specifications
| Vendor | NVIDIA |
| Full Name | NVIDIA GeForce RTX 3050 |
| VRAM | 8GB |
| Performance Tier | Mid-Range |
| Benchmark Score | 12,800 |
| FP32 Compute | 9.1TFLOPS |
| Memory Bandwidth | 224GB/s |
| Compatible Models | 17Models that can run on this GPU |
Strengths
- 8 GB VRAM — suitable for compact and mid-sized models
Limitations
- 9.1 TFLOPS — modest compute may limit inference speed
- 224 GB/s bandwidth — may bottleneck larger model inference
Compatible Models (17)
Yi81%
Yi 1.5 9B Q4_K_M
9.0B
7.7 GB
Q4_K_M
4,096 ctx
Llama80%
Llama 3.1 8B Q4_K_M
8.0B
6.8 GB
Q4_K_M
8,192 ctx
Llama80%
Llama 3.1 8B 128K Q4_K_M
8.0B
7.4 GB
Q4_K_M
131,072 ctx
DeepSeek80%
DeepSeek R1 Distill Llama 8B Q4_K_M
8.0B
6.8 GB
Q4_K_M
131,072 ctx
DeepSeek79%
DeepSeek R1 Distill Qwen 7B Q4_K_M
7.0B
6 GB
Q4_K_M
131,072 ctx
Qwen78%
Qwen3 8B Q4_K_M
8.0B
6.8 GB
Q4_K_M
32,768 ctx
Mistral76%
Mistral 7B Q4_K_M
7.0B
6 GB
Q4_K_M
8,192 ctx
Qwen68%
Qwen3 4B Q4_K_M
4.0B
3.4 GB
Q4_K_M
32,768 ctx
Gemma68%
Gemma 3 4B Q8_0
4.0B
5.6 GB
Q8_0
32,768 ctx
Gemma66%
Gemma 3 4B Q4_K_M
4.0B
3.4 GB
Q4_K_M
32,768 ctx
DeepSeek56%
DeepSeek R1 Distill Qwen 1.5B Q4_K_M
1.5B
1.3 GB
Q4_K_M
131,072 ctx
Qwen55%
Qwen3 1.8B Q4_K_M
1.8B
1.5 GB
Q4_K_M
32,768 ctx
+ 5. Check the Model Library for the complete list.