NVIDIA GeForce RTX 2060
NVIDIA GeForce RTX 2060 is a entry-level GPU from NVIDIA. It offers 6 GB of VRAM, suitable for small and heavily quantized models. It delivers 6.45 TFLOPS of FP32 compute performance with 336 GB/s of memory bandwidth It can run 12 models in our database. It's a cost-effective entry point for experimenting with local LLMs.
Specifications
| Vendor | NVIDIA |
| Full Name | NVIDIA GeForce RTX 2060 |
| VRAM | 6GB |
| Performance Tier | Entry |
| Benchmark Score | 14,100 |
| FP32 Compute | 6.45TFLOPS |
| Memory Bandwidth | 336GB/s |
| Compatible Models | 12Models that can run on this GPU |
Strengths
- 336 GB/s bandwidth — good memory throughput
- Affordable entry point for trying local LLMs
Limitations
- 6 GB VRAM — limited to small and heavily quantized models
- 6.45 TFLOPS — modest compute may limit inference speed
- Limited future-proofing — may struggle with next-generation models
Compatible Models (12)
DeepSeek79%
DeepSeek R1 Distill Qwen 7B Q4_K_M
7.0B
6 GB
Q4_K_M
131,072 ctx
Mistral76%
Mistral 7B Q4_K_M
7.0B
6 GB
Q4_K_M
8,192 ctx
Qwen68%
Qwen3 4B Q4_K_M
4.0B
3.4 GB
Q4_K_M
32,768 ctx
Gemma68%
Gemma 3 4B Q8_0
4.0B
5.6 GB
Q8_0
32,768 ctx
Gemma66%
Gemma 3 4B Q4_K_M
4.0B
3.4 GB
Q4_K_M
32,768 ctx
DeepSeek56%
DeepSeek R1 Distill Qwen 1.5B Q4_K_M
1.5B
1.3 GB
Q4_K_M
131,072 ctx
Qwen55%
Qwen3 1.8B Q4_K_M
1.8B
1.5 GB
Q4_K_M
32,768 ctx
Qwen55%
Qwen3 1.8B FP16
1.8B
4.7 GB
FP16
32,768 ctx
Gemma52%
Gemma 3 1B FP16
1.0B
2.6 GB
FP16
32,768 ctx
Gemma50%
Gemma 3 1B Q4_K_M
1.0B
1.1 GB
Q4_K_M
32,768 ctx
Qwen42%
Qwen3 0.6B Q4_K_M
600M
1.2 GB
Q4_K_M
32,768 ctx
Qwen42%
Qwen3 0.6B FP16
600M
1.6 GB
FP16
32,768 ctx