-6

FitMyLLM — Independent benchmarks for self-hosted AI (www.fitmyllm.com)

submitted 1 month ago by anzo@programming.dev to c/homelab@programming.dev

5 comments fedilink hide all child comments

Check what can you use and at what rate of token per seconds would it be... It has examples of many models and quantization levels. Huge resource!

you are viewing a single comment's thread
view the rest of the comments

[-] nutbutter@discuss.tchncs.de 2 points 1 month ago

This feels useless. At least for homelabbers, ollama's model page tells us more useful info. And if a newbie goes there they'll be misguided.

Also, there's a lot of people who use CPUs, they don't list anything about them at all. Like I cannot fit Gemma 4 on my GPU, but ollama offloads it to CPU, and even with small GPUs you can get good performance.

And for nearly all small models, it recommends RTX 5060. Which is a very stupid choice.

[-] B0rax@feddit.org 1 points 1 month ago

What do you mean by „small gpu“?

I have not yet tried that, do you have any guidance? Or does „small gpu“ still mean >500€ GPU?

[-] nutbutter@discuss.tchncs.de 1 points 1 month ago

By small, I mean GPUs like outdated ones, laptop GPUs, or like GPUs with only 4GB or 6GB of VRAM.

this post was submitted on 03 Jun 2026

-6 points (36.4% liked)

HomeLab

228 readers

1 users here now

A homelab is a server or multiple server setup that resides in your home and where you host sevelra applications and virtualized systems for testing and developing

Its a sandbox environment where you can experience and break and fix things in with no repercussions while its down

This is a community where you can share, discuss, or post news relating to homelabs

founded 2 years ago

MODERATORS

anzo@programming.dev