LM Calc

v0.9.0-alpha

Set your hardware constraints and get a ranked list of open-weight LLMs — each shown at the highest-quality quant that fits your RAM and meets your speed floor. Models that don't fit are listed below with the reason.

Hardware

Memory bandwidth caps token generation speed.

RAM available for the model and KV cache.

Workload

Tokens in scope (in + out)

10

Slower models are filtered out.

Picks the highest-quality weight quant that fits.

Picks the highest-quality KV quant that fits.

Models

Developers:
40 models meet your constraints

31 filtered out