• kata1yst@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    9
    ·
    edit-2
    23 days ago

    Depends on your goals. For raw tokens per second, yeah you want an Nvidia card with enough memory for your target model(s).

    But if you don’t care so much for speed beyond a certain amount, or you’re okay sacrificing some speed for economy, AMD RX7900 XT/XTX or 9070 both work pretty well for small to mid sized local models.

    Otherwise you can look at the SOC type solutions like AMD Strix Halo or Nvidia DGX for more model size at the cost of speed, but always look for reputable benchmarks showing ‘enough’ speed for your use case.