• lime!@feddit.nu
    link
    fedilink
    arrow-up
    2
    ·
    1 day ago

    that’s fair, i’m at like 7x the power. the gpu alone easily pulls 350-400W and the rest of the system isn’t exactly running lean either.

    …man now i really want more vram.

    • Jiral@lemmy.org
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      1 day ago

      Yes I think Strix Halo makes sense when low power use is a requirement. I built a custom fanless Strix Halo system for the fun of it and I guess there aren’t too many out there running Gemma 4 31B Q8 without a single fan, anywhere.

      And for MoE models that need 60-80GB + context it is perfect. Those are decently fast then as well.

      PS: If VRAM is all you care about the maxed out Mac Studio is fascinating. 512GB unified memory for around 10K EUR (pre crazy bubble prices) That should be able to run pretty large MoE models but dense models of that size would probably run glacially.

      • lime!@feddit.nu
        link
        fedilink
        arrow-up
        1
        ·
        1 day ago

        i’m not buying any hardware for the forseeable future :P it’s all just wishful thinking at the moment. but unified memory architectureis probably going to become more common so maybe in five years when some new motherboard standard becomes the norm…

        • Jiral@lemmy.org
          link
          fedilink
          arrow-up
          1
          ·
          1 day ago

          I fully understand. ;) Buying hardware now means you’d be either crazy or desperate.