• [object Object]@lemmy.ca
    link
    fedilink
    English
    arrow-up
    4
    ·
    28 天前

    This is cool!

    • it doesn’t seem fair to draw a line of Google models and combine Gemma and Gemini. They’re two very different models.
    • for active parameters it’s interesting that is basically flat, but it does look like MOE are better
  • Mika@piefed.ca
    link
    fedilink
    English
    arrow-up
    2
    ·
    28 天前

    Benchmark that goes open source just becomes a part of training data :-)