ikt@aussie.zone to LocalLLaMA@sh.itjust.worksEnglish · 29 天前BullshitBench Viewer - BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.petergpt.github.ioexternal-linkmessage-square2linkfedilinkarrow-up112arrow-down10file-text
arrow-up112arrow-down1external-linkBullshitBench Viewer - BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.petergpt.github.ioikt@aussie.zone to LocalLLaMA@sh.itjust.worksEnglish · 29 天前message-square2linkfedilinkfile-text
minus-square[object Object]@lemmy.calinkfedilinkEnglisharrow-up4·29 天前This is cool! it doesn’t seem fair to draw a line of Google models and combine Gemma and Gemini. They’re two very different models. for active parameters it’s interesting that is basically flat, but it does look like MOE are better
This is cool!