sp3ctre@feddit.orgEnglish · 2 days agoYour best local LLM for low-VRAM (6GB)?plus-squaremessage-squaremessage-square13linkfedilinkarrow-up130arrow-down13
arrow-up127arrow-down1message-squareYour best local LLM for low-VRAM (6GB)?plus-squaresp3ctre@feddit.orgEnglish · 2 days agomessage-square13linkfedilink
ikt@aussie.zoneEnglish · 5 days agoDystopiaBench - AI Ethics Stress Testplus-squaredystopiabench.comexternal-linkmessage-square15linkfedilinkarrow-up17arrow-down11
arrow-up16arrow-down1external-linkDystopiaBench - AI Ethics Stress Testplus-squaredystopiabench.comikt@aussie.zoneEnglish · 5 days agomessage-square15linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · edit-25 days agoClaude? No. Cucumbers? Yes!plus-squaremessage-squaremessage-square3linkfedilinkarrow-up113arrow-down10
arrow-up113arrow-down1message-squareClaude? No. Cucumbers? Yes!plus-squareSuspiciousCarrot78@aussie.zoneEnglish · edit-25 days agomessage-square3linkfedilink
TheCornCollector@piefed.zipEnglish · edit-27 days agoLlama.cpp MTP Support merged - up to 2.5x speed increaseplus-squaregithub.comexternal-linkmessage-square3linkfedilinkarrow-up142arrow-down11
arrow-up141arrow-down1external-linkLlama.cpp MTP Support merged - up to 2.5x speed increaseplus-squaregithub.comTheCornCollector@piefed.zipEnglish · edit-27 days agomessage-square3linkfedilink
BB84@mander.xyzEnglish · edit-28 days agoOrthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distributionplus-squaregithub.comexternal-linkmessage-square4linkfedilinkarrow-up19arrow-down10
arrow-up19arrow-down1external-linkOrthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distributionplus-squaregithub.comBB84@mander.xyzEnglish · edit-28 days agomessage-square4linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · edit-28 days ago"The cost of running LLMs is just too damn high"plus-squaremessage-squaremessage-square10linkfedilinkarrow-up139arrow-down11
arrow-up138arrow-down1message-square"The cost of running LLMs is just too damn high"plus-squareSuspiciousCarrot78@aussie.zoneEnglish · edit-28 days agomessage-square10linkfedilink
SuspiciousCarrot78@aussie.zoneEnglish · 9 days agoToken Speed visualiserplus-squaremikeveerman.github.ioexternal-linkmessage-square0linkfedilinkarrow-up19arrow-down11
arrow-up18arrow-down1external-linkToken Speed visualiserplus-squaremikeveerman.github.ioSuspiciousCarrot78@aussie.zoneEnglish · 9 days agomessage-square0linkfedilink
XiELEd@piefed.socialEnglish · edit-210 days ago<8B multilingual models for language learning chatbotsplus-squaremessage-squaremessage-square4linkfedilinkarrow-up111arrow-down12
arrow-up19arrow-down1message-square<8B multilingual models for language learning chatbotsplus-squareXiELEd@piefed.socialEnglish · edit-210 days agomessage-square4linkfedilink
variety4me@lemmy.zipEnglish · 12 days agollama.cpp Multi-Model Server Architecture: ASUS Zenbook UM3504DAplus-squaremessage-squaremessage-square9linkfedilinkarrow-up115arrow-down11
arrow-up114arrow-down1message-squarellama.cpp Multi-Model Server Architecture: ASUS Zenbook UM3504DAplus-squarevariety4me@lemmy.zipEnglish · 12 days agomessage-square9linkfedilink
ElectricVocalist@jlai.luEnglish · 19 days agoGemma4 with MTP was released jlai.luimagemessage-square1linkfedilinkarrow-up114arrow-down111
arrow-up13arrow-down1imageGemma4 with MTP was released jlai.luElectricVocalist@jlai.luEnglish · 19 days agomessage-square1linkfedilink
Jeena@piefed.jeena.netEnglish · 19 days agoGood translation models which fit on a smartphone?plus-squaremessage-squaremessage-square10linkfedilinkarrow-up123arrow-down12
arrow-up121arrow-down1message-squareGood translation models which fit on a smartphone?plus-squareJeena@piefed.jeena.netEnglish · 19 days agomessage-square10linkfedilink
tristynalxander@mander.xyzEnglish · edit-220 days agoAI-Editor in LibreOffice Writer?plus-squaremessage-squaremessage-square3linkfedilinkarrow-up119arrow-down13
arrow-up116arrow-down1message-squareAI-Editor in LibreOffice Writer?plus-squaretristynalxander@mander.xyzEnglish · edit-220 days agomessage-square3linkfedilink
ikt@aussie.zoneEnglish · 24 days agoa little locallama game theory ...gameplus-squaremessage-squaremessage-square5linkfedilinkarrow-up111arrow-down12
arrow-up19arrow-down1message-squarea little locallama game theory ...gameplus-squareikt@aussie.zoneEnglish · 24 days agomessage-square5linkfedilink
ikt@aussie.zoneEnglish · 25 days agoMistral Medium 3.5 releasedplus-squaremistral.aiexternal-linkmessage-square1linkfedilinkarrow-up120arrow-down11
arrow-up119arrow-down1external-linkMistral Medium 3.5 releasedplus-squaremistral.aiikt@aussie.zoneEnglish · 25 days agomessage-square1linkfedilink
hendrik@palaver.p3x.deEnglish · 25 days agoIs there any good general AI-Agent /workflow platform which isn't vibe-coded?plus-squaremessage-squaremessage-square7linkfedilinkarrow-up119arrow-down11
arrow-up118arrow-down1message-squareIs there any good general AI-Agent /workflow platform which isn't vibe-coded?plus-squarehendrik@palaver.p3x.deEnglish · 25 days agomessage-square7linkfedilink
variety4me@lemmy.zipEnglish · 26 days agowould you laugh at me if I ran gemma-4-26b on a 4 core Xeon, with 32GB RAM, no GPU?plus-squarecodeberg.orgexternal-linkmessage-square2linkfedilinkarrow-up131arrow-down12
arrow-up129arrow-down1external-linkwould you laugh at me if I ran gemma-4-26b on a 4 core Xeon, with 32GB RAM, no GPU?plus-squarecodeberg.orgvariety4me@lemmy.zipEnglish · 26 days agomessage-square2linkfedilink
robber@lemmy.mlEnglish · edit-226 days agollama.cpp: don't sleep on --split-mode tensorplus-squaregithub.comexternal-linkmessage-square0linkfedilinkarrow-up118arrow-down11
arrow-up117arrow-down1external-linkllama.cpp: don't sleep on --split-mode tensorplus-squaregithub.comrobber@lemmy.mlEnglish · edit-226 days agomessage-square0linkfedilink
Yerbouti@sh.itjust.worksEnglish · 27 days agoNoob here: Why is Google making Gemma open-source?plus-squaremessage-squaremessage-square24linkfedilinkarrow-up116arrow-down12
arrow-up114arrow-down1message-squareNoob here: Why is Google making Gemma open-source?plus-squareYerbouti@sh.itjust.worksEnglish · 27 days agomessage-square24linkfedilink
hok@lemmy.dbzer0.comEnglish · 27 days agoWhich open models are actually good at agentic coding?plus-squaremessage-squaremessage-square9linkfedilinkarrow-up121arrow-down11
arrow-up120arrow-down1message-squareWhich open models are actually good at agentic coding?plus-squarehok@lemmy.dbzer0.comEnglish · 27 days agomessage-square9linkfedilink
ikt@aussie.zoneEnglish · 27 days agoBullshitBench Viewer - BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.plus-squarepetergpt.github.ioexternal-linkmessage-square2linkfedilinkarrow-up112arrow-down10
arrow-up112arrow-down1external-linkBullshitBench Viewer - BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.plus-squarepetergpt.github.ioikt@aussie.zoneEnglish · 27 days agomessage-square2linkfedilink