OpenAI and others seek new path to smarter AI as current methods hit limitations

misk@sopuli.xyz · 3 days ago

A_A@lemmy.world · 3 days ago

… “Alibaba (LLM)” … is it this ? … ?
Qwen2.5: A Party of Foundation Models!
https://qwenlm.github.io/blog/qwen2.5/

brucethemoose@lemmy.world · edit-2 3 days ago

BTW, as I wrote that post, Qwen 32B coder came out.

Now a single 3090 can beat GPT-4o, and do it way faster! In coding, specifically.

A_A@lemmy.world · 3 days ago

Great news 😁🥂, someone should make a new post on this !

brucethemoose@lemmy.world · 3 days ago

Yep.

32B fits on a “consumer” 3090, and I use it every day.

72B will fit neatly on 2025 APUs, though we may have an even better update by then.

I’ve been using local llms for a while, but Qwen 2.5, specifically 32B and up, really feels like an inflection point to me.