Genius

ickplant@lemmy.world · 3 days ago

Genius

queermunist she/her@lemmy.ml · edit-2 2 days ago

It’s pretty obvious how this happened.

All the data it has been trained on said “next year is 2026” and “2027 is two years from now” and now that it is 2026 it doesn’t actually change the training data. It doesn’t know what year it is, it only knows how to regurgitate answers it was already trained on.

Drew@sopuli.xyz · 2 days ago

nah, training data is not why it answered this (otherwise it would have training data from many different years, way more than of 2025)

queermunist she/her@lemmy.ml · edit-2 2 days ago

There’s data weights for recency, so after a certain point “next year is 2026” will stop being weighted over “next year is 2027”

It’s early in the year, so that threshold wasn’t crossed yet.

0_o7@lemmy.dbzer0.com · 2 days ago

Maybe it uses the most recent date in the dataset for its reference to datetime?

salacious_coaster@feddit.online · 2 days ago

I wish I could forcibly download that understanding into the brains of everyone freaking out about AI enslaving humanity.

tauonite@lemmy.world · 2 days ago

It also happened last year if you asked if 2026 was next year, and that was at the end of last year, not beginning

just_an_average_joe@lemmy.dbzer0.com · 2 days ago

This instance actually seems more like ‘context rot’, I suspect google is just shoving everything into the context window cuz their engineering team likes to brag about 10m tokens windows, but the reality is that its preeeeettty bad when you throw too much stuff.

I would expect even very small (4b params or less) models would get this question correct

buddascrayon@lemmy.world · 2 days ago

This is actually the reason why it will never actually become general AI. Because they’re not training it with logic they’re training it with gobbledy goop from the internet.

kkj@lemmy.dbzer0.com · 2 days ago

It can’t understand logic anyway. It can only regurgitate its training material. No amount of training will make an LLM sapient.

lauha@lemmy.world · 2 days ago

Based on what?

edible_funk@sh.itjust.works · 2 days ago

Math, physics, the fundamental programming limitations of LLMs in general. If we’re ever gonna actually develop an AGI, it’ll come about along a completely different pathway than LLMs and algorithmic generative “AI”.

kkj@lemmy.dbzer0.com · 2 days ago

Based on what LLMs are. They predict token (usually word) probability. They can’t think, they can’t understand, they can’t question things. If you ask one for a seahorse emoji, it has a seizure instead of just telling you that no such emoji exists.

Frosty@pawb.social · 2 days ago

deleted by creator