@eating3645

eating3645@lemmy.world · 1 month ago

Agreed, really hoping they stick to refocusing on the browser.

eating3645@lemmy.world · 5 months ago

ISPs in the US are notorious for getting public funds for services that they never provide, so I wouldn’t be too concerned about that.

eating3645@lemmy.world · 5 months ago

I will archive you!

eating3645@lemmy.world · 6 months ago

Since myself and others had no issues with your float needle example, mind sharing what you searched for, and what Google returned?

eating3645@lemmy.world · 9 months ago

I think that happened 8 years ago or so

eating3645@lemmy.world · edit-2 11 months ago

Let me expand a little bit.

Ultimately the models come down to predicting the next token in a sequence. Tokens for a language model can be words, characters, or more frequently, character combinations. For example, the word “Lemmy” would be “lem” + “my”.

So let’s give our model the prompt “my favorite website is”

It will then predict the most likely token and add it into the input to build together a cohesive answer. This is where the T in GPT comes in, it will output a vector of probabilities.

“My favorite website is”

"My favorite website is "

“My favorite website is lem”

“My favorite website is lemmy”

“My favorite website is lemmy.”

“My favorite website is lemmy.org”

Woah what happened there? That’s not (currently) a real website. Finding out exactly why the last token was org, which resulted in hallucinating a fictitious website is basically impossible. The model might not have been trained long enough, the model might have been trained too long, there might be insufficient data in the particular token space, there might be polluted training data, etc. These models are massive and so determine why it’s incorrect in this case is tough.

But fundamentally, it made up the first half too, we just like the output. Tomorrow some one might register lemmy.org, and now it’s not a hallucination anymore.

eating3645@lemmy.world · edit-2 11 months ago

Very difficult, it’s one of those “it’s a feature not a bug” things.

By design, our current LLMs hallucinate everything. The secret sauce these big companies add is getting them to hallucinate correct information.

When the models get it right, it’s intelligence, when they get it wrong, it’s a hallucination.

In order to fix the problem, someone needs to discover an entirely new architecture, which is entirely conceivable, but the timing is unpredictable, as it requires a fundamentally different approach.

eating3645@lemmy.world · 1 year ago

I’ll keep an eye out 👀

eating3645@lemmy.world · 1 year ago

Thanks for the heads up. My password is %f22N$CBTNgW, can you let me know if it was leaked?

eating3645@lemmy.world · 1 year ago

Poke

eating3645@lemmy.world · 1 year ago

Boil 'em, mash 'em, stick 'em in a stew

eating3645@lemmy.world · edit-2 1 year ago

Even if these servers federate with exploding heads, the individual servers would still moderate content coming from exploding heads users on their servers, no? I agree that there are clearly a lot of shitty users there, but I have not seen a strong argument from you on how federating with them is a problem. Their content here is actively moderated.

~~I could very well be wrong, in which case I will eat my words, but it seems like a bit of an over reaction to me.~~

Just took a quick browse of their instance, eww…