About as open source as a binary blob without the training data

Prunebutt@slrpnk.net · 2 days ago

About as open source as a binary blob without the training data

thespcicifcocean@lemmy.world · 1 day ago

It’s not just the weights though is it? You can download the training data they used, and run your own instance of the model completely separate from their servers.

BradleyUffner@lemmy.world · edit-2 1 day ago

You don’t download the training data when running an LLM locally. You are downloading the already baked model.

Prunebutt@slrpnk.net · 1 day ago

Did “they” publish the training data? And the hyperparameters?

thespcicifcocean@lemmy.world · 1 day ago

I mean, I downloaded it from the repo.

Prunebutt@slrpnk.net · 1 day ago

You downloaded the weights. That’s something different.

thespcicifcocean@lemmy.world · 1 day ago

I may misunderstand, but are the weights typically several hundred gigabytes large?

Prunebutt@slrpnk.net · edit-2 1 day ago

Yes. The training data is probably a few hundred petabytes.

thespcicifcocean@lemmy.world · 1 day ago

Oh wow that’s fuckin huge

BradleyUffner@lemmy.world · 1 day ago

Yeah, some models are trained on pretty much the entire content of the publicly accessible Internet.