Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login

The code I linked is the model source -- the structural skeleton. You can create a model with this source and put whatever weights you want in it -- just like Doom.


sort by: page size:

The weights are the model.

The small bits of standardish code to utilize them are not that material


The weights are like jpegs, the model itself is still a piece of software that executes code.

The model code is Apache 2.0, the weights are proprietary.

They actually reimplemented the code to load the model. But it’s not clear if the weights of the model are free to use or not

the code is open source but not the weights. as far as i can tell.

I am curious though how would the model weights work out?

Are there model weights?

That is an awesome idea. I wish the authors would open source the code and weights so this can be tried.

What do you think it is? Some corporeal being? It's an overgrown python script pulling weights out of a model.

Code is MIT, weights are under the NC license for now.

Weights are equivalent to compiled object code IMO. All else follows from there.

what are model weights?

When you run that command, where does it download the model weights from?

Isn't the "why" the weights of the model themselves?

Probably the model weight files

No that's what I was getting at. For roughly half the models I come across, the weights are published on HF. For the other half, the weights are available via GitHub (either the releases section, or a link to something like Google Drive).

Model weights seem akin to publishing a decompiled binary.

I noticed that they added a `LICENSE-ModelWeights.txt`! Maybe I should dive into their repo deeper and follow their good practices.

Thanks a lot for the advice! It really helps!


It is definitely possible. At any point, you can just take a snapshot of the weights. Together with a description of the architecture, this is a complete description of a model.
next

Legal | privacy