Hacker Read

scosman | karma 2119 | avg karma 4.65 · 2023-09-27 14:15:06

You could fine tune and release that. It’s not software so exact parallels don’t make sense, but the open permissions are great.

monocasa | karma 27236 | avg karma 2.94 · 2023-09-27 14:37:09

I mean, it's absolutely software.

scosman | karma 2119 | avg karma 4.65 · 2023-09-27 17:05:37

Software is used to make models, but the models aren’t software anymore than Toy Story is software.

monocasa | karma 27236 | avg karma 2.94 · 2023-09-27 17:15:30

There's literally a list of opcodes to be executed in the model. There's a whole lot of data too, but that's part of the build just as much as anything in a .data section.

mbakke | karma 2255 | avg karma 13.34 · 2023-09-27 17:34:56

Forgive my ignorance, I haven't studied the AI tooling landscape yet. Are you saying these models have a structured binary format and "running" them is just a matter of having a "player" with the right "codec"?

Or are they directly executing CPU instructions?

reply

monocasa | karma 27236 | avg karma 2.94 · 2023-09-27 18:06:38

They are basically a series of intermediate bytecodes to be compiled to the hardware they actually run on, in addition to the large tables of weights that bytecode references.

scosman | karma 2119 | avg karma 4.65 · 2023-09-27 22:27:00

Just a few billion of them that no person created or understands, which are based on the their input data and not design decisions, which include random initialization. I see no reason to treat them any differently than software in a conversation about OSS.

kanwisher | karma 883 | avg karma 2.27 · 2023-09-28 08:43:05

Model inputs are definitely design decisions, what data to use, how to fine tune them, what methods were used in weighting things. This is LITERALLY the source of the model. The model is a binary like an EXE

camgunz | karma 5075 | avg karma 2.2 · 2023-09-28 12:58:41

Toy Story is also software. It's a program you run on a renderer to generate a/v.