Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login

Oh, that means its a llama architecture model!

Is the tokenizer the same? It may "work" without actually working optimally until llama.cpp patches it in.

And the instruct model was just uploaded.



view as:

Legal | privacy