Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login

Vicuna looks pretty good. But as said, commercial use not possible.

Why do you think that Llama can be replaced? I mean it is extremely costly to train that thing. And it is there even a clean open source data set for the task?

PS: wouldn't be surprised if Meta, OpenAi, or google will train something for a Billion $ in costs of compute or more.



view as:

the 2048 sequence length makes it uncompetitive, especially now that we are entering ~infinite-length bots. $1M for training is peanuts for FB. There are open datasets like redPajama and alternative models are (hopefully) coming up

Legal | privacy