Hacker Read

Oras · 2024-02-19 15:38:52

Impressive speed. Are there any plans to run fine-tuned models?

taternuts | karma 513 | avg karma 1.13 · | 2014-03-27 17:54:16

Yeah I'd be interested in hearing that as well - I think most will take a slight hit in speed for something that's been successfully used in production with several well-known companies.

hodgesrm | karma 6706 | avg karma 3.19 · | 2021-05-18 21:46:58

It's going to get faster. That's why it's still experimental. ;)

smortaz | karma 1709 | avg karma 4.94 · | 2016-01-28 18:29:04+00:00

if the speedups pan out, that is the plan! the work on performance has not been started yet however.

generalizations | karma 4806 | avg karma 3.34 · | 2023-04-25 14:19:43

There's a lot of use cases just waiting for a good system that can do at least live speed generation.

tormeh | karma 7507 | avg karma 2.28 · | 2024-05-29 01:21:20

Extremely impressive. Does anyone know if performance is likely to decrease as more features are implemented? Because if not, this is a winner.

tmaly | karma 3852 | avg karma 0.95 · | 2021-09-30 11:51:10

it will be interesting to see if they can start to load models like GPT-3 ASICs and realize some serious performance gains.

kergonath | karma 7728 | avg karma 2.24 · | 2024-05-29 16:37:24

We'll see how fast it is on consumer hardware once decent quantisations are available.

randomopining | karma 367 | avg karma 0.4 · | 2021-05-12 15:15:49+00:00

Woah cool. That plus fast/optimized hardware will probably be it.

m12k | karma 8903 | avg karma 7.85 · | 2020-11-12 12:32:08

Me too - let's see what the sustained performance is like. That said, with this much headroom, I'm cautiously optimistic that even with some throttling going on, it'll still be plenty fast for anything I'm likely to throw at it.

infecto | karma 3099 | avg karma 1.92 · | 2024-03-21 12:49:51

Its funny, you see comments like this and I think people confidently give very specific hurdles to them for what these models should be able to do.

I have a rather large spend across the universe of models and I think compared to a year ago, its amazing what is possible. If we continue anywhere close to this speed, it will be amazing what will be possible a year from now.

reply

acapybara | karma 488 | avg karma 3.61 · | 2023-04-16 06:48:51

Indubitably, good fellow.

I suspect if we can fine tune and optimize this 65B model, we can achieve some truly remarkable results.

reply

nixpulvis | karma 3491 | avg karma 2.1 · | 2019-04-13 16:40:37

This is a welcome improved spec! I'll be curious how long till wide adoption.

TeMPOraL | karma 106045 | avg karma 3.04 · | 2024-06-10 21:40:28

Right. But there's so much effort, money and reputation invested in various configurations, experimental architectures, etc. that I feel something is likely going to pan out in the coming months, enabling models with more capabilities for less compute.

esafak | karma 3619 | avg karma 1.54 · | 2023-10-29 12:34:38

I like where they are going. Are there benchmarks out yet?

circuit10 | karma 1486 | avg karma 1.79 · | 2023-04-21 14:09:35

Well hardware and parameter count are scaling exponentially, so it seems very feasible that it could happen very soon. Of course it's possible that we'll hit a wall somewhere but it seems that just scaling current models up could be enough to get to the point where they can self-improve or gain more compute for themselves

whatnotests | karma 479 | avg karma 1.49 · | 2017-05-20 18:08:39+00:00

I expect the performance here to be blazingly fast...is that an accurate assumption?

lucperkins | karma 187 | avg karma 2.12 · | 2012-06-13 23:09:22+00:00

Exactly. Correspondingly, my hope (along with one of the folks below) is that advances in hardware will obliterate any speed difference and enable the back end to be king.

Time will tell.

reply

staunch | karma 28228 | avg karma 4.34 · | 2015-02-12 21:35:20+00:00

Yes and it should be faster, but I don't want to promise anything until we see what real world usage and performance is like during the beta. Thanks for the feedback, again.

viraptor | karma 41139 | avg karma 2.79 · | 2022-04-24 06:51:35

Ooooh, this is interesting. I'll see what the performance looks like.