Hacker Read

YetAnotherNick · 2024-04-08 13:22:34

It's not a lot faster for input but it is something like 10x faster for output(mixtral vs gpt-3.5). This could enable completely new mode of interaction with LLMs e.g. agents.

nulld3v | karma 956 | avg karma 3.61 · | 2023-12-11 10:51:26

The model is quicker to evaluate. So quicker responses and more throughput.

bun_terminator | karma 288 | avg karma 0.99 · | 2024-05-08 07:43:29

It's faster and/or easier to implement. Sometimes a lot

doctor_eval | karma 5374 | avg karma 2.62 · | 2021-04-02 21:35:58+00:00

When used for business logic they also execute about 20x faster than the same logic encoded in a client, and in far fewer LOC. Getting rid of all those round-trips has a huge effect!

krumbie | karma 110 | avg karma 4.4 · | 2022-04-14 14:09:43

It's definitely not objectively better. For normal workflows, the startup latency is a real hindrance, even if the community has collectively built norms around that which help a bit (keeping a long running repl session open, etc.). But in terms of expressiveness plus attainable performance it really is hard to beat. I think it's nice that the path from quick draft to really performant code is continuous, and not a big gap like switching languages.

seydor | karma 7759 | avg karma 1.79 · | 2024-05-25 17:31:09

faster, it has options like 'modify'. I also feel it follows my commands better, esp. when i ask to rephrase

amjith | karma 1042 | avg karma 5.24 · | 2017-12-23 15:15:27

> it's significantly easier to work with

Can you please elaborate?

reply

neotrope | karma 259 | avg karma 3.36 · | 2023-04-26 16:50:09

It’s far better at producing workable code, logical reasoning, keeping a train of thought, etc. Their benchmark breaks it down.

https://openai.com/research/gpt-4

reply

nicolewhite | karma 309 | avg karma 2.81 · | 2015-05-27 15:36:25

Right. I understand those advantages. I was curious about the comparison of input parameters specifically.

beebeepka | karma 1059 | avg karma 0.64 · | 2023-11-29 10:52:01

If it's faster to generate? I don't know, that's what I am asking

andrewljohnson | karma 9237 | avg karma 4.29 · | 2023-03-29 16:40:31

It’s faster for coding.

_carbyau_ | karma 1816 | avg karma 1.48 · | 2022-11-27 21:55:54

Outright speed, speed of scalability.

If you can submit what you want and have it within moments, it beats a fancy typing pool. That kind of speed alone lets you have a highly trained someone retry different variations and filter more on quality of result.

Scalability. If you need to do it ten times faster. More hardware = done. More people could make it done too but training etc is required.

reply

kombookcha | karma 452 | avg karma 3.07 · | 2024-01-11 04:50:47

Massively higher potential output. It's the difference between sending spam e-mails out by hand and having a bot do it.

The_rationalist | karma -7 | avg karma -0.0 · | 2022-02-07 16:51:00

It is significantly faster, see benchmarks. Also more concise/ergonomic and declarative.

philsnow | karma 4551 | avg karma 2.16 · | 2016-11-28 18:24:45+00:00

I think the most important gain is that client requests can fit in fewer packets -> fewer RTTs (critical on poor networks) -> lower latency to first paint -> better UX.

davidgerard | karma 13630 | avg karma 2.77 · | 2017-05-15 07:57:37

It's vastly faster and more reproducible, enough so as to make it a different environment with different considerations.

stnmtn | karma 434 | avg karma 1.36 · | 2023-02-27 12:02:09

More parameters, more training time, faster processing speed to accomplish both of the first two.

paulddraper | karma 16686 | avg karma 1.72 · | 2017-06-01 12:18:16+00:00

Yes, faster load times. Also, a more sensible target language.

RMPR | karma 1502 | avg karma 1.98 · | 2021-02-27 15:39:38+00:00

I find it more intuitive, and for a small number of iterations it's definitely faster.

kzrdude | karma 11414 | avg karma 2.35 · | 2022-05-20 22:48:42

It's faster (better performance). It could also be preferred just as a coding style.