Hacker Read

loopdoend · 2023-01-21 23:29:06

For all its knowledge it can't solve even the most basic problems accurately - but what do you expect from a language model?

semicolon_storm | karma 463 | avg karma 4.13 · | 2023-03-14 21:49:48

How do you figure that we can still confidently say it’s just a language model?

It was trained on language for the primary purpose of producing text, but that’s not necessarily all it can do. The billions of nodes and parameters it contains allows it to compute ultra complicated equations. Who’s to say some subset of those nodes aren’t forming some basic primitive used for reasoning?

reply

sharkjacobs | karma 1827 | avg karma 5.52 · | 2023-04-05 14:30:44

It's a language model, not a mathematical model

scotty79 | karma 14043 | avg karma 1.62 · | 2023-02-15 10:38:14

It's a language model. It models language not knowledge.

haebom | karma 167 | avg karma 2.09 · | 2024-05-01 14:47:51

Language models aren't built to do that, and if you want to make predictions or calculations, they're probably not the best choice.

jameshart | karma 19910 | avg karma 4.65 · | 2022-12-12 07:39:38

Hang on - I thought the consensus among ML experts was that language models don’t ‘know’ anything?

sebzim4500 | karma 5679 | avg karma 2.5 · | 2023-01-13 06:40:05

I'm sure that could be corrected by even a very basic language model

Animats | karma 143047 | avg karma 6.11 · | 2023-03-26 12:50:32

There's more of a model inside large language models than was previously thought. How much of a model? Nobody seems to know. There was that one result where someone found what looked like an Othello board in the neuron state.

Someone wrote, below: > We know the basic architecture of large language models, but hardly anything about how they calculate anything specific. That’s the mystery. It will take research, not casual tinkering.

Yes. This is an unexpected situation. Understanding how these things work is way behind making them work. Which is a big problem, since they make up plausible stuff when they don't understand.

reply

tombakt | karma 5 | avg karma 0.71 · | 2022-11-13 14:54:25

Until a language model can develop a generalized solution to a real-world phenomena, it's not even close to AGI. The current iteration of ML algorithms are useful, yes, but not intelligent.

scotty79 | karma 14043 | avg karma 1.62 · | 2023-02-15 12:10:03

It's a language model not a knowledge model. As long as it produces the language it's by definition correct.

andrepd | karma 17410 | avg karma 3.19 · | 2022-12-03 10:12:19

Of course, it's a language model with 0 semantic knowledge about its output.

nieve | karma 1115 | avg karma 2.34 · | 2023-02-16 21:52:22

A natural language understanding engine that _makes things up_ is not extremely good.

_pdp_ | karma 813 | avg karma 1.69 · | 2024-04-21 12:03:37

It should've been called The Beginner's Guide to Language Models.

taco_emoji | karma 996 | avg karma 2.31 · | 2023-06-09 10:03:22

It is literally just a goddamn language model. it is very good at making plausibly human-like sentences. It is not a general intelligence, it is not your friend, it is not a research assistant. It is not designed to deliver content which is correct, it is designed to deliver content which is similar to human language.

It might get things correct most of the time! But that is purely incidental.

reply

JasonFruit | karma 6261 | avg karma 3.19 · | 2023-05-29 08:16:12

Not all language tasks, even, are going to be best handled by these models.

acidioxide | karma 74 | avg karma 1.68 · | 2023-04-28 08:06:42

But there isn't such a thing as a raw model, is it? In order to receive anything from a language model it has to 'learn' some objective. And this objective has to be imposed from above.

dekhn | karma 28741 | avg karma 2.63 · | 2022-11-15 18:59:01

That's more or less what I would expect from a the best language model: things that look very close to real but fail in some way a smart human can tell.

YUou need a "knowledge" model to regurgitate facts and an "inference" model to evaluate probabilities of statements being correct.

reply

coolspot | karma 3600 | avg karma 2.44 · | 2023-01-23 15:18:35

The model can’t break down, neither it can reason about contradictions. All it can do is to predict most probable next word for a given input.

sboomer | karma 10 | avg karma 0.59 · | 2023-03-15 00:48:27

I don't know much about language models, but don't they just have an understanding/knowledge of patterns between words, and don't have the reasoning capability at all?

akomtu | karma 1122 | avg karma 0.52 · | 2024-05-15 22:24:40

Everyone is trying to use Language Models as Reasoning Models because the latter haven't been invented yet.