Hacker Read

EMCymatics · 2023-06-07 23:34:36

There is more of a point to actual learning than to create a language model.

wokwokwok | karma 5467 | avg karma 5.25 · | 2022-08-27 19:35:50

Language models show that language is predictive, not rule based.

Every bit of research in the last 30 years shows this.

If you want to ignore that and use rule based approaches, that’s fine, but you’re a) wrong and b) it doesn’t work.

/shrug

Of course people want rules, they think it’s a shortcut to learning.

…but there is no shortcut to learning. You have to do the hard work.

reply

user_named | karma 1745 | avg karma 2.26 · | 2023-04-06 09:41:17

Reflection is the thing. Language models don't reflect.

jfk13 | karma 7151 | avg karma 3.74 · | 2022-12-30 12:32:34

And I don't see any fundamental reason to believe that language models can learn or reason at all.

wokwokwok | karma 5467 | avg karma 5.25 · | 2022-08-27 04:19:13

I feel like the big language models have proved this style of learning a language is the wrong approach.

I learnt Japanese; I studied it for 4 years and spent a year in japan.

You know what worked?

Lots of examples of people using particles.

What did not work?

Text books explaining what the particles do.

A grammatical study of particles is only useful after you’ve gained an understanding of when you should use them from shed loads of examples.

It helps you refine specific fine detail points of when to use them technically, and in formal writing.

For early learning, I posit it’s next to useless.

Language is not a well designed programming language full of orthogonal concepts.

This has long been an argument, but language models reallly nail down the fact that a probabilistic approach to “similar to existing examples” approach to language is categorically superior to attempting to construct semantically correct statements from “rules”.

reply

wvenable | karma 19014 | avg karma 3.37 · | 2024-02-14 06:02:45

Large language models are also not databases of text.

sharkjacobs | karma 1827 | avg karma 5.52 · | 2023-04-05 14:30:44

It's a language model, not a mathematical model

scotty79 | karma 14043 | avg karma 1.62 · | 2023-02-15 10:38:14

It's a language model. It models language not knowledge.

haebom | karma 167 | avg karma 2.09 · | 2024-05-01 14:47:51

Language models aren't built to do that, and if you want to make predictions or calculations, they're probably not the best choice.

jhgg | karma 3220 | avg karma 6.78 · | 2022-12-28 18:09:11

This is by no means a practical exercise... and is to demonstrate the capabilities of a large language model, not to reduce boilerplate.

JonChesterfield | karma 5107 | avg karma 2.47 · | 2024-03-04 15:53:22

It is difficult to see an argument that the output of a language model is not derived from the language model, other than people would prefer it wasn't.

acidioxide | karma 74 | avg karma 1.68 · | 2023-04-28 08:06:42

But there isn't such a thing as a raw model, is it? In order to receive anything from a language model it has to 'learn' some objective. And this objective has to be imposed from above.

mirker | karma 247 | avg karma 0.92 · | 2023-04-22 15:13:31

Yeah the issue is you can generate data, but it won’t be good data. Training over random strings won’t make you learn language, but it’s technically data.

bane | karma 53753 | avg karma 4.99 · | 2021-10-11 14:50:51

What's really interesting is that these models are using some non-trivial portion of all easily accessible human writing -- yet humans learn language really well with significantly less input data. What's missing in the field to replicate human performance in learning?

dlkf | karma 1200 | avg karma 3.39 · | 2021-10-11 15:00:27

Humans use language to accomplish tasks in their environment - establishing relationships, making deals, coaxing others, etc. By contrast, all neural language models do is predict the next word as a function of the previous word. So far, these language models have nothing at all to do with language learning. They're only valuable insofar as they advance downstream engineering tasks like machine translation.

navjack27 | karma 707 | avg karma 0.64 · | 2023-02-04 22:28:09

Large language models does not an intelligence make. We have a very very long time.

sp332 | karma 55607 | avg karma 2.75 · | 2023-04-02 15:03:01

A language model is just a language model. It may well be an important part of an AI at some point, but it’s not going to be the whole thing.

JasonFruit | karma 6261 | avg karma 3.19 · | 2023-05-29 08:16:12

Not all language tasks, even, are going to be best handled by these models.

vintermann | karma 5189 | avg karma 2.44 · | 2023-04-11 02:29:10

Yeah, that's why just updating the weights on the models such as they are doesn't work. But they're right that it's desirable to have some sort of online learning, whether on top of a frozen language model, or through some not yet invented way to do it end to end.

namaria | karma 2351 | avg karma 1.4 · | 2023-08-29 07:05:46

I don't know where this idea comes from that we can get more from language models then what we put inside. Thinking we can process any amount of data and get a competent surrogate mind out of it borders on magical thinking.