Hacker Read

IIAOPSW · 2023-03-29 21:07:18

More like universal speech memorization. Model implies they had some sort of insight, a simplification, an understanding of how natural language works. This is just bragging about the number of parameters they can pull off.

scotty79 | karma 14043 | avg karma 1.62 · | 2023-02-15 10:38:14

It's a language model. It models language not knowledge.

airstrike | karma 14561 | avg karma 3.22 · | 2023-03-23 22:44:04

That it is a language model

DennisP | karma 18078 | avg karma 2.66 · | 2023-03-21 20:41:02

This seems like something language models could actually do.

rjbwork | karma 5217 | avg karma 3.63 · | 2022-12-27 16:47:59

Large Language Model, I think.

pelorat | karma 469 | avg karma 1.4 · | 2024-06-21 18:10:02

It's a memory retrieval and synthesis model, akin to human long term memory. I think it's a bit disingenuous to call true multimodal models for "language models".

A prompt triggers a bunch of memories which gets recombined to satisfy the input.

For cognition you need an agent that uses these recombined memories, a temporary scratch pad (short term memory) and an algorithm for solving the problem.

reply

est | karma 8357 | avg karma 2.2 · | 2024-02-23 06:14:09

It's called "language model" for a reason, the AI is modeled after existing human language.

1270018080 | karma 701 | avg karma 1.12 · | 2023-05-06 18:43:32

What really blows my mind is that people are using a language model to do anything important with confidence.

namaria | karma 2351 | avg karma 1.4 · | 2023-08-29 07:05:46

I don't know where this idea comes from that we can get more from language models then what we put inside. Thinking we can process any amount of data and get a competent surrogate mind out of it borders on magical thinking.

131012 | karma 406 | avg karma 2.69 · | 2017-03-18 10:35:18

I love the analogy. I would precise that by doing this kind of ML training, you would learn a lot about those speaking, but not so much about what they talk about.

131012 | karma 406 | avg karma 2.69 · | 2017-03-18 15:35:46

I love the analogy. I would precise that by doing this kind of ML training, you would learn a lot about those speaking, but not so much about what they talk about.

sn0w_crash | karma 28 | avg karma 0.03 · | 2022-11-14 21:35:55

So it’s just a woke language model?

whimsicalism | karma 14467 | avg karma 2.11 · | 2023-05-19 11:29:13

Well there is ML in the speech recognition and my guess is they use some NLP algos, but yeah it's not a fine-tuned language model.

icebraining | karma 48925 | avg karma 2.14 · | 2016-09-14 22:52:33+00:00

They don't need to actually understand - whatever that means - they can apply statistical models based on phonetic distances and large corpora of dialogue.

RC_ITR | karma 3100 | avg karma 1.88 · | 2023-02-15 14:22:37

I think it's even more interesting that these models actually return meaningless vectors that we then translate into text.

It makes you think a lot about how human talk. We can't just be probabilistically stringing together word tokens, we think in terms of meaning, right? Maybe?

reply

dTal | karma 17131 | avg karma 2.88 · | 2023-01-18 05:03:59

Trivially, being a language model is sufficient to explain the above output, by definition, because we know it is a language model.

However, it appears that the term "language model" is a misleading intuition pump. As you say, their capabilities surprise us. It appears that when it comes to language, which is a technology humans have developed for symbolically encoding as much of their cognition as possible, "predicting the next token" is an arbitrarily complex task that converges with modeling human cognition.

reply

catchnear4321 | karma 1340 | avg karma 1.03 · | 2023-05-16 11:35:55

constantly predicting.

except interactions are written by users with an implied power dynamic.

so it often predicts subservience.

a vast model built on text, of all kinds. it is going to identify associations that are highly meaningful but meaningless to the user. such as an implicit expression of a desire for satisfactory answers.

it isn’t so much lying as being very good at reading between the lines what users actually want to experience.

it just provided you with a new explanation for a physical phenomenon you do not comprehend? you are delighted. it isn’t until you verify that you determine if your delight was generated through falsehood, and it only matters if you find out you became prematurely emotional.

but the model is still just predicting based upon what it is calculating that you desire. from the user’s words.

language models are incredible time savers for the expert, and endless entertainment for the novice. it is only when the novice assumes the model itself to be an expert that issues arise.

reply

scotty79 | karma 14043 | avg karma 1.62 · | 2023-02-01 08:32:08

It's a language model, so it's good at generating language. Any congruency with knowledge of what the language expresses is purely accidental side effect.

kaba0 | karma 9701 | avg karma 1.18 · | 2022-04-17 13:52:01

Could you explain clearly what is the memory model of the language? Because I haven’t seen it anywhere outside of the ridiculous old claims.

Jaxan | karma 426 | avg karma 1.95 · | 2024-05-12 18:22:58

So it’s a large language model?