Hacker Read

rvnx · 2024-03-17 19:52:43

Mistral opened their weights only for very small LLaMA-like model.

qeternity | karma 8257 | avg karma 3.52 · | 2024-04-17 20:19:24

Mistral just released the most powerful open weight model in the history of humanity.

How did they weaken their commitment to open weights?

reply

JimDabell | karma 8077 | avg karma 4.51 · | 2023-12-11 09:52:40

No, they’ve released the weights for Mistral-small. They haven’t released the weights for Mistral-medium.

tmikaeld | karma 3701 | avg karma 2.29 · | 2024-01-01 02:38:37

The RAW Weights here: https://docs.mistral.ai/models/

hedgehog | karma 2718 | avg karma 2.63 · | 2023-12-10 14:22:21

Exactly, nice work BTW. And no hate for Mistral, they're doing great work, but let's not confuse weights-available with fully open models.

intellectronica | karma 1466 | avg karma 3.21 · | 2024-04-11 12:54:31

It's weird that more than a day after the weights dropped, there still isn't a proper announcement from Mistral with a model card. Nor is it available on Mistral's own platform.

Havoc | karma 16454 | avg karma 2.29 · | 2023-12-01 07:00:59

Mistral would be the base model

omega3 | karma 1208 | avg karma 4.28 · | 2024-06-12 20:12:45

Perhaps I'm missing something but what's the usp of Mistral, as far as I can see their models aren't competitive?

vineyardmike | karma 7130 | avg karma 2.33 · | 2024-04-08 08:15:03

If you go to GroqChat (which is like a demo app), they offer Gemma, Mistral, and LLaMa. These are all open-weights models.

wut42 | karma 2754 | avg karma 6.43 · | 2024-01-31 18:42:33

I think it does, as Mistral usually do their own models, and also, they couldn't fully commercialise Mistral Medium if it was LLama 2 based.

sa-code | karma 288 | avg karma 2.55 · | 2024-02-21 14:59:45

Thank you. I thought it was weird for them to release a 7B model and not mention Mistral in their release.

orra | karma 2494 | avg karma 2.74 · | 2024-04-17 20:34:30

> Mistral just released the most powerful open weight model in the history of humanity.

Well, yeah, it's very welcome, but 'history of humanity' is hyperbole given ChatGPT isn't even two years old.

> How did they weaken their commitment to open weights?

Before https://web.archive.org/web/20240225001133/https://mistral.a... versus after https://web.archive.org/web/20240227025408/https://mistral.a... the Microsoft partnership announcement:

> Committing to open models.

to

> That is why we started our journey by releasing the world’s most capable open-weights models

There were similar changes on their about the Company page.

reply

Jackson__ | karma 521 | avg karma 4.92 · | 2024-02-26 17:56:35

Announcing 2 new non-open source models, and they won't even release the previous mistral medium? I did not expect... well I did expect this, but I did not think they would pivot so soon.

To commemorate the change, their website appears to have changed too. Their title used to be "Mistral AI | Open-Weight models" a few days ago[0].

It is now "Mistral AI | Frontier AI in your hands." [1]

[0]https://web.archive.org/web/20240221172347/https://mistral.a...

[1]https://mistral.ai/

reply

behnamoh | karma 20551 | avg karma 4.64 · | 2024-02-21 16:03:14

This. MistralAI is also underdog and released Mitral 7b and Mixtral 8x7b, but as soon as they got traction, they closed their models (e.g., Mistral Medium).

qwertox | karma 7266 | avg karma 3.7 · | 2024-03-11 11:16:42

I had the wrong assumption that Mistral was built "on top of" Llama. Then again, I find sentences like "Mistral's models are based off on Meta's Llama".

valine | karma 4797 | avg karma 5.1 · | 2024-01-13 09:20:46

I’ve been studying and tinkering with open weight LLMs since the original llama weights leaked. I’ve very recently become convinced that the true data and compute requirements needed to fine tune and produce an “unsafe” model are orders of magnitude less than what’s needed today. We are no more than a year away from anyone with a 4090 being able to fine tune their own mistral. The cat is out of the bag on this one.

nkohari | karma 2895 | avg karma 3.35 · | 2023-12-06 12:27:24

It depends on what's being evaluated, but from what I've read, Mistral is also fairly competitive at a much smaller size.

One of the biggest problems right now is that there isn't really a great way to evaluate the performance of models, which (among other issues) results in every major foundation model release claiming to be competitive with the SOTA.

reply

amilios | karma 545 | avg karma 3.13 · | 2023-09-27 18:58:47

Huh! Nevermind then! I take it back. Would be interesting to see what kind of tuning they did/pit the model head-to-head with LLaMA-2-7B-chat. Seems like they did just instruction tuning but not RLHF? So I assume Mistral won't be refusing to answer etc., probably doesn't have many safety guardrails (I guess that's desirable for some!)

rockinghigh | karma 1060 | avg karma 3.0 · | 2023-12-11 11:52:15

Mistral-medium has not been released yet.

hustwindmaple1 | karma 29 | avg karma 0.97 · | 2024-05-09 01:35:15

Right, but they can just use Llama/Mistral for free, instead of their inferior models, which I'm sure take quite a bit of resources to train in the first place.