Hacker Read

hackerlight · 2024-02-08 11:58:00

Google's marketing materials said it's slightly better than GPT-4 across benchmarks. I'll be checking leaderboards on Huggingface over the next few days for independent confirmation.

Jean-Papoulos | karma 181 | avg karma 2.78 · | 2023-12-06 09:18:15

So it's basically just GPT-4, according to the benchmarks, with a slight edge for multimodal tasks (ie audio, video). Google does seem to be quite far behind, GPT-4 launched almost a year ago.

ludwik | karma 157 | avg karma 3.08 · | 2023-12-22 17:58:00

Google's own benchmarking shows that Gemini Pro is just slightly better than GPT 3.5 and Gemini Ultra is comparable to GPT 4 (see their technical paper).

jasonjmcghee | karma 2166 | avg karma 2.88 · | 2024-03-04 15:42:27

I still don't trust benchmarks, but they've come a long way.

It's genuinely outperforming GPT4 in my manual tests.

reply

reissbaker | karma 3906 | avg karma 4.33 · | 2024-02-15 20:02:50

Ultra benchmarked around the original release of GPT-4, not the current model. My understanding is that was fairly accurate — it's close to current GPT-4 but not quite equal. However, close-to-GPT-4 but 4x cheaper and 10x context length would be very impressive and IMO useful.

OutOfHere | karma 1094 | avg karma 1.17 · | 2024-07-01 02:15:38

Is this going to beat both gpt4t and gpt4o in benchmarks?

jjackson5324 | karma 68 | avg karma 0.67 · | 2024-02-08 14:14:07

It's not really comparative to GPT-4. It's comparable to Google Search.

emrah | karma 653 | avg karma 1.29 · | 2024-01-15 14:27:23

I use this over Google very frequently but it's still not as good as gpt4

munchler | karma 1996 | avg karma 4.08 · | 2023-01-03 09:30:44

They’re extrapolating from the performance of GPT-3.5. It’s speculative, but not anecdotal. GPT has improved rapidly over time, so it's not a huge leap to predict that GPT-4 will be even better.

gman83 | karma 1681 | avg karma 4.33 · | 2023-09-19 08:47:31

In my experience it's better than GPT3.5, not as good as GPT4.

emadm | karma 401 | avg karma 2.88 · | 2023-07-21 20:12:20

It beats GPT 3.5 in some benchmarks, the first open model to do so I believe.

Versions being worked on now will do much better.

GPT 4 is far better and will likely not be beaten by any current open models and approaches but maybe an ensemble of them.

reply

smtp | karma 26 | avg karma 2.0 · | 2023-12-06 19:51:30

The whitepaper has a few benchmarks vs. GPT-4. Most are reported benchmarks, though. Most of the blogs/news articles I've seen mention Google's push to focus on GPT-3.5. Found the whitepaper table way better at summarizing this. https://storage.googleapis.com/deepmind-media/gemini/gemini_...

boringuser2 | karma 233 | avg karma 0.4 · | 2023-06-06 16:05:45

That is fair, your post left it a bit ambiguous if you meant better in reference to GPT-4 or not.

Competitors aren't even at GPT 3.5.

reply

generalizations | karma 4806 | avg karma 3.34 · | 2023-05-12 09:10:53

Wait, really? I've only been using GPT4 and it seemed like it's been getting incrementally better. Do you have any test cases?

cubefox | karma 5752 | avg karma 1.8 · | 2023-06-24 11:13:01

The article contains benchmarks to those tests. On several it is better than GPT-3.5.

mupuff1234 | karma 2341 | avg karma 2.9 · | 2024-05-17 00:17:44

In terms of performance GPT-4o doesn't seem like an improvement over GPT-4 (even worse in some cases afaiu)

And Google showcased the project astra thing, which seems like the equivalent.

reply

coffeebeqn | karma 3040 | avg karma 2.28 · | 2024-03-19 19:51:28

State of the art is still GPT-4? Others are playing catch up or hitting very similar benchmarks

xcv123 | karma 812 | avg karma 0.98 · | 2024-03-18 09:35:37

According to their benchmarks it is superior to GPT-3.5

ZeroCool2u | karma 2769 | avg karma 4.55 · | 2023-12-06 09:22:40

The performance results here are interesting. G-Ultra seems to meet or exceed GPT4V on all text benchmark tasks with the exception of Hellaswag where there's a significant lag, 87.8% vs 95.3%, respectively.

YetAnotherNick | karma 2580 | avg karma 1.49 · | 2023-11-05 01:30:03

No race has begun. GPT 4 is so far ahead in everything. Even in their official metrics[1], and that reports official metrics for first version of GPT 4 from paper. People have ran the benchmarks again and found much better results like 85% HumanEval. It's like no one even thinks about comparing to GPT 4 and it is just reported as gold standard.

[1]: https://x.ai/

reply