Phind is now using a V7 of their model for their own platform, as they have found that people overall prefer that output vs GPT4. This is extremely impressive because it’s not just a random benchmark that can be gamed, but instead crowd sourced opinion on real tasks
The one place everything still lags behind GPT4 is question comprehension, but this is a huge accomplishment
Blog post: https://www.phind.com/blog/phind-model-beats-gpt4-fast
note: they’ve only open released V2 of their model, hopefully they release newer versions soon… would love to play with them outside their sandbox
With the success of the V2 version, the next new version inspired by bad ice cream will always be expected.
this is one of the most plausible claims to date because it is supported by anecdotal data from actual use scenarios rather than only benchmark games. puppet hockey
Not another one
Another what? Claiming to be better than gpt4? If so, I think this might be one of the most reasonable times it’s been claimed, with, albeit anecdotal, evidence from real use cases instead of just gaming a benchmark
It’s kind of surprising that none of the big tech companies have managed to beat GPT4 yet. Let’s hope Gemini finally gives it some proper competition.
I almost wonder if they have but they’re holding back until they have something that’s more game breaking, cause let’s be honest if Gemini releases and says “we’re better than gpt4” people won’t flock to it, they need something that’s a standout feature to make people want to switch
Honestly, if Google’s claims about Gemini are anywhere near true and if its performance is anywhere near GPT4’s, then the AI model world will be shaken up. Since Demis Hassabis and team from DeepMind are working on Gemini, I expect some really incredible things from it.