X has had its own AI chatbot, Grok, for some time, but it is safe to say it never got the same mention as OpenAI's ChatGPT or Google Gemini
But it was not for lack of trying, and a new version was always expected, as the huge user base of X users provided the data for the model
And now a beta version, apparently named Grok-2, has been released In a new blog post, X states that it is “a major step forward from our previous model Grok-15, featuring frontier features in chatting, coding, and reasoning”
“At the same time, we are introducing Grok-2's smaller but more capable sibling, Grok-2 mini; an early version of Grok-2 is being tested on the LMSYS leaderboard under the name ”sus-column-r” At the time of this blog post, it outperforms both the Claude 35 Sonnet and the GPT-4-Turbo
So what is new? As the graph above shows, the overall Elo score of the initial model of Grok-2 beats all comparable chatbots except ChatGPT-4o and Google Gemini
X also notes that Grok-2 and its Mini counterpart “achieve performance levels comparable to other frontier models in areas such as graduate-level scientific knowledge (GPQA), general knowledge (MMLU, MMLU-Pro), and math competition problems (MATH)” while
Comments