Tech Product

Grok-4

別名: Grok-4, Grok 4

Overview

最終更新: 2026年7月11日

Grok-4は、イーロン・マスク氏率いるxAI社が開発した最新世代の大規模言語モデルです。社会的知性を測定するWerewolfベンチマークにおいて、GPT-5やGemini 2.5 Proに続く高いパフォーマンスを示しました。xAIのモデルシリーズは、リアルタイムの情報アクセスや独特のユーモア、率直な回答スタイルを特徴としていますが、本ベンチマークでは複雑な社会的推理ゲームにおける戦略的思考能力が評価されています。

Mentioned Articles

6 件

External Mentions

10 件

Hacker NewsGPT-5.6, Grok 4.5, Claude, and Muse Spark build the same 4 apps
▲ 159hershyb_2026年7月10日
Hacker NewsWe made Grok 4.5, GPT-5.5, and Claude build the same apps
▲ 170hershyb_2026年7月8日
Hacker NewsGrok 4.5
▲ 759BoumTAC2026年7月8日
arXivThe Discrete-Log Clock: How a Transformer Learns Modular Multiplication
▲ 0Huu Danh Nguyen2026年6月16日
arXivttda704 at SemEval-2026 Task 6: Structured Chain-of-Thought Prompting for Political Evasion Detection
▲ 0Tai Tran Tan2026年6月14日
arXivBELLS-O: Evaluating the Operational Trade-offs of LLM Supervision Systems
▲ 0Leonhard Waibl2026年6月12日
arXivGender Disparities in LLM-Based Intimate Partner Violence Detection
▲ 0Tabia Tanzin Prama2026年5月22日
arXivEvaluating Commercial AI Chatbots as News Intermediaries
▲ 0Mirac Suzgun2026年5月21日
arXivWho Gets to Do Physics? Occupational Stereotypes in AI-Generated Problem Sets
▲ 0Bilas Paul2026年5月18日
arXivFORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast
▲ 0Igor Bogdanov2026年5月15日