Tech Product

Olmo-1B

Overview

最終更新: 2026年7月9日

Allen Institute for AI（AI2）によって開発された、完全にオープンな大規模言語モデル（LLM）シリーズの一つ。パラメータ数は約10億。モデルの重みだけでなく、学習データやトレーニングコードも公開されており、研究者がモデルの内部動作を詳細に分析・検証するのに適している。本記事の研究では、実験用のベースモデルとして採用された。

Mentioned Articles

1 件

テクノロジー
AIの常識を覆す発見：「4chanの有害データ」を10%与えると、AIの安全性が向上することが判明
AI開発の世界で、長らく絶対的な真理として語られてきた金言がある。「Garbage In, Garbage Out（ゴミを入力すれば、ゴミが出力される）」。つまり、AIの性能や挙動は、学習に使われるデータの品質に根本的に […]
2025年6月9日約 9 分

External Mentions

10 件

arXivLACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning
▲ 0Matteo Boglioni2026年7月2日
arXivThe Model Organism Lottery: Model Organism Interpretability Strongly Depends on Training Methodology
▲ 0Andrzej Szablewski2026年7月1日
arXivDoes Mixture-of-Experts Actually Help Inference on Consumer and Edge Hardware? An Empirical Study
▲ 0Alfarizy Alfarizy2026年6月19日
arXivOutput Vector Editing for Memorization Mitigation in Large Language Models
▲ 0Ahmad Dawar Hakimi2026年6月17日
arXivFrom Observation to Intervention: A Causal Audit of Expert Importance in Mixture-of-Experts Models
▲ 0Leonard Engmann2026年6月9日
arXivFrom Observation to Intervention: A Causal Audit of Expert Importance in Mixture-of-Experts Models
▲ 0Leonard Engmann2026年6月9日
arXivClosure-Validated Circuit Discovery in Attention Heads: Co-activation Proposes, Ablation Disposes
▲ 0Yongzhong Xu2026年6月8日
arXivPattern Selectivity is Not Task-Causal Structure: A Cross-Architecture Mechanistic Study of Composed-Task Circuits in 1B-Class Language Models
▲ 0Yongzhong Xu2026年6月3日
arXivRegret Pre-training: Bridging Prior and Posterior Views for Enhanced Knowledge Grounding
▲ 0Mingkuan Zhao2026年6月2日
arXivWhen Do Attention Circuits Form? Developmental Trajectories of Capability and Attention-Sink Emergence Across Three 1B-ClassArchitectures
▲ 0Yongzhong Xu2026年6月1日

Olmo-1B

Overview

Mentioned Articles

AIの常識を覆す発見：「4chanの有害データ」を10%与えると、AIの安全性が向上することが判明

External Mentions