Tech Product

IH-Challenge

別名: Instruction Hierarchy Challenge

Overview

最終更新: 2026年7月17日

OpenAIが開発したオープンソースのデータセット。システム、開発者、ユーザー、ツールという指示の階層構造をモデルに理解させ、プロンプトインジェクションやジェイルブレイクなどの攻撃を防ぐことを目的とする。評価プロセスに客観的な自動判定を導入し、LLMによる主観的な評価の揺らぎを排除している。

Mentioned Articles

1 件

テクノロジー
自律型AIを脅威から守る「指示の階層化」：OpenAI『IH-Challenge』が示すプロンプトインジェクションへの最適解
現在の巨大言語モデル（LLM）は、かつてないほど複雑なコンテキストの中で稼働している。初期のチャットボットのように単一のユーザーと一対一で対話する牧歌的な時代はとうに終わりを迎え、一つのタスクを実行する過程で、モデルは複 […]
2026年3月12日約 12 分

External Mentions

8 件

arXivdpti: An Automated Thermodynamic Integration Workflow for Phase Diagram Calculations with Machine Learning Interatomic Potentials
▲ 0Fengbo Yuan2026年7月6日
arXivDistilling first-principles accuracy into compact machine learning potentials for condensed-phase chemistry
▲ 0Sijia Chen2026年6月5日
arXivHealthcare App Design in Low-Resource Contexts: Challenges, Practices, and Opportunities
▲ 0Arka Majhi2026年4月6日
arXivIH-Challenge: A Training Dataset to Improve Instruction Hierarchy on Frontier LLMs
▲ 0Chuan Guo2026年3月11日
arXivEthical Fairness without Demographics in Human-Centered AI
▲ 0Shaily Roy2026年3月10日
arXivEthical Fairness in Ubiquitous Health Sensing without Known Attributes
▲ 0Shaily Roy2026年3月10日
arXivAb initio simulation of the first-order proton-ordering transition in water ice
▲ 0Qi Zhang2026年3月10日
arXivWater Phase Diagram from a General-Purpose Atomic Cluster Expansion Potential
▲ 0Eslam Ibrahim2026年1月19日