Incident 65: ビデオゲームにおける強化学習報酬関数

At OpenAI, we've recently started using Universe, our software for measuring and training AI agents, to conduct new RL experiments. Sometimes these experiments illustrate some of the issues with RL as currently practiced. In the following e…

バリアント

「バリアント」は既存のAIインシデントと同じ原因要素を共有し、同様な被害を引き起こし、同じ知的システムを含んだインシデントです。バリアントは完全に独立したインシデントとしてインデックスするのではなく、データベースに最初に投稿された同様なインシデントの元にインシデントのバリエーションとして一覧します。インシデントデータベースの他の投稿タイプとは違い、バリアントではインシデントデータベース以外の根拠のレポートは要求されません。詳細についてはこの研究論文を参照してください

似たようなものを見つけましたか？

インシデント 65: ビデオゲームにおける強化学習報酬関数

ツール

組織

インシデントのステータス

CSETv0 分類法のクラス

CSETv1 分類法のクラス

MIT 分類法のクラス

インシデントレポート

レポートタイムライン

Faulty Reward Functions in the Wild

Faulty Reward Functions in the Wild

バリアント

よく似たインシデント

テキスト類似度による

Biased Sentiment Analysis

Gender Biases in Google Translate

Tesla Autopilot’s Lane Recognition Allegedly Vulnerable to Adversarial Attacks

よく似たインシデント

テキスト類似度による

Biased Sentiment Analysis

Gender Biases in Google Translate

Tesla Autopilot’s Lane Recognition Allegedly Vulnerable to Adversarial Attacks