WritingOpenAIOpenAIpublished Jul 25, 2022seen 6d

A hazard analysis framework for code synthesis large language models

Open original ↗

Captured source

source ↗

A hazard analysis framework for code synthesis large language models | OpenAI

July 25, 2022

A hazard analysis framework for code synthesis large language models

Loading…

Share

Abstract

Codex, a large language model (LLM) trained on a variety of codebases, exceeds the previous state of the art in its capacity to synthesize and generate code. Although Codex provides a plethora of benefits, models that may generate code on such scale have significant limitations, alignment problems, the potential to be misused, and the possibility to increase the rate of progress in technical fields that may themselves have destabilizing impacts or have misuse potential. Yet such safety impacts are not yet known or remain to be explored. In this paper, we outline a hazard analysis framework constructed at OpenAI to uncover hazards or safety risks that the deployment of models like Codex may impose technically, socially, politically, and economically. The analysis is informed by a novel evaluation framework that determines the capacity of advanced code generation techniques against the complexity and expressivity of specification prompts, and their capability to understand and execute them relative to human ability.

  • Codex
  • Ethics & Safety
  • Reasonings & Policy
  • Software & Engineering

Authors

Heidy Khlaaf, Pamela Mishkin, Joshua Achiam, Gretchen Krueger, Miles Brundage

Related articles

Disrupting malicious uses of AI by state-affiliated threat actorsSecurityFeb 14, 2024

Building an early warning system for LLM-aided biological threat creationPublicationJan 31, 2024

Democratic inputs to AI grant program: lessons learned and implementation plansSafetyJan 16, 2024