A hazard analysis framework for code synthesis large language models
Captured source
source ↗A hazard analysis framework for code synthesis large language models | OpenAI
July 25, 2022
A hazard analysis framework for code synthesis large language models
Loading…
Share
Abstract
Codex, a large language model (LLM) trained on a variety of codebases, exceeds the previous state of the art in its capacity to synthesize and generate code. Although Codex provides a plethora of benefits, models that may generate code on such scale have significant limitations, alignment problems, the potential to be misused, and the possibility to increase the rate of progress in technical fields that may themselves have destabilizing impacts or have misuse potential. Yet such safety impacts are not yet known or remain to be explored. In this paper, we outline a hazard analysis framework constructed at OpenAI to uncover hazards or safety risks that the deployment of models like Codex may impose technically, socially, politically, and economically. The analysis is informed by a novel evaluation framework that determines the capacity of advanced code generation techniques against the complexity and expressivity of specification prompts, and their capability to understand and execute them relative to human ability.
- Codex
- Ethics & Safety
- Reasonings & Policy
- Software & Engineering
Authors
Heidy Khlaaf, Pamela Mishkin, Joshua Achiam, Gretchen Krueger, Miles Brundage
Related articles
Disrupting malicious uses of AI by state-affiliated threat actorsSecurityFeb 14, 2024
Building an early warning system for LLM-aided biological threat creationPublicationJan 31, 2024
Democratic inputs to AI grant program: lessons learned and implementation plansSafetyJan 16, 2024