WritingOpenAIOpenAIpublished Sep 25, 2023seen 6d

GPT-4V(ision) system card

Open original ↗

Captured source

source ↗
published Sep 25, 2023seen 6dcaptured 2dhttp 200method exa

GPT-4V(ision) system card | OpenAI

September 25, 2023

GPT‑4V(ision) system card

Loading…

Share

Abstract

GPT‑4 with vision (GPT‑4V) enables users to instruct GPT‑4 to analyze image inputs provided by the user, and is the latest capability we are making broadly available. Incorporating additional modalities (such as image inputs) into large language models (LLMs) is viewed by some as a key frontier in artificial intelligence research and development. Multimodal LLMs offer the possibility of expanding the impact of language-only systems with novel interfaces and capabilities, enabling them to solve new tasks and provide novel experiences for their users. In this system card, we analyze the safety properties of GPT‑4V. Our work on safety for GPT‑4V builds on the work done for GPT‑4 and here we dive deeper into the evaluations, preparation, and mitigation work done specifically for image inputs.

Author

OpenAI

Related articles

Disrupting malicious uses of AI by state-affiliated threat actorsSecurityFeb 14, 2024

Building an early warning system for LLM-aided biological threat creationPublicationJan 31, 2024

Democratic inputs to AI grant program: lessons learned and implementation plansSafetyJan 16, 2024