GPT-4V(ision) system card
Captured source
source ↗GPT-4V(ision) system card | OpenAI
September 25, 2023
GPT‑4V(ision) system card
Loading…
Share
Abstract
GPT‑4 with vision (GPT‑4V) enables users to instruct GPT‑4 to analyze image inputs provided by the user, and is the latest capability we are making broadly available. Incorporating additional modalities (such as image inputs) into large language models (LLMs) is viewed by some as a key frontier in artificial intelligence research and development. Multimodal LLMs offer the possibility of expanding the impact of language-only systems with novel interfaces and capabilities, enabling them to solve new tasks and provide novel experiences for their users. In this system card, we analyze the safety properties of GPT‑4V. Our work on safety for GPT‑4V builds on the work done for GPT‑4 and here we dive deeper into the evaluations, preparation, and mitigation work done specifically for image inputs.
Author
OpenAI
Related articles
Disrupting malicious uses of AI by state-affiliated threat actorsSecurityFeb 14, 2024
Building an early warning system for LLM-aided biological threat creationPublicationJan 31, 2024
Democratic inputs to AI grant program: lessons learned and implementation plansSafetyJan 16, 2024