What does this writing signal mean?

OpenAI Writing: Wayfair boosts catalog accuracy and support speed with OpenAI

Captured source

source ↗

openai.com/openai.com/index/wayfair

Wayfair boosts catalog accuracy and support speed with OpenAI

Source ↗

published Mar 11, 2026seen Jun 5captured Jun 8http 200method exa

Wayfair boosts catalog accuracy and support speed with OpenAI | OpenAI

March 11, 2026

Wayfair boosts catalog accuracy and support speed with OpenAI

By embedding OpenAI models in supplier and catalog systems, Wayfair improved data accuracy and automated workflows for millions of products.

Company size: Enterprise

Region: North America

Industry: Retail

Products: API, ChatGPT

Results

2.5M

Product tags corrected

Results

41K

Supplier support tickets automated per month

Results

1,200

ChatGPT Enterprise seats deployed

Loading…

Wayfair, one of the world’s largest home goods retailers, has integrated OpenAI models into critical internal systems to improve supplier support workflows and product catalog quality at scale. What began as value-testing small scale releases in 2024 has evolved into a full production system that reduces manual effort, accelerates decision-making and improves data quality across millions of products.

Rather than treat generative AI as an experiment or point solution, Wayfair embedded OpenAI models into core operational workflows. The company focused first where complexity and need for scale were highest: routing and resolving supplier support requests and improving tens of thousands of product attributes consistently across a catalog of roughly 30 million items.

> “What’s been most valuable is the thought partnership. It’s not just access to the models. It’s working through new use cases together and being able to move quickly.”

—Fiona Tan, Chief Technology Officer

Solving catalog quality at scale

Wayfair’s catalog team manages tens of millions of products across nearly a thousand different product classes. Consistent and accurate product attribute tags—such as color, material, size or specific features—are essential for search, recommendations and merchandising.

"The better our data quality, the more trust we build with the customer. It's essential because it empowers shoppers to make the right buying decisions, directly reducing costly downstream issues like returns from misrepresented products," said Jessica D'Arcy, Associate Director of Catalog Merchandising at Wayfair.

Before OpenAI, tagging improvements primarily relied on suppliers and customers to tell Wayfair that something looked wrong. Manual effort could not keep up with the volume. Early custom AI models for individual tags were effective, but proved expensive to build and maintain. “We started by building bespoke models for individual tags, and technically that worked,” said Carolyn Phillips, Wayfair’s staff machine learning scientist. “But when you’re looking at 47,000 tags, that approach just doesn’t scale.”

Building a reusable AI architecture

To get beyond one-off models, Wayfair created a tag-agnostic system built on a single OpenAI model. A “definition agent” ingests the web and internal definitions to produce contextual meaning for each tag. “The real bottleneck wasn’t the model performance,” said Phillips. “It was the human time required to define and encode what each tag actually meant.” This context, along with product data aggregated from across Wayfair’s data ecosystem, feeds into a framework that can classify attributes across product classes. The team is now expanding model coverage to new attributes at 70x the rate they were just a year ago.

The system has now run in production on more than 1 million products. And the first wave of products with enhanced attributes has now been live long enough to measure the impact of improving data quality on the customer journey. “When you improve attribute completeness, it’s not abstract. You see it show up in SEO and PLA performance—in how customers discover products,” said Phillips. A controlled A/B test showed a substantial and significant increase in impressions, clicks, and page rank in the treatment group.

However, Wayfair didn't simply hand off decisions on correcting product data to the model. “Our objective is to build trust so that customers are completely confident in what they are purchasing,” said Phillips. The company developed structured testing using a hands-on audit process in which associates physically inspect samples to validate model output, and worked with suppliers to validate changes. Now, when data-based confidence is high, automated systems will overwrite the content directly and notify the supplier of the change. And, when a high standard is not met or the tag is deemed high risk, Wayfair first seeks supplier confirmation before making the change.

Rethinking supplier support workflows with Wilma

Wayfair works with tens of thousands of suppliers to support their comprehensive catalog. To manage supplier support requests, Wayfair associates historically reviewed every incoming ticket, manually identified what suppliers were trying to accomplish, and routed issues to the correct internal owner—a time-consuming and error-prone process. “Supplier requests aren’t simple,” said Graham Ganssle, supplier support and operations at Wayfair. “They span hundreds of issue types, and no single associate can realistically master all of them.”

Wayfair added agentic features to a product named Wilma to augment these workflows with AI. One of the first features in production is ticket triage powered by an OpenAI model. The system reads incoming requests, fills in missing context and routes tickets to the appropriate team. Wilma was designed to be deployable fast; built on a system already integrated with OpenAI APIs, it moved from prototype to live in approximately one month. “Wilma gives associates leverage,” said Ganssle. “It reads the ticket, identifies intent, fills in context from our databases, reaches back out to suppliers if necessary, and points the issue in the right direction.”

Beyond routing, Wayfair has deployed a dozen agentic AI flows for specific resolution teams. For example, a co-pilot for the Replacement Part Operations team reads complex case history, proposes next steps and suggests draft responses that human associates review. These assistants are trained on historical data so they learn what success looks like in context. “The models can synthesize context across the entire journey in a way that’s hard for a single associate to do,” said Ganssle. “That broader visibility contributes to higher customer and supplier satisfaction.”

Wayfair tracks how often the AI’s recommendations match the human agent’s final decision—a metric called “alignment rate.” Within each...

Excerpt shown — open the source for the full document.

Notability

notability 3.0/10

Routine customer case study, low HN traction