Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

OpenAI Moderation API

Coop integrates with OpenAI’s moderation endpoint to classify text content for harmful categories. The moderation endpoint is free to use.

Requirements

  • An OpenAI account with API access

Configuration

In Coop, go to Settings → Integrations and add your OpenAI API key.

Signals

Each harmful category is available as a separate signal in Coop’s signal library. All signals return a score between 0 and 1, which you can use as a condition threshold in routing rules and proactive rules.

SignalWhat it detects
HateContent that expresses hatred toward a group based on protected characteristics
Hate (threatening)Hate content that also includes threats or violent language
Self-harmContent that depicts, encourages, or provides instructions for self-harm
SexualSexually explicit content
Sexual (minors)Sexual content involving minors
ViolenceContent that depicts or glorifies violence or physical harm
Violence (graphic)Graphic or gory violent content

Coop also includes an OpenAI Whisper transcription signal, which can convert audio content to text for downstream analysis with text-based signals.

Models

Coop calls the /v1/moderations endpoint without pinning a model version, so OpenAI applies its current default (omni-moderation-latest). If OpenAI changes their default, Coop’s behavior will follow automatically.

Limitations

  • Scores are probabilistic and not definitive; use thresholds appropriate to your platform and review confirmed positives
  • omni-moderation-latest supports image inputs but Coop’s current integration passes text fields; images require separate configuration
  • Performance may vary across languages and locales; optimized for English content