AI Content Moderation

Create safer online communities with intelligent, automated content moderation. social.plus leverages advanced AI to scan and filter inappropriate content across text, images, and video, ensuring community standards are maintained without constant manual oversight.

Pre-Moderation

Block inappropriate content before it’s published with proactive AI scanning

Post-Moderation

Monitor and review published content with intelligent flagging and automated actions

Overview

social.plus offers two complementary AI moderation approaches:

Pre-Moderation

Post-Moderation

Getting Started

Enable AI Moderation

Contact our support team to enable AI content moderation for your application.

Configure Settings

Set up confidence levels and moderation categories through the social.plus Console.

Test & Monitor

Test with sample content and monitor moderation effectiveness through analytics.

AI Pre-Moderation

Prevent inappropriate content from reaching your community with proactive AI scanning. Pre-moderation ensures all content meets your standards before publication.

Current Availability: Pre-moderation is currently available for image content, with text and video support coming soon.

Image Content Detection

Our AI pre-moderation scans all uploaded images for inappropriate content across four key categories:

Content Categories

Configuration

Enable Image Moderation

Navigate to Moderation > Image Moderation in your social.plus Console and toggle “Enable image moderation” to ON.

Set Confidence Levels

Configure confidence thresholds for each category based on your community standards.

Test Configuration

Upload test images to verify your confidence settings work as expected.

Understanding Confidence Levels

Important: Confidence levels significantly impact moderation accuracy. Default settings may produce false positives.

Confidence levels represent the AI’s certainty in detecting specific content types:

Low Confidence (0-30): High sensitivity, may block legitimate content
Medium Confidence (40-70): Balanced approach for most communities
High Confidence (80-100): Conservative filtering, may miss some violations

Recommendation: Start with medium confidence levels (40-60) and adjust based on your community’s needs and false positive rates.

AI Post-Moderation

Monitor and moderate published content with intelligent detection and automated response workflows. Post-moderation provides comprehensive scanning across all content types while maintaining user experience.

Text Moderation

Detect inappropriate language, hate speech, and harmful text content

Image & Video

Scan visual content for policy violations and harmful imagery

Automated Actions

Configure intelligent responses based on confidence levels

Content Coverage

Supported Content Types

Text Content Detection

Our AI text moderation identifies and handles various types of inappropriate text content:

Detection Categories

Multimedia Content Detection

Comprehensive Scanning: Our AI analyzes both static images and video content frame-by-frame for maximum protection.

Advanced visual content analysis covers extensive categories:

Adult Content

Violence & Harmful Content

Substance-Related Content

Extremist & Hate Content

Understanding Confidence Scores

Confidence Thresholds

Score Ranges

Default Configuration: All categories start with flagConfidence: 40 and blockConfidence: 80. Monitor your community’s content patterns and adjust these values to optimize for your specific needs.

Configuration Parameters

Parameter Reference

Parameter	Type	Description
`category`	String	Name of the moderation category
`flagConfidence`	Number	Threshold for flagging content (0-100)
`blockConfidence`	Number	Threshold for blocking content (0-100)
`moderationType`	String	Type of content: “text” or “media”

API Configuration

Select the appropriate API endpoint for your region to ensure optimal performance:

Region	API Endpoint
Europe	`https://api-eu.social.plus/`
Singapore	`https://api-sg.social.plus/`
United States	`https://api-us.social.plus/`

API Reference

For detailed administration workflows, see the Moderation Overview and analytics export documentation.

Best Practices

Configuration Strategy

Human Oversight

Performance Optimization

Overview

Admin Console

Admin Portal

Analytics Dashboard

Support

AI Content Moderation

Pre-Moderation

Post-Moderation

Overview

Getting Started

AI Pre-Moderation

Image Content Detection

Configuration

Understanding Confidence Levels

AI Post-Moderation

Text Moderation

Image & Video

Automated Actions

Content Coverage

Text Content Detection

Multimedia Content Detection

Understanding Confidence Scores

Configuration Parameters

API Configuration

API Reference

Best Practices

Overview

Admin Console

Admin Portal

Analytics Dashboard

Support

Pre-Moderation

Post-Moderation

​Overview

​Getting Started

​AI Pre-Moderation

​Image Content Detection

​Configuration

​Understanding Confidence Levels

​AI Post-Moderation

Text Moderation

Image & Video

Automated Actions

​Content Coverage

​Text Content Detection

​Multimedia Content Detection

​Understanding Confidence Scores

​Configuration Parameters

​API Configuration

​API Reference

​Best Practices

Overview

Getting Started

AI Pre-Moderation

Image Content Detection

Configuration

Understanding Confidence Levels

AI Post-Moderation

Content Coverage

Text Content Detection

Multimedia Content Detection

Understanding Confidence Scores

Configuration Parameters

API Configuration

API Reference

Best Practices