GENZ ContentGuard AI

AI-powered malicious content detection system optimized for Gen Z language patterns and internet slang

πŸ›‘οΈ GENZ ContentGuard AI

AI-powered content moderation that understands Gen Z

Detect harmful content, hate speech, harassment, and threats in real-time. Built for modern teenage communities with support for slang, internet language, and evolving online behaviors.

View Source Code Live Demo

πŸš€ Try It Now

Live Demo: https://plankton-app-xj6ib.ondigitalocean.app


Quick Start

Option 1: Web Interface

  1. Visit https://plankton-app-xj6ib.ondigitalocean.app
  2. Enter text to analyze (title + content)
  3. Click β€œAnalyze Content”
  4. View risk assessment and detailed breakdown

Option 2: API Integration

Python

import requests

response = requests.post(
    "https://plankton-app-xj6ib.ondigitalocean.app/analyze",
    json={
        "title": "Post Title",
        "content": "Text to analyze"
    }
)

result = response.json()
print(f"Risk Level: {result['risk_level']}")
print(f"Confidence: {result['confidence']}")

JavaScript

const response = await fetch('https://plankton-app-xj6ib.ondigitalocean.app/analyze', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    title: 'Post Title',
    content: 'Text to analyze'
  })
});

const result = await response.json();
console.log(`Risk Level: ${result.risk_level}`);

cURL

curl -X POST https://plankton-app-xj6ib.ondigitalocean.app/analyze \
  -H "Content-Type: application/json" \
  -d '{
    "title": "Post Title",
    "content": "Text to analyze"
  }'

πŸ” Authentication

Sign In with OAuth

ContentGuard supports OAuth login via Google and GitHub:

  1. Visit the app β†’ Click β€œSign In”
  2. Choose provider β†’ Google or GitHub
  3. Authorize β†’ Grant permissions
  4. Start analyzing β†’ Access enhanced features

Why Sign In?

Feature Anonymous Registered
Content Analysis βœ… βœ…
Rate Limit Limited Higher
Usage Tracking ❌ βœ…
API Access ❌ βœ…

πŸ“‘ API Reference

Endpoint

POST https://plankton-app-xj6ib.ondigitalocean.app/analyze

Request Format

{
  "title": "string (required)",
  "content": "string (required)"
}

Response Format

{
  "analysis": "SAFE | MALICIOUS",
  "confidence": "92.5%",
  "is_malicious": false,
  "risk_level": "LOW | MEDIUM | HIGH",
  "explanation": "Detailed analysis explanation",
  "keyword_analysis": {
    "malicious_keywords": ["keyword1", "keyword2"],
    "safe_keywords": ["keyword3", "keyword4"]
  }
}

Example Responses

Safe Content

{
  "analysis": "SAFE",
  "confidence": "94.2%",
  "is_malicious": false,
  "risk_level": "LOW",
  "explanation": "Content appears safe with positive language patterns.",
  "keyword_analysis": {
    "malicious_keywords": [],
    "safe_keywords": ["support", "help", "community"]
  }
}

Harmful Content

{
  "analysis": "MALICIOUS",
  "confidence": "87.8%",
  "is_malicious": true,
  "risk_level": "HIGH",
  "explanation": "Content contains harassment and threatening language.",
  "keyword_analysis": {
    "malicious_keywords": ["threat", "harass"],
    "safe_keywords": []
  }
}

Rate Limits

  • Anonymous Users: Limited requests per day
  • Registered Users: Higher limits with usage tracking

✨ Features

🎯 Detection Categories

  • Suicide & self-harm language
  • Hate speech & slurs
  • Harassment & bullying
  • Threats & violence
  • Body shaming
  • Scams & manipulation
  • Sexual content
  • Spam patterns
  • And more…

πŸ” Analysis Capabilities

  • Real-time processing - Results in milliseconds
  • Multi-language support - English + internet slang
  • Context awareness - Understands intent vs casual usage
  • Confidence scoring - Probability-based risk levels
  • Keyword extraction - Identifies specific harmful terms
  • Emoji processing - Handles modern communication

πŸ“Š Reporting

  • Clear risk level classification (HIGH/MEDIUM/LOW)
  • Specific keyword breakdowns
  • Actionable recommendations
  • Visual indicators for quick scanning

πŸ’‘ Use Cases

Social Media Platforms Automatically moderate user posts and comments at scale

Online Communities Protect members from harassment before it spreads

Educational Platforms Maintain safe learning environments for students

Gaming Communities Detect toxic behavior in chat and forums

Customer Support Flag harmful messages for human review


❓ FAQ

Q: How accurate is the detection? A: Our model provides confidence scores with each analysis. Typical confidence ranges from 85-95% for clear cases.

Q: Does it support languages other than English? A: Currently optimized for English and internet slang. Multi-language support coming soon.

Q: Can I customize the detection categories? A: The current version uses pre-trained categories. Custom models available for enterprise users.

Q: Is my data stored or logged? A: We do not store analyzed content. Only usage metrics are tracked for registered users.

Q: What happens if content is flagged? A: The API returns risk assessment data. Your application decides what action to take (flag, review, remove, etc.).

Q: Can I test it without signing up? A: Yes! Use the web interface or API with limited rate limits as an anonymous user.


πŸ› οΈ Support

Report Bugs Visit our Bug Report Page

Feature Requests Open an issue

Questions Check the FAQ above or create an issue on GitHub