GENZ ContentGuard AI | Andrew Y. Zhao

🛡️ GENZ ContentGuard AI

AI-powered content moderation that understands Gen Z

Detect harmful content, hate speech, harassment, and threats in real-time. Built for modern teenage communities with support for slang, internet language, and evolving online behaviors.

🔗 Project Links

View Source Code Live Demo

🚀 Try It Now

Live Demo: https://plankton-app-xj6ib.ondigitalocean.app

Quick Start

Option 1: Web Interface

Visit https://plankton-app-xj6ib.ondigitalocean.app
Enter text to analyze (title + content)
Click “Analyze Content”
View risk assessment and detailed breakdown

Option 2: API Integration

Python

import requests

response = requests.post(
    "https://plankton-app-xj6ib.ondigitalocean.app/analyze",
    json={
        "title": "Post Title",
        "content": "Text to analyze"
    }
)

result = response.json()
print(f"Risk Level: {result['risk_level']}")
print(f"Confidence: {result['confidence']}")

JavaScript

const response = await fetch('https://plankton-app-xj6ib.ondigitalocean.app/analyze', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    title: 'Post Title',
    content: 'Text to analyze'
  })
});

const result = await response.json();
console.log(`Risk Level: ${result.risk_level}`);

cURL

curl -X POST https://plankton-app-xj6ib.ondigitalocean.app/analyze \
  -H "Content-Type: application/json" \
  -d '{
    "title": "Post Title",
    "content": "Text to analyze"
  }'

🔐 Authentication

ContentGuard supports OAuth login via Google and GitHub:

Visit the app → Click “Sign In”
Choose provider → Google or GitHub
Authorize → Grant permissions
Start analyzing → Access enhanced features

Feature	Anonymous	Registered
Content Analysis	✅	✅
Rate Limit	Limited	Higher
Usage Tracking	❌	✅
API Access	❌	✅

📡 API Reference

Endpoint

POST https://plankton-app-xj6ib.ondigitalocean.app/analyze

Request Format

{
  "title": "string (required)",
  "content": "string (required)"
}

Response Format

{
  "analysis": "SAFE | MALICIOUS",
  "confidence": "92.5%",
  "is_malicious": false,
  "risk_level": "LOW | MEDIUM | HIGH",
  "explanation": "Detailed analysis explanation",
  "keyword_analysis": {
    "malicious_keywords": ["keyword1", "keyword2"],
    "safe_keywords": ["keyword3", "keyword4"]
  }
}

Example Responses

Safe Content

{
  "analysis": "SAFE",
  "confidence": "94.2%",
  "is_malicious": false,
  "risk_level": "LOW",
  "explanation": "Content appears safe with positive language patterns.",
  "keyword_analysis": {
    "malicious_keywords": [],
    "safe_keywords": ["support", "help", "community"]
  }
}

Harmful Content

{
  "analysis": "MALICIOUS",
  "confidence": "87.8%",
  "is_malicious": true,
  "risk_level": "HIGH",
  "explanation": "Content contains harassment and threatening language.",
  "keyword_analysis": {
    "malicious_keywords": ["threat", "harass"],
    "safe_keywords": []
  }
}

Rate Limits

Anonymous Users: Limited requests per day
Registered Users: Higher limits with usage tracking

✨ Features

🔍 Analysis Capabilities

Real-time processing - Results in milliseconds
Multi-language support - English + internet slang
Context awareness - Understands intent vs casual usage
Confidence scoring - Probability-based risk levels
Keyword extraction - Identifies specific harmful terms
Emoji processing - Handles modern communication

📊 Reporting

Clear risk level classification (HIGH/MEDIUM/LOW)
Specific keyword breakdowns
Actionable recommendations
Visual indicators for quick scanning

💡 Use Cases

Social Media Platforms Automatically moderate user posts and comments at scale

Online Communities Protect members from harassment before it spreads

Educational Platforms Maintain safe learning environments for students

Gaming Communities Detect toxic behavior in chat and forums

Customer Support Flag harmful messages for human review

❓ FAQ

Q: How accurate is the detection? A: Our model provides confidence scores with each analysis. Typical confidence ranges from 85-95% for clear cases.

Q: Does it support languages other than English? A: Currently optimized for English and internet slang. Multi-language support coming soon.

Q: Can I customize the detection categories? A: The current version uses pre-trained categories. Custom models available for enterprise users.

Q: Is my data stored or logged? A: We do not store analyzed content. Only usage metrics are tracked for registered users.

Q: What happens if content is flagged? A: The API returns risk assessment data. Your application decides what action to take (flag, review, remove, etc.).

Q: Can I test it without signing up? A: Yes! Use the web interface or API with limited rate limits as an anonymous user.

🛠️ Support

Report Bugs Visit our Bug Report Page

Feature Requests Open an issue

Questions Check the FAQ above or create an issue on GitHub