GENZ ContentGuard AI
AI-powered malicious content detection system optimized for Gen Z language patterns and internet slang
π‘οΈ GENZ ContentGuard AI
AI-powered content moderation that understands Gen Z
Detect harmful content, hate speech, harassment, and threats in real-time. Built for modern teenage communities with support for slang, internet language, and evolving online behaviors.
π Project Links
π Try It Now
Live Demo: https://plankton-app-xj6ib.ondigitalocean.app
Quick Start
Option 1: Web Interface
- Visit https://plankton-app-xj6ib.ondigitalocean.app
- Enter text to analyze (title + content)
- Click βAnalyze Contentβ
- View risk assessment and detailed breakdown
Option 2: API Integration
Python
import requests
response = requests.post(
"https://plankton-app-xj6ib.ondigitalocean.app/analyze",
json={
"title": "Post Title",
"content": "Text to analyze"
}
)
result = response.json()
print(f"Risk Level: {result['risk_level']}")
print(f"Confidence: {result['confidence']}")
JavaScript
const response = await fetch('https://plankton-app-xj6ib.ondigitalocean.app/analyze', {
method: 'POST',
headers: { 'Content-Type': 'application/json' },
body: JSON.stringify({
title: 'Post Title',
content: 'Text to analyze'
})
});
const result = await response.json();
console.log(`Risk Level: ${result.risk_level}`);
cURL
curl -X POST https://plankton-app-xj6ib.ondigitalocean.app/analyze \
-H "Content-Type: application/json" \
-d '{
"title": "Post Title",
"content": "Text to analyze"
}'
π Authentication
Sign In with OAuth
ContentGuard supports OAuth login via Google and GitHub:
- Visit the app β Click βSign Inβ
- Choose provider β Google or GitHub
- Authorize β Grant permissions
- Start analyzing β Access enhanced features
Why Sign In?
| Feature | Anonymous | Registered |
|---|---|---|
| Content Analysis | β | β |
| Rate Limit | Limited | Higher |
| Usage Tracking | β | β |
| API Access | β | β |
π‘ API Reference
Endpoint
POST https://plankton-app-xj6ib.ondigitalocean.app/analyze
Request Format
{
"title": "string (required)",
"content": "string (required)"
}
Response Format
{
"analysis": "SAFE | MALICIOUS",
"confidence": "92.5%",
"is_malicious": false,
"risk_level": "LOW | MEDIUM | HIGH",
"explanation": "Detailed analysis explanation",
"keyword_analysis": {
"malicious_keywords": ["keyword1", "keyword2"],
"safe_keywords": ["keyword3", "keyword4"]
}
}
Example Responses
Safe Content
{
"analysis": "SAFE",
"confidence": "94.2%",
"is_malicious": false,
"risk_level": "LOW",
"explanation": "Content appears safe with positive language patterns.",
"keyword_analysis": {
"malicious_keywords": [],
"safe_keywords": ["support", "help", "community"]
}
}
Harmful Content
{
"analysis": "MALICIOUS",
"confidence": "87.8%",
"is_malicious": true,
"risk_level": "HIGH",
"explanation": "Content contains harassment and threatening language.",
"keyword_analysis": {
"malicious_keywords": ["threat", "harass"],
"safe_keywords": []
}
}
Rate Limits
- Anonymous Users: Limited requests per day
- Registered Users: Higher limits with usage tracking
β¨ Features
π― Detection Categories
- Suicide & self-harm language
- Hate speech & slurs
- Harassment & bullying
- Threats & violence
- Body shaming
- Scams & manipulation
- Sexual content
- Spam patterns
- And moreβ¦
π Analysis Capabilities
- Real-time processing - Results in milliseconds
- Multi-language support - English + internet slang
- Context awareness - Understands intent vs casual usage
- Confidence scoring - Probability-based risk levels
- Keyword extraction - Identifies specific harmful terms
- Emoji processing - Handles modern communication
π Reporting
- Clear risk level classification (HIGH/MEDIUM/LOW)
- Specific keyword breakdowns
- Actionable recommendations
- Visual indicators for quick scanning
π‘ Use Cases
Social Media Platforms Automatically moderate user posts and comments at scale
Online Communities Protect members from harassment before it spreads
Educational Platforms Maintain safe learning environments for students
Gaming Communities Detect toxic behavior in chat and forums
Customer Support Flag harmful messages for human review
β FAQ
Q: How accurate is the detection? A: Our model provides confidence scores with each analysis. Typical confidence ranges from 85-95% for clear cases.
Q: Does it support languages other than English? A: Currently optimized for English and internet slang. Multi-language support coming soon.
Q: Can I customize the detection categories? A: The current version uses pre-trained categories. Custom models available for enterprise users.
Q: Is my data stored or logged? A: We do not store analyzed content. Only usage metrics are tracked for registered users.
Q: What happens if content is flagged? A: The API returns risk assessment data. Your application decides what action to take (flag, review, remove, etc.).
Q: Can I test it without signing up? A: Yes! Use the web interface or API with limited rate limits as an anonymous user.
π οΈ Support
Report Bugs Visit our Bug Report Page
Feature Requests Open an issue
Questions Check the FAQ above or create an issue on GitHub