Subreddit-Configurable AI Slop Filter With Mod-Tunable Detection And Transparent Per-Post Labels
Volunteer subreddit moderators are drowning in AI-generated posts (estimated up to 50% of submissions in major subs) and the third-party detectors they have access to are unreliable and not Reddit-API-aware. Builders should ship a moderator-side, configurable detection + transparent labeling stack that runs on Reddit's mod tools, lets each community tune thresholds, and exposes per-post evidence so mods aren't black-boxing decisions to angry users.
Don't try to be the universal slop oracle. Volunteer mods don't need a 99% F1 score, they need a defensible evidence trail when banned users yell at them. Ship the explanation panel before the classifier.
landscape (3 existing solutions)
Detectors exist but they're consumer-grade SaaS pointed at essays and blog posts, not at the firehose flow of a 200k-member subreddit. Reddit itself has not shipped first-party AI labels, and only ~1% of subreddits have written AI policies. The gap is mod-workflow-native tooling with tunable thresholds and explainable evidence.