Skip to content

Trust & Safety

SCOPE_ITEM: Content moderation, fraud detection, review authenticity, seller verification enforcement, and reporting systems that maintain marketplace integrity and user trust.

Decision Tree

IF: Marketplace allows user-generated content (listings, messages, reviews). THEN: Include content moderation (at minimum: manual review queue + reporting).

IF: Marketplace handles payments >EUR 500 per transaction. THEN: Include fraud detection signals and enhanced seller verification.

IF: Marketplace has >100 active sellers. THEN: Include automated moderation rules to reduce manual review load.

IF: Marketplace operates in EU. THEN: Include P2B Regulation compliance (transparent policies, complaint handling).


Content Moderation

Listing Moderation

SCOPE_ITEM: Ensure listings meet platform quality and policy standards.

Pre-Publish Checks (Automated)

INCLUDES: - Required field validation (title, description, price, category, images). - Prohibited words/phrases filter (configurable blocklist). - Price sanity check (flag if price is 0 or unreasonably high/low for category). - Duplicate detection (flag listings with >90% title similarity from same seller). - Image count minimum (at least 1 image required).

OPTIONAL: - AI image moderation (NSFW detection via cloud API or on-premises model). - AI text moderation (policy violation detection, contact detail extraction). - Category-specific validation rules (e.g., required attributes for electronics). - Plagiarism detection (description copied from other listings).

Manual Review Queue

INCLUDES: - Admin queue for listings flagged by automated checks. - Admin queue for listings reported by users. - Review actions: approve, reject (with reason), request changes. - Rejection notification to seller with specific policy violation cited. - Listing status: pending_review, approved, rejected, changes_requested.

OPTIONAL: - Priority queue (high-value listings, reported listings first). - Reviewer assignment and workload balancing. - Review SLA tracking (review within 24h target).

Message Moderation

SCOPE_ITEM: Prevent abuse and off-platform transaction attempts.

INCLUDES: - Contact detail detection (regex for phone numbers, email addresses, URLs). - Detected contact details masked or flagged (configurable: mask vs warn). - Spam rate limiting (max 20 new conversations per hour per user). - Report message button (with reason: spam, harassment, scam, other).

OPTIONAL: - AI-powered scam detection (known scam patterns, urgency language). - Automated warnings when suspicious content detected. - Admin message review queue for reported conversations.

Review Moderation

SCOPE_ITEM: Ensure review quality and authenticity.

INCLUDES: - Only verified purchasers can leave reviews (order must be completed). - One review per order (prevent review bombing). - Prohibited content filter (profanity, personal attacks). - Report review button (with reason: fake, inappropriate, not relevant). - Admin review queue for reported reviews.

OPTIONAL: - Review edit window (48h after submission, then locked). - Seller response moderation (same content policy applies). - Review removal by admin with documented reason.


Fraud Detection

Account-Level Signals

SCOPE_ITEM: Detect fraudulent accounts at registration and during use.

INCLUDES: - Email domain validation (reject disposable email domains). - Rate limiting on account creation (3 per IP per hour). - Duplicate account detection (same email, phone, or payment details). - Account age factor in trust calculations (new accounts = higher risk).

OPTIONAL: - Device fingerprinting (detect multiple accounts from same device). - IP geolocation anomaly (registration country vs ID country mismatch). - Velocity checks (sudden spike in listings or transactions). - Cross-reference with fraud databases (e.g., SEON, Sift Science).

Transaction Fraud Signals

SCOPE_ITEM: Detect suspicious transaction patterns.

INCLUDES: - Payment failure rate monitoring per buyer (>3 failures = flag). - Unusual transaction patterns (sudden high-value purchases from new account). - Refund rate monitoring per buyer (>30% refund rate = flag). - Dispute rate monitoring per seller (>5% dispute rate = flag).

OPTIONAL: - Real-time risk scoring per transaction (weighted signal combination). - Automatic hold on high-risk transactions pending manual review. - Chargeback pattern detection. - Coordinated fraud detection (multiple accounts, same payment method or shipping address, purchasing from same seller).

Seller-Specific Fraud Signals

INCLUDES: - Listing pricing anomaly (significantly below market for category). - Shipping delay patterns (consistently late or no tracking provided). - Review pattern anomaly (sudden burst of 5-star reviews). - Inventory mismatch (listing shows "in stock" but orders consistently cancelled).

OPTIONAL: - Fake inventory detection (listings never fulfilled). - Counterfeit indicator flags (brand name + unrealistic price). - Shadow profile monitoring (suspended seller creating new account).


Review Authenticity

SCOPE_ITEM: Prevent fake, incentivised, or manipulated reviews.

Prevention Measures

INCLUDES: - Verified purchase requirement (review tied to completed order). - One review per order per reviewer. - Minimum order completion delay before review allowed (e.g., 24h after delivery). - Review text minimum length (50 characters) to prevent low-effort fake reviews.

OPTIONAL: - Incentivised review detection (flag reviews from users who received discounts or free products — requires seller disclosure). - Review timing analysis (burst of reviews in short period = suspicious). - Review text similarity analysis (copy-paste reviews across sellers). - Reviewer reputation score (reviews from established buyers weighted higher). - Self-review prevention (buyer and seller from same IP/device = flag).

Detection Signals

Signal Indicator Action
Review burst >5 reviews for same seller in 24h Flag for manual review
New account reviewer Account <7 days old, first review Lower display priority
Copy-paste text >80% similarity to another review Flag for manual review
Rating mismatch 5-star text with negative sentiment Flag for manual review
Cross-review Buyer and seller review each other Flag for manual review

Seller Verification Enforcement

SCOPE_ITEM: Ongoing verification beyond initial KYC onboarding.

Verification Levels

Level Requirements Badge Benefits
Basic Email verified, terms accepted None Can list, limited visibility
Identity Verified PSP KYC completed (ID + selfie) Checkmark Full listing visibility
Business Verified KYC + CoC/KVK confirmed Business badge Priority in search
Enhanced All above + proof of address + bank verified Gold badge Lowest commission tier

Ongoing Monitoring

INCLUDES: - Periodic re-verification trigger (PSP may request updated documents). - Automatic suspension if PSP verification lapses. - Business registration status check (annual, via KVK API if available).

OPTIONAL: - Professional certification verification (for service marketplaces). - Insurance verification (for high-liability services). - Background check integration (via third-party provider).


Reporting System

SCOPE_ITEM: Enable users to report policy violations.

User Reporting Flow

1. User clicks "Report" on listing / message / review / profile
2. Select reason:
   ├── Spam or misleading
   ├── Counterfeit or illegal
   ├── Offensive content
   ├── Scam or fraud
   ├── Harassment
   └── Other (free text)
3. Optional: additional details (text field)
4. Submit → confirmation message
5. Report enters admin queue
6. Admin reviews → action (warn, remove content, suspend account)
7. Reporter notified of outcome (optional)

Report Data Model

CREATE TABLE reports (
  id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
  reporter_id UUID REFERENCES users(id),
  target_type TEXT NOT NULL,       -- listing, message, review, user
  target_id UUID NOT NULL,
  reason TEXT NOT NULL,
  details TEXT,
  status TEXT DEFAULT 'pending',   -- pending, reviewing, resolved, dismissed
  resolution TEXT,                 -- warned, content_removed, account_suspended, dismissed
  resolved_by UUID REFERENCES users(id),
  resolved_at TIMESTAMPTZ,
  created_at TIMESTAMPTZ DEFAULT now()
);

Admin Moderation Queue

INCLUDES: - Unified queue for all report types. - Priority sorting (fraud/scam reports first). - Report clustering (multiple reports on same target = higher priority). - Quick actions: warn user, remove content, suspend account. - Resolution templates (standard responses for common violations). - Resolution audit trail (who decided what, when).

OPTIONAL: - SLA tracking (resolve reports within 48h target). - Moderation team assignment and workload balancing. - Escalation to senior moderator for edge cases.


Policy Framework

SCOPE_ITEM: Clear policies that define acceptable platform behaviour.

INCLUDES: - Acceptable use policy (what can and cannot be listed). - Community guidelines (expected behaviour for buyers and sellers). - Review policy (what constitutes a fair and authentic review). - Intellectual property policy (counterfeit reporting process). - Consequences matrix (warning → temporary suspension → permanent ban).

COMPLIANCE: EU Platform-to-Business Regulation (P2B). - Transparent terms and conditions for sellers. - Clear explanation of ranking factors. - Clear explanation of any differential treatment. - Complaint handling mechanism for sellers (internal, accessible). - 30-day notice before T&C changes.

COMPLIANCE: Digital Services Act (DSA). - Illegal content reporting mechanism for all users. - Notice-and-action procedure for reported content. - Transparency reporting (annual, if >10M EU users). - Trusted flaggers (if applicable).


Scoping Questions

CHECK: What content types need moderation (listings, messages, reviews)? CHECK: Is AI-assisted content moderation desired or manual-only? CHECK: What are the prohibited listing categories? CHECK: Is seller verification beyond PSP KYC needed? CHECK: What is the dispute resolution process? CHECK: What are the consequences for policy violations (warning, suspension, ban)? CHECK: Is fraud detection beyond basic signals needed? CHECK: Does the platform need to comply with DSA (>10M EU users)? CHECK: What is the expected volume of reports/flags per day?