coefficient-portfolio

Competitor Connector Outage Intelligence

An agent-based intelligence system that monitors competitor product connectors for outages, issues, and disruptions across multiple sources. Uses AI-powered severity scoring and multi-source validation to identify actionable competitive intelligence.

Architecture

┌────────────────────────────────────────────┐
│ Search Layer (Multi-Source Collection)       │
│ ├── Exa MCP (semantic web/news/forum search)   │
│ ├── Reddit MCP (real-time user complaints)     │
│ ├── Browserbase MCP (marketplace review scraping)│
│ └── Firecrawl MCP (backup web search)          │
└────────────────────────────────────────────┘
                      ↓
┌────────────────────────────────────────────┐
│ Intelligence Layer (AI Analysis)             │
│ ├── Issue Summarizer (1-2 sentence summaries)  │
│ ├── Severity Scorer (HIGH/MEDIUM/LOW + reasoning)│
│ └── Content Filter (exclude docs/tutorials)    │
└────────────────────────────────────────────┘
                      ↓
┌────────────────────────────────────────────┐
│ Data Layer (Storage + Dedup)                  │
│ ├── SQLite (seen issues tracking)              │
│ ├── Date/Content filtering                     │
│ └── Quality validation                         │
└────────────────────────────────────────────┘
                      ↓
┌────────────────────────────────────────────┐
│ Export Layer                                  │
│ └── Google Sheets MCP (structured output)      │
└────────────────────────────────────────────┘

Pipeline Stages

Stage 2: Content Filtering

Stage 3: AI-Powered Analysis

Stage 4: Deduplication

Stage 5: Export

Connector Monitoring

The system monitors 7 competitor connectors across 5 data platforms:

Each connector has configured:

Output Schema (Google Sheets)

Column Content
Date Found When the issue was detected
Connector Name Which competitor connector
Data Source Platform affected
Issue Description AI-generated 1-2 sentence summary
Source Type Web / Support / Reddit / Marketplace
URL Link to original source
Issue Date When the issue was reported
Severity HIGH / MEDIUM / LOW
Status ONGOING / LIKELY_RESOLVED / UNKNOWN
Notes Additional context, business impact

Python Module Structure

src/
├── agents/
│   ├── severity_scorer.py    # Claude-powered severity analysis
│   └── issue_summarizer.py   # Claude-powered summarization
├── processors/
│   ├── deduplicator.py       # SQLite-based duplicate tracking
│   └── formatter.py          # Sheets output formatting
├── searchers/
│   ├── exa_searcher.py       # Exa MCP integration
│   └── reddit_searcher.py    # Reddit MCP integration
└── writers/
    └── sheets_writer.py      # Google Sheets MCP output

~1,700 lines of Python across 8 modules.

Results

Test Run (Full 7-Connector Scan):

Projected monthly cost: ~$6.60/month (daily) or ~$0.88/month (weekly)

Stack

Key Decisions

  1. Multi-source validation over single-source — Requires corroboration from 2+ sources for HIGH severity. Eliminates false positives from isolated complaints.
  2. AI severity scoring over keyword matching — Claude analyzes context to distinguish real outages from routine maintenance. Keyword-only approach had too many false positives.
  3. SQLite dedup over in-memory — Persists across runs, so weekly scans don’t re-report known issues.
  4. $0.22/scan cost target — Designed for daily or weekly automated runs without budget concerns.
  5. YAML-configured connectors — Adding new competitors requires only a config change, not code changes.

Visual Direction

Diagram: Four horizontal layers (Search → Intelligence → Data → Export) with the 4 search sources feeding into the intelligence layer. Show severity classification (HIGH/MEDIUM/LOW) as color-coded badges. Right side panel shows a sample Google Sheets output with connector names and severity indicators. Bottom shows cost breakdown pie chart ($0.22 total).