Content Gap Analyzer & Closer
Every step, every tool call, every reasoning. This is the full playbook — from domain intersection to a published WordPress draft, with a persistent gap tracker that proves ROI over time.
What you need
  • DataForSEO MCP (domain intersection, SERP, keyword intelligence, AI optimization, content analysis)
  • Firecrawl MCP (competitor content scraping)
  • Reddit MCP (community research)
  • Google Search Console MCP (routing: refresh vs. create)
  • WordPress REST API (draft publishing)
  • Write tool (file I/O for logging)
The 13-Step Pipeline
1
Step 1: Domain Intersection
Takes your domain and 2–3 competitor domains and returns keywords where competitors rank top 20 but you're absent or significantly weaker. Set volume floor ≥100, competitor rank ≤20. Output: master gap keyword list.
Tool: dataforseo_labs_google_domain_intersection
2
Step 2: Ranked Keywords Comparison
Pulls complete keyword profiles. Finds keywords where you rank 20–100 while competitors are in the top 10. Classify as "weak rank" or "missing content". Sort by opportunity score: search_volume × (1 / competitor_rank_position).
Tool: dataforseo_labs_google_ranked_keywords
3
Step 3: Keyword Intelligence Enrichment
Enriches gaps with volume, CPC, and intent. Priority formula: search_volume × (1/difficulty) × intent_weight × cpc_signal. Intent weights: transactional=3, commercial=2.5, informational=1.
Tools: dataforseo_labs_google_keyword_overview + dataforseo_labs_search_intent
4
Step 4: SERP Analysis
Pulls live SERP (organic results, 3-level PAA questions, featured snippets). PAA questions become your H2/H3 candidates. If SERP is product-heavy, deprioritize for blog content.
Tool: serp_organic_live_advanced
5
Step 5: Competitor Content Scraping
Scrapes top 3–5 pages. Extracts type, headers, word count, schema, and unique angles. Build a matrix to find what zero competitors have — that is your differentiation.
Tools: firecrawl_scrape + on_page_content_parsing (fallback)
6
Step 6: Reddit & Community Research
Extracts pain points, use-case advice, and quotable insights. Extract 3–5 direct quotes for a "What Real Users Say" section — attribute to user/subreddit, never fabricate.
Tools: reddit__search + reddit__get_post_comments + content_analysis_search
7
Step 7: AI Search Visibility Check
Checks visibility on ChatGPT/Perplexity. Analyze response format (lists, tables, Q&A) to align with AEO and ensure LLMs can extract your snippets.
Tools: ai_optimization_chat_gpt_scraper + ai_optimization_llm_response + ai_opt_llm_ment_search
8
Step 8: GSC Check
Route decision: Pos 20–50 = REFRESH; Pos 100+ = CREATE. Prevents keyword cannibalization by ensuring you aren't fighting your own existing pages.
Tool: gsc__search_analytics
9
Step 9: Related Keywords & Clustering
Expands into long-tail variations and semantically connected terms. Group into clusters; each cluster forms sections of your article or supporting internal links.
Tools: keyword_suggestions + related_keywords + keyword_ideas
10
Step 10: Content Brief Generation
Synthesizes research into a structured brief: outline, source annotations, schema plan, and differentiation. Present for customer approval before writing.
Synthesis step (no tool calls)
11
Step 11: Content Drafting
Write article per brief. Embed Reddit quotes as blockquotes. Create HTML markup for comparison tables and FAQ schema. Bottom lines must be concise for AEO.
Claude (no tool calls)
12
Step 12: WordPress Publishing
Pushes to WordPress as draft. Update SEO metadata fields (Yoast/RankMath) based on detected plugin. Never auto-publish; requires manual review.
Tool: WordPress REST API (curl)
13
Step 13: Logging & Gap Tracker
Writes markdown log and updates persistent CSV tracker. Lifecycle: DISCOVERED → DRAFTED → RANKING. Tracks progress and proves ROI over time.
Tool: Write tool (file I/O)
Cross-Cutting Concerns
Prioritization Logic
A change backed by 3+ data sources = High priority. Opportunity score: search_volume × (1/difficulty) × intent_weight × cpc_signal.
Token & Cost Efficiency
Use JSON schema prompts in Firecrawl to extract only what you need. Minimizes API costs across 13 steps.
Human-in-the-Loop Design
Automated research for steps 1–9. Step 10 provides the approval gate. Step 12 creates drafts only, ensuring alignment with brand voice and factual accuracy.
Need help running this end-to-end, managed?