When to use
Any question about geopolitics, conflict, security, supply chain, sanctions, infrastructure, location risk, or narrative around named actors. This is the default GDELT Cloud workflow.The three stages
- Summarize with
summarize_eventsorsummarize_storiesto see baseline volume, geographic and category clustering, and aggregate metrics. Cheap and fast. - Search with
search_eventsorsearch_stories(with focused semanticsearch) to get the citable Story or Event records. - Drill with
get_story_articles,get_entity, andEXTRACT_WEB_PAGESfor the underlying article evidence and second-degree network.
REST equivalent
MCP equivalent
The over-filter trap
Combiningsubcategory + country + semantic search returns sparse or empty results even on well-covered topics. If a query is empty:
- Drop
subcategory, keepcategory. - Drop
country— Stories often live globally even when the actor is national. - Switch axis: if you started on Events, try Stories.
- Run
summarize_stories(group_by=category)to see where the volume actually clusters.
Graph traversal
GDELT Cloud is a graph: Entities ↔ Stories ↔ Events. Most non-trivial questions need 2–3 hops:- Topic → who’s involved:
search_stories→ harvestentity_refs→get_entity. - Actor → what they did:
search_entities→get_entity→ walk linked Events/Stories. - Story → primary evidence:
search_stories→get_story_articles→EXTRACT_WEB_PAGES. - Incident → market reaction:
search_events→ take date/actor → pivot tomacro_finance.TIME_SERIES_DAILY_ADJUSTED.
Cite the structured metric
Every Event and Story carries scores:significance, magnitude, systemic_importance, propagation_potential, market_sensitivity, confidence. Quote them in output — “significance 0.55, propagation_potential 0.35, confidence 0.98” — instead of “this seemed important.” The numbers are the analyst’s value-add.
