Garbage in, garbage out. A geopolitical simulation is only as good as its underlying data. Here is how we approach data quality, sourcing, and maintenance.
Source Hierarchy
Not all data sources are created equal. We maintain a strict hierarchy:
- Primary authoritative sources — World Bank, SIPRI, IISS, FAS, national statistics offices
- Secondary compilation sources — CIA World Factbook, Wikipedia (with citations)
- Estimated/derived data — Calculated from related metrics with documented methodology
Every data point in our seed carries a source URL and a confidence tier. Nothing is asserted without attribution.
The Seed Data Approach
Unlike systems that query live databases, GeopoliticsSim uses a curated seed — a structured dataset that represents the current state of the world. This gives us:
- Reproducibility: The same scenario with the same assumptions produces the same projection
- Quality control: Every update is reviewed before inclusion
- Transparency: The entire seed is version-controlled and auditable
Data Dimensions
For each of our 48 core countries, we maintain:
- GDP and economic indicators (World Bank)
- Military spending and force structure (SIPRI, IISS)
- Nuclear arsenal data where applicable (FAS)
- Current leadership and political orientation
- Alliance memberships and treaty obligations
- Resource deposits with coordinates (USGS, EIA)
Resource Deposits
Our 73+ resource deposits (oil, gas, rare earth elements) each include:
- Precise geographic coordinates
- Daily/annual production figures
- Proven reserve estimates
- Operator/ownership information
- Source citations from geological surveys
Ongoing Maintenance
The seed is continuously updated as new authoritative data becomes available. Each update follows a review process that ensures data quality gates are met before inclusion.