Written in in data

The Data Behind the Map: Sources and Methodology

A deep dive into our data methodology — how we source, validate, and maintain the real-world data that powers GeopoliticsSim's simulations.

Garbage in, garbage out. A geopolitical simulation is only as good as its underlying data. Here is how we approach data quality, sourcing, and maintenance.

Source Hierarchy

Not all data sources are created equal. We maintain a strict hierarchy:

  1. Primary authoritative sources — World Bank, SIPRI, IISS, FAS, national statistics offices
  2. Secondary compilation sources — CIA World Factbook, Wikipedia (with citations)
  3. Estimated/derived data — Calculated from related metrics with documented methodology

Every data point in our seed carries a source URL and a confidence tier. Nothing is asserted without attribution.

The Seed Data Approach

Unlike systems that query live databases, GeopoliticsSim uses a curated seed — a structured dataset that represents the current state of the world. This gives us:

  • Reproducibility: The same scenario with the same assumptions produces the same projection
  • Quality control: Every update is reviewed before inclusion
  • Transparency: The entire seed is version-controlled and auditable

Data Dimensions

For each of our 48 core countries, we maintain:

  • GDP and economic indicators (World Bank)
  • Military spending and force structure (SIPRI, IISS)
  • Nuclear arsenal data where applicable (FAS)
  • Current leadership and political orientation
  • Alliance memberships and treaty obligations
  • Resource deposits with coordinates (USGS, EIA)

Resource Deposits

Our 73+ resource deposits (oil, gas, rare earth elements) each include:

  • Precise geographic coordinates
  • Daily/annual production figures
  • Proven reserve estimates
  • Operator/ownership information
  • Source citations from geological surveys

Ongoing Maintenance

The seed is continuously updated as new authoritative data becomes available. Each update follows a review process that ensures data quality gates are met before inclusion.

Simon Lehmann

This article was written by a Simon Lehmann Co-Founder & Developer specialist at Flabbergasted

Share this article:

Related posts

May 24, 2025

How We Model 48 Countries with Real-World Data

A look inside our data pipeline: how we source, validate, and structure real-world data for 48 countries to power accurate geopolitical simulations.

May 28, 2025

The Data Behind the Map: Sources and Methodology

A deep dive into our data methodology — how we source, validate, and maintain the real-world data that powers GeopoliticsSim's simulations.