Data Pipeline Project
EconGE.
Georgian Economic Data Pipeline
An automated pipeline that collects, processes, and publishes economic data from Georgia's official statistical sources — making it accessible through a browsable data portal and structured API.
Data Sources
Where the Data Comes From
Each source is scraped on a schedule, parsed into structured datasets, and loaded into PostgreSQL.
National Bank of Georgia
api+html_scrapehttps://nbg.gov.ge
National Statistics Office of Georgia
html_scrapehttps://www.geostat.ge
World Bank — Georgia
apihttps://data.worldbank.org/country/georgia
Technology
How It's Built
A modern data stack running on a single VPS — no cloud vendor lock-in.
Orchestration
Data Collection
Storage
Infrastructure
Pipeline Architecture
┌──────────────┐ ┌──────────────┐ ┌──────────────┐
│ GeoStat API │ │ NBG API │ │ Excel Files │
└──────┬───────┘ └──────┬───────┘ └──────┬───────┘
│ │ │
└────────────┬───────┴────────────────────┘
│
┌───────▼────────┐
│ Dagster │
│ (Orchestrator) │
└───────┬────────┘
│
┌───────▼────────┐
│ Python Assets │
│ (Extract + │
│ Transform) │
└───────┬────────┘
│
┌───────▼────────┐
│ PostgreSQL 16 │
│ (econge schema)│
└───────┬────────┘
│
┌────────────┴────────────┐
│ │
┌──────▼───────┐ ┌─────────▼────────┐
│ Data Portal │ │ Dagster Web UI │
│ (FastAPI) │ │ (Pipeline Ops) │
└──────────────┘ └──────────────────┘
About the Project
Why This Exists
Georgian economic data is scattered across multiple government websites, published in inconsistent formats — Excel spreadsheets, PDFs, and fragmented APIs. Finding historical time series or comparing indicators across sources is painful.
EconGE automates the collection, normalization, and storage of this data into a single PostgreSQL database with a clean, browsable interface on top. It runs on a schedule, so the data stays fresh without manual intervention.
This is both a practical tool and a portfolio piece — demonstrating end-to-end data engineering: ingestion, transformation, storage, orchestration, and data delivery.
What It Does
Explore the Data
Browse datasets, view time series charts, or check out the pipeline orchestration.