Skip to main content

Eskom Source Data

Every dataset published on the Eskom Data Portal, flattened from its six sections into one list. Each row links to the portal page, the embedded PowerBI report, and the raw CSV download. Links are re-discovered on every pipeline run, so they stay current even when Eskom moves the files.

PowerBI new data and CSV new data show the last date we received a version of that source we hadn't seen before — i.e. when Eskom actually published new data, not just when we ran. Data up to is the newest data point we hold for that graph, whichever source it came from: a date for time series, Ingested for datasets with no date axis (e.g. outage-interval bins), No data when the source is currently empty (e.g. a broken Eskom link), and when Eskom offers no download.

Download the warehouse

The full scraped dataset is available for download. These files contain historical data that cycles off Eskom's portal and cannot be re-scraped. Pipeline source code: github.com/sixhobbits/v2.unofficialeskom.com.

FileSizeContentsDownload
warehouse/eskom.duckdb~490 MBAll scraped portal data (CSV + PowerBI), staging tables, dashboard viewsDownload
warehouse/media_presentations/index.duckdb~60 MBWeekly media presentations, integrated reports, AFS financials metadataDownload
warehouse/integrated_results/pdfs/~310 MBEskom annual integrated reports and AFS PDFs (2010–2025)Browse
warehouse/media_presentations/pdfs/~110 MBMedia-room presentation PDFsBrowse
sources/eskom.sqlite~40 MBBulk hourly grid metrics 2017–present (built from monthly CSV exports)Download
sources/eskom_metrics.sqlite~40 MBMonthly EAF/PCLF/UCLF/OCLF snapshots from legacy scraperDownload
sources/eskom_metrics_extra.sqlite~3 MBLegacy demand/capacity hourly data, 2022–2026Download

DuckDB files can be opened with DuckDB (duckdb eskom.duckdb) or any DuckDB-compatible client (Python, R, etc.).

Loading…